1
|
Sun J, Kulandaisamy A, Ru J, Gromiha MM, Cribbs AP. TMKit: a Python interface for computational analysis of transmembrane proteins. Brief Bioinform 2023; 24:bbad288. [PMID: 37594311 PMCID: PMC10516361 DOI: 10.1093/bib/bbad288] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Revised: 07/07/2023] [Accepted: 07/18/2023] [Indexed: 08/19/2023] Open
Abstract
Transmembrane proteins are receptors, enzymes, transporters and ion channels that are instrumental in regulating a variety of cellular activities, such as signal transduction and cell communication. Despite tremendous progress in computational capacities to support protein research, there is still a significant gap in the availability of specialized computational analysis toolkits for transmembrane protein research. Here, we introduce TMKit, an open-source Python programming interface that is modular, scalable and specifically designed for processing transmembrane protein data. TMKit is a one-stop computational analysis tool for transmembrane proteins, enabling users to perform database wrangling, engineer features at the mutational, domain and topological levels, and visualize protein-protein interaction interfaces. In addition, TMKit includes seqNetRR, a high-performance computing library that allows customized construction of a large number of residue connections. This library is particularly well suited for assigning correlation matrix-based features at a fast speed. TMKit should serve as a useful tool for researchers in assisting the study of transmembrane protein sequences and structures. TMKit is publicly available through https://github.com/2003100127/tmkit and https://tmkit-guide.herokuapp.com/doc/overview.
Collapse
Affiliation(s)
- Jianfeng Sun
- Nuffield Department of Orthopedics, Rheumatology, and Musculoskeletal Sciences, Botnar Research Centre, University of Oxford, Headington, Oxford OX3 7LD, UK
| | - Arulsamy Kulandaisamy
- Department of Biotechnology, Bhupat and Jyoti Mehta School of BioSciences, Indian Institute of Technology Madras, Chennai 600036, Tamil Nadu, India
| | - Jinlong Ru
- Chair of Prevention of Microbial Diseases, School of Life Sciences Weihenstephan, Technical University of Munich, 85354 Freising, Germany
| | - M Michael Gromiha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of BioSciences, Indian Institute of Technology Madras, Chennai 600036, Tamil Nadu, India
| | - Adam P Cribbs
- Nuffield Department of Orthopedics, Rheumatology, and Musculoskeletal Sciences, Botnar Research Centre, University of Oxford, Headington, Oxford OX3 7LD, UK
| |
Collapse
|
2
|
Lin P, Yan Y, Tao H, Huang SY. Deep transfer learning for inter-chain contact predictions of transmembrane protein complexes. Nat Commun 2023; 14:4935. [PMID: 37582780 PMCID: PMC10427616 DOI: 10.1038/s41467-023-40426-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Accepted: 07/21/2023] [Indexed: 08/17/2023] Open
Abstract
Membrane proteins are encoded by approximately a quarter of human genes. Inter-chain residue-residue contact information is important for structure prediction of membrane protein complexes and valuable for understanding their molecular mechanism. Although many deep learning methods have been proposed to predict the intra-protein contacts or helix-helix interactions in membrane proteins, it is still challenging to accurately predict their inter-chain contacts due to the limited number of transmembrane proteins. Addressing the challenge, here we develop a deep transfer learning method for predicting inter-chain contacts of transmembrane protein complexes, named DeepTMP, by taking advantage of the knowledge pre-trained from a large data set of non-transmembrane proteins. DeepTMP utilizes a geometric triangle-aware module to capture the correct inter-chain interaction from the coevolution information generated by protein language models. DeepTMP is extensively evaluated on a test set of 52 self-associated transmembrane protein complexes, and compared with state-of-the-art methods including DeepHomo2.0, CDPred, GLINTER, DeepHomo, and DNCON2_Inter. It is shown that DeepTMP considerably improves the precision of inter-chain contact prediction and outperforms the existing approaches in both accuracy and robustness.
Collapse
Affiliation(s)
- Peicong Lin
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Yumeng Yan
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Huanyu Tao
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China.
| |
Collapse
|
3
|
Paradis JS, Feng X, Murat B, Jefferson RE, Sokrat B, Szpakowska M, Hogue M, Bergkamp ND, Heydenreich FM, Smit MJ, Chevigné A, Bouvier M, Barth P. Computationally designed GPCR quaternary structures bias signaling pathway activation. Nat Commun 2022; 13:6826. [PMID: 36369272 PMCID: PMC9652377 DOI: 10.1038/s41467-022-34382-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Accepted: 10/24/2022] [Indexed: 11/13/2022] Open
Abstract
Communication across membranes controls critical cellular processes and is achieved by receptors translating extracellular signals into selective cytoplasmic responses. While receptor tertiary structures can be readily characterized, receptor associations into quaternary structures are challenging to study and their implications in signal transduction remain poorly understood. Here, we report a computational approach for predicting receptor self-associations, and designing receptor oligomers with various quaternary structures and signaling properties. Using this approach, we designed chemokine receptor CXCR4 dimers with reprogrammed binding interactions, conformations, and abilities to activate distinct intracellular signaling proteins. In agreement with our predictions, the designed CXCR4s dimerized through distinct conformations and displayed different quaternary structural changes upon activation. Consistent with the active state models, all engineered CXCR4 oligomers activated the G protein Gi, but only specific dimer structures also recruited β-arrestins. Overall, we demonstrate that quaternary structures represent an important unforeseen mechanism of receptor biased signaling and reveal the existence of a bias switch at the dimer interface of several G protein-coupled receptors including CXCR4, mu-Opioid and type-2 Vasopressin receptors that selectively control the activation of G proteins vs β-arrestin-mediated pathways. The approach should prove useful for predicting and designing receptor associations to uncover and reprogram selective cellular signaling functions.
Collapse
Affiliation(s)
- Justine S Paradis
- Department of Biochemistry and Molecular Medicine, Université de Montréal, Montréal, QC, H3T 1J4, Canada
- Institute for Research in Immunology and Cancer (IRIC), Université de Montréal, Montréal, QC, H3T 1J4, Canada
| | - Xiang Feng
- Interfaculty Institute of Bioengineering, Ecole Polytechnique Fédérale de Lausanne, Lausanne, CH-1015, Switzerland
- Department of Structural Biology, Van Andel Institute, Grand Rapids, MI, USA
| | - Brigitte Murat
- Department of Biochemistry and Molecular Medicine, Université de Montréal, Montréal, QC, H3T 1J4, Canada
- Institute for Research in Immunology and Cancer (IRIC), Université de Montréal, Montréal, QC, H3T 1J4, Canada
| | - Robert E Jefferson
- Interfaculty Institute of Bioengineering, Ecole Polytechnique Fédérale de Lausanne, Lausanne, CH-1015, Switzerland
| | - Badr Sokrat
- Department of Biochemistry and Molecular Medicine, Université de Montréal, Montréal, QC, H3T 1J4, Canada
- Institute for Research in Immunology and Cancer (IRIC), Université de Montréal, Montréal, QC, H3T 1J4, Canada
| | - Martyna Szpakowska
- Department of Infection and Immunity, Immuno-Pharmacology and Interactomics, Luxembourg Institute of Health, Esch-sur-Alzette, Luxembourg
| | - Mireille Hogue
- Institute for Research in Immunology and Cancer (IRIC), Université de Montréal, Montréal, QC, H3T 1J4, Canada
| | - Nick D Bergkamp
- Amsterdam Institute for Molecules, Medicines and Systems (AIMMS), Division of Medicinal Chemistry, Faculty of Sciences, Vrije Universiteit, Amsterdam, The Netherlands
| | - Franziska M Heydenreich
- Department of Biochemistry and Molecular Medicine, Université de Montréal, Montréal, QC, H3T 1J4, Canada
- Institute for Research in Immunology and Cancer (IRIC), Université de Montréal, Montréal, QC, H3T 1J4, Canada
| | - Martine J Smit
- Amsterdam Institute for Molecules, Medicines and Systems (AIMMS), Division of Medicinal Chemistry, Faculty of Sciences, Vrije Universiteit, Amsterdam, The Netherlands
| | - Andy Chevigné
- Department of Infection and Immunity, Immuno-Pharmacology and Interactomics, Luxembourg Institute of Health, Esch-sur-Alzette, Luxembourg
| | - Michel Bouvier
- Department of Biochemistry and Molecular Medicine, Université de Montréal, Montréal, QC, H3T 1J4, Canada.
- Institute for Research in Immunology and Cancer (IRIC), Université de Montréal, Montréal, QC, H3T 1J4, Canada.
| | - Patrick Barth
- Interfaculty Institute of Bioengineering, Ecole Polytechnique Fédérale de Lausanne, Lausanne, CH-1015, Switzerland.
| |
Collapse
|
4
|
Molecular mechanism of interaction between fatty acid delta 6 desaturase and acyl-CoA by computational prediction. AMB Express 2022; 12:69. [PMID: 35680699 PMCID: PMC9184693 DOI: 10.1186/s13568-022-01410-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2021] [Accepted: 05/27/2022] [Indexed: 11/17/2022] Open
Abstract
Enzyme catalyzed desaturation of intracellular fatty acids plays an important role in various physiological and pathological processes related to lipids. Limited to the multiple transmembrane domains, it is difficult to obtain their three-dimensional structure of fatty acid desaturases. So how they interact with their substrates is unclear. Here, we predicted the complex of Micromonas pusilla delta 6 desaturase (MpFADS6) with the substrate linoleinyl-CoA (ALA-CoA) by trRosetta software and docking poses by Dock 6 software. The potential enzyme–substrate binding sites were anchored by analysis of the complex. Then, site-directed mutagenesis and activity verification clarified that W290, W224, and F352 were critical residues of the substrate tunnel and directly bonded to ALA-CoA. H94 and H69 were indispensable for transporting electrons with heme. H452, N445, and H358 significantly influenced the recognition and attraction of MpFADS6 to the substrate. These findings provide new insights and methods to determine the structure, mechanisms and directed transformation of membrane-bound desaturases. The structure of the Δ6 fatty acid desaturase and substrate complex is modeled. The substrate tunnel and key residues of MpFADS6 catalytic activity are determined. The new insights to determine the mechanism of the membrane-bound desaturases.
Collapse
|
5
|
Rudden LSP, Hijazi M, Barth P. Deep learning approaches for conformational flexibility and switching properties in protein design. Front Mol Biosci 2022; 9:928534. [PMID: 36032687 PMCID: PMC9399439 DOI: 10.3389/fmolb.2022.928534] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2022] [Accepted: 07/15/2022] [Indexed: 11/30/2022] Open
Abstract
Following the hugely successful application of deep learning methods to protein structure prediction, an increasing number of design methods seek to leverage generative models to design proteins with improved functionality over native proteins or novel structure and function. The inherent flexibility of proteins, from side-chain motion to larger conformational reshuffling, poses a challenge to design methods, where the ideal approach must consider both the spatial and temporal evolution of proteins in the context of their functional capacity. In this review, we highlight existing methods for protein design before discussing how methods at the forefront of deep learning-based design accommodate flexibility and where the field could evolve in the future.
Collapse
Affiliation(s)
| | | | - Patrick Barth
- *Correspondence: Lucas S. P. Rudden, ; Patrick Barth,
| |
Collapse
|
6
|
Sica MP, Smulski CR. Coarse Grained Molecular Dynamic Simulations for the Study of TNF Receptor Family Members' Transmembrane Organization. Front Cell Dev Biol 2021; 8:577278. [PMID: 33553138 PMCID: PMC7859260 DOI: 10.3389/fcell.2020.577278] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2020] [Accepted: 12/21/2020] [Indexed: 11/21/2022] Open
Abstract
The Tumor Necrosis Factor (TNF) and the TNF receptor (TNFR) superfamilies are composed of 19 ligands and 30 receptors, respectively. The oligomeric properties of ligands, both membrane bound and soluble, has been studied most. However, less is known about the oligomeric properties of TNFRs. Earlier reports identified the extracellular, membrane-distal, cysteine-rich domain as a pre-ligand assembly domain which stabilizes receptor dimers and/or trimers in the absence of ligand. Nevertheless, recent reports based on structural nuclear magnetic resonance (NMR) highlight the intrinsic role of the transmembrane domains to form dimers (p75NTR), trimers (Fas), or dimers of trimers (DR5). Thus, understanding the structural basis of transmembrane oligomerization may shed light on the mechanism for signal transduction and the impact of disease-associated mutations in this region. To this end, here we used an in silico coarse grained molecular dynamics approach with Martini force field to study TNFR transmembrane homotypic interactions. We have first validated this approach studying the three TNFR described by NMR (p75NTR, Fas, and DR5). We have simulated membrane patches composed of 36 helices of the same receptor equidistantly distributed in order to get unbiassed information on spontaneous proteins assemblies. Good agreement was found in the specific residues involved in homotypic interactions and we were able to observe dimers, trimers, and higher-order oligomers corresponding to those reported in NMR experiments. We have, applied this approach to study the assembly of disease-related mutations being able to assess their impact on oligomerization stability. In conclusion, our results showed the usefulness of coarse grained simulations with Martini force field to study in an unbiased manner higher order transmembrane oligomerization.
Collapse
Affiliation(s)
- Mauricio P Sica
- Instituto de Energía y Desarrollo Sustentable, Centro Atómico Bariloche, Comisión Nacional de Energía Atómica (CNEA), San Carlos de Bariloche, Argentina.,Medical Physics Department, Centro Atómico Bariloche, Comisión Nacional de Energía Atómica (CNEA), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), San Carlos de Bariloche, Argentina
| | - Cristian R Smulski
- Medical Physics Department, Centro Atómico Bariloche, Comisión Nacional de Energía Atómica (CNEA), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), San Carlos de Bariloche, Argentina
| |
Collapse
|
7
|
Xiao Y, Zeng B, Berner N, Frishman D, Langosch D, George Teese M. Experimental determination and data-driven prediction of homotypic transmembrane domain interfaces. Comput Struct Biotechnol J 2020; 18:3230-3242. [PMID: 33209210 PMCID: PMC7649602 DOI: 10.1016/j.csbj.2020.09.035] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2020] [Revised: 09/22/2020] [Accepted: 09/24/2020] [Indexed: 12/22/2022] Open
Abstract
Homotypic TMD interfaces identified by different techniques share strong similarities. The GxxxG motif is the feature most strongly associated with interfaces. Other features include conservation, polarity, coevolution, and depth in the membrane The role of each of each feature strongly depends on the individual protein. Machine-learning helps predict interfaces from evolutionary sequence data
Interactions between their transmembrane domains (TMDs) frequently support the assembly of single-pass membrane proteins to non-covalent complexes. Yet, the TMD-TMD interactome remains largely uncharted. With a view to predicting homotypic TMD-TMD interfaces from primary structure, we performed a systematic analysis of their physical and evolutionary properties. To this end, we generated a dataset of 50 self-interacting TMDs. This dataset contains interfaces of nine TMDs from bitopic human proteins (Ire1, Armcx6, Tie1, ATP1B1, PTPRO, PTPRU, PTPRG, DDR1, and Siglec7) that were experimentally identified here and combined with literature data. We show that interfacial residues of these homotypic TMD-TMD interfaces tend to be more conserved, coevolved and polar than non-interfacial residues. Further, we suggest for the first time that interface positions are deficient in β-branched residues, and likely to be located deep in the hydrophobic core of the membrane. Overrepresentation of the GxxxG motif at interfaces is strong, but that of (small)xxx(small) motifs is weak. The multiplicity of these features and the individual character of TMD-TMD interfaces, as uncovered here, prompted us to train a machine learning algorithm. The resulting prediction method, THOIPA (www.thoipa.org), excels in the prediction of key interface residues from evolutionary sequence data.
Collapse
Affiliation(s)
- Yao Xiao
- Center for Integrated Protein Science Munich (CIPSM) at the Lehrstuhl für Chemie der Biopolymere, Technische Universität München, Weihenstephaner Berg 3, 85354 Freising, Germany
| | - Bo Zeng
- Department of Bioinformatics, Wissenschaftszentrum, Weihenstephan, Maximus-von-Imhof-Forum 3, Freising 85354, Germany
| | - Nicola Berner
- Center for Integrated Protein Science Munich (CIPSM) at the Lehrstuhl für Chemie der Biopolymere, Technische Universität München, Weihenstephaner Berg 3, 85354 Freising, Germany
| | - Dmitrij Frishman
- Department of Bioinformatics, Wissenschaftszentrum, Weihenstephan, Maximus-von-Imhof-Forum 3, Freising 85354, Germany.,Department of Bioinformatics, Peter the Great Saint Petersburg Polytechnic University, St. Petersburg 195251, Russian Federation
| | - Dieter Langosch
- Center for Integrated Protein Science Munich (CIPSM) at the Lehrstuhl für Chemie der Biopolymere, Technische Universität München, Weihenstephaner Berg 3, 85354 Freising, Germany
| | - Mark George Teese
- Center for Integrated Protein Science Munich (CIPSM) at the Lehrstuhl für Chemie der Biopolymere, Technische Universität München, Weihenstephaner Berg 3, 85354 Freising, Germany.,TNG Technology Consulting GmbH, Beta-Straße 13a, 85774 Unterföhring, Germany
| |
Collapse
|
8
|
Šimčíková D, Heneberg P. Refinement of evolutionary medicine predictions based on clinical evidence for the manifestations of Mendelian diseases. Sci Rep 2019; 9:18577. [PMID: 31819097 PMCID: PMC6901466 DOI: 10.1038/s41598-019-54976-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2018] [Accepted: 11/21/2019] [Indexed: 12/28/2022] Open
Abstract
Prediction methods have become an integral part of biomedical and biotechnological research. However, their clinical interpretations are largely based on biochemical or molecular data, but not clinical data. Here, we focus on improving the reliability and clinical applicability of prediction algorithms. We assembled and curated two large non-overlapping large databases of clinical phenotypes. These phenotypes were caused by missense variations in 44 and 63 genes associated with Mendelian diseases. We used these databases to establish and validate the model, allowing us to improve the predictions obtained from EVmutation, SNAP2 and PoPMuSiC 2.1. The predictions of clinical effects suffered from a lack of specificity, which appears to be the common constraint of all recently used prediction methods, although predictions mediated by these methods are associated with nearly absolute sensitivity. We introduced evidence-based tailoring of the default settings of the prediction methods; this tailoring substantially improved the prediction outcomes. Additionally, the comparisons of the clinically observed and theoretical variations led to the identification of large previously unreported pools of variations that were under negative selection during molecular evolution. The evolutionary variation analysis approach described here is the first to enable the highly specific identification of likely disease-causing missense variations that have not yet been associated with any clinical phenotype.
Collapse
Affiliation(s)
- Daniela Šimčíková
- Charles University, Third Faculty of Medicine, Prague, Czech Republic
| | - Petr Heneberg
- Charles University, Third Faculty of Medicine, Prague, Czech Republic.
| |
Collapse
|
9
|
Rajagopal N, Irudayanathan FJ, Nangia S. Computational Nanoscopy of Tight Junctions at the Blood-Brain Barrier Interface. Int J Mol Sci 2019; 20:E5583. [PMID: 31717316 PMCID: PMC6888702 DOI: 10.3390/ijms20225583] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Revised: 11/05/2019] [Accepted: 11/06/2019] [Indexed: 12/16/2022] Open
Abstract
The selectivity of the blood-brain barrier (BBB) is primarily maintained by tight junctions (TJs), which act as gatekeepers of the paracellular space by blocking blood-borne toxins, drugs, and pathogens from entering the brain. The BBB presents a significant challenge in designing neurotherapeutics, so a comprehensive understanding of the TJ architecture can aid in the design of novel therapeutics. Unraveling the intricacies of TJs with conventional experimental techniques alone is challenging, but recently developed computational tools can provide a valuable molecular-level understanding of TJ architecture. We employed the computational methods toolkit to investigate claudin-5, a highly expressed TJ protein at the BBB interface. Our approach started with the prediction of claudin-5 structure, evaluation of stable dimer conformations and nanoscale assemblies, followed by the impact of lipid environments, and posttranslational modifications on these claudin-5 assemblies. These led to the study of TJ pores and barriers and finally understanding of ion and small molecule transport through the TJs. Some of these in silico, molecular-level findings, will need to be corroborated by future experiments. The resulting understanding can be advantageous towards the eventual goal of drug delivery across the BBB. This review provides key insights gleaned from a series of state-of-the-art nanoscale simulations (or computational nanoscopy studies) performed on the TJ architecture.
Collapse
Affiliation(s)
| | | | - Shikha Nangia
- Department of Biomedical and Chemical Engineering, Syracuse University, Syracuse, NY 13244, USA
| |
Collapse
|
10
|
Wozniak PP, Pelc J, Skrzypecki M, Vriend G, Kotulska M. Bio-knowledge-based filters improve residue-residue contact prediction accuracy. Bioinformatics 2019; 34:3675-3683. [PMID: 29850768 DOI: 10.1093/bioinformatics/bty416] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2017] [Accepted: 05/19/2018] [Indexed: 11/13/2022] Open
Abstract
Motivation Residue-residue contact prediction through direct coupling analysis has reached impressive accuracy, but yet higher accuracy will be needed to allow for routine modelling of protein structures. One way to improve the prediction accuracy is to filter predicted contacts using knowledge about the particular protein of interest or knowledge about protein structures in general. Results We focus on the latter and discuss a set of filters that can be used to remove false positive contact predictions. Each filter depends on one or a few cut-off parameters for which the filter performance was investigated. Combining all filters while using default parameters resulted for a test set of 851 protein domains in the removal of 29% of the predictions of which 92% were indeed false positives. Availability and implementation All data and scripts are available at http://comprec-lin.iiar.pwr.edu.pl/FPfilter/. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- P P Wozniak
- Department of Biomedical Engineering, Faculty of Fundamental Problems of Technology, Wroclaw University of Science and Technology, Wroclaw, Poland
| | - J Pelc
- Department of Biomedical Engineering, Faculty of Fundamental Problems of Technology, Wroclaw University of Science and Technology, Wroclaw, Poland
| | - M Skrzypecki
- Department of Biomedical Engineering, Faculty of Fundamental Problems of Technology, Wroclaw University of Science and Technology, Wroclaw, Poland
| | - G Vriend
- Centre for Molecular and Biomolecular Informatics, Radboud University Medical Centre, Nijmegen, The Netherlands
| | - M Kotulska
- Department of Biomedical Engineering, Faculty of Fundamental Problems of Technology, Wroclaw University of Science and Technology, Wroclaw, Poland
| |
Collapse
|
11
|
Weinstein JY, Elazar A, Fleishman SJ. A lipophilicity-based energy function for membrane-protein modelling and design. PLoS Comput Biol 2019; 15:e1007318. [PMID: 31461441 PMCID: PMC6736313 DOI: 10.1371/journal.pcbi.1007318] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2019] [Revised: 09/10/2019] [Accepted: 08/01/2019] [Indexed: 01/14/2023] Open
Abstract
Membrane-protein design is an exciting and increasingly successful research area which has led to landmarks including the design of stable and accurate membrane-integral proteins based on coiled-coil motifs. Design of topologically more complex proteins, such as most receptors, channels, and transporters, however, demands an energy function that balances contributions from intra-protein contacts and protein-membrane interactions. Recent advances in water-soluble all-atom energy functions have increased the accuracy in structure-prediction benchmarks. The plasma membrane, however, imposes different physical constraints on protein solvation. To understand these constraints, we recently developed a high-throughput experimental screen, called dsTβL, and inferred apparent insertion energies for each amino acid at dozens of positions across the bacterial plasma membrane. Here, we express these profiles as lipophilicity energy terms in Rosetta and demonstrate that the new energy function outperforms previous ones in modelling and design benchmarks. Rosetta ab initio simulations starting from an extended chain recapitulate two-thirds of the experimentally determined structures of membrane-spanning homo-oligomers with <2.5Å root-mean-square deviation within the top-predicted five models (available online: http://tmhop.weizmann.ac.il). Furthermore, in two sequence-design benchmarks, the energy function improves discrimination of stabilizing point mutations and recapitulates natural membrane-protein sequences of known structure, thereby recommending this new energy function for membrane-protein modelling and design.
Collapse
Affiliation(s)
| | - Assaf Elazar
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Sarel Jacob Fleishman
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
- * E-mail:
| |
Collapse
|
12
|
Schott-Verdugo S, Müller L, Classen E, Gohlke H, Groth G. Structural Model of the ETR1 Ethylene Receptor Transmembrane Sensor Domain. Sci Rep 2019; 9:8869. [PMID: 31222090 PMCID: PMC6586836 DOI: 10.1038/s41598-019-45189-w] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2019] [Accepted: 06/03/2019] [Indexed: 01/14/2023] Open
Abstract
The structure, mechanism of action and copper stoichiometry of the transmembrane sensor domain of the plant ethylene receptor ETR1 and homologs have remained elusive, hampering the understanding on how the perception of the plant hormone ethylene is transformed into a downstream signal. We generated the first structural model of the transmembrane sensor domain of ETR1 by integrating ab initio structure prediction and coevolutionary information. To refine and independently validate the model, we determined protein-related copper stoichiometries on purified receptor preparations and explored the helix arrangement by tryptophan scanning mutagenesis. All-atom molecular dynamics simulations of the dimeric model reveal how ethylene can bind proximal to the copper ions in the receptor, illustrating the initial stages of the ethylene perception process.
Collapse
Affiliation(s)
- Stephan Schott-Verdugo
- Institute for Pharmaceutical and Medicinal Chemistry, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
- Centro de Bioinformática y Simulación Molecular (CBSM), Facultad de Ingeniería, Universidad de Talca, Talca, Chile
| | - Lena Müller
- Institute of Biochemical Plant Physiology, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Elisa Classen
- Institute of Biochemical Plant Physiology, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
- Institute of Hydraulic Engineering and Water Resources Management, RWTH Aachen University, Aachen, Germany
| | - Holger Gohlke
- Institute for Pharmaceutical and Medicinal Chemistry, Heinrich Heine University Düsseldorf, Düsseldorf, Germany.
- John von Neumann Institute for Computing (NIC), Jülich Supercomputing Centre (JSC) & Institute for Complex Systems - Structural Biochemistry (ICS 6), Forschungszentrum Jülich GmbH, Jülich, Germany.
- Bioeconomy Science Center, Forschungszentrum Jülich GmbH, Jülich, Germany.
| | - Georg Groth
- Institute of Biochemical Plant Physiology, Heinrich Heine University Düsseldorf, Düsseldorf, Germany.
- Bioeconomy Science Center, Forschungszentrum Jülich GmbH, Jülich, Germany.
| |
Collapse
|
13
|
Interfaces Between Alpha-helical Integral Membrane Proteins: Characterization, Prediction, and Docking. Comput Struct Biotechnol J 2019; 17:699-711. [PMID: 31303974 PMCID: PMC6603304 DOI: 10.1016/j.csbj.2019.05.005] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2019] [Revised: 05/20/2019] [Accepted: 05/21/2019] [Indexed: 11/28/2022] Open
Abstract
Protein-protein interaction (PPI) is an essential mechanism by which proteins perform their biological functions. For globular proteins, the molecular characteristics of such interactions have been well analyzed, and many computational tools are available for predicting PPI sites and constructing structural models of the complex. In contrast, little is known about the molecular features of the interaction between integral membrane proteins (IMPs) and few methods exist for constructing structural models of their complexes. Here, we analyze the interfaces from a non-redundant set of complexes of α-helical IMPs whose structures have been determined to a high resolution. We find that the interface is not significantly different from the rest of the surface in terms of average hydrophobicity. However, the interface is significantly better conserved and, on average, inter-subunit contacting residue pairs correlate more strongly than non-contacting pairs, especially in obligate complexes. We also develop a neural network-based method, with an area under the receiver operating characteristic curve of 0.75 and a Pearson correlation coefficient of 0.70, for predicting interface residues and their weighted contact numbers (WCNs). We further show that predicted interface residues and their WCNs can be used as restraints to reconstruct the structure α-helical IMP dimers through docking for fourteen out of a benchmark set of sixteen complexes. The RMSD100 values of the best-docked ligand subunit to its native structure are <2.5 Å for these fourteen cases. The structural analysis conducted in this work provides molecular details about the interface between α-helical IMPs and the WCN restraints represent an efficient means to score α-helical IMP docking candidates.
Collapse
Key Words
- AUC, Area under the ROC curve
- IMP, Integral membrane protein
- MAE, Mean absolute error
- MSA, Multiple sequence alignment
- Membrane protein docking
- Membrane protein interfaces
- Neural networks
- OPM, Orientations of proteins in membranes
- PCC, Pearson correlation coefficient
- PDB, Protein data bank
- PPI, Protein-protein interaction
- PPM, Positioning of proteins in membrane.
- PPV, Positive predictive value
- PSSM, Position-specific scoring matrix
- RMSD, Root-mean-square distance
- ROC, Receiver operating characteristic curve
- RSA, Relative solvent accessibility
- TNR, True negative rate
- TPR, True positive rate
- WCN, Weighted contact number
- Weighted contact numbers
Collapse
|
14
|
Zeng B, Hönigschmid P, Frishman D. Residue co-evolution helps predict interaction sites in α-helical membrane proteins. J Struct Biol 2019; 206:156-169. [DOI: 10.1016/j.jsb.2019.02.009] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2018] [Revised: 01/30/2019] [Accepted: 02/13/2019] [Indexed: 11/29/2022]
|
15
|
Mravic M, Hu H, Lu Z, Bennett JS, Sanders CR, Orr AW, DeGrado WF. De novo designed transmembrane peptides activating the α5β1 integrin. Protein Eng Des Sel 2018; 31:181-190. [PMID: 29992271 PMCID: PMC6151875 DOI: 10.1093/protein/gzy014] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2018] [Accepted: 05/30/2018] [Indexed: 11/12/2022] Open
Abstract
Computationally designed transmembrane α-helical peptides (CHAMP) have been used to compete for helix-helix interactions within the membrane, enabling the ability to probe the activation of the integrins αIIbβ3 and αvβ3. Here, this method is extended towards the design of CHAMP peptides that inhibit the association of the α5β1 transmembrane (TM) domains, targeting the Ala-X3-Gly motif within α5. Our previous design algorithm was performed alongside a new workflow implemented within the widely used Rosetta molecular modeling suite. Peptides from each computational approach activated integrin α5β1 but not αVβ3 in human endothelial cells. Two CHAMP peptides were shown to directly associate with an α5 TM domain peptide in detergent micelles to a similar degree as a β1 TM peptide does. By solution-state nuclear magnetic resonance, one of these CHAMP peptides was shown to bind primarily the integrin β1 TM domain, which itself has a Gly-X3-Gly motif. The second peptide associated modestly with both α5 and β1 constructs, with slight preference for α5. Although the design goal was not fully realized, this work characterizes novel CHAMP peptides activating α5β1 that can serve as useful reagents for probing integrin biology.
Collapse
Affiliation(s)
- Marco Mravic
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA, USA
| | - Hailin Hu
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA, USA
- School of Medicine, Tsinghua University, Beijing, China
| | - Zhenwei Lu
- Department of Biochemistry, Vanderbilt University School of Medicine Basic Sciences, Nashville, Tennessee, USA
| | - Joel S Bennett
- Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Charles R Sanders
- Department of Biochemistry, Vanderbilt University School of Medicine Basic Sciences, Nashville, Tennessee, USA
| | - A Wayne Orr
- Departments of Pathology and Translational Pathobiology, Cell Biology and Anatomy, and Physiology, Louisiana State University Health Sciences Center, Shreveport, LA, USA
| | - William F DeGrado
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA, USA
| |
Collapse
|
16
|
Lesovoy DM, Mineev KS, Bragin PE, Bocharova OV, Bocharov EV, Arseniev AS. NMR relaxation parameters of methyl groups as a tool to map the interfaces of helix-helix interactions in membrane proteins. JOURNAL OF BIOMOLECULAR NMR 2017; 69:165-179. [PMID: 29063258 DOI: 10.1007/s10858-017-0146-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2017] [Accepted: 10/14/2017] [Indexed: 06/07/2023]
Abstract
In the case of soluble proteins, chemical shift mapping is used to identify the intermolecular interfaces when the NOE-based calculations of spatial structure of the molecular assembly are impossible or impracticable. However, the reliability of the membrane protein interface mapping based on chemical shifts or other relevant parameters was never assessed. In the present work, we investigate the predictive power of various NMR parameters that can be used for mapping of helix-helix interfaces in dimeric TM domains. These parameters are studied on a dataset containing three structures of helical dimers obtained for two different proteins in various membrane mimetics. We conclude that the amide chemical shifts have very little predictive value, while the methyl chemical shifts could be used to predict interfaces, though with great care. We suggest an approach based on conversion of the carbon NMR relaxation parameters of methyl groups into parameters of motion, and one of such values, the characteristic time of methyl rotation, appears to be a reliable sensor of interhelix contacts in transmembrane domains. The carbon NMR relaxation parameters of methyl groups can be measured accurately and with high sensitivity and resolution, making the proposed parameter a useful tool for investigation of protein-protein interfaces even in large membrane proteins. An approach to build the models of transmembrane dimers based on perturbations of methyl parameters and TMDOCK software is suggested.
Collapse
Affiliation(s)
- D M Lesovoy
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences RAS, Str. Miklukho-Maklaya 16/10, Moscow, Russian Federation, 117997
| | - K S Mineev
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences RAS, Str. Miklukho-Maklaya 16/10, Moscow, Russian Federation, 117997
- Moscow Institute of Physics and Technology, Institutsky per., 9, Dolgoprudny, Russian Federation, 141700
| | - P E Bragin
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences RAS, Str. Miklukho-Maklaya 16/10, Moscow, Russian Federation, 117997
- Lomonosov Moscow State University, Leninskiye Gory, 1, Moscow, Russian Federation, 119991
| | - O V Bocharova
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences RAS, Str. Miklukho-Maklaya 16/10, Moscow, Russian Federation, 117997
- Moscow Institute of Physics and Technology, Institutsky per., 9, Dolgoprudny, Russian Federation, 141700
| | - E V Bocharov
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences RAS, Str. Miklukho-Maklaya 16/10, Moscow, Russian Federation, 117997.
- Moscow Institute of Physics and Technology, Institutsky per., 9, Dolgoprudny, Russian Federation, 141700.
- National Research Centre "Kurchatov Institute", Kurchatov Complex of NBICS-technologies, Akad. Kurchatova Sqr., 1, Moscow, Russian Federation, 123182.
| | - A S Arseniev
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences RAS, Str. Miklukho-Maklaya 16/10, Moscow, Russian Federation, 117997
- Moscow Institute of Physics and Technology, Institutsky per., 9, Dolgoprudny, Russian Federation, 141700
| |
Collapse
|
17
|
Wozniak PP, Konopka BM, Xu J, Vriend G, Kotulska M. Forecasting residue-residue contact prediction accuracy. Bioinformatics 2017; 33:3405-3414. [PMID: 29036497 DOI: 10.1093/bioinformatics/btx416] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2017] [Accepted: 06/22/2017] [Indexed: 11/14/2022] Open
Abstract
Motivation Apart from meta-predictors, most of today's methods for residue-residue contact prediction are based entirely on Direct Coupling Analysis (DCA) of correlated mutations in multiple sequence alignments (MSAs). These methods are on average ∼40% correct for the 100 strongest predicted contacts in each protein. The end-user who works on a single protein of interest will not know if predictions are either much more or much less correct than 40%, which is especially a problem if contacts are predicted to steer experimental research on that protein. Results We designed a regression model that forecasts the accuracy of residue-residue contact prediction for individual proteins with an average error of 7 percentage points. Contacts were predicted with two DCA methods (gplmDCA and PSICOV). The models were built on parameters that describe the MSA, the predicted secondary structure, the predicted solvent accessibility and the contact prediction scores for the target protein. Results show that our models can be also applied to the meta-methods, which was tested on RaptorX. Availability and implementation All data and scripts are available from http://comprec-lin.iiar.pwr.edu.pl/dcaQ/. Contact malgorzata.kotulska@pwr.edu.pl. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- P P Wozniak
- Department of Biomedical Engineering, Faculty of Fundamental Problems of Technology, Wroclaw University of Science and Technology, Wroclaw, Poland
| | - B M Konopka
- Department of Biomedical Engineering, Faculty of Fundamental Problems of Technology, Wroclaw University of Science and Technology, Wroclaw, Poland
| | - J Xu
- Toyota Technological Institute at Chicago, Chicago, IL 60637, USA
| | - G Vriend
- Centre for Molecular and Biomolecular Informatics, Radboud University Medical Centre, GA 6525, Nijmegen, The Netherlands
| | - M Kotulska
- Department of Biomedical Engineering, Faculty of Fundamental Problems of Technology, Wroclaw University of Science and Technology, Wroclaw, Poland
| |
Collapse
|
18
|
Shamsi Z, Moffett AS, Shukla D. Enhanced unbiased sampling of protein dynamics using evolutionary coupling information. Sci Rep 2017; 7:12700. [PMID: 28983093 PMCID: PMC5629199 DOI: 10.1038/s41598-017-12874-7] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2016] [Accepted: 09/14/2017] [Indexed: 12/25/2022] Open
Abstract
One of the major challenges in atomistic simulations of proteins is efficient sampling of pathways associated with rare conformational transitions. Recent developments in statistical methods for computation of direct evolutionary couplings between amino acids within and across polypeptide chains have allowed for inference of native residue contacts, informing accurate prediction of protein folds and multimeric structures. In this study, we assess the use of distances between evolutionarily coupled residues as natural choices for reaction coordinates which can be incorporated into Markov state model-based adaptive sampling schemes and potentially used to predict not only functional conformations but also pathways of conformational change, protein folding, and protein-protein association. We demonstrate the utility of evolutionary couplings in sampling and predicting activation pathways of the β 2-adrenergic receptor (β 2-AR), folding of the FiP35 WW domain, and dimerization of the E. coli molybdopterin synthase subunits. We find that the time required for β 2-AR activation and folding of the WW domain are greatly diminished using evolutionary couplings-guided adaptive sampling. Additionally, we were able to identify putative molybdopterin synthase association pathways and near-crystal structure complexes from protein-protein association simulations.
Collapse
Affiliation(s)
- Zahra Shamsi
- Department of Chemical and Biomolecular Engineering, University of Illinois, Urbana, IL, 61801, USA
| | - Alexander S Moffett
- Center for Biophysics and Quantitative Biology, University of Illinois, Urbana, IL, 61801, USA
| | - Diwakar Shukla
- Department of Chemical and Biomolecular Engineering, University of Illinois, Urbana, IL, 61801, USA.
- Center for Biophysics and Quantitative Biology, University of Illinois, Urbana, IL, 61801, USA.
- Department of Plant Biology, University of Illinois, Urbana, IL, 61801, USA.
- National Center for Supercomputing Applications, University of Illinois, Urbana, IL, 61801, USA.
| |
Collapse
|
19
|
Latek D. Rosetta Broker for membrane protein structure prediction: concentrative nucleoside transporter 3 and corticotropin-releasing factor receptor 1 test cases. BMC STRUCTURAL BIOLOGY 2017; 17:8. [PMID: 28774292 PMCID: PMC5543540 DOI: 10.1186/s12900-017-0078-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/11/2017] [Accepted: 07/26/2017] [Indexed: 02/12/2023]
Abstract
Background Membrane proteins are difficult targets for structure prediction due to the limited structural data deposited in Protein Data Bank. Most computational methods for membrane protein structure prediction are based on the comparative modeling. There are only few de novo methods targeting that distinct protein family. In this work an example of such de novo method was used to structurally and functionally characterize two representatives of distinct membrane proteins families of solute carrier transporters and G protein-coupled receptors. The well-known Rosetta program and one of its protocols named Broker was used in two test cases. The first case was de novo structure prediction of three N-terminal transmembrane helices of the human concentrative nucleoside transporter 3 (hCNT3) homotrimer belonging to the solute carrier 28 family of transporters (SLC28). The second case concerned the large scale refinement of transmembrane helices of a homology model of the corticotropin-releasing factor receptor 1 (CRFR1) belonging to the G protein-coupled receptors family. Results The inward-facing model of the hCNT3 homotrimer was used to propose the functional impact of its single nucleotide polymorphisms. Additionally, the 100 ns molecular dynamics simulation of the unliganded hCNT3 model confirmed its validity and revealed mobility of the selected binding site and homotrimer interface residues. The large scale refinement of transmembrane helices of the CRFR1 homology model resulted in the significant improvement of its accuracy with respect to the crystal structure of CRFR1, especially in the binding site area. Consequently, the antagonist CP-376395 could be docked with Autodock VINA to the CRFR1 model without any steric clashes. Conclusions The presented work demonstrated that Rosetta Broker can be a versatile tool for solving various issues referring to protein biology. Two distinct examples of de novo membrane protein structure prediction presented here provided important insights into three major areas of protein biology. Namely, the dynamics of the inward-facing hCNT3 homotrimer system, the structural changes of the CRFR1 receptor upon the antagonist binding and finally, the role of single nucleotide polymorphisms in both, hCNT3 and CRFR1 proteins, were investigated. Electronic supplementary material The online version of this article (doi:10.1186/s12900-017-0078-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Dorota Latek
- Faculty of Chemistry, University of Warsaw, Pasteur St. 1, 02-093, Warsaw, Poland.
| |
Collapse
|
20
|
Alford RF, Leaver-Fay A, Jeliazkov JR, O’Meara MJ, DiMaio FP, Park H, Shapovalov MV, Renfrew PD, Mulligan VK, Kappel K, Labonte JW, Pacella MS, Bonneau R, Bradley P, Dunbrack RL, Das R, Baker D, Kuhlman B, Kortemme T, Gray JJ. The Rosetta All-Atom Energy Function for Macromolecular Modeling and Design. J Chem Theory Comput 2017; 13:3031-3048. [PMID: 28430426 PMCID: PMC5717763 DOI: 10.1021/acs.jctc.7b00125] [Citation(s) in RCA: 795] [Impact Index Per Article: 113.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Over the past decade, the Rosetta biomolecular modeling suite has informed diverse biological questions and engineering challenges ranging from interpretation of low-resolution structural data to design of nanomaterials, protein therapeutics, and vaccines. Central to Rosetta's success is the energy function: a model parametrized from small-molecule and X-ray crystal structure data used to approximate the energy associated with each biomolecule conformation. This paper describes the mathematical models and physical concepts that underlie the latest Rosetta energy function, called the Rosetta Energy Function 2015 (REF15). Applying these concepts, we explain how to use Rosetta energies to identify and analyze the features of biomolecular models. Finally, we discuss the latest advances in the energy function that extend its capabilities from soluble proteins to also include membrane proteins, peptides containing noncanonical amino acids, small molecules, carbohydrates, nucleic acids, and other macromolecules.
Collapse
Affiliation(s)
- Rebecca F. Alford
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, 3400 North Charles Street, Baltimore, Maryland 21218, United States
| | - Andrew Leaver-Fay
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, 120 Mason Farm Road, Chapel Hill, North Carolina 27599, United States
| | - Jeliazko R. Jeliazkov
- Program in Molecular Biophysics, Johns Hopkins University, 3400 North Charles Street, Baltimore, Maryland 21218, United States
| | - Matthew J. O’Meara
- Department of Pharmaceutical Chemistry, University of California at San Francisco, 1700 Fourth Street, San Francisco, California 94158, United States
| | - Frank P. DiMaio
- Department of Biochemistry, University of Washington, J-Wing Health Sciences Building, Box 357350, Seattle, Washington 98195, United States
| | - Hahnbeom Park
- Department of Biochemistry, University of Washington, Molecular Engineering and Sciences, Box 357350, 4000 15 Ave NE, Seattle, Washington 98195, United States
| | - Maxim V. Shapovalov
- Institute for Cancer Research, Fox Chase Cancer Center, 333 Cottman Avenue, Philadelphia, Pennsylvania 19111, United States
| | - P. Douglas Renfrew
- Department of Biology, Center for Genomics and Systems Biology, New York University, 100 Washington Square East, New York, New York 10003
- Center for Computational Biology, Flatiron Institute, Simons Foundation, 162 5 Avenue, New York, New York 10010, United States
| | - Vikram K. Mulligan
- Department of Biochemistry, University of Washington, Molecular Engineering and Sciences, Box 357350, 4000 15 Ave NE, Seattle, Washington 98195, United States
| | - Kalli Kappel
- Biophysics Program, Stanford University, 450 Serra Mall, Stanford, California 94305, United States
| | - Jason W. Labonte
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, 3400 North Charles Street, Baltimore, Maryland 21218, United States
| | - Michael S. Pacella
- Department of Biomedical Engineering, Johns Hopkins University, 3400 North Charles Street, Baltimore, Maryland 21218, United States
| | - Richard Bonneau
- Department of Biology, Center for Genomics and Systems Biology, New York University, 100 Washington Square East, New York, New York 10003
- Center for Computational Biology, Flatiron Institute, Simons Foundation, 162 5 Avenue, New York, New York 10010, United States
| | - Philip Bradley
- Computational Biology Program, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, Washington 98109, United States
| | - Roland L. Dunbrack
- Institute for Cancer Research, Fox Chase Cancer Center, 333 Cottman Avenue, Philadelphia, Pennsylvania 19111, United States
| | - Rhiju Das
- Biophysics Program, Stanford University, 450 Serra Mall, Stanford, California 94305, United States
| | - David Baker
- Department of Biochemistry, University of Washington, Molecular Engineering and Sciences, Box 357350, 4000 15 Ave NE, Seattle, Washington 98195, United States
| | - Brian Kuhlman
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, 120 Mason Farm Road, Chapel Hill, North Carolina 27599, United States
| | - Tanja Kortemme
- Department of Bioengineering and Therapeutic Sciences, University of California at San Francisco, San Francisco, California 94158, United States
| | - Jeffrey J. Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, 3400 North Charles Street, Baltimore, Maryland 21218, United States
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, 120 Mason Farm Road, Chapel Hill, North Carolina 27599, United States
| |
Collapse
|
21
|
Toward high-resolution computational design of the structure and function of helical membrane proteins. Nat Struct Mol Biol 2017; 23:475-80. [PMID: 27273630 DOI: 10.1038/nsmb.3231] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2016] [Accepted: 04/20/2016] [Indexed: 02/07/2023]
Abstract
The computational design of α-helical membrane proteins is still in its infancy but has already made great progress. De novo design allows stable, specific and active minimal oligomeric systems to be obtained. Computational reengineering can improve the stability and function of naturally occurring membrane proteins. Currently, the major hurdle for the field is the experimental characterization of the designs. The emergence of new structural methods for membrane proteins will accelerate progress.
Collapse
|
22
|
Simkovic F, Ovchinnikov S, Baker D, Rigden DJ. Applications of contact predictions to structural biology. IUCRJ 2017; 4:291-300. [PMID: 28512576 PMCID: PMC5414403 DOI: 10.1107/s2052252517005115] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/12/2016] [Accepted: 04/03/2017] [Indexed: 06/07/2023]
Abstract
Evolutionary pressure on residue interactions, intramolecular or intermolecular, that are important for protein structure or function can lead to covariance between the two positions. Recent methodological advances allow much more accurate contact predictions to be derived from this evolutionary covariance signal. The practical application of contact predictions has largely been confined to structural bioinformatics, yet, as this work seeks to demonstrate, the data can be of enormous value to the structural biologist working in X-ray crystallo-graphy, cryo-EM or NMR. Integrative structural bioinformatics packages such as Rosetta can already exploit contact predictions in a variety of ways. The contribution of contact predictions begins at construct design, where structural domains may need to be expressed separately and contact predictions can help to predict domain limits. Structure solution by molecular replacement (MR) benefits from contact predictions in diverse ways: in difficult cases, more accurate search models can be constructed using ab initio modelling when predictions are available, while intermolecular contact predictions can allow the construction of larger, oligomeric search models. Furthermore, MR using supersecondary motifs or large-scale screens against the PDB can exploit information, such as the parallel or antiparallel nature of any β-strand pairing in the target, that can be inferred from contact predictions. Contact information will be particularly valuable in the determination of lower resolution structures by helping to assign sequence register. In large complexes, contact information may allow the identity of a protein responsible for a certain region of density to be determined and then assist in the orientation of an available model within that density. In NMR, predicted contacts can provide long-range information to extend the upper size limit of the technique in a manner analogous but complementary to experimental methods. Finally, predicted contacts can distinguish between biologically relevant interfaces and mere lattice contacts in a final crystal structure, and have potential in the identification of functionally important regions and in foreseeing the consequences of mutations.
Collapse
Affiliation(s)
- Felix Simkovic
- Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, England
| | - Sergey Ovchinnikov
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98195, USA
- Howard Hughes Medical Institute, University of Washington, Box 357370, Seattle, WA 98195, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98195, USA
- Howard Hughes Medical Institute, University of Washington, Box 357370, Seattle, WA 98195, USA
| | - Daniel J. Rigden
- Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, England
| |
Collapse
|
23
|
TMDOCK: An Energy-Based Method for Modeling α-Helical Dimers in Membranes. J Mol Biol 2017; 429:390-398. [DOI: 10.1016/j.jmb.2016.09.005] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2016] [Revised: 08/26/2016] [Accepted: 09/02/2016] [Indexed: 11/22/2022]
|
24
|
Wozniak PP, Vriend G, Kotulska M. Correlated mutations select misfolded from properly folded proteins. Bioinformatics 2017; 33:1497-1504. [PMID: 28203707 DOI: 10.1093/bioinformatics/btx013] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2016] [Accepted: 01/11/2017] [Indexed: 11/14/2022] Open
Affiliation(s)
- P P Wozniak
- Faculty of Fundamental Problems of Technology, Department of Biomedical Engineering, Wrocław University of Science and Technology, Wrocław, Poland
| | - G Vriend
- Centre for Molecular and Biomolecular Informatics, Radboud University Medical Centre, Nijmegen, The Netherlands
| | - M Kotulska
- Faculty of Fundamental Problems of Technology, Department of Biomedical Engineering, Wrocław University of Science and Technology, Wrocław, Poland
| |
Collapse
|
25
|
Lomize AL, Lomize MA, Krolicki SR, Pogozheva ID. Membranome: a database for proteome-wide analysis of single-pass membrane proteins. Nucleic Acids Res 2017; 45:D250-D255. [PMID: 27510400 PMCID: PMC5210604 DOI: 10.1093/nar/gkw712] [Citation(s) in RCA: 46] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2016] [Revised: 07/31/2016] [Accepted: 08/04/2016] [Indexed: 12/29/2022] Open
Abstract
The Membranome database was developed to assist analysis and computational modeling of single-pass (bitopic) transmembrane (TM) proteins and their complexes by providing structural information about these proteins on a genomic scale. The database currently collects data on >6000 bitopic proteins from Homo sapiens, Arabidopsis thaliana, Dictyostelium discoideum, Saccharomyces cerevisiae, Escherichia coli and Methanocaldococcus jannaschii It presents the following data: (i) hierarchical classification of bitopic proteins into 15 functional classes, 689 structural superfamilies and 1404 families; (ii) 446 complexes of bitopic proteins with known three-dimensional (3D) structures classified into 129 families; (iii) computationally generated three-dimensional models of TM α-helices positioned in membranes; (iv) amino acid sequences, domain architecture, functional annotation and available experimental structures of bitopic proteins; (v) TM topology and intracellular localization, (vi) physical interactions between proteins from the database along with links to other resources. The database is freely accessible at http://membranome.org There is a variety of options for browsing, sorting, searching and retrieval of the content, including downloadable coordinate files of TM domains with calculated membrane boundaries.
Collapse
Affiliation(s)
- Andrei L Lomize
- Department of Medicinal Chemistry, College of Pharmacy, University of Michigan, Ann Arbor, MI 48109-1065, USA
| | | | - Shean R Krolicki
- Department of Computational Science and Engineering, University of Michigan, Ann Arbor, MI 48109-1065, USA
| | - Irina D Pogozheva
- Department of Medicinal Chemistry, College of Pharmacy, University of Michigan, Ann Arbor, MI 48109-1065, USA
| |
Collapse
|
26
|
Alcock F, Stansfeld PJ, Basit H, Habersetzer J, Baker MA, Palmer T, Wallace MI, Berks BC. Assembling the Tat protein translocase. eLife 2016; 5. [PMID: 27914200 PMCID: PMC5201420 DOI: 10.7554/elife.20718] [Citation(s) in RCA: 53] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2016] [Accepted: 11/29/2016] [Indexed: 12/18/2022] Open
Abstract
The twin-arginine protein translocation system (Tat) transports folded proteins across the bacterial cytoplasmic membrane and the thylakoid membranes of plant chloroplasts. The Tat transporter is assembled from multiple copies of the membrane proteins TatA, TatB, and TatC. We combine sequence co-evolution analysis, molecular simulations, and experimentation to define the interactions between the Tat proteins of Escherichia coli at molecular-level resolution. In the TatBC receptor complex the transmembrane helix of each TatB molecule is sandwiched between two TatC molecules, with one of the inter-subunit interfaces incorporating a functionally important cluster of interacting polar residues. Unexpectedly, we find that TatA also associates with TatC at the polar cluster site. Our data provide a structural model for assembly of the active Tat translocase in which substrate binding triggers replacement of TatB by TatA at the polar cluster site. Our work demonstrates the power of co-evolution analysis to predict protein interfaces in multi-subunit complexes. DOI:http://dx.doi.org/10.7554/eLife.20718.001
Collapse
Affiliation(s)
- Felicity Alcock
- Department of Biochemistry, University of Oxford, Oxford, United Kingdom
| | | | - Hajra Basit
- Department of Chemistry, University of Oxford, Oxford, United Kingdom
| | - Johann Habersetzer
- Division of Molecular Microbiology, College of Life Sciences, University of Dundee, Dundee, United Kingdom
| | - Matthew Ab Baker
- Department of Chemistry, University of Oxford, Oxford, United Kingdom
| | - Tracy Palmer
- Division of Molecular Microbiology, College of Life Sciences, University of Dundee, Dundee, United Kingdom
| | - Mark I Wallace
- Department of Chemistry, University of Oxford, Oxford, United Kingdom
| | - Ben C Berks
- Department of Biochemistry, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
27
|
Abstract
The majority of therapeutics target membrane proteins, accessible on the surface of cells, to alter cellular signaling. Cells use membrane proteins to transduce signals into cells, transport ions and molecules, bind cells to a surface or substrate, and catalyze reactions. Newly devised technologies allow us to drug conventionally "undruggable" regions of membrane proteins, enabling modulation of protein-protein, protein-lipid, and protein-nucleic acid interactions. In this review, we survey the state of the art of high-throughput screening and rational design in drug discovery, and we evaluate the advances in biological understanding and technological capacity that will drive pharmacotherapy forward against unorthodox membrane protein targets.
Collapse
Affiliation(s)
- Hang Yin
- Department of Chemistry and Biochemistry.,BioFrontiers Institute, and.,Center of Basic Molecular Science, Department of Chemistry, Tsinghua University, Beijing 100082, China
| | - Aaron D Flynn
- BioFrontiers Institute, and.,Department of Molecular, Cellular, and Developmental Biology, University of Colorado, Boulder, Colorado 80309; ,
| |
Collapse
|
28
|
Noel JK, Morcos F, Onuchic JN. Sequence co-evolutionary information is a natural partner to minimally-frustrated models of biomolecular dynamics. F1000Res 2016; 5. [PMID: 26918164 PMCID: PMC4755392 DOI: 10.12688/f1000research.7186.1] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 01/21/2016] [Indexed: 11/25/2022] Open
Abstract
Experimentally derived structural constraints have been crucial to the implementation of computational models of biomolecular dynamics. For example, not only does crystallography provide essential starting points for molecular simulations but also high-resolution structures permit for parameterization of simplified models. Since the energy landscapes for proteins and other biomolecules have been shown to be minimally frustrated and therefore funneled, these structure-based models have played a major role in understanding the mechanisms governing folding and many functions of these systems. Structural information, however, may be limited in many interesting cases. Recently, the statistical analysis of residue co-evolution in families of protein sequences has provided a complementary method of discovering residue-residue contact interactions involved in functional configurations. These functional configurations are often transient and difficult to capture experimentally. Thus, co-evolutionary information can be merged with that available for experimentally characterized low free-energy structures, in order to more fully capture the true underlying biomolecular energy landscape.
Collapse
Affiliation(s)
- Jeffrey K Noel
- Center for Theoretical Biological Physics, Rice University, Houston, TX, USA; Kristallographie, Max-Delbrück-Centrum für Molekulare Medizin, Berlin, Germany
| | - Faruck Morcos
- Department of Biological Sciences, University of Texas at Dallas, Richardson, TX, USA
| | - Jose N Onuchic
- Center for Theoretical Biological Physics, Rice University, Houston, TX, USA
| |
Collapse
|
29
|
A topological and conformational stability alphabet for multipass membrane proteins. Nat Chem Biol 2016; 12:167-73. [PMID: 26780406 DOI: 10.1038/nchembio.2001] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2015] [Accepted: 11/13/2015] [Indexed: 12/27/2022]
Abstract
Multipass membrane proteins perform critical signal transduction and transport across membranes. How transmembrane helix (TMH) sequences encode the topology and conformational flexibility regulating these functions remains poorly understood. Here we describe a comprehensive analysis of the sequence-structure relationships at multiple interacting TMHs from all membrane proteins with structures in the Protein Data Bank (PDB). We found that membrane proteins can be deconstructed in interacting TMH trimer units, which mostly fold into six distinct structural classes of topologies and conformations. Each class is enriched in recurrent sequence motifs from functionally unrelated proteins, revealing unforeseen consensus and evolutionary conserved networks of stabilizing interhelical contacts. Interacting TMHs' topology and local protein conformational flexibility were remarkably well predicted in a blinded fashion from the identified binding-hotspot motifs. Our results reveal universal sequence-structure principles governing the complex anatomy and plasticity of multipass membrane proteins that may guide de novo structure prediction, design, and studies of folding and dynamics.
Collapse
|
30
|
Alford RF, Koehler Leman J, Weitzner BD, Duran AM, Tilley DC, Elazar A, Gray JJ. An Integrated Framework Advancing Membrane Protein Modeling and Design. PLoS Comput Biol 2015; 11:e1004398. [PMID: 26325167 PMCID: PMC4556676 DOI: 10.1371/journal.pcbi.1004398] [Citation(s) in RCA: 111] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2015] [Accepted: 06/09/2015] [Indexed: 11/19/2022] Open
Abstract
Membrane proteins are critical functional molecules in the human body, constituting more than 30% of open reading frames in the human genome. Unfortunately, a myriad of difficulties in overexpression and reconstitution into membrane mimetics severely limit our ability to determine their structures. Computational tools are therefore instrumental to membrane protein structure prediction, consequently increasing our understanding of membrane protein function and their role in disease. Here, we describe a general framework facilitating membrane protein modeling and design that combines the scientific principles for membrane protein modeling with the flexible software architecture of Rosetta3. This new framework, called RosettaMP, provides a general membrane representation that interfaces with scoring, conformational sampling, and mutation routines that can be easily combined to create new protocols. To demonstrate the capabilities of this implementation, we developed four proof-of-concept applications for (1) prediction of free energy changes upon mutation; (2) high-resolution structural refinement; (3) protein-protein docking; and (4) assembly of symmetric protein complexes, all in the membrane environment. Preliminary data show that these algorithms can produce meaningful scores and structures. The data also suggest needed improvements to both sampling routines and score functions. Importantly, the applications collectively demonstrate the potential of combining the flexible nature of RosettaMP with the power of Rosetta algorithms to facilitate membrane protein modeling and design.
Collapse
Affiliation(s)
- Rebecca F. Alford
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States of America
- Department of Chemistry, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America
| | - Julia Koehler Leman
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Brian D. Weitzner
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Amanda M. Duran
- Center for Structural Biology, Department of Chemistry, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Drew C. Tilley
- Department of Physiology and Membrane Biology, University of California, Davis, Davis, California, United States of America
| | - Assaf Elazar
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
| | - Jeffrey J. Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States of America
- * E-mail:
| |
Collapse
|
31
|
Abstract
Transmembrane (TM) helices of integral membrane proteins can facilitate strong and specific noncovalent protein-protein interactions. Mutagenesis and structural analyses have revealed numerous examples in which the interaction between TM helices of single-pass membrane proteins is dependent on a GxxxG or (small)xxx(small) motif. It is therefore tempting to use the presence of these simple motifs as an indicator of TM helix interactions. In this Current Topic review, we point out that these motifs are quite common, with more than 50% of single-pass TM domains containing a (small)xxx(small) motif. However, the actual interaction strength of motif-containing helices depends strongly on sequence context and membrane properties. In addition, recent studies have revealed several GxxxG-containing TM domains that interact via alternative interfaces involving hydrophobic, polar, aromatic, or even ionizable residues that do not form recognizable motifs. In multipass membrane proteins, GxxxG motifs can be important for protein folding, and not just oligomerization. Our current knowledge thus suggests that the presence of a GxxxG motif alone is a weak predictor of protein dimerization in the membrane.
Collapse
Affiliation(s)
- Mark G Teese
- Lehrstuhl für Chemie der Biopolymere, Technische Universität München , 85354 Freising, Germany.,Center for Integrated Protein Science Munich (CIPSM) , 81377 Munich, Germany
| | - Dieter Langosch
- Lehrstuhl für Chemie der Biopolymere, Technische Universität München , 85354 Freising, Germany.,Center for Integrated Protein Science Munich (CIPSM) , 81377 Munich, Germany
| |
Collapse
|