1
|
Yue Y, Li S, Cheng Y, Wang L, Hou T, Zhu Z, He S. Integration of molecular coarse-grained model into geometric representation learning framework for protein-protein complex property prediction. Nat Commun 2024; 15:9629. [PMID: 39511202 PMCID: PMC11544137 DOI: 10.1038/s41467-024-53583-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Accepted: 10/16/2024] [Indexed: 11/15/2024] Open
Abstract
Structure-based machine learning algorithms have been utilized to predict the properties of protein-protein interaction (PPI) complexes, such as binding affinity, which is critical for understanding biological mechanisms and disease treatments. While most existing algorithms represent PPI complex graph structures at the atom-scale or residue-scale, these representations can be computationally expensive or may not sufficiently integrate finer chemical-plausible interaction details for improving predictions. Here, we introduce MCGLPPI, a geometric representation learning framework that combines graph neural networks (GNNs) with MARTINI molecular coarse-grained (CG) models to predict PPI overall properties accurately and efficiently. Extensive experiments on three types of downstream PPI property prediction tasks demonstrate that at the CG-scale, MCGLPPI achieves competitive performance compared with the counterparts at the atom- and residue-scale, but with only a third of computational resource consumption. Furthermore, CG-scale pre-training on protein domain-domain interaction structures enhances its predictive capabilities for PPI tasks. MCGLPPI offers an effective and efficient solution for PPI overall property predictions, serving as a promising tool for the large-scale analysis of biomolecular interactions.
Collapse
Affiliation(s)
- Yang Yue
- School of Computer Science, The University of Birmingham, Edgbaston, Birmingham, UK
| | - Shu Li
- Macao Polytechnic University, Macao, China
| | - Yihua Cheng
- School of Computer Science, The University of Birmingham, Edgbaston, Birmingham, UK
| | - Lie Wang
- Bone Marrow Transplantation Center of the First Affiliated Hospital, Institute of Immunology, Zhejiang University School of Medicine, Hangzhou, China
| | - Tingjun Hou
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China
| | - Zexuan Zhu
- National Engineering Laboratory for Big Data System Computing Technology, Shenzhen University, Shenzhen, China.
| | - Shan He
- School of Computer Science, The University of Birmingham, Edgbaston, Birmingham, UK.
- Macao Polytechnic University, Macao, China.
| |
Collapse
|
2
|
Newaz K, Schaefers C, Weisel K, Baumbach J, Frishman D. Prognostic importance of splicing-triggered aberrations of protein complex interfaces in cancer. NAR Genom Bioinform 2024; 6:lqae133. [PMID: 39328266 PMCID: PMC11426328 DOI: 10.1093/nargab/lqae133] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2024] [Revised: 08/30/2024] [Accepted: 09/13/2024] [Indexed: 09/28/2024] Open
Abstract
Aberrant alternative splicing (AS) is a prominent hallmark of cancer. AS can perturb protein-protein interactions (PPIs) by adding or removing interface regions encoded by individual exons. Identifying prognostic exon-exon interactions (EEIs) from PPI interfaces can help discover AS-affected cancer-driving PPIs that can serve as potential drug targets. Here, we assessed the prognostic significance of EEIs across 15 cancer types by integrating RNA-seq data with three-dimensional (3D) structures of protein complexes. By analyzing the resulting EEI network we identified patient-specific perturbed EEIs (i.e., EEIs present in healthy samples but absent from the paired cancer samples or vice versa) that were significantly associated with survival. We provide the first evidence that EEIs can be used as prognostic biomarkers for cancer patient survival. Our findings provide mechanistic insights into AS-affected PPI interfaces. Given the ongoing expansion of available RNA-seq data and the number of 3D structurally-resolved (or confidently predicted) protein complexes, our computational framework will help accelerate the discovery of clinically important cancer-promoting AS events.
Collapse
Affiliation(s)
- Khalique Newaz
- Institute for Computational Systems Biology and Center for Data and Computing in Natural Sciences, Universität Hamburg, 22761 Hamburg, Germany
| | - Christoph Schaefers
- Department of Oncology, Hematology and Bone Marrow Transplantation with Division of Pneumology, Universitätsklinikum Hamburg-Eppendorf, 20251 Hamburg, Germany
| | - Katja Weisel
- Department of Oncology, Hematology and Bone Marrow Transplantation with Division of Pneumology, Universitätsklinikum Hamburg-Eppendorf, 20251 Hamburg, Germany
| | - Jan Baumbach
- Institute for Computational Systems Biology and Center for Data and Computing in Natural Sciences, Universität Hamburg, 22761 Hamburg, Germany
- Department of Mathematics and Computer Science, University of Southern Denmark, Odense, Denmark
| | - Dmitrij Frishman
- Department of Bioinformatics, School of Life Sciences, Technical University of Munich, 85354 Freising, Germany
| |
Collapse
|
3
|
Yamamoto K, Nagatoishi S, Nakakido M, Kuroda D, Tsumoto K. Functional insights of Tyr37 in framework region 2 directly contributing to the binding affinities and dissociation kinetics in single-domain V HH antibodies. Biochem Biophys Res Commun 2024; 709:149839. [PMID: 38564943 DOI: 10.1016/j.bbrc.2024.149839] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2024] [Accepted: 03/26/2024] [Indexed: 04/04/2024]
Abstract
Single-domain VHH antibody is regarded as one of the promising antibody classes for therapeutic and diagnostic applications. VHH antibodies have amino acids in framework region 2 that are distinct from those in conventional antibodies, such as the Val37Phe/Tyr (V37F/Y) substitution. Correlations between the residue type at position 37 and the conformation of the CDR3 in VHH antigen recognition have been previously reported. However, few studies focused on the meaning of harboring two residue types in position 37 of VHH antibodies, and the concrete roles of Y37 have been little to be elucidated. Here, we investigated the functional states of position 37 in co-crystal structures and performed analyses of three model antibodies with either F or Y at position 37. Our analysis indicates that Y at position 37 enhances the dissociation rate, which is highly correlated with drug efficacy. Our findings help to explain the molecular mechanisms that distinguish VHH antibodies from conventional antibodies.
Collapse
Affiliation(s)
- Koichi Yamamoto
- Department of Bioengineering, School of Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8656, Japan
| | - Satoru Nagatoishi
- The Institute of Medical Science, The University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo, 108-8639, Japan; Medical Device Development and Regulation Research Center, School of Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8656, Japan.
| | - Makoto Nakakido
- Department of Bioengineering, School of Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8656, Japan
| | - Daisuke Kuroda
- Department of Bioengineering, School of Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8656, Japan; Research Center for Drug and Vaccine Development, National Institute of Infectious Diseases, 1-23-1 Toyama, Shinjuku-ku, Tokyo, 162-8640, Japan
| | - Kouhei Tsumoto
- Department of Bioengineering, School of Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8656, Japan; The Institute of Medical Science, The University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo, 108-8639, Japan; Medical Device Development and Regulation Research Center, School of Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8656, Japan.
| |
Collapse
|
4
|
Graef J, Ehrt C, Reim T, Rarey M. Database-Driven Identification of Structurally Similar Protein-Protein Interfaces. J Chem Inf Model 2024; 64:3332-3349. [PMID: 38470439 DOI: 10.1021/acs.jcim.3c01462] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/13/2024]
Abstract
Analyzing the similarity of protein interfaces in protein-protein interactions gives new insights into protein function and assists in discovering new drugs. Usually, tools that assess the similarity focus on the interactions between two protein interfaces, while sometimes we only have one predicted interface. Herein, we present PiMine, a database-driven protein interface similarity search. It compares interface residues of one or two interacting chains by calculating and searching tetrahedral geometric patterns of α-carbon atoms and calculating physicochemical and shape-based similarity. On a dedicated, tailor-made dataset, we show that PiMine outperforms commonly used comparison tools in terms of early enrichment when considering interfaces of sequentially and structurally unrelated proteins. In an application example, we demonstrate its usability for protein interaction partner prediction by comparing predicted interfaces to known protein-protein interfaces.
Collapse
Affiliation(s)
- Joel Graef
- Universität Hamburg, ZBH─Center for Bioinformatics , Albert-Einstein-Ring 8-10, 22761 Hamburg, Germany
| | - Christiane Ehrt
- Universität Hamburg, ZBH─Center for Bioinformatics , Albert-Einstein-Ring 8-10, 22761 Hamburg, Germany
| | - Thorben Reim
- Universität Hamburg, ZBH─Center for Bioinformatics , Albert-Einstein-Ring 8-10, 22761 Hamburg, Germany
| | - Matthias Rarey
- Universität Hamburg, ZBH─Center for Bioinformatics , Albert-Einstein-Ring 8-10, 22761 Hamburg, Germany
| |
Collapse
|
5
|
Wang Z, Brand R, Adolf-Bryfogle J, Grewal J, Qi Y, Combs SA, Golovach N, Alford R, Rangwala H, Clark PM. EGGNet, a Generalizable Geometric Deep Learning Framework for Protein Complex Pose Scoring. ACS OMEGA 2024; 9:7471-7479. [PMID: 38405499 PMCID: PMC10882658 DOI: 10.1021/acsomega.3c04889] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Revised: 01/19/2024] [Accepted: 01/23/2024] [Indexed: 02/27/2024]
Abstract
Computational prediction of molecule-protein interactions has been key for developing new molecules to interact with a target protein for therapeutics development. Previous work includes two independent streams of approaches: (1) predicting protein-protein interactions (PPIs) between naturally occurring proteins and (2) predicting binding affinities between proteins and small-molecule ligands [also known as drug-target interaction (DTI)]. Studying the two problems in isolation has limited the ability of these computational models to generalize across the PPI and DTI tasks, both of which ultimately involve noncovalent interactions with a protein target. In this work, we developed Equivariant Graph of Graphs neural Network (EGGNet), a geometric deep learning (GDL) framework, for molecule-protein binding predictions that can handle three types of molecules for interacting with a target protein: (1) small molecules, (2) synthetic peptides, and (3) natural proteins. EGGNet leverages a graph of graphs (GoG) representation constructed from the molecular structures at atomic resolution and utilizes a multiresolution equivariant graph neural network to learn from such representations. In addition, EGGNet leverages the underlying biophysics and makes use of both atom- and residue-level interactions, which improve EGGNet's ability to rank candidate poses from blind docking. EGGNet achieves competitive performance on both a public protein-small-molecule binding affinity prediction task (80.2% top 1 success rate on CASF-2016) and a synthetic protein interface prediction task (88.4% area under the precision-recall curve). We envision that the proposed GDL framework can generalize to many other protein interaction prediction problems, such as binding site prediction and molecular docking, helping accelerate protein engineering and structure-based drug development.
Collapse
Affiliation(s)
- Zichen Wang
- Amazon
Web Services, Amazon, Seattle, Washington 98109-5210, United
States
| | - Ryan Brand
- Amazon
Web Services, Amazon, Seattle, Washington 98109-5210, United
States
| | - Jared Adolf-Bryfogle
- Janssen
Biotherapeutics, Janssen Pharmaceutical
Companies of Johnson & Johnson, Spring House, Titusville, New Jersey 08560-1504, United States
| | - Jasleen Grewal
- Amazon
Web Services, Amazon, Seattle, Washington 98109-5210, United
States
| | - Yanjun Qi
- Amazon
Web Services, Amazon, Seattle, Washington 98109-5210, United
States
| | - Steven A. Combs
- Janssen
Biotherapeutics, Janssen Pharmaceutical
Companies of Johnson & Johnson, Spring House, Titusville, New Jersey 08560-1504, United States
| | - Nataliya Golovach
- Janssen
Biotherapeutics, Janssen Pharmaceutical
Companies of Johnson & Johnson, Spring House, Titusville, New Jersey 08560-1504, United States
| | - Rebecca Alford
- Janssen
Biotherapeutics, Janssen Pharmaceutical
Companies of Johnson & Johnson, Spring House, Titusville, New Jersey 08560-1504, United States
| | - Huzefa Rangwala
- Amazon
Web Services, Amazon, Seattle, Washington 98109-5210, United
States
| | - Peter M. Clark
- Janssen
Biotherapeutics, Janssen Pharmaceutical
Companies of Johnson & Johnson, Spring House, Titusville, New Jersey 08560-1504, United States
| |
Collapse
|
6
|
Xu X, Bonvin AMJJ. DeepRank-GNN-esm: a graph neural network for scoring protein-protein models using protein language model. BIOINFORMATICS ADVANCES 2024; 4:vbad191. [PMID: 38213822 PMCID: PMC10782804 DOI: 10.1093/bioadv/vbad191] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Revised: 12/19/2023] [Indexed: 01/13/2024]
Abstract
Motivation Protein-Protein interactions (PPIs) play critical roles in numerous cellular processes. By modelling the 3D structures of the correspond protein complexes valuable insights can be obtained, providing, e.g. starting points for drug and protein design. One challenge in the modelling process is however the identification of near-native models from the large pool of generated models. To this end we have previously developed DeepRank-GNN, a graph neural network that integrates structural and sequence information to enable effective pattern learning at PPI interfaces. Its main features are related to the Position Specific Scoring Matrices (PSSMs), which are computationally expensive to generate, significantly limits the algorithm's usability. Results We introduce here DeepRank-GNN-esm that includes as additional features protein language model embeddings from the ESM-2 model. We show that the ESM-2 embeddings can actually replace the PSSM features at no cost in-, or even better performance on two PPI-related tasks: scoring docking poses and detecting crystal artifacts. This new DeepRank version bypasses thus the need of generating PSSM, greatly improving the usability of the software and opening new application opportunities for systems for which PSSM profiles cannot be obtained or are irrelevant (e.g. antibody-antigen complexes). Availability and implementation DeepRank-GNN-esm is freely available from https://github.com/DeepRank/DeepRank-GNN-esm.
Collapse
Affiliation(s)
- Xiaotong Xu
- Department of Chemistry, Faculty of Science, Computational Structural Biology Group, Bijvoet Centre for Biomolecular Research, Utrecht 3584 CS, The Netherlands
| | - Alexandre M J J Bonvin
- Department of Chemistry, Faculty of Science, Computational Structural Biology Group, Bijvoet Centre for Biomolecular Research, Utrecht 3584 CS, The Netherlands
| |
Collapse
|
7
|
Son HF, Park W, Kim S, Kim IK, Kim KJ. Structure-based functional analysis of a novel NADPH-producing glyceraldehyde-3-phosphate dehydrogenase from Corynebacterium glutamicum. Int J Biol Macromol 2024; 255:128103. [PMID: 37992937 DOI: 10.1016/j.ijbiomac.2023.128103] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Revised: 11/12/2023] [Accepted: 11/13/2023] [Indexed: 11/24/2023]
Abstract
Corynebacterium glutamicum is an industrial workhorse applied in the production of valuable biochemicals. In the process of bio-based chemical production, improving cofactor recycling and mitigating cofactor imbalance are considered major solutions for enhancing the production yield and efficiency. Although, glyceraldehyde-3-phosphate dehydrogenase (GapDH), a glycolytic enzyme, can be a promising candidate for a sufficient NADPH cofactor supply, however, most microorganisms have only NAD-dependent GapDHs. In this study, we performed functional characterization and structure determination of novel NADPH-producing GapDH from C. glutamicum (CgGapX). Based on the crystal structure of CgGapX in complex with NADP cofactor, the unique structural features of CgGapX for NADP stabilization were elucidated. Also, N-terminal additional region (Auxiliary domain, AD) appears to have an effect on enzyme stabilization. In addition, through structure-guided enzyme engineering, we developed a CgGapX variant that exhibited 4.3-fold higher kcat, and 1.2-fold higher kcat/KM values when compared with wild-type. Furthermore, a bioinformatic analysis of 100 GapX-like enzymes from 97 microorganisms in the KEGG database revealed that the GapX-like enzymes possess a variety of AD, which seem to determine enzyme stability. Our findings are expected to provide valuable information for supplying NADPH cofactor pools in bio-based value-added chemical production.
Collapse
Affiliation(s)
- Hyeoncheol Francis Son
- Clean Energy Research Center, Korea Institute of Science and Technology, Seoul 02792, Republic of Korea
| | - Woojin Park
- School of Life Sciences, BK21 FOUR KNU Creative BioResearch Group, Kyungpook National University, Daegu 41566, Republic of Korea
| | - Sangwoo Kim
- KNU Institute for Microorganisms, Kyungpook National University, Daegu 41566, Republic of Korea
| | - Il-Kwon Kim
- KNU Institute for Microorganisms, Kyungpook National University, Daegu 41566, Republic of Korea
| | - Kyung-Jin Kim
- School of Life Sciences, BK21 FOUR KNU Creative BioResearch Group, Kyungpook National University, Daegu 41566, Republic of Korea; KNU Institute for Microorganisms, Kyungpook National University, Daegu 41566, Republic of Korea.
| |
Collapse
|
8
|
Appasamy SD, Berrisford J, Gaborova R, Nair S, Anyango S, Grudinin S, Deshpande M, Armstrong D, Pidruchna I, Ellaway JIJ, Leines GD, Gupta D, Harrus D, Varadi M, Velankar S. Annotating Macromolecular Complexes in the Protein Data Bank: Improving the FAIRness of Structure Data. Sci Data 2023; 10:853. [PMID: 38040737 PMCID: PMC10692154 DOI: 10.1038/s41597-023-02778-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Accepted: 11/23/2023] [Indexed: 12/03/2023] Open
Abstract
Macromolecular complexes are essential functional units in nearly all cellular processes, and their atomic-level understanding is critical for elucidating and modulating molecular mechanisms. The Protein Data Bank (PDB) serves as the global repository for experimentally determined structures of macromolecules. Structural data in the PDB offer valuable insights into the dynamics, conformation, and functional states of biological assemblies. However, the current annotation practices lack standardised naming conventions for assemblies in the PDB, complicating the identification of instances representing the same assembly. In this study, we introduce a method leveraging resources external to PDB, such as the Complex Portal, UniProt and Gene Ontology, to describe assemblies and contextualise them within their biological settings accurately. Employing the proposed approach, we assigned standard names to over 90% of unique assemblies in the PDB and provided persistent identifiers for each assembly. This standardisation of assembly data enhances the PDB, facilitating a deeper understanding of macromolecular complexes. Furthermore, the data standardisation improves the PDB's FAIR attributes, fostering more effective basic and translational research and scientific education.
Collapse
Affiliation(s)
- Sri Devan Appasamy
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK.
| | - John Berrisford
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Romana Gaborova
- CEITEC - Central European Institute of Technology, Masaryk University, Brno, Czech Republic
| | - Sreenath Nair
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Stephen Anyango
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Sergei Grudinin
- Univ. Grenoble Alpes, CNRS, Grenoble INP, LJK, 38000, Grenoble, France
| | - Mandar Deshpande
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - David Armstrong
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Ivanna Pidruchna
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Joseph I J Ellaway
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Grisell Díaz Leines
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Deepti Gupta
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Deborah Harrus
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Mihaly Varadi
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Sameer Velankar
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| |
Collapse
|
9
|
Ozden B, Kryshtafovych A, Karaca E. The impact of AI-based modeling on the accuracy of protein assembly prediction: Insights from CASP15. Proteins 2023; 91:1636-1657. [PMID: 37861057 PMCID: PMC10873090 DOI: 10.1002/prot.26598] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 09/12/2023] [Accepted: 09/14/2023] [Indexed: 10/21/2023]
Abstract
In CASP15, 87 predictors submitted around 11 000 models on 41 assembly targets. The community demonstrated exceptional performance in overall fold and interface contact predictions, achieving an impressive success rate of 90% (compared to 31% in CASP14). This remarkable accomplishment is largely due to the incorporation of DeepMind's AF2-Multimer approach into custom-built prediction pipelines. To evaluate the added value of participating methods, we compared the community models to the baseline AF2-Multimer predictor. In over 1/3 of cases, the community models were superior to the baseline predictor. The main reasons for this improved performance were the use of custom-built multiple sequence alignments, optimized AF2-Multimer sampling, and the manual assembly of AF2-Multimer-built subcomplexes. The best three groups, in order, are Zheng, Venclovas, and Wallner. Zheng and Venclovas reached a 73.2% success rate over all (41) cases, while Wallner attained 69.4% success rate over 36 cases. Nonetheless, challenges remain in predicting structures with weak evolutionary signals, such as nanobody-antigen, antibody-antigen, and viral complexes. Expectedly, modeling large complexes also remains challenging due to their high memory compute demands. In addition to the assembly category, we assessed the accuracy of modeling interdomain interfaces in the tertiary structure prediction targets. Models on seven targets featuring 17 unique interfaces were analyzed. Best predictors achieved a 76.5% success rate, with the UM-TBM group being the leader. In the interdomain category, we observed that the predictors faced challenges, as in the case of the assembly category, when the evolutionary signal for a given domain pair was weak or the structure was large. Overall, CASP15 witnessed unprecedented improvement in interface modeling, reflecting the AI revolution seen in CASP14.
Collapse
Affiliation(s)
- Burcu Ozden
- Izmir Biomedicine and Genome Center, Izmir, Türkiye
- Izmir International Biomedicine and Genome Institute, Dokuz Eylul University, Izmir, Türkiye
| | - Andriy Kryshtafovych
- Protein Structure Prediction Center, Genome and Biomedical Sciences Facilities, University of California, Davis, California, USA
| | - Ezgi Karaca
- Izmir Biomedicine and Genome Center, Izmir, Türkiye
- Izmir International Biomedicine and Genome Institute, Dokuz Eylul University, Izmir, Türkiye
| |
Collapse
|
10
|
Avraham O, Tsaban T, Ben-Aharon Z, Tsaban L, Schueler-Furman O. Protein language models can capture protein quaternary state. BMC Bioinformatics 2023; 24:433. [PMID: 37964216 PMCID: PMC10647083 DOI: 10.1186/s12859-023-05549-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Accepted: 10/27/2023] [Indexed: 11/16/2023] Open
Abstract
BACKGROUND Determining a protein's quaternary state, i.e. the number of monomers in a functional unit, is a critical step in protein characterization. Many proteins form multimers for their activity, and over 50% are estimated to naturally form homomultimers. Experimental quaternary state determination can be challenging and require extensive work. To complement these efforts, a number of computational tools have been developed for quaternary state prediction, often utilizing experimentally validated structural information. Recently, dramatic advances have been made in the field of deep learning for predicting protein structure and other characteristics. Protein language models, such as ESM-2, that apply computational natural-language models to proteins successfully capture secondary structure, protein cell localization and other characteristics, from a single sequence. Here we hypothesize that information about the protein quaternary state may be contained within protein sequences as well, allowing us to benefit from these novel approaches in the context of quaternary state prediction. RESULTS We generated ESM-2 embeddings for a large dataset of proteins with quaternary state labels from the curated QSbio dataset. We trained a model for quaternary state classification and assessed it on a non-overlapping set of distinct folds (ECOD family level). Our model, named QUEEN (QUaternary state prediction using dEEp learNing), performs worse than approaches that include information from solved crystal structures. However, it successfully learned to distinguish multimers from monomers, and predicts the specific quaternary state with moderate success, better than simple sequence similarity-based annotation transfer. Our results demonstrate that complex, quaternary state related information is included in such embeddings. CONCLUSIONS QUEEN is the first to investigate the power of embeddings for the prediction of the quaternary state of proteins. As such, it lays out strengths as well as limitations of a sequence-based protein language model approach, compared to structure-based approaches. Since it does not require any structural information and is fast, we anticipate that it will be of wide use both for in-depth investigation of specific systems, as well as for studies of large sets of protein sequences. A simple colab implementation is available at: https://colab. RESEARCH google.com/github/Furman-Lab/QUEEN/blob/main/QUEEN_prediction_notebook.ipynb .
Collapse
Affiliation(s)
- Orly Avraham
- Department of Microbiology and Molecular Genetics, Faculty of Medicine, Institute for Biomedical Research Israel-Canada, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Tomer Tsaban
- Department of Microbiology and Molecular Genetics, Faculty of Medicine, Institute for Biomedical Research Israel-Canada, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Ziv Ben-Aharon
- Department of Microbiology and Molecular Genetics, Faculty of Medicine, Institute for Biomedical Research Israel-Canada, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Linoy Tsaban
- Gaffin Center for Neuro-Oncology, Sharett Institute for Oncology, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
- The Wohl Institute for Translational Medicine, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Ora Schueler-Furman
- Department of Microbiology and Molecular Genetics, Faculty of Medicine, Institute for Biomedical Research Israel-Canada, The Hebrew University of Jerusalem, Jerusalem, Israel.
| |
Collapse
|
11
|
Ozden B, Kryshtafovych A, Karaca E. The Impact of AI-Based Modeling on the Accuracy of Protein Assembly Prediction: Insights from CASP15. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.10.548341. [PMID: 37503072 PMCID: PMC10369898 DOI: 10.1101/2023.07.10.548341] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
In CASP15, 87 predictors submitted around 11,000 models on 41 assembly targets. The community demonstrated exceptional performance in overall fold and interface contact prediction, achieving an impressive success rate of 90% (compared to 31% in CASP14). This remarkable accomplishment is largely due to the incorporation of DeepMind's AF2-Multimer approach into custom-built prediction pipelines. To evaluate the added value of participating methods, we compared the community models to the baseline AF2-Multimer predictor. In over 1/3 of cases the community models were superior to the baseline predictor. The main reasons for this improved performance were the use of custom-built multiple sequence alignments, optimized AF2-Multimer sampling, and the manual assembly of AF2-Multimer-built subcomplexes. The best three groups, in order, are Zheng, Venclovas and Wallner. Zheng and Venclovas reached a 73.2% success rate over all (41) cases, while Wallner attained 69.4% success rate over 36 cases. Nonetheless, challenges remain in predicting structures with weak evolutionary signals, such as nanobody-antigen, antibody-antigen, and viral complexes. Expectedly, modeling large complexes remains also challenging due to their high memory compute demands. In addition to the assembly category, we assessed the accuracy of modeling interdomain interfaces in the tertiary structure prediction targets. Models on seven targets featuring 17 unique interfaces were analyzed. Best predictors achieved the 76.5% success rate, with the UM-TBM group being the leader. In the interdomain category, we observed that the predictors faced challenges, as in the case of the assembly category, when the evolutionary signal for a given domain pair was weak or the structure was large. Overall, CASP15 witnessed unprecedented improvement in interface modeling, reflecting the AI revolution seen in CASP14.
Collapse
Affiliation(s)
- Burcu Ozden
- Izmir Biomedicine and Genome Center, Izmir, Türkiye
- Izmir International Biomedicine and Genome Institute, Dokuz Eylul University, Izmir, Türkiye
| | - Andriy Kryshtafovych
- Protein Structure Prediction Center, Genome and Biomedical Sciences Facilities, University of California, Davis, California, USA
| | - Ezgi Karaca
- Izmir Biomedicine and Genome Center, Izmir, Türkiye
- Izmir International Biomedicine and Genome Institute, Dokuz Eylul University, Izmir, Türkiye
| |
Collapse
|
12
|
McFee M, Kim PM. GDockScore: a graph-based protein-protein docking scoring function. BIOINFORMATICS ADVANCES 2023; 3:vbad072. [PMID: 37359726 PMCID: PMC10290236 DOI: 10.1093/bioadv/vbad072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Revised: 05/30/2023] [Accepted: 06/10/2023] [Indexed: 06/28/2023]
Abstract
Summary Protein complexes play vital roles in a variety of biological processes, such as mediating biochemical reactions, the immune response and cell signalling, with 3D structure specifying function. Computational docking methods provide a means to determine the interface between two complexed polypeptide chains without using time-consuming experimental techniques. The docking process requires the optimal solution to be selected with a scoring function. Here, we propose a novel graph-based deep learning model that utilizes mathematical graph representations of proteins to learn a scoring function (GDockScore). GDockScore was pre-trained on docking outputs generated with the Protein Data Bank biounits and the RosettaDock protocol, and then fine-tuned on HADDOCK decoys generated on the ZDOCK Protein Docking Benchmark. GDockScore performs similarly to the Rosetta scoring function on docking decoys generated using the RosettaDock protocol. Furthermore, state-of-the-art is achieved on the CAPRI score set, a challenging dataset for developing docking scoring functions. Availability and implementation The model implementation is available at https://gitlab.com/mcfeemat/gdockscore. Supplementary information Supplementary data are available at Bioinformatics Advances online.
Collapse
Affiliation(s)
- Matthew McFee
- Department of Molecular Genetics, The University of Toronto, Toronto, ON M5S 1A8, Canada
- Donnelly Centre for Cellular and Biomolecular Research, The University of Toronto, Toronto, ON M5S 3E1, Canada
| | | |
Collapse
|
13
|
Ki D, Hong H, Kim IK, Kim KJ. Crystal Structure and Functional Characterization of Acetylornithine Aminotransferase from Corynebacterium glutamicum. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2023. [PMID: 37230944 DOI: 10.1021/acs.jafc.3c00659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
The amino acids l-arginine and l-ornithine are widely used in animal feed and as health supplements and pharmaceutical compounds. In arginine biosynthesis, acetylornithine aminotransferase (AcOAT) uses pyridoxal-5'-phosphate (PLP) as a cofactor for amino group transfer. Here, we determined the crystal structures of the apo and PLP complex forms of AcOAT from Corynebacterium glutamicum (CgAcOAT). Our structural observations revealed that CgAcOAT undergoes an order-to-disorder conformational change upon binding with PLP. Additionally, we observed that unlike other AcOATs, CgAcOAT exists as a tetramer. Subsequently, we identified the key residues involved in PLP and substrate binding based on structural analysis and site-directed mutagenesis. This study might provide structural insights on CgAcOAT, which can be utilized for the development of improved l-arginine production enzymes.
Collapse
Affiliation(s)
- Dongwoo Ki
- School of Life Sciences, BK21 FOUR KNU Creative BioResearch Group, KNU Institute for Microorganisms, Kyungpook National University, Daegu 41566, Republic of Korea
| | - Hwaseok Hong
- School of Life Sciences, BK21 FOUR KNU Creative BioResearch Group, KNU Institute for Microorganisms, Kyungpook National University, Daegu 41566, Republic of Korea
| | - Il-Kwon Kim
- School of Life Sciences, BK21 FOUR KNU Creative BioResearch Group, KNU Institute for Microorganisms, Kyungpook National University, Daegu 41566, Republic of Korea
| | - Kyung-Jin Kim
- School of Life Sciences, BK21 FOUR KNU Creative BioResearch Group, KNU Institute for Microorganisms, Kyungpook National University, Daegu 41566, Republic of Korea
| |
Collapse
|
14
|
Réau M, Renaud N, Xue LC, Bonvin AMJJ. DeepRank-GNN: a graph neural network framework to learn patterns in protein-protein interfaces. Bioinformatics 2022; 39:6845451. [PMID: 36420989 PMCID: PMC9805592 DOI: 10.1093/bioinformatics/btac759] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Revised: 10/19/2022] [Accepted: 11/23/2022] [Indexed: 11/25/2022] Open
Abstract
MOTIVATION Gaining structural insights into the protein-protein interactome is essential to understand biological phenomena and extract knowledge for rational drug design or protein engineering. We have previously developed DeepRank, a deep-learning framework to facilitate pattern learning from protein-protein interfaces using convolutional neural network (CNN) approaches. However, CNN is not rotation invariant and data augmentation is required to desensitize the network to the input data orientation which dramatically impairs the computation performance. Representing protein-protein complexes as atomic- or residue-scale rotation invariant graphs instead enables using graph neural networks (GNN) approaches, bypassing those limitations. RESULTS We have developed DeepRank-GNN, a framework that converts protein-protein interfaces from PDB 3D coordinates files into graphs that are further provided to a pre-defined or user-defined GNN architecture to learn problem-specific interaction patterns. DeepRank-GNN is designed to be highly modularizable, easily customized and is wrapped into a user-friendly python3 package. Here, we showcase DeepRank-GNN's performance on two applications using a dedicated graph interaction neural network: (i) the scoring of docking poses and (ii) the discriminating of biological and crystal interfaces. In addition to the highly competitive performance obtained in those tasks as compared to state-of-the-art methods, we show a significant improvement in speed and storage requirement using DeepRank-GNN as compared to DeepRank. AVAILABILITY AND IMPLEMENTATION DeepRank-GNN is freely available from https://github.com/DeepRank/DeepRank-GNN. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | | | - Li C Xue
- Center for Molecular and Biomolecular Informatics, Radboudumc, Nijmegen 6525 GA, The Netherlands
| | | |
Collapse
|
15
|
Goepfert A, Barske C, Lehmann S, Wirth E, Willemsen J, Gudjonsson JE, Ward NL, Sarkar MK, Hemmig R, Kolbinger F, Rondeau JM. IL-17-induced dimerization of IL-17RA drives the formation of the IL-17 signalosome to potentiate signaling. Cell Rep 2022; 41:111489. [PMID: 36260993 PMCID: PMC9637376 DOI: 10.1016/j.celrep.2022.111489] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Revised: 08/20/2022] [Accepted: 09/21/2022] [Indexed: 11/30/2022] Open
Abstract
Signaling through innate immune receptors such as the Toll-like receptor (TLR)/interleukin-1 receptor (IL-1R) superfamily proceeds via the assembly of large membrane-proximal complexes or “signalosomes.” Although structurally distinct, the IL-17 receptor family triggers cellular responses that are typical of innate immune receptors. The IL-17RA receptor subunit is shared by several members of the IL-17 family. Using a combination of crystallographic, biophysical, and mutational studies, we show that IL-17A, IL-17F, and IL-17A/F induce IL-17RA dimerization. X-ray analysis of the heteromeric IL-17A complex with the extracellular domains of the IL-17RA and IL-17RC receptors reveals that cytokine-induced IL-17RA dimerization leads to the formation of a 2:2:2 hexameric signaling assembly. Furthermore, we demonstrate that the formation of the IL-17 signalosome potentiates IL-17-induced IL-36γ and CXCL1 mRNA expression in human keratinocytes, compared with a dimerization-defective IL-17RA variant. IL-17RA is the shared co-receptor for several IL-17 family members. Goepfert et al. show that IL-17 induces IL-17RA dimerization, which then drives the formation of a 2:2:2 hexameric signaling assembly with IL-17RC. Furthermore, IL-17RA dimerization potentiates IL-17 signaling in immortalized primary human keratinocytes, compared with cells expressing a dimerization-defective IL-17RA variant.
Collapse
Affiliation(s)
- Arnaud Goepfert
- Novartis Institutes for BioMedical Research, Novartis Pharma AG, 4002 Basel, Switzerland
| | - Carmen Barske
- Novartis Institutes for BioMedical Research, Novartis Pharma AG, 4002 Basel, Switzerland
| | - Sylvie Lehmann
- Novartis Institutes for BioMedical Research, Novartis Pharma AG, 4002 Basel, Switzerland
| | - Emmanuelle Wirth
- Novartis Institutes for BioMedical Research, Novartis Pharma AG, 4002 Basel, Switzerland
| | - Joschka Willemsen
- Novartis Institutes for BioMedical Research, Novartis Pharma AG, 4002 Basel, Switzerland
| | | | - Nicole L Ward
- Department of Dermatology, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Mrinal K Sarkar
- Department of Dermatology, University of Michigan, Ann Arbor, MI, USA
| | - René Hemmig
- Novartis Institutes for BioMedical Research, Novartis Pharma AG, 4002 Basel, Switzerland
| | - Frank Kolbinger
- Novartis Institutes for BioMedical Research, Novartis Pharma AG, 4002 Basel, Switzerland
| | - Jean-Michel Rondeau
- Novartis Institutes for BioMedical Research, Novartis Pharma AG, 4002 Basel, Switzerland.
| |
Collapse
|
16
|
Pollet L, Lambourne L, Xia Y. Structural Determinants of Yeast Protein-Protein Interaction Interface Evolution at the Residue Level. J Mol Biol 2022; 434:167750. [PMID: 35850298 DOI: 10.1016/j.jmb.2022.167750] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 06/09/2022] [Accepted: 07/12/2022] [Indexed: 12/01/2022]
Abstract
Interfaces of contact between proteins play important roles in determining the proper structure and function of protein-protein interactions (PPIs). Therefore, to fully understand PPIs, we need to better understand the evolutionary design principles of PPI interfaces. Previous studies have uncovered that interfacial sites are more evolutionarily conserved than other surface protein sites. Yet, little is known about the nature and relative importance of evolutionary constraints in PPI interfaces. Here, we explore constraints imposed by the structure of the microenvironment surrounding interfacial residues on residue evolutionary rate using a large dataset of over 700 structural models of baker's yeast PPIs. We find that interfacial residues are, on average, systematically more conserved than all other residues with a similar degree of total burial as measured by relative solvent accessibility (RSA). Besides, we find that RSA of the residue when the PPI is formed is a better predictor of interfacial residue evolutionary rate than RSA in the monomer state. Furthermore, we investigate four structure-based measures of residue interfacial involvement, including change in RSA upon binding (ΔRSA), number of residue-residue contacts across the interface, and distance from the center or the periphery of the interface. Integrated modeling for evolutionary rate prediction in interfaces shows that ΔRSA plays a dominant role among the four measures of interfacial involvement, with minor, but independent contributions from other measures. These results yield insight into the evolutionary design of interfaces, improving our understanding of the role that structure plays in the molecular evolution of PPIs at the residue level.
Collapse
Affiliation(s)
- Léah Pollet
- Department of Bioengineering, Faculty of Engineering, McGill University, Montreal, QC, Canada
| | - Luke Lambourne
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, USA; Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, USA; Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA.
| | - Yu Xia
- Department of Bioengineering, Faculty of Engineering, McGill University, Montreal, QC, Canada.
| |
Collapse
|
17
|
Guven-Maiorov E, Sakakibara N, Ponnamperuma RM, Dong K, Matar H, King KE, Weinberg WC. Delineating functional mechanisms of the p53/p63/p73 family of transcription factors through identification of protein-protein interactions using interface mimicry. Mol Carcinog 2022; 61:629-642. [PMID: 35560453 PMCID: PMC9949960 DOI: 10.1002/mc.23405] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Revised: 01/27/2022] [Accepted: 01/31/2022] [Indexed: 11/08/2022]
Abstract
Members of the p53 family of transcription factors-p53, p63, and p73-share a high degree of homology; however, members can be activated in response to different stimuli, perform distinct (sometimes opposing) roles and are expressed in different tissues. The level of complexity is increased further by the transcription of multiple isoforms of each homolog, which may interact or interfere with each other and can impact cellular outcome. Proteins perform their functions through interacting with other proteins (and/or with nucleic acids). Therefore, identification of the interactors of a protein and how they interact in 3D is essential to fully comprehend their roles. By utilizing an in silico protein-protein interaction prediction method-HMI-PRED-we predicted interaction partners of p53 family members and modeled 3D structures of these protein interaction complexes. This method recovered experimentally known interactions while identifying many novel candidate partners. We analyzed the similarities and differences observed among the interaction partners to elucidate distinct functions of p53 family members and provide examples of how this information may yield mechanistic insight to explain their overlapping versus distinct/opposing outcomes in certain contexts. While some interaction partners are common to p53, p63, and p73, the majority are unique to each member. Nevertheless, most of the enriched pathways associated with these partners are common to all members, indicating that the members target the same biological pathways but through unique mediators. p63 and p73 have more common enriched pathways compared to p53, supporting their similar developmental roles in different tissues.
Collapse
Affiliation(s)
- Emine Guven-Maiorov
- Laboratory of Molecular Oncology, Office of Biotechnology Products, Center for Drug Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, United States.,National Cancer Institute, Bethesda, MD, United States.,Postal and email addresses of corresponding authors FDA/CDER/OPQ/OBP, Building 52-72/2306, 10903 New Hampshire Avenue, Silver Spring, MD 20993, United States, ,
| | - Nozomi Sakakibara
- Laboratory of Molecular Oncology, Office of Biotechnology Products, Center for Drug Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, United States
| | - Roshini M. Ponnamperuma
- Laboratory of Molecular Oncology, Office of Biotechnology Products, Center for Drug Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, United States
| | - Kun Dong
- Laboratory of Molecular Oncology, Office of Biotechnology Products, Center for Drug Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, United States.,National Cancer Institute, Bethesda, MD, United States
| | - Hector Matar
- Laboratory of Molecular Oncology, Office of Biotechnology Products, Center for Drug Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, United States
| | - Kathryn E. King
- Laboratory of Molecular Oncology, Office of Biotechnology Products, Center for Drug Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, United States
| | - Wendy C. Weinberg
- Laboratory of Molecular Oncology, Office of Biotechnology Products, Center for Drug Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, United States.,Postal and email addresses of corresponding authors FDA/CDER/OPQ/OBP, Building 52-72/2306, 10903 New Hampshire Avenue, Silver Spring, MD 20993, United States, ,
| |
Collapse
|
18
|
Renaud N, Geng C, Georgievska S, Ambrosetti F, Ridder L, Marzella DF, Réau MF, Bonvin AMJJ, Xue LC. DeepRank: a deep learning framework for data mining 3D protein-protein interfaces. Nat Commun 2021; 12:7068. [PMID: 34862392 PMCID: PMC8642403 DOI: 10.1038/s41467-021-27396-0] [Citation(s) in RCA: 50] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2021] [Accepted: 11/12/2021] [Indexed: 11/08/2022] Open
Abstract
Three-dimensional (3D) structures of protein complexes provide fundamental information to decipher biological processes at the molecular scale. The vast amount of experimentally and computationally resolved protein-protein interfaces (PPIs) offers the possibility of training deep learning models to aid the predictions of their biological relevance. We present here DeepRank, a general, configurable deep learning framework for data mining PPIs using 3D convolutional neural networks (CNNs). DeepRank maps features of PPIs onto 3D grids and trains a user-specified CNN on these 3D grids. DeepRank allows for efficient training of 3D CNNs with data sets containing millions of PPIs and supports both classification and regression. We demonstrate the performance of DeepRank on two distinct challenges: The classification of biological versus crystallographic PPIs, and the ranking of docking models. For both problems DeepRank is competitive with, or outperforms, state-of-the-art methods, demonstrating the versatility of the framework for research in structural biology.
Collapse
Affiliation(s)
- Nicolas Renaud
- Netherlands eScience Center, Science Park 140, 1098 XG, Amsterdam, The Netherlands
| | - Cunliang Geng
- Netherlands eScience Center, Science Park 140, 1098 XG, Amsterdam, The Netherlands
- Bijvoet Centre for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, 3584, Utrecht, CH, The Netherlands
| | - Sonja Georgievska
- Netherlands eScience Center, Science Park 140, 1098 XG, Amsterdam, The Netherlands
| | - Francesco Ambrosetti
- Bijvoet Centre for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, 3584, Utrecht, CH, The Netherlands
| | - Lars Ridder
- Netherlands eScience Center, Science Park 140, 1098 XG, Amsterdam, The Netherlands
| | - Dario F Marzella
- Center for Molecular and Biomolecular Informatics, Radboudumc, Greet Grooteplein 26-28, 6525, Nijmegen, GA, The Netherlands
| | - Manon F Réau
- Bijvoet Centre for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, 3584, Utrecht, CH, The Netherlands
| | - Alexandre M J J Bonvin
- Bijvoet Centre for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, 3584, Utrecht, CH, The Netherlands.
| | - Li C Xue
- Bijvoet Centre for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, 3584, Utrecht, CH, The Netherlands.
- Center for Molecular and Biomolecular Informatics, Radboudumc, Greet Grooteplein 26-28, 6525, Nijmegen, GA, The Netherlands.
| |
Collapse
|
19
|
Martinez OE, Mahoney BJ, Goring AK, Yi SW, Tran DP, Cascio D, Phillips ML, Muthana MM, Chen X, Jung ME, Loo JA, Clubb RT. Insight into the molecular basis of substrate recognition by the wall teichoic acid glycosyltransferase TagA. J Biol Chem 2021; 298:101464. [PMID: 34864059 PMCID: PMC8784642 DOI: 10.1016/j.jbc.2021.101464] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Revised: 11/18/2021] [Accepted: 11/22/2021] [Indexed: 11/30/2022] Open
Abstract
Wall teichoic acid (WTA) polymers are covalently affixed to the Gram-positive bacterial cell wall and have important functions in cell elongation, cell morphology, biofilm formation, and β-lactam antibiotic resistance. The first committed step in WTA biosynthesis is catalyzed by the TagA glycosyltransferase (also called TarA), a peripheral membrane protein that produces the conserved linkage unit, which joins WTA to the cell wall peptidoglycan. TagA contains a conserved GT26 core domain followed by a C-terminal polypeptide tail that is important for catalysis and membrane binding. Here, we report the crystal structure of the Thermoanaerobacter italicus TagA enzyme bound to UDP-N-acetyl-d-mannosamine, revealing the molecular basis of substrate binding. Native MS experiments support the model that only monomeric TagA is enzymatically active and that it is stabilized by membrane binding. Molecular dynamics simulations and enzyme activity measurements indicate that the C-terminal polypeptide tail facilitates catalysis by encapsulating the UDP-N-acetyl-d-mannosamine substrate, presenting three highly conserved arginine residues to the active site that are important for catalysis (R214, R221, and R224). From these data, we present a mechanistic model of catalysis that ascribes functions for these residues. This work could facilitate the development of new antimicrobial compounds that disrupt WTA biosynthesis in pathogenic bacteria.
Collapse
Affiliation(s)
- Orlando E Martinez
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, California, USA; UCLA-DOE Institute of Genomics and Proteomics, University of California, Los Angeles, Los Angeles, California, USA
| | - Brendan J Mahoney
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, California, USA; UCLA-DOE Institute of Genomics and Proteomics, University of California, Los Angeles, Los Angeles, California, USA
| | - Andrew K Goring
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, California, USA; UCLA-DOE Institute of Genomics and Proteomics, University of California, Los Angeles, Los Angeles, California, USA
| | - Sung-Wook Yi
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, California, USA
| | - Denise P Tran
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, California, USA
| | - Duilio Cascio
- UCLA-DOE Institute of Genomics and Proteomics, University of California, Los Angeles, Los Angeles, California, USA
| | - Martin L Phillips
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, California, USA
| | - Musleh M Muthana
- Department of Chemistry, University of California, Davis, California, USA
| | - Xi Chen
- Department of Chemistry, University of California, Davis, California, USA
| | - Michael E Jung
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, California, USA; Molecular Biology Institute, University of California, Los Angeles, Los Angeles, California, USA
| | - Joseph A Loo
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, California, USA; UCLA-DOE Institute of Genomics and Proteomics, University of California, Los Angeles, Los Angeles, California, USA; Molecular Biology Institute, University of California, Los Angeles, Los Angeles, California, USA
| | - Robert T Clubb
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, California, USA; UCLA-DOE Institute of Genomics and Proteomics, University of California, Los Angeles, Los Angeles, California, USA; Molecular Biology Institute, University of California, Los Angeles, Los Angeles, California, USA.
| |
Collapse
|
20
|
PDB-wide identification of physiological hetero-oligomeric assemblies based on conserved quaternary structure geometry. Structure 2021; 29:1303-1311.e3. [PMID: 34520740 PMCID: PMC8575123 DOI: 10.1016/j.str.2021.07.012] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Revised: 03/22/2021] [Accepted: 07/23/2021] [Indexed: 11/21/2022]
Abstract
An accurate understanding of biomolecular mechanisms and diseases requires information on protein quaternary structure (QS). A critical challenge in inferring QS information from crystallography data is distinguishing biological interfaces from fortuitous crystal-packing contacts. Here, we employ QS conservation across homologs to infer the biological relevance of hetero-oligomers. We compare the structures and compositions of hetero-oligomers, which allow us to annotate 7,810 complexes as physiologically relevant, 1,060 as likely errors, and 1,432 with comparative information on subunit stoichiometry and composition. Excluding immunoglobulins, these annotations encompass over 51% of hetero-oligomers in the PDB. We curate a dataset of 577 hetero-oligomeric complexes to benchmark these annotations, which reveals an accuracy >94%. When homology information is not available, we compare QS across repositories (PDB, PISA, and EPPIC) to derive confidence estimates. This work provides high-quality annotations along with a large benchmark dataset of hetero-assemblies.
Collapse
|
21
|
Khazen G, Gyulkhandanian A, Issa T, Maroun RC. Getting to know each other: PPIMem, a novel approach for predicting transmembrane protein-protein complexes. Comput Struct Biotechnol J 2021; 19:5184-5197. [PMID: 34630938 PMCID: PMC8476896 DOI: 10.1016/j.csbj.2021.09.013] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Revised: 08/23/2021] [Accepted: 09/12/2021] [Indexed: 02/03/2023] Open
Abstract
Because of their considerable number and diversity, membrane proteins and their macromolecular complexes represent the functional units of cells. Their quaternary structure may be stabilized by interactions between the α-helices of different proteins in the hydrophobic region of the cell membrane. Membrane proteins equally represent potential pharmacological targets par excellence for various diseases. Unfortunately, their experimental 3D structure and that of their complexes with other intramembrane protein partners are scarce due to technical difficulties. To overcome this key problem, we devised PPIMem, a computational approach for the specific prediction of higher-order structures of α-helical transmembrane proteins. The novel approach involves proper identification of the amino acid residues at the interface of molecular complexes with a 3D structure. The identified residues compose then nonlinear interaction motifs that are conveniently expressed as mathematical regular expressions. These are efficiently implemented for motif search in amino acid sequence databases, and for the accurate prediction of intramembrane protein-protein complexes. Our template interface-based approach predicted 21,544 binary complexes between 1,504 eukaryotic plasma membrane proteins across 39 species. We compare our predictions to experimental datasets of protein-protein interactions as a first validation method. The online database that results from the PPIMem algorithm with the annotated predicted interactions are implemented as a web server and can be accessed directly at https://transint.univ-evry.fr.
Collapse
Affiliation(s)
- Georges Khazen
- Computer Science and Mathematics Department, Lebanese American University, Byblos, Lebanon
| | - Aram Gyulkhandanian
- Inserm U1204/Université d'Evry/Université Paris-Saclay, Structure-Activité des Biomolécules Normales et Pathologiques, 91025 Evry, France
| | - Tina Issa
- Computer Science and Mathematics Department, Lebanese American University, Byblos, Lebanon
| | - Rachid C Maroun
- Inserm U1204/Université d'Evry/Université Paris-Saclay, Structure-Activité des Biomolécules Normales et Pathologiques, 91025 Evry, France
| |
Collapse
|
22
|
Gaber A, Pavšič M. Modeling and Structure Determination of Homo-Oligomeric Proteins: An Overview of Challenges and Current Approaches. Int J Mol Sci 2021; 22:9081. [PMID: 34445785 PMCID: PMC8396596 DOI: 10.3390/ijms22169081] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Revised: 08/20/2021] [Accepted: 08/20/2021] [Indexed: 12/12/2022] Open
Abstract
Protein homo-oligomerization is a very common phenomenon, and approximately half of proteins form homo-oligomeric assemblies composed of identical subunits. The vast majority of such assemblies possess internal symmetry which can be either exploited to help or poses challenges during structure determination. Moreover, aspects of symmetry are critical in the modeling of protein homo-oligomers either by docking or by homology-based approaches. Here, we first provide a brief overview of the nature of protein homo-oligomerization. Next, we describe how the symmetry of homo-oligomers is addressed by crystallographic and non-crystallographic symmetry operations, and how biologically relevant intermolecular interactions can be deciphered from the ordered array of molecules within protein crystals. Additionally, we describe the most important aspects of protein homo-oligomerization in structure determination by NMR. Finally, we give an overview of approaches aimed at modeling homo-oligomers using computational methods that specifically address their internal symmetry and allow the incorporation of other experimental data as spatial restraints to achieve higher model reliability.
Collapse
|
23
|
Agamennone M, Nicoli A, Bayer S, Weber V, Borro L, Gupta S, Fantacuzzi M, Di Pizio A. Protein-protein interactions at a glance: Protocols for the visualization of biomolecular interactions. Methods Cell Biol 2021; 166:271-307. [PMID: 34752337 DOI: 10.1016/bs.mcb.2021.06.012] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Protein-protein interactions (PPIs) play a key role in many biological processes and are intriguing targets for drug discovery campaigns. Advancements in experimental and computational techniques are leading to a growth of data accessibility, and, with it, an increased need for the analysis of PPIs. In this respect, visualization tools are essential instruments to represent and analyze biomolecular interactions. In this chapter, we reviewed some of the available tools, highlighting their features, and describing their functions with practical information on their usage.
Collapse
Affiliation(s)
| | - Alessandro Nicoli
- Leibniz-Institute for Food Systems Biology at the Technical University of Munich, Freising, Germany
| | - Sebastian Bayer
- Leibniz-Institute for Food Systems Biology at the Technical University of Munich, Freising, Germany
| | - Verena Weber
- Leibniz-Institute for Food Systems Biology at the Technical University of Munich, Freising, Germany
| | - Luca Borro
- Department of Imaging, Advanced Cardiovascular Imaging Unit, Bambino Gesù Children's Hospital, IRCCS, Rome, Italy
| | - Shailendra Gupta
- Department of Systems Biology and Bioinformatics, University of Rostock, Rostock, Germany
| | | | - Antonella Di Pizio
- Leibniz-Institute for Food Systems Biology at the Technical University of Munich, Freising, Germany.
| |
Collapse
|
24
|
Karakulak T, Rifaioglu AS, Rodrigues JPGLM, Karaca E. Predicting the Specificity- Determining Positions of Receptor Tyrosine Kinase Axl. Front Mol Biosci 2021; 8:658906. [PMID: 34195226 PMCID: PMC8236827 DOI: 10.3389/fmolb.2021.658906] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2021] [Accepted: 04/20/2021] [Indexed: 11/22/2022] Open
Abstract
Owing to its clinical significance, modulation of functionally relevant amino acids in protein-protein complexes has attracted a great deal of attention. To this end, many approaches have been proposed to predict the partner-selecting amino acid positions in evolutionarily close complexes. These approaches can be grouped into sequence-based machine learning and structure-based energy-driven methods. In this work, we assessed these methods’ ability to map the specificity-determining positions of Axl, a receptor tyrosine kinase involved in cancer progression and immune system diseases. For sequence-based predictions, we used SDPpred, Multi-RELIEF, and Sequence Harmony. For structure-based predictions, we utilized HADDOCK refinement and molecular dynamics simulations. As a result, we observed that (i) sequence-based methods overpredict partner-selecting residues of Axl and that (ii) combining Multi-RELIEF with HADDOCK-based predictions provides the key Axl residues, covered by the extensive molecular dynamics simulations. Expanding on these results, we propose that a sequence-structure-based approach is necessary to determine specificity-determining positions of Axl, which can guide the development of therapeutic molecules to combat Axl misregulation.
Collapse
Affiliation(s)
- Tülay Karakulak
- Izmir Biomedicine and Genome Center, Izmir, Turkey.,Izmir International Biomedicine and Genome Institute, Dokuz Eylul University, Izmir, Turkey.,Institute of Molecular Life Sciences, University of Zurich, Zurich, Switzerland.,Department of Pathology and Molecular Pathology, University Hospital Zurich, Zurich, Switzerland.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Ahmet Sureyya Rifaioglu
- Department of Electrical - Electronics Engineering, İskenderun Technical University, Hatay, Turkey
| | - João P G L M Rodrigues
- Department of Structural Biology, Stanford University School of Medicine, Stanford, CA, United States
| | - Ezgi Karaca
- Izmir Biomedicine and Genome Center, Izmir, Turkey.,Izmir International Biomedicine and Genome Institute, Dokuz Eylul University, Izmir, Turkey
| |
Collapse
|
25
|
CAVIAR: a method for automatic cavity detection, description and decomposition into subcavities. J Comput Aided Mol Des 2021; 35:737-750. [PMID: 34050420 DOI: 10.1007/s10822-021-00390-w] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Accepted: 05/11/2021] [Indexed: 10/21/2022]
Abstract
The accurate description of protein binding sites is essential to the determination of similarity and the application of machine learning methods to relate the binding sites to observed functions. This work describes CAVIAR, a new open source tool for generating descriptors for binding sites, using protein structures in PDB and mmCIF format as well as trajectory frames from molecular dynamics simulations as input. The applicability of CAVIAR descriptors is showcased by computing machine learning predictions of binding site ligandability. The method can also automatically assign subcavities, even in the absence of a bound ligand. The defined subpockets mimic the empirical definitions used in medicinal chemistry projects. It is shown that the experimental binding affinity scales relatively well with the number of subcavities filled by the ligand, with compounds binding to more than three subcavities having nanomolar or better affinities to the target. The CAVIAR descriptors and methods can be used in any machine learning-based investigations of problems involving binding sites, from protein engineering to hit identification. The full software code is available on GitHub and a conda package is hosted on Anaconda cloud.
Collapse
|
26
|
Chen YF, Xia Y. Structural Profiling of Bacterial Effectors Reveals Enrichment of Host-Interacting Domains and Motifs. Front Mol Biosci 2021; 8:626600. [PMID: 34012977 PMCID: PMC8126662 DOI: 10.3389/fmolb.2021.626600] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Accepted: 04/21/2021] [Indexed: 11/13/2022] Open
Abstract
Effector proteins are bacterial virulence factors secreted directly into host cells and, through extensive interactions with host proteins, rewire host signaling pathways to the advantage of the pathogen. Despite the crucial role of globular domains as mediators of protein-protein interactions (PPIs), previous structural studies of bacterial effectors are primarily focused on individual domains, rather than domain-mediated PPIs, which limits their ability to uncover systems-level molecular recognition principles governing host-bacteria interactions. Here, we took an interaction-centric approach and systematically examined the potential of structural components within bacterial proteins to engage in or target eukaryote-specific domain-domain interactions (DDIs). Our results indicate that: 1) effectors are about six times as likely as non-effectors to contain host-like domains that mediate DDIs exclusively in eukaryotes; 2) the average domain in effectors is about seven times as likely as that in non-effectors to co-occur with DDI partners in eukaryotes rather than in bacteria; and 3) effectors are about nine times as likely as non-effectors to contain bacteria-exclusive domains that target host domains mediating DDIs exclusively in eukaryotes. Moreover, in the absence of host-like domains or among pathogen proteins without domain assignment, effectors harbor a higher variety and density of short linear motifs targeting host domains that mediate DDIs exclusively in eukaryotes. Our study lends novel quantitative insight into the structural basis of effector-induced perturbation of host-endogenous PPIs and may aid in the design of selective inhibitors of host-pathogen interactions.
Collapse
Affiliation(s)
| | - Yu Xia
- Department of Bioengineering, McGill University, Montreal, QC, Canada
| |
Collapse
|
27
|
Hashemi ZS, Zarei M, Fath MK, Ganji M, Farahani MS, Afsharnouri F, Pourzardosht N, Khalesi B, Jahangiri A, Rahbar MR, Khalili S. In silico Approaches for the Design and Optimization of Interfering Peptides Against Protein-Protein Interactions. Front Mol Biosci 2021; 8:669431. [PMID: 33996914 PMCID: PMC8113820 DOI: 10.3389/fmolb.2021.669431] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2021] [Accepted: 04/06/2021] [Indexed: 01/01/2023] Open
Abstract
Large contact surfaces of protein-protein interactions (PPIs) remain to be an ongoing issue in the discovery and design of small molecule modulators. Peptides are intrinsically capable of exploring larger surfaces, stable, and bioavailable, and therefore bear a high therapeutic value in the treatment of various diseases, including cancer, infectious diseases, and neurodegenerative diseases. Given these promising properties, a long way has been covered in the field of targeting PPIs via peptide design strategies. In silico tools have recently become an inevitable approach for the design and optimization of these interfering peptides. Various algorithms have been developed to scrutinize the PPI interfaces. Moreover, different databases and software tools have been created to predict the peptide structures and their interactions with target protein complexes. High-throughput screening of large peptide libraries against PPIs; "hotspot" identification; structure-based and off-structure approaches of peptide design; 3D peptide modeling; peptide optimization strategies like cyclization; and peptide binding energy evaluation are among the capabilities of in silico tools. In the present study, the most recent advances in the field of in silico approaches for the design of interfering peptides against PPIs will be reviewed. The future perspective of the field and its advantages and limitations will also be pinpointed.
Collapse
Affiliation(s)
- Zahra Sadat Hashemi
- ATMP Department, Breast Cancer Research Center, Motamed Cancer Institute, Academic Center for Education, Culture and Research, Tehran, Iran
| | - Mahboubeh Zarei
- Pharmaceutical Sciences Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Mohsen Karami Fath
- Department of Cellular and Molecular Biology, Faculty of Biological Sciences, Kharazmi University, Tehran, Iran
| | - Mahmoud Ganji
- Department of Medical Biotechnology, Faculty of Medical Sciences, Tarbiat Modares University, Tehran, Iran
| | - Mahboube Shahrabi Farahani
- Department of Medical Biotechnology, Faculty of Medical Sciences, Tarbiat Modares University, Tehran, Iran
| | - Fatemeh Afsharnouri
- Department of Medical Biotechnology, Faculty of Medical Sciences, Tarbiat Modares University, Tehran, Iran
| | - Navid Pourzardosht
- Cellular and Molecular Research Center, Faculty of Medicine, Guilan University of Medical Sciences, Rasht, Iran
- Department of Biochemistry, Guilan University of Medical Sciences, Rasht, Iran
| | - Bahman Khalesi
- Department of Research and Production of Poultry Viral Vaccine, Razi Vaccine and Serum Research Institute, Agricultural Research Education and Extension Organization, Karaj, Iran
| | - Abolfazl Jahangiri
- Applied Microbiology Research Center, Systems Biology and Poisonings Institute, Baqiyatallah University of Medical Sciences, Tehran, Iran
| | - Mohammad Reza Rahbar
- Pharmaceutical Sciences Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Saeed Khalili
- Department of Biology Sciences, Shahid Rajaee Teacher Training University, Tehran, Iran
| |
Collapse
|
28
|
Abrami L, Audagnotto M, Ho S, Marcaida MJ, Mesquita FS, Anwar MU, Sandoz PA, Fonti G, Pojer F, Peraro MD, van der Goot FG. Palmitoylated acyl protein thioesterase APT2 deforms membranes to extract substrate acyl chains. Nat Chem Biol 2021; 17:438-447. [PMID: 33707782 PMCID: PMC7610442 DOI: 10.1038/s41589-021-00753-2] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Revised: 12/27/2020] [Accepted: 01/26/2021] [Indexed: 01/31/2023]
Abstract
Many biochemical reactions require controlled recruitment of proteins to membranes. This is largely regulated by posttranslational modifications. A frequent one is S-acylation, which consists of the addition of acyl chains and can be reversed by poorly understood acyl protein thioesterases (APTs). Using a panel of computational and experimental approaches, we dissect the mode of action of the major cellular thioesterase APT2 (LYPLA2). We show that soluble APT2 is vulnerable to proteasomal degradation, from which membrane binding protects it. Interaction with membranes requires three consecutive steps: electrostatic attraction, insertion of a hydrophobic loop and S-acylation by the palmitoyltransferases ZDHHC3 or ZDHHC7. Once bound, APT2 is predicted to deform the lipid bilayer to extract the acyl chain bound to its substrate and capture it in a hydrophobic pocket to allow hydrolysis. This molecular understanding of APT2 paves the way to understand the dynamics of APT2-mediated deacylation of substrates throughout the endomembrane system.
Collapse
Affiliation(s)
- Laurence Abrami
- Global Health Institute, School of Life Sciences, EPFL, Lausanne, Switzerland
| | - Martina Audagnotto
- Institute of Bioengineering, School of Life Sciences, EPFL, Lausanne, Switzerland
| | - Sylvia Ho
- Global Health Institute, School of Life Sciences, EPFL, Lausanne, Switzerland
| | - Maria Jose Marcaida
- Institute of Bioengineering, School of Life Sciences, EPFL, Lausanne, Switzerland
| | | | - Muhammad U. Anwar
- Global Health Institute, School of Life Sciences, EPFL, Lausanne, Switzerland
| | - Patrick A. Sandoz
- Global Health Institute, School of Life Sciences, EPFL, Lausanne, Switzerland
| | - Giulia Fonti
- Institute of Bioengineering, School of Life Sciences, EPFL, Lausanne, Switzerland
| | - Florence Pojer
- Protein Production and Structure Core Facility, School of Life Sciences, EPFL, Lausanne, Switzerland
| | - Matteo Dal Peraro
- Institute of Bioengineering, School of Life Sciences, EPFL, Lausanne, Switzerland,Corresponding Authors: F. Gisou van der Goot () and Matteo Dal Peraro ()
| | - F. Gisou van der Goot
- Global Health Institute, School of Life Sciences, EPFL, Lausanne, Switzerland,Corresponding Authors: F. Gisou van der Goot () and Matteo Dal Peraro ()
| |
Collapse
|
29
|
Bianchetti L, Sinar D, Depenveiller C, Dejaegere A. Insights into mineralocorticoid receptor homodimerization from a combined molecular modeling and bioinformatics study. Proteins 2021; 89:952-965. [PMID: 33713045 DOI: 10.1002/prot.26073] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2020] [Revised: 02/25/2021] [Accepted: 03/04/2021] [Indexed: 01/01/2023]
Abstract
In vertebrates, the mineralocorticoid receptor (MR) is a steroid-activated nuclear receptor (NR) that plays essential roles in water-electrolyte balance and blood pressure homeostasis. It belongs to the group of oxo-steroidian NRs, together with the glucocorticoid (GR), progesterone (PR), and androgen (AR) receptors. Classically, these oxo-steroidian NRs homodimerize and bind to specific genomic sequences to activate gene expression. NRs are multi-domain proteins, and dimerization is mediated by both the DNA (DBD) and ligand binding domains (LBDs), with the latter thought to provide the largest dimerization interface. However, at the structural level, the dimerization of oxo-steroidian receptors LBDs has remained largely a matter of debate and, despite their sequence homology, there is currently no consensus on a common homodimer assembly across the four receptors, that is, GR, PR, AR, and MR. Here, we examined all available MR LBD crystals using different computational methods (protein common interface database, proteins, interfaces, structures and assemblies, protein-protein interaction prediction by structural matching, and evolutionary protein-protein interface classifier, and the molecular mechanics Poisson-Boltzmann surface area method). A consensus is reached by all methods and singles out an interface mediated by helices H9, H10 and the C-terminal F domain as having characteristics of a biologically relevant assembly. Interestingly, a similar assembly was previously identified for GRα, MR closest homolog. Alternative architectures that were proposed for GRα were not observed for MR. These data call for further experimental investigations of oxo-steroid dimer architectures.
Collapse
Affiliation(s)
- Laurent Bianchetti
- Laboratoire de Chimie Biophysique de la Signalisation de la Transcription, Département de Biologie Structurale Intégrative, Institut de Génétique et de Biologie Moléculaire et Cellulaire, Illkirch, France
- Centre National de la Recherche Scientifique UMR7104, Illkirch, France
- Institut National de la Santé et de la Recherche Médicale, Illkirch, France
- Ecole Supérieure de Biotechnologie de Strasbourg, Université de Strasbourg, Illkirch, France
| | - Deniz Sinar
- Laboratoire de Chimie Biophysique de la Signalisation de la Transcription, Département de Biologie Structurale Intégrative, Institut de Génétique et de Biologie Moléculaire et Cellulaire, Illkirch, France
- Centre National de la Recherche Scientifique UMR7104, Illkirch, France
- Institut National de la Santé et de la Recherche Médicale, Illkirch, France
- Ecole Supérieure de Biotechnologie de Strasbourg, Université de Strasbourg, Illkirch, France
| | - Camille Depenveiller
- Laboratoire de Chimie Biophysique de la Signalisation de la Transcription, Département de Biologie Structurale Intégrative, Institut de Génétique et de Biologie Moléculaire et Cellulaire, Illkirch, France
- Centre National de la Recherche Scientifique UMR7104, Illkirch, France
- Institut National de la Santé et de la Recherche Médicale, Illkirch, France
- Ecole Supérieure de Biotechnologie de Strasbourg, Université de Strasbourg, Illkirch, France
| | - Annick Dejaegere
- Laboratoire de Chimie Biophysique de la Signalisation de la Transcription, Département de Biologie Structurale Intégrative, Institut de Génétique et de Biologie Moléculaire et Cellulaire, Illkirch, France
- Centre National de la Recherche Scientifique UMR7104, Illkirch, France
- Institut National de la Santé et de la Recherche Médicale, Illkirch, France
- Ecole Supérieure de Biotechnologie de Strasbourg, Université de Strasbourg, Illkirch, France
| |
Collapse
|
30
|
Dhawanjewar AS, Roy AA, Madhusudhan MS. A knowledge-based scoring function to assess quaternary associations of proteins. Bioinformatics 2020; 36:3739-3748. [PMID: 32246820 DOI: 10.1093/bioinformatics/btaa207] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2019] [Revised: 03/01/2020] [Accepted: 03/30/2020] [Indexed: 12/21/2022] Open
Abstract
MOTIVATION The elucidation of all inter-protein interactions would significantly enhance our knowledge of cellular processes at a molecular level. Given the enormity of the problem, the expenses and limitations of experimental methods, it is imperative that this problem is tackled computationally. In silico predictions of protein interactions entail sampling different conformations of the purported complex and then scoring these to assess for interaction viability. In this study, we have devised a new scheme for scoring protein-protein interactions. RESULTS Our method, PIZSA (Protein Interaction Z-Score Assessment), is a binary classification scheme for identification of native protein quaternary assemblies (binders/nonbinders) based on statistical potentials. The scoring scheme incorporates residue-residue contact preference on the interface with per residue-pair atomic contributions and accounts for clashes. PIZSA can accurately discriminate between native and non-native structural conformations from protein docking experiments and outperform other contact-based potential scoring functions. The method has been extensively benchmarked and is among the top 6 methods, outperforming 31 other statistical, physics based and machine learning scoring schemes. The PIZSA potentials can also distinguish crystallization artifacts from biological interactions. AVAILABILITY AND IMPLEMENTATION PIZSA is implemented as a web server at http://cospi.iiserpune.ac.in/pizsa and can be downloaded as a standalone package from http://cospi.iiserpune.ac.in/pizsa/Download/Download.html. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Abhilesh S Dhawanjewar
- Indian Institute of Science Education and Research, Pashan, Pune 411008, India.,School of Biological Sciences, University of Nebraska, Lincoln, NE 68588, USA
| | - Ankit A Roy
- Indian Institute of Science Education and Research, Pashan, Pune 411008, India
| | | |
Collapse
|
31
|
Abstract
The continued development of X-ray free-electron lasers and serial crystallography techniques has opened up new experimental frontiers. Nanoscale dynamical processes such as crystal growth can now be probed at unprecedented time and spatial resolutions. Pair-angle distribution function (PADF) analysis is a correlation-based technique that has the potential to extend the limits of current serial crystallography experiments, by relaxing the requirements for crystal order, size and number density per exposure. However, unlike traditional crystallographic methods, the PADF technique does not recover the electron density directly. Instead it encodes substantial information about local three-dimensional structure in the form of three- and four-body correlations. It is not yet known how protein structure maps into the many-body PADF correlations. In this paper, we explore the relationship between the PADF and protein conformation. We calculate correlations in reciprocal and real space for model systems exhibiting increasing degrees of order and secondary structural complexity, from disordered polypeptides, single alpha helices, helix bundles and finally a folded 100 kilodalton protein. These models systems inform us about the distinctive angular correlations generated by bonding, polypeptide chains, secondary structure and tertiary structure. They further indicate the potential to use angular correlations as a sensitive measure of conformation change that is complementary to existing structural analysis techniques.
Collapse
|
32
|
Jiménez-García B, Elez K, Koukos PI, Bonvin AM, Vangone A. PRODIGY-crystal: a web-tool for classification of biological interfaces in protein complexes. Bioinformatics 2020; 35:4821-4823. [PMID: 31141126 PMCID: PMC9186318 DOI: 10.1093/bioinformatics/btz437] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2019] [Revised: 05/06/2019] [Accepted: 05/25/2019] [Indexed: 01/11/2023] Open
Abstract
Summary Distinguishing biologically relevant interfaces from crystallographic ones in
biological complexes is fundamental in order to associate cellular functions to the
correct macromolecular assemblies. Recently, we described a detailed study reporting the
differences in the type of intermolecular residue–residue contacts between biological
and crystallographic interfaces. Our findings allowed us to develop a fast predictor of
biological interfaces reaching an accuracy of 0.92 and competitive to the current state
of the art. Here we present its web-server implementation, PRODIGY-CRYSTAL, aimed at the
classification of biological and crystallographic interfaces. PRODIGY-CRYSTAL has the
advantage of being fast, accurate and simple. This, together with its user-friendly
interface and user support forum, ensures its broad accessibility. Availability and implementation PRODIGY-CRYSTAL is freely available without registration requirements at https://haddock.science.uu.nl/services/PRODIGY-CRYSTAL.
Collapse
Affiliation(s)
- Brian Jiménez-García
- Computational Structural Biology Group, Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Utrecht 3512 JE, The Netherlands
| | - Katarina Elez
- Present address: University of Bologna, Via Selmi 3 40126, Bologna, Italy
| | - Panagiotis I Koukos
- Computational Structural Biology Group, Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Utrecht 3512 JE, The Netherlands
| | - Alexandre Mjj Bonvin
- To whom correspondence should be addressed. E-mail: or . Present address: Pharma Research and Early Development, Large Molecule Research, Roche Innovation Center Munich, Nonnenwald 2, Penzberg 82377, Germany
| | - Anna Vangone
- To whom correspondence should be addressed. E-mail: or . Present address: Pharma Research and Early Development, Large Molecule Research, Roche Innovation Center Munich, Nonnenwald 2, Penzberg 82377, Germany
| |
Collapse
|
33
|
Guven-Maiorov E, Hakouz A, Valjevac S, Keskin O, Tsai CJ, Gursoy A, Nussinov R. HMI-PRED: A Web Server for Structural Prediction of Host-Microbe Interactions Based on Interface Mimicry. J Mol Biol 2020; 432:3395-3403. [PMID: 32061934 PMCID: PMC7261632 DOI: 10.1016/j.jmb.2020.01.025] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2019] [Revised: 11/28/2019] [Accepted: 01/14/2020] [Indexed: 02/07/2023]
Abstract
Microbes, commensals, and pathogens, control the numerous functions in the host cells. They can alter host signaling and modulate immune surveillance by interacting with the host proteins. For shedding light on the contribution of microbes to health and disease, it is vital to discern how microbial proteins rewire host signaling and through which host proteins they do this. Host-Microbe Interaction PREDictor (HMI-PRED) is a user-friendly web server for structural prediction of protein-protein interactions (PPIs) between the host and a microbial species, including bacteria, viruses, fungi, and protozoa. HMI-PRED relies on "interface mimicry" through which the microbial proteins hijack host binding surfaces. Given the structure of a microbial protein of interest, HMI-PRED will return structural models of potential host-microbe interaction (HMI) complexes, the list of host endogenous and exogenous PPIs that can be disrupted, and tissue expression of the microbe-targeted host proteins. The server also allows users to upload homology models of microbial proteins. Broadly, it aims at large-scale, efficient identification of HMIs. The prediction results are stored in a repository for community access. HMI-PRED is free and available at https://interactome.ku.edu.tr/hmi.
Collapse
Affiliation(s)
- Emine Guven-Maiorov
- Computational Structural Biology Section, Basic Science Program, Frederick National Laboratory for Cancer Research, Frederick, MD, 21702, USA.
| | - Asma Hakouz
- Department of Computer Engineering, Koc University, Istanbul, 34450, Turkey.
| | - Sukejna Valjevac
- Department of Computer Engineering, Koc University, Istanbul, 34450, Turkey.
| | - Ozlem Keskin
- Department of Chemical and Biological Engineering, Koc University, Istanbul, 34450, Turkey.
| | - Chung-Jung Tsai
- Computational Structural Biology Section, Basic Science Program, Frederick National Laboratory for Cancer Research, Frederick, MD, 21702, USA.
| | - Attila Gursoy
- Department of Computer Engineering, Koc University, Istanbul, 34450, Turkey.
| | - Ruth Nussinov
- Computational Structural Biology Section, Basic Science Program, Frederick National Laboratory for Cancer Research, Frederick, MD, 21702, USA; Sackler Inst. of Molecular Medicine, Department of Human Genetics and Molecular Medicine, Sackler School of Medicine, Tel Aviv University, Tel Aviv, 69978, Israel.
| |
Collapse
|
34
|
Biological vs. Crystallographic Protein Interfaces: An Overview of Computational Approaches for Their Classification. CRYSTALS 2020. [DOI: 10.3390/cryst10020114] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Complexes between proteins are at the basis of almost every process in cells. Their study, from a structural perspective, has a pivotal role in understanding biological functions and, importantly, in drug development. X-ray crystallography represents the broadest source for the experimental structural characterization of protein-protein complexes. Correctly identifying the biologically relevant interface from the crystallographic ones is, however, not trivial and can be prone to errors. Over the past two decades, computational methodologies have been developed to study the differences of those interfaces and automatically classify them as biological or crystallographic. Overall, protein-protein interfaces show differences in terms of composition, energetics and evolutionary conservation between biological and crystallographic ones. Based on those observations, a number of computational methods have been developed for this classification problem, which can be grouped into three main categories: Energy-, empirical knowledge- and machine learning-based approaches. In this review, we give a comprehensive overview of the training datasets and methods so far implemented, providing useful links and a brief description of each method.
Collapse
|
35
|
Liu H, Cao M, Wang Y, Lv B, Li C. Bioengineering oligomerization and monomerization of enzymes: learning from natural evolution to matching the demands for industrial applications. Crit Rev Biotechnol 2020; 40:231-246. [PMID: 31914816 DOI: 10.1080/07388551.2019.1711014] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
It is generally accepted that oligomeric enzymes evolve from their monomeric ancestors, and the evolution process generates superior structural benefits for functional advantages. Furthermore, adjusting the transition between different oligomeric states is an important mechanism for natural enzymes to regulate their catalytic functions for adapting environmental fluctuations in nature, which inspires researchers to mimic such a strategy to develop artificially oligomerized enzymes through protein engineering for improved performance under specific conditions. On the other hand, transforming oligomeric enzymes into their monomers is needed in fundamental research for deciphering catalytic mechanisms as well as exploring their catalytic capacities for better industrial applications. In this article, strategies for developing artificially oligomerized and monomerized enzymes are reviewed and highlighted by their applications. Furthermore, advances in the computational prediction of oligomeric structures are introduced, which would accelerate the systematic design of oligomeric and monomeric enzymes. Finally, the current challenges and future directions in this field are discussed.
Collapse
Affiliation(s)
- Hu Liu
- Institute for Synthetic Biosystem, Department of Biochemical Engineering, School of Chemistry and Chemical Engineering, Beijing Institute of Technology, Beijing, China
| | - Mingming Cao
- Institute for Synthetic Biosystem, Department of Biochemical Engineering, School of Chemistry and Chemical Engineering, Beijing Institute of Technology, Beijing, China
| | - Ying Wang
- Institute for Synthetic Biosystem, Department of Biochemical Engineering, School of Chemistry and Chemical Engineering, Beijing Institute of Technology, Beijing, China
| | - Bo Lv
- Institute for Synthetic Biosystem, Department of Biochemical Engineering, School of Chemistry and Chemical Engineering, Beijing Institute of Technology, Beijing, China
| | - Chun Li
- Institute for Synthetic Biosystem, Department of Biochemical Engineering, School of Chemistry and Chemical Engineering, Beijing Institute of Technology, Beijing, China
| |
Collapse
|
36
|
Motta FN, Azevedo CDS, Neves BP, Araújo CND, Grellier P, Santana JMD, Bastos IMD. Oligopeptidase B, a missing enzyme in mammals and a potential drug target for trypanosomatid diseases. Biochimie 2019; 167:207-216. [DOI: 10.1016/j.biochi.2019.10.006] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2019] [Accepted: 10/15/2019] [Indexed: 12/21/2022]
|
37
|
Guven-Maiorov E, Tsai CJ, Nussinov R. Oncoviruses Can Drive Cancer by Rewiring Signaling Pathways Through Interface Mimicry. Front Oncol 2019; 9:1236. [PMID: 31803618 PMCID: PMC6872517 DOI: 10.3389/fonc.2019.01236] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2019] [Accepted: 10/28/2019] [Indexed: 01/17/2023] Open
Abstract
Oncoviruses rewire host pathways to subvert host immunity and promote their survival and proliferation. However, exactly how is challenging to understand. Here, by employing the first and to date only interface-based host-microbe interaction (HMI) prediction method, we explore a pivotal strategy oncoviruses use to drive cancer: mimicking binding surfaces-interfaces-of human proteins. We show that oncoviruses can target key human network proteins and transform cells by acquisition of cancer hallmarks. Experimental large-scale mapping of HMIs is difficult and individual HMIs do not permit in-depth grasp of tumorigenic virulence mechanisms. Our computational approach is tractable and 3D structural HMI models can help elucidate pathogenesis mechanisms and facilitate drug design. We observe that many host proteins are unique targets for certain oncoviruses, whereas others are common to several, suggesting similar infectious strategies. A rough estimation of our false discovery rate based on the tissue expression of oncovirus-targeted human proteins is 25%.
Collapse
Affiliation(s)
- Emine Guven-Maiorov
- Computational Structural Biology Section, Basic Science Program, Frederick National Laboratory for Cancer Research, Frederick, MD, United States
| | - Chung-Jung Tsai
- Computational Structural Biology Section, Basic Science Program, Frederick National Laboratory for Cancer Research, Frederick, MD, United States
| | - Ruth Nussinov
- Computational Structural Biology Section, Basic Science Program, Frederick National Laboratory for Cancer Research, Frederick, MD, United States
- Department of Human Genetics and Molecular Medicine, Sackler Institute of Molecular Medicine, Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
38
|
Hadi-Alijanvand H. Soft regions of protein surface are potent for stable dimer formation. J Biomol Struct Dyn 2019; 38:3587-3598. [PMID: 31476974 DOI: 10.1080/07391102.2019.1662328] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
By having knowledge about the characteristics of protein interaction interfaces, we will be able to manipulate protein complexes for therapies. Dimer state is considered as the primary alphabet of the most proteins' quaternary structure. The properties of binding interface between subunits and of noninterface region define the specificity and stability of the intended protein complex. Considering some topological properties and amino acids' affinity for binding in interfaces of protein dimers, we construct the interface-specific recurrence plots. The data obtained from recurrence quantitative analysis, and accessibility-related metrics help us to classify the protein dimers into four distinct classes. Some mechanical properties of binding interfaces are computed for each predefined class of the dimers. The computed mechanical characteristics of binding patch region are compared with those of nonbinding region of proteins. Our observations indicate that the mechanical properties of protein binding sites have a decisive impact on determining the dimer stability. We introduce a new concept in analyzing protein structure by considering mechanical properties of protein structure. We conclude that the interface region between subunits of stable dimers is usually mechanically softer than the interface of unstable protein dimers. AbbreviationsAABaverage affinity for bindingANManisotropic network modelAPCaffinity propagation clusteringASAaccessible surface areaCCDinter residues distanceCSCcomplex stability codeDMdistance matrixΔGdissPISA-computed dissociation free energyGNMGaussian normal mode analysisNMAnormal mode analysisPBPprotein binding patchPISAproteins, interfaces, structures and assembliesrASArelative accessible area in respect to unfolded state of residuesRMrecurrence matrixrPrelative protrusionRPrecurrence plotRQArecurrence quantitative analysisSEMstandard error of meanCommunicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Hamid Hadi-Alijanvand
- Department of Biological Sciences, Institute for Advanced Studies in Basic Sciences (IASBS), Zanjan, Iran
| |
Collapse
|
39
|
Accurate Classification of Biological and non-Biological Interfaces in Protein Crystal Structures using Subtle Covariation Signals. Sci Rep 2019; 9:12603. [PMID: 31471543 PMCID: PMC6717244 DOI: 10.1038/s41598-019-48913-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2017] [Accepted: 08/14/2019] [Indexed: 11/08/2022] Open
Abstract
Proteins often work as oligomers or multimers in vivo. Therefore, elucidating their oligomeric or multimeric form (quaternary structure) is crucially important to ascertain their function. X-ray crystal structures of numerous proteins have been accumulated, providing information related to their biological units. Extracting information of biological units from protein crystal structures represents a meaningful task for modern biology. Nevertheless, although many methods have been proposed for identifying biological units appearing in protein crystal structures, it is difficult to distinguish biological protein-protein interfaces from crystallographic ones. Therefore, our simple but highly accurate classifier was developed to infer biological units in protein crystal structures using large amounts of protein sequence information and a modern contact prediction method to exploit covariation signals (CSs) in proteins. We demonstrate that our proposed method is promising even for weak signals of biological interfaces. We also discuss the relation between classification accuracy and conservation of biological units, and illustrate how the selection of sequences included in multiple sequence alignments as sources for obtaining CSs affects the results. With increased amounts of sequence data, the proposed method is expected to become increasingly useful.
Collapse
|
40
|
Waterhouse A, Bertoni M, Bienert S, Studer G, Tauriello G, Gumienny R, Heer FT, de Beer TAP, Rempfer C, Bordoli L, Lepore R, Schwede T. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res 2019; 46:W296-W303. [PMID: 29788355 PMCID: PMC6030848 DOI: 10.1093/nar/gky427] [Citation(s) in RCA: 7774] [Impact Index Per Article: 1295.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2018] [Accepted: 05/07/2018] [Indexed: 11/13/2022] Open
Abstract
Homology modelling has matured into an important technique in structural biology, significantly contributing to narrowing the gap between known protein sequences and experimentally determined structures. Fully automated workflows and servers simplify and streamline the homology modelling process, also allowing users without a specific computational expertise to generate reliable protein models and have easy access to modelling results, their visualization and interpretation. Here, we present an update to the SWISS-MODEL server, which pioneered the field of automated modelling 25 years ago and been continuously further developed. Recently, its functionality has been extended to the modelling of homo- and heteromeric complexes. Starting from the amino acid sequences of the interacting proteins, both the stoichiometry and the overall structure of the complex are inferred by homology modelling. Other major improvements include the implementation of a new modelling engine, ProMod3 and the introduction a new local model quality estimation method, QMEANDisCo. SWISS-MODEL is freely available at https://swissmodel.expasy.org.
Collapse
Affiliation(s)
- Andrew Waterhouse
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Martino Bertoni
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Stefan Bienert
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Gabriel Studer
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Gerardo Tauriello
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Rafal Gumienny
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Florian T Heer
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Tjaart A P de Beer
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Christine Rempfer
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Lorenza Bordoli
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Rosalba Lepore
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Torsten Schwede
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| |
Collapse
|
41
|
Geometric description of self-interaction potential in symmetric protein complexes. Sci Data 2019; 6:64. [PMID: 31101822 PMCID: PMC6525250 DOI: 10.1038/s41597-019-0058-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2018] [Accepted: 03/29/2019] [Indexed: 01/23/2023] Open
Abstract
Proteins can self-associate with copies of themselves to form symmetric complexes called homomers. Homomers are widespread in all kingdoms of life and allow for unique geometric and functional properties, as reflected in viral capsids or allostery. Once a protein forms a homomer, however, its internal symmetry can compound the effect of point mutations and trigger uncontrolled self-assembly into high-order structures. We identified mutation hot spots for supramolecular assembly, which are predictable by geometry. Here, we present a dataset of descriptors that characterize these hot spot positions both geometrically and chemically, as well as computer scripts allowing the calculation and visualization of these properties for homomers of choice. Since the biological relevance of homomers is not readily available from their X-ray crystallographic structure, we also provide reliability estimates obtained by methods we recently developed. These data have implications in the study of disease-causing mutations, protein evolution and can be exploited in the design of biomaterials. Design Type(s) | protein interaction analysis objective • protein structure prediction objective • modeling and simulation objective | Measurement Type(s) | protein complex | Technology Type(s) | computational modeling technique | Factor Type(s) | Filtering • source | Sample Characteristic(s) | laboratory environment |
Machine-accessible metadata file describing the reported data (ISA-Tab format)
Collapse
|
42
|
Principles and characteristics of biological assemblies in experimentally determined protein structures. Curr Opin Struct Biol 2019; 55:34-49. [PMID: 30965224 DOI: 10.1016/j.sbi.2019.03.006] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2019] [Accepted: 03/01/2019] [Indexed: 12/27/2022]
Abstract
More than half of all structures in the PDB are assemblies of two or more proteins, including both homooligomers and heterooligomers. Structural information on these assemblies comes from X-ray crystallography, NMR, and cryo-EM spectroscopy. The correct assembly in an X-ray structure is often ambiguous, and computational methods have been developed to identify the most likely biologically relevant assembly based on physical properties of assemblies and sequence conservation in interfaces. Taking advantage of the large number of structures now available, some of the most recent methods have relied on similarity of interfaces and assemblies across structures of homologous proteins.
Collapse
|
43
|
Guven-Maiorov E, Tsai CJ, Ma B, Nussinov R. Interface-Based Structural Prediction of Novel Host-Pathogen Interactions. Methods Mol Biol 2019; 1851:317-335. [PMID: 30298406 PMCID: PMC8192064 DOI: 10.1007/978-1-4939-8736-8_18] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
About 20% of the cancer incidences worldwide have been estimated to be associated with infections. However, the molecular mechanisms of exactly how they contribute to host tumorigenesis are still unknown. To evade host defense, pathogens hijack host proteins at different levels: sequence, structure, motif, and binding surface, i.e., interface. Interface similarity allows pathogen proteins to compete with host counterparts to bind to a target protein, rewire physiological signaling, and result in persistent infections, as well as cancer. Identification of host-pathogen interactions (HPIs)-along with their structural details at atomic resolution-may provide mechanistic insight into pathogen-driven cancers and innovate therapeutic intervention. HPI data including structural details is scarce and large-scale experimental detection is challenging. Therefore, there is an urgent and mounting need for efficient and robust computational approaches to predict HPIs and their complex (bound) structures. In this chapter, we review the first and currently only interface-based computational approach to identify novel HPIs. The concept of interface mimicry promises to identify more HPIs than complete sequence or structural similarity. We illustrate this concept with a case study on Kaposi's sarcoma herpesvirus (KSHV) to elucidate how it subverts host immunity and helps contribute to malignant transformation of the host cells.
Collapse
Affiliation(s)
- Emine Guven-Maiorov
- Cancer and Inflammation Program, Leidos Biomedical Research, Inc. Frederick National Laboratory for Cancer Research, National Cancer Institute, Frederick, MD, USA
| | - Chung-Jung Tsai
- Cancer and Inflammation Program, Leidos Biomedical Research, Inc. Frederick National Laboratory for Cancer Research, National Cancer Institute, Frederick, MD, USA
| | - Buyong Ma
- Cancer and Inflammation Program, Leidos Biomedical Research, Inc. Frederick National Laboratory for Cancer Research, National Cancer Institute, Frederick, MD, USA
| | - Ruth Nussinov
- Cancer and Inflammation Program, Leidos Biomedical Research, Inc. Frederick National Laboratory for Cancer Research, National Cancer Institute, Frederick, MD, USA.
- Department of Human Genetics and Molecular Medicine, Sackler Inst. of Molecular Medicine, Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel.
| |
Collapse
|
44
|
Elez K, Bonvin AMJJ, Vangone A. Distinguishing crystallographic from biological interfaces in protein complexes: role of intermolecular contacts and energetics for classification. BMC Bioinformatics 2018; 19:438. [PMID: 30497368 PMCID: PMC6266931 DOI: 10.1186/s12859-018-2414-9] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
BACKGROUND Study of macromolecular assemblies is fundamental to understand functions in cells. X-ray crystallography is the most common technique to solve their 3D structure at atomic resolution. In a crystal, however, both biologically-relevant interfaces and non-specific interfaces resulting from crystallographic packing are observed. Due to the complexity of the biological assemblies currently tackled, classifying those interfaces, i.e. distinguishing biological from crystal lattice interfaces, is not trivial and often prone to errors. In this context, analyzing the physico-chemical characteristics of biological/crystal interfaces can help researchers identify possible features that distinguish them and gain a better understanding of the systems. RESULTS In this work, we are providing new insights into the differences between biological and crystallographic complexes by focusing on "pair-properties" of interfaces that have not yet been fully investigated. We investigated properties such intermolecular residue-residue contacts (already successfully applied to the prediction of binding affinities) and interaction energies (electrostatic, Van der Waals and desolvation). By using the XtalMany and BioMany interface datasets, we show that interfacial residue contacts, classified as a function of their physico-chemical properties, can distinguish between biological and crystallographic interfaces. The energetic terms show, on average, higher values for crystal interfaces, reflecting a less stable interface due to crystal packing compared to biological interfaces. By using a variety of machine learning approaches, we trained a new interface classification predictor based on contacts and interaction energetic features. Our predictor reaches an accuracy in classifying biological vs crystal interfaces of 0.92, compared to 0.88 for EPPIC (one of the main state-of-the-art classifiers reporting same performance as PISA). CONCLUSION In this work we have gained insights into the nature of intermolecular contacts and energetics terms distinguishing biological from crystallographic interfaces. Our findings might have a broader applicability in structural biology, for example for the identification of near native poses in docking. We implemented our classification approach into an easy-to-use and fast software, freely available to the scientific community from http://github.com/haddocking/interface-classifier .
Collapse
Affiliation(s)
- Katarina Elez
- Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, 3584 CH, Utrecht, The Netherlands
- Present address: University of Bologna, Via Selmi 3, 40126, Bologna, Italy
| | - Alexandre M J J Bonvin
- Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, 3584 CH, Utrecht, The Netherlands.
| | - Anna Vangone
- Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, 3584 CH, Utrecht, The Netherlands.
- present address: Pharma Research and Early Development, Large Molecule Research, Roche Innovation Center Munich, Nonnenwald 2, Penzberg, Germany.
| |
Collapse
|
45
|
Lazar T, Guharoy M, Schad E, Tompa P. Unique Physicochemical Patterns of Residues in Protein–Protein Interfaces. J Chem Inf Model 2018; 58:2164-2173. [DOI: 10.1021/acs.jcim.8b00270] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Affiliation(s)
- Tamas Lazar
- VIB-VUB Center for Structural Biology, Vlaams Instituut voor Biotechnologie, Pleinlaan 2, 1050 Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
| | - Mainak Guharoy
- VIB-VUB Center for Structural Biology, Vlaams Instituut voor Biotechnologie, Pleinlaan 2, 1050 Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
| | - Eva Schad
- Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, Magyar tudosok korutja 2, 1117 Budapest, Hungary
| | - Peter Tompa
- VIB-VUB Center for Structural Biology, Vlaams Instituut voor Biotechnologie, Pleinlaan 2, 1050 Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
- Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, Magyar tudosok korutja 2, 1117 Budapest, Hungary
| |
Collapse
|
46
|
Nöldeke ER, Muckenfuss LM, Niemann V, Müller A, Störk E, Zocher G, Schneider T, Stehle T. Structural basis of cell wall peptidoglycan amidation by the GatD/MurT complex of Staphylococcus aureus. Sci Rep 2018; 8:12953. [PMID: 30154570 PMCID: PMC6113224 DOI: 10.1038/s41598-018-31098-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2018] [Accepted: 08/06/2018] [Indexed: 01/09/2023] Open
Abstract
The peptidoglycan of Staphylococcus aureus is highly amidated. Amidation of α-D-isoglutamic acid in position 2 of the stem peptide plays a decisive role in the polymerization of cell wall building blocks. S. aureus mutants with a reduced degree of amidation are less viable and show increased susceptibility to methicillin, indicating that targeting the amidation reaction could be a useful strategy to combat this pathogen. The enzyme complex that catalyzes the formation of α-D-isoglutamine in the Lipid II stem peptide was identified recently and shown to consist of two subunits, the glutamine amidotransferase-like protein GatD and the Mur ligase homolog MurT. We have solved the crystal structure of the GatD/MurT complex at high resolution, revealing an open, boomerang-shaped conformation in which GatD is docked onto one end of MurT. Putative active site residues cluster at the interface between GatD and MurT and are contributed by both proteins, thus explaining the requirement for the assembled complex to carry out the reaction. Site-directed mutagenesis experiments confirm the validity of the observed interactions. Small-angle X-ray scattering data show that the complex has a similar conformation in solution, although some movement at domain interfaces can occur, allowing the two proteins to approach each other during catalysis. Several other Gram-positive pathogens, including Streptococcus pneumoniae, Clostridium perfringens and Mycobacterium tuberculosis have homologous enzyme complexes. Combined with established biochemical assays, the structure of the GatD/MurT complex provides a solid basis for inhibitor screening in S. aureus and other pathogens.
Collapse
Affiliation(s)
- Erik R Nöldeke
- Interfaculty Institute of Biochemistry, University of Tübingen, D-72076, Tübingen, Germany
| | - Lena M Muckenfuss
- Interfaculty Institute of Biochemistry, University of Tübingen, D-72076, Tübingen, Germany.,Department of Biochemistry, University of Zurich, CH-8057, Zurich, Switzerland
| | - Volker Niemann
- Interfaculty Institute of Biochemistry, University of Tübingen, D-72076, Tübingen, Germany.,Hain Lifescience GmbH, D-72147, Nehren, Germany
| | - Anna Müller
- Institute for Pharmaceutical Microbiology, University of Bonn, D-53115, Bonn, Germany
| | - Elena Störk
- Interfaculty Institute of Biochemistry, University of Tübingen, D-72076, Tübingen, Germany
| | - Georg Zocher
- Interfaculty Institute of Biochemistry, University of Tübingen, D-72076, Tübingen, Germany
| | - Tanja Schneider
- Institute for Pharmaceutical Microbiology, University of Bonn, D-53115, Bonn, Germany
| | - Thilo Stehle
- Interfaculty Institute of Biochemistry, University of Tübingen, D-72076, Tübingen, Germany. .,Vanderbilt University School of Medicine, Nashville, Tennessee, 37232, USA.
| |
Collapse
|
47
|
Yogavel M, Chhibber-Goel J, Jamwal A, Gupta S, Sharma A. Engagement Rules That Underpin DBL-DARC Interactions for Ingress of Plasmodium knowlesi and Plasmodium vivax Into Human Erythrocytes. Front Mol Biosci 2018; 5:78. [PMID: 30211170 PMCID: PMC6120517 DOI: 10.3389/fmolb.2018.00078] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2018] [Accepted: 08/03/2018] [Indexed: 11/21/2022] Open
Abstract
Malaria parasite erythrocytic stages comprise of repeated bursts of parasites via cyclical invasion of host erythrocytes using dedicated receptor-ligand interactions. A family of erythrocyte-binding proteins from Plasmodium knowlesi (Pk) and Plasmodium vivax (Pv) attach to human Duffy antigen receptor for chemokines (DARC) via their Duffy binding-like domains (DBLs) for invasion. Here we provide a novel, testable and overarching interaction model that rationalizes even contradictory pieces of evidence that have so far existed in the literature on Pk/Pv-DBL/DARC binding determinants. We further address the conundrum of how parasite-encoded Pk/Pv-DBLs recognize human DARC and collate evidence for two distinct DARC integration sites on Pk/Pv-DBLs.
Collapse
Affiliation(s)
- Manickam Yogavel
- Molecular Medicine - Structural Parasitology Group, International Centre for Genetic Engineering and Biotechnology, New Delhi, India
| | - Jyoti Chhibber-Goel
- Molecular Medicine - Structural Parasitology Group, International Centre for Genetic Engineering and Biotechnology, New Delhi, India
| | - Abhishek Jamwal
- Molecular Medicine - Structural Parasitology Group, International Centre for Genetic Engineering and Biotechnology, New Delhi, India
| | - Swati Gupta
- Molecular Medicine - Structural Parasitology Group, International Centre for Genetic Engineering and Biotechnology, New Delhi, India
| | - Amit Sharma
- Molecular Medicine - Structural Parasitology Group, International Centre for Genetic Engineering and Biotechnology, New Delhi, India
| |
Collapse
|
48
|
Hu J, Liu HF, Sun J, Wang J, Liu R. Integrating co-evolutionary signals and other properties of residue pairs to distinguish biological interfaces from crystal contacts. Protein Sci 2018; 27:1723-1735. [PMID: 29931702 DOI: 10.1002/pro.3448] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2018] [Revised: 04/21/2018] [Accepted: 05/16/2018] [Indexed: 12/25/2022]
Abstract
It remains challenging to accurately discriminate between biological and crystal interfaces. Most existing analyses and algorithms focused on the features derived from a single side of the interface. However, less attention has been paid to the properties of residue pairs across protein interfaces. To address this problem, we defined a novel co-evolutionary feature for homodimers through integrating direct coupling analysis and image processing techniques. The residue pairs across biological homodimeric interfaces were significantly enriched in co-evolving residues compared to those across crystal contacts, resulting in a promising classification accuracy with area under the curves (AUCs) of >0.85. Considering the availability of co-evolutionary feature, we also designed other residue pair based features that were useful for both homodimers and heterodimers. The most informative residue pairs were identified to reflect the interaction preferences across protein interfaces. Regarding the other extant properties, we designed the new descriptors at the interface residue level as well as at the pairwise contact level. Extensive validation showed that these single properties can be used to identify biological interfaces with AUCs ranging from 0.60 to 0.88. By integrating co-evolutionary feature with other residue pair based properties, our final prediction model output excellent performance with AUCs of >0.91 on different datasets. Compared to existing methods, our algorithm not only yielded better or comparable results but also provided complementary information. An easy-to-use web server is freely accessible at http://liulab.hzau.edu.cn/RPAIAnalyst.
Collapse
Affiliation(s)
- Jian Hu
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, P. R. China.,College of Biomedical Engineering, South-Central University for Nationalities, Wuhan, 430074, P. R. China
| | - Hui-Fang Liu
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, P. R. China
| | - Jun Sun
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, P. R. China
| | - Jia Wang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, P. R. China
| | - Rong Liu
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, P. R. China
| |
Collapse
|
49
|
Waterhouse A, Bertoni M, Bienert S, Studer G, Tauriello G, Gumienny R, Heer FT, de Beer TAP, Rempfer C, Bordoli L, Lepore R, Schwede T. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res 2018. [PMID: 29788355 DOI: 10.1093/nar/gky427.pmid:29788355] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/25/2023] Open
Abstract
Homology modelling has matured into an important technique in structural biology, significantly contributing to narrowing the gap between known protein sequences and experimentally determined structures. Fully automated workflows and servers simplify and streamline the homology modelling process, also allowing users without a specific computational expertise to generate reliable protein models and have easy access to modelling results, their visualization and interpretation. Here, we present an update to the SWISS-MODEL server, which pioneered the field of automated modelling 25 years ago and been continuously further developed. Recently, its functionality has been extended to the modelling of homo- and heteromeric complexes. Starting from the amino acid sequences of the interacting proteins, both the stoichiometry and the overall structure of the complex are inferred by homology modelling. Other major improvements include the implementation of a new modelling engine, ProMod3 and the introduction a new local model quality estimation method, QMEANDisCo. SWISS-MODEL is freely available at https://swissmodel.expasy.org.
Collapse
Affiliation(s)
- Andrew Waterhouse
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Martino Bertoni
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Stefan Bienert
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Gabriel Studer
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Gerardo Tauriello
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Rafal Gumienny
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Florian T Heer
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Tjaart A P de Beer
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Christine Rempfer
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Lorenza Bordoli
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Rosalba Lepore
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Torsten Schwede
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| |
Collapse
|
50
|
Stenkamp RE. Identifying G protein-coupled receptor dimers from crystal packings. ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY 2018; 74:655-670. [PMID: 29968675 DOI: 10.1107/s2059798318008136] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/30/2018] [Accepted: 06/01/2018] [Indexed: 12/20/2022]
Abstract
Dimers of G protein-coupled receptors (GPCRs) are believed to be important for signaling with their associated G proteins. Low-resolution electron microscopy has shown rhodopsin dimers in native retinal membranes, and CXCR4 dimers have been found in several different crystal structures. Evidence for dimers of other GPCRs is more indirect. An alternative to computational modeling studies is to search for parallel dimers in the packing environments of the reported crystal structures of GPCRs. Two major structural types of GPCR dimers exist (as predicted by others), but there is considerable structural variation within each cluster. The different structural variants described here might reflect different functional properties and should provide a range of model structures for computational and experimental examination.
Collapse
Affiliation(s)
- Ronald E Stenkamp
- Departments of Biological Structure and Biochemistry, Biomolecular Structure Center, University of Washington, Box 357420, Seattle, WA 98195, USA
| |
Collapse
|