1
|
Meador K, Castells-Graells R, Aguirre R, Sawaya MR, Arbing MA, Sherman T, Senarathne C, Yeates TO. A suite of designed protein cages using machine learning and protein fragment-based protocols. Structure 2024; 32:751-765.e11. [PMID: 38513658 PMCID: PMC11162342 DOI: 10.1016/j.str.2024.02.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Revised: 01/22/2024] [Accepted: 02/23/2024] [Indexed: 03/23/2024]
Abstract
Designed protein cages and related materials provide unique opportunities for applications in biotechnology and medicine, but their creation remains challenging. Here, we apply computational approaches to design a suite of tetrahedrally symmetric, self-assembling protein cages. For the generation of docked conformations, we emphasize a protein fragment-based approach, while for sequence design of the de novo interface, a comparison of knowledge-based and machine learning protocols highlights the power and increased experimental success achieved using ProteinMPNN. An analysis of design outcomes provides insights for improving interface design protocols, including prioritizing fragment-based motifs, balancing interface hydrophobicity and polarity, and identifying preferred polar contact patterns. In all, we report five structures for seven protein cages, along with two structures of intermediate assemblies, with the highest resolution reaching 2.0 Å using cryo-EM. This set of designed cages adds substantially to the body of available protein nanoparticles, and to methodologies for their creation.
Collapse
Affiliation(s)
- Kyle Meador
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90095, USA
| | | | - Roman Aguirre
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90095, USA
| | - Michael R Sawaya
- UCLA-DOE Institute for Genomics and Proteomics, Los Angeles, CA 90095, USA
| | - Mark A Arbing
- UCLA-DOE Institute for Genomics and Proteomics, Los Angeles, CA 90095, USA
| | - Trent Sherman
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90095, USA
| | - Chethaka Senarathne
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90095, USA
| | - Todd O Yeates
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90095, USA; UCLA-DOE Institute for Genomics and Proteomics, Los Angeles, CA 90095, USA.
| |
Collapse
|
2
|
Nandigrami P, Fiser A. Assessing the functional impact of protein binding site definition. Protein Sci 2024; 33:e5026. [PMID: 38757384 PMCID: PMC11099757 DOI: 10.1002/pro.5026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Revised: 05/01/2024] [Accepted: 05/03/2024] [Indexed: 05/18/2024]
Abstract
Many biomedical applications, such as classification of binding specificities or bioengineering, depend on the accurate definition of protein binding interfaces. Depending on the choice of method used, substantially different sets of residues can be classified as belonging to the interface of a protein. A typical approach used to verify these definitions is to mutate residues and measure the impact of these changes on binding. Besides the lack of exhaustive data, this approach also suffers from the fundamental problem that a mutation introduces an unknown amount of alteration into an interface, which potentially alters the binding characteristics of the interface. In this study we explore the impact of alternative binding site definitions on the ability of a protein to recognize its cognate ligand using a pharmacophore approach, which does not affect the interface. The study also shows that methods for protein binding interface predictions should perform above approximately F-score = 0.7 accuracy level to capture the biological function of a protein.
Collapse
Affiliation(s)
- Prithviraj Nandigrami
- Departments of Systems and Computational Biology, and BiochemistryAlbert Einstein College of MedicineBronxNew YorkUSA
| | - Andras Fiser
- Departments of Systems and Computational Biology, and BiochemistryAlbert Einstein College of MedicineBronxNew YorkUSA
| |
Collapse
|
3
|
Grassmann G, Miotto M, Desantis F, Di Rienzo L, Tartaglia GG, Pastore A, Ruocco G, Monti M, Milanetti E. Computational Approaches to Predict Protein-Protein Interactions in Crowded Cellular Environments. Chem Rev 2024; 124:3932-3977. [PMID: 38535831 PMCID: PMC11009965 DOI: 10.1021/acs.chemrev.3c00550] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 02/20/2024] [Accepted: 02/21/2024] [Indexed: 04/11/2024]
Abstract
Investigating protein-protein interactions is crucial for understanding cellular biological processes because proteins often function within molecular complexes rather than in isolation. While experimental and computational methods have provided valuable insights into these interactions, they often overlook a critical factor: the crowded cellular environment. This environment significantly impacts protein behavior, including structural stability, diffusion, and ultimately the nature of binding. In this review, we discuss theoretical and computational approaches that allow the modeling of biological systems to guide and complement experiments and can thus significantly advance the investigation, and possibly the predictions, of protein-protein interactions in the crowded environment of cell cytoplasm. We explore topics such as statistical mechanics for lattice simulations, hydrodynamic interactions, diffusion processes in high-viscosity environments, and several methods based on molecular dynamics simulations. By synergistically leveraging methods from biophysics and computational biology, we review the state of the art of computational methods to study the impact of molecular crowding on protein-protein interactions and discuss its potential revolutionizing effects on the characterization of the human interactome.
Collapse
Affiliation(s)
- Greta Grassmann
- Department
of Biochemical Sciences “Alessandro Rossi Fanelli”, Sapienza University of Rome, Rome 00185, Italy
- Center
for Life Nano & Neuro Science, Istituto
Italiano di Tecnologia, Rome 00161, Italy
| | - Mattia Miotto
- Center
for Life Nano & Neuro Science, Istituto
Italiano di Tecnologia, Rome 00161, Italy
| | - Fausta Desantis
- Center
for Life Nano & Neuro Science, Istituto
Italiano di Tecnologia, Rome 00161, Italy
- The
Open University Affiliated Research Centre at Istituto Italiano di
Tecnologia, Genoa 16163, Italy
| | - Lorenzo Di Rienzo
- Center
for Life Nano & Neuro Science, Istituto
Italiano di Tecnologia, Rome 00161, Italy
| | - Gian Gaetano Tartaglia
- Center
for Life Nano & Neuro Science, Istituto
Italiano di Tecnologia, Rome 00161, Italy
- Department
of Neuroscience and Brain Technologies, Istituto Italiano di Tecnologia, Genoa 16163, Italy
- Center
for Human Technologies, Genoa 16152, Italy
| | - Annalisa Pastore
- Experiment
Division, European Synchrotron Radiation
Facility, Grenoble 38043, France
| | - Giancarlo Ruocco
- Center
for Life Nano & Neuro Science, Istituto
Italiano di Tecnologia, Rome 00161, Italy
- Department
of Physics, Sapienza University, Rome 00185, Italy
| | - Michele Monti
- RNA
System Biology Lab, Department of Neuroscience and Brain Technologies, Istituto Italiano di Tecnologia, Genoa 16163, Italy
| | - Edoardo Milanetti
- Center
for Life Nano & Neuro Science, Istituto
Italiano di Tecnologia, Rome 00161, Italy
- Department
of Physics, Sapienza University, Rome 00185, Italy
| |
Collapse
|
4
|
Ladias C, Papakotoulas P, Papaioannou M, Papanikolaou NA. Overcoming phenotypic switching: targeting protein-protein interactions in cancer. EXPLORATION OF TARGETED ANTI-TUMOR THERAPY 2023; 4:1071-1081. [PMID: 38023990 PMCID: PMC10651353 DOI: 10.37349/etat.2023.00181] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Accepted: 08/02/2023] [Indexed: 12/01/2023] Open
Abstract
Alternative protein-protein interactions (PPIs) arising from mutations or post-translational modifications (PTMs), termed phenotypic switching (PS), are critical for the transmission of alternative pathogenic signals and are particularly significant in cancer. In recent years, PPIs have emerged as promising targets for rational drug design, primarily because their high specificity facilitates targeting of disease-related signaling pathways. However, obstacles exist at the molecular level that arise from the properties of the interaction interfaces and the propensity of small molecule drugs to interact with more than one cleft surface. The difficulty in identifying small molecules that act as activators or inhibitors to counteract the biological effects of mutations raises issues that have not been encountered before. For example, small molecules can bind tightly but may not act as drugs or bind to multiple sites (interaction promiscuity). Another reason is the absence of significant clefts on protein surfaces; if a pocket is present, it may be too small, or its geometry may prevent binding. PS, which arises from oncogenic (alternative) signaling, causes drug resistance and forms the basis for the systemic robustness of tumors. In this review, the properties of PPI interfaces relevant to the design and development of targeting drugs are examined. In addition, the interactions between three tyrosine kinase inhibitors (TKIs) employed as drugs are discussed. Finally, potential novel targets of one of these drugs were identified in silico.
Collapse
Affiliation(s)
- Christos Ladias
- Department of Biological Applications and Technology, University of Ioannina, 45110 Ioannina, Epirus, Greece
| | - Pavlos Papakotoulas
- First Department of Clinical Oncology, Theageneio Cancer Hospital, 54639 Thessaloniki, Macedonia, Greece
| | - Maria Papaioannou
- Laboratory of Biological Chemistry, Department of Medicine, Section of Biological Sciences and Preventive Medicine, Aristotle University of Thessaloniki School of Medicine, 54124 Thessaloniki, Macedonia, Greece
| | - Nikolaos A. Papanikolaou
- Laboratory of Biological Chemistry, Department of Medicine, Section of Biological Sciences and Preventive Medicine, Aristotle University of Thessaloniki School of Medicine, 54124 Thessaloniki, Macedonia, Greece
| |
Collapse
|
5
|
Sidhanta SPD, Sowdhamini R, Srinivasan N. Comparative analysis of permanent and transient domain-domain interactions in multi-domain proteins. Proteins 2023. [PMID: 37828826 DOI: 10.1002/prot.26581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Revised: 08/09/2023] [Accepted: 08/11/2023] [Indexed: 10/14/2023]
Abstract
Protein domains are structural, functional, and evolutionary units. These domains bring out the diversity of functionality by means of interactions with other co-existing domains and provide stability. Hence, it is important to study intra-protein inter-domain interactions from the perspective of types of interactions. Domains within a chain could interact over short timeframes or permanently, rather like protein-protein interactions (PPIs). However, no systematic study has been carried out between two classes, namely permanent and transient domain-domain interactions. In this work, we studied 263 two-domain proteins, belonging to either of these classes and their interfaces on the basis of several factors, such as interface area and details of interactions (number, strength, and types of interactions). We also characterized them based on residue conservation at the interface, correlation of residue motions across domains, its involvement in repeat formation, and their involvement in particular molecular processes. Finally, we could analyze the interactions arising from domains in two-domain monomeric proteins, and we observed significant differences between these two classes of domain interactions and a few similarities. This study will help to obtain a better understanding of structure-function and folding principles of multi-domain proteins.
Collapse
Affiliation(s)
| | - Ramanathan Sowdhamini
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India
- Computational Approaches to Protein Science, National Centre for Biological Sciences, Bangalore, India
- Computational Biology, Institute of Bioinformatics and Applied Biotechnology, Bangalore, India
| | | |
Collapse
|
6
|
Meador K, Castells-Graells R, Aguirre R, Sawaya MR, Arbing MA, Sherman T, Senarathne C, Yeates TO. A Suite of Designed Protein Cages Using Machine Learning Algorithms and Protein Fragment-Based Protocols. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.09.561468. [PMID: 37873110 PMCID: PMC10592684 DOI: 10.1101/2023.10.09.561468] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
Designed protein cages and related materials provide unique opportunities for applications in biotechnology and medicine, while methods for their creation remain challenging and unpredictable. In the present study, we apply new computational approaches to design a suite of new tetrahedrally symmetric, self-assembling protein cages. For the generation of docked poses, we emphasize a protein fragment-based approach, while for de novo interface design, a comparison of computational protocols highlights the power and increased experimental success achieved using the machine learning program ProteinMPNN. In relating information from docking and design, we observe that agreement between fragment-based sequence preferences and ProteinMPNN sequence inference correlates with experimental success. Additional insights for designing polar interactions are highlighted by experimentally testing larger and more polar interfaces. In all, using X-ray crystallography and cryo-EM, we report five structures for seven protein cages, with atomic resolution in the best case reaching 2.0 Å. We also report structures of two incompletely assembled protein cages, providing unique insights into one type of assembly failure. The new set of designed cages and their structures add substantially to the body of available protein nanoparticles, and to methodologies for their creation.
Collapse
Affiliation(s)
- Kyle Meador
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA 90095
| | | | - Roman Aguirre
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA 90095
| | - Michael R. Sawaya
- UCLA-DOE Institute for Genomics and Proteomics, Los Angeles, CA, USA 90095
| | - Mark A. Arbing
- UCLA-DOE Institute for Genomics and Proteomics, Los Angeles, CA, USA 90095
| | - Trent Sherman
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA 90095
| | - Chethaka Senarathne
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA 90095
| | - Todd O. Yeates
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA 90095
- UCLA-DOE Institute for Genomics and Proteomics, Los Angeles, CA, USA 90095
| |
Collapse
|
7
|
Badonyi M, Marsh JA. Buffering of genetic dominance by allele-specific protein complex assembly. SCIENCE ADVANCES 2023; 9:eadf9845. [PMID: 37256959 PMCID: PMC10413657 DOI: 10.1126/sciadv.adf9845] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/02/2023] [Accepted: 04/24/2023] [Indexed: 06/02/2023]
Abstract
Protein complex assembly often occurs while subunits are being translated, resulting in complexes whose subunits were translated from the same mRNA in an allele-specific manner. It has thus been hypothesized that such cotranslational assembly may counter the assembly-mediated dominant-negative effect, whereby co-assembly of mutant and wild-type subunits "poisons" complex activity. Here, we show that cotranslationally assembling subunits are much less likely to be associated with autosomal dominant relative to recessive disorders, and that subunits with dominant-negative disease mutations are significantly depleted in cotranslational assembly compared to those associated with loss-of-function mutations. We also find that complexes with known dominant-negative effects tend to expose their interfaces late during translation, lessening the likelihood of cotranslational assembly. Finally, by combining complex properties with other features, we trained a computational model for predicting proteins likely to be associated with non-loss-of-function disease mechanisms, which we believe will be of considerable utility for protein variant interpretation.
Collapse
Affiliation(s)
- Mihaly Badonyi
- MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, UK
| | | |
Collapse
|
8
|
Simanshu DK, Philips MR, Hancock JF. Consensus on the RAS dimerization hypothesis: Strong evidence for lipid-mediated clustering but not for G-domain-mediated interactions. Mol Cell 2023; 83:1210-1215. [PMID: 36990093 PMCID: PMC10150945 DOI: 10.1016/j.molcel.2023.03.008] [Citation(s) in RCA: 21] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Revised: 02/24/2023] [Accepted: 03/07/2023] [Indexed: 03/30/2023]
Abstract
One of the open questions in RAS biology is the existence of RAS dimers and their role in RAF dimerization and activation. The idea of RAS dimers arose from the discovery that RAF kinases function as obligate dimers, which generated the hypothesis that RAF dimer formation might be nucleated by G-domain-mediated RAS dimerization. Here, we review the evidence for RAS dimerization and describe a recent discussion among RAS researchers that led to a consensus that the clustering of two or more RAS proteins is not due to the stable association of G-domains but, instead, is a consequence of RAS C-terminal membrane anchors and the membrane phospholipids with which they interact.
Collapse
Affiliation(s)
- Dhirendra K Simanshu
- NCI RAS Initiative, Cancer Research Technology Program, Frederick National Laboratory for Cancer Research, Frederick, MD, USA.
| | - Mark R Philips
- Perlmutter Cancer Center, NYU Grossman School of Medicine, New York, NY, USA.
| | - John F Hancock
- Department of Integrative Biology and Pharmacology, McGovern Medical School, University of Texas Health Science Center, Houston, TX, USA.
| |
Collapse
|
9
|
Nandigrami P, Fiser A. Assessing the functional impact of protein binding site definition. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.26.525812. [PMID: 36747792 PMCID: PMC9900911 DOI: 10.1101/2023.01.26.525812] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
Many biomedical applications, such as classification of binding specificities or bioengineering, depend on the accurate definition of protein binding interfaces. Depending on the choice of method used, substantially different sets of residues can be classified as belonging to the interface of a protein. A typical approach used to verify these definitions is to mutate residues and measure the impact of these changes on binding. Besides the lack of exhaustive data this approach generates, it also suffers from the fundamental problem that a mutation introduces an unknown amount of alteration into an interface, which potentially alters the binding characteristics of the interface. In this study we explore the impact of alternative binding site definitions on the ability of a protein to recognize its cognate ligand using a pharmacophore approach, which does not affect the interface. The study also provides guidance on the minimum expected accuracy of interface definition that is required to capture the biological function of a protein.
Collapse
Affiliation(s)
- Prithviraj Nandigrami
- Departments of Systems & Computational Biology, and Biochemistry, Albert Einstein College of Medicine 1300 Morris Park Ave, Bronx, NY 10461, USA
| | - Andras Fiser
- Departments of Systems & Computational Biology, and Biochemistry, Albert Einstein College of Medicine 1300 Morris Park Ave, Bronx, NY 10461, USA
| |
Collapse
|
10
|
Thiers KLL, da Silva JHM, Vasconcelos DCA, Aziz S, Noceda C, Arnholdt-Schmitt B, Costa JH. Polymorphisms in alternative oxidase genes from ecotypes of Arabidopsis and rice revealed an environment-induced linkage to altitude and rainfall. PHYSIOLOGIA PLANTARUM 2023; 175:e13847. [PMID: 36562612 DOI: 10.1111/ppl.13847] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/16/2021] [Revised: 12/07/2022] [Accepted: 12/19/2022] [Indexed: 06/17/2023]
Abstract
We investigated SNPs in alternative oxidase (AOX) genes and their connection to ecotype origins (climate, altitude, and rainfall) by using genomic data sets of Arabidopsis and rice populations from 1190 and 90 ecotypes, respectively. Parameters were defined to detect non-synonymous SNPs in the AOX ORF, which revealed amino acid (AA) changes in AOX1c, AOX1d, and AOX2 from Arabidopsis and AOX1c from rice in comparison to AOX references from Columbia-0 and Japonica ecotypes, respectively. Among these AA changes, Arabidopsis AOX1c_A161E&G165R and AOX1c_R242S revealed a link to high rainfall and high altitude, respectively, while all other changes in Arabidopsis and rice AOX was connected to high altitude and rainfall. Comparative 3D modeling showed that all mutant AOX presented structural differences in relation to the respective references. Molecular docking analysis uncovered lower binding affinity values between AOX and the substrate ubiquinol for most of the identified structures compared to their reference, indicating better enzyme-substrate binding affinities. Thus, our in silico data suggest that the majority of the AA changes found in the available ecotypes will confer better enzyme-subtract interactions and thus indicate environment-related, more efficient AOX activity.
Collapse
Affiliation(s)
- Karine Leitão Lima Thiers
- Functional Genomics and Bioinformatics, Department of Biochemistry and Molecular Biology, Federal University of Ceara, Fortaleza, Brazil
- Non-Institutional Competence Focus (NICFocus) 'Functional Cell Reprogramming and Organism Plasticity' (FunCROP), coordinated from Foros de Vale de Figueira, Alentejo, Portugal
| | | | | | - Shahid Aziz
- Functional Genomics and Bioinformatics, Department of Biochemistry and Molecular Biology, Federal University of Ceara, Fortaleza, Brazil
- Non-Institutional Competence Focus (NICFocus) 'Functional Cell Reprogramming and Organism Plasticity' (FunCROP), coordinated from Foros de Vale de Figueira, Alentejo, Portugal
| | - Carlos Noceda
- Non-Institutional Competence Focus (NICFocus) 'Functional Cell Reprogramming and Organism Plasticity' (FunCROP), coordinated from Foros de Vale de Figueira, Alentejo, Portugal
- Cell and Molecular Biology of Plants (BIOCEMP)/Industrial Biotechnology and Bioproducts, Departamento de Ciencias de la Vida y de la Agricultura, Universidad de las Fuerzas Armadas-ESPE, Sangolquí, Ecuador
- Facultad de Ciencias de la ingeniería, Universidad Estatal de Milagro, Milagro, Ecuador
| | - Birgit Arnholdt-Schmitt
- Functional Genomics and Bioinformatics, Department of Biochemistry and Molecular Biology, Federal University of Ceara, Fortaleza, Brazil
- Non-Institutional Competence Focus (NICFocus) 'Functional Cell Reprogramming and Organism Plasticity' (FunCROP), coordinated from Foros de Vale de Figueira, Alentejo, Portugal
| | - José Hélio Costa
- Functional Genomics and Bioinformatics, Department of Biochemistry and Molecular Biology, Federal University of Ceara, Fortaleza, Brazil
- Non-Institutional Competence Focus (NICFocus) 'Functional Cell Reprogramming and Organism Plasticity' (FunCROP), coordinated from Foros de Vale de Figueira, Alentejo, Portugal
| |
Collapse
|
11
|
Tubiana T, Sillitoe I, Orengo C, Reuter N. Dissecting peripheral protein-membrane interfaces. PLoS Comput Biol 2022; 18:e1010346. [PMID: 36516231 PMCID: PMC9797079 DOI: 10.1371/journal.pcbi.1010346] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Revised: 12/28/2022] [Accepted: 11/24/2022] [Indexed: 12/15/2022] Open
Abstract
Peripheral membrane proteins (PMPs) include a wide variety of proteins that have in common to bind transiently to the chemically complex interfacial region of membranes through their interfacial binding site (IBS). In contrast to protein-protein or protein-DNA/RNA interfaces, peripheral protein-membrane interfaces are poorly characterized. We collected a dataset of PMP domains representative of the variety of PMP functions: membrane-targeting domains (Annexin, C1, C2, discoidin C2, PH, PX), enzymes (PLA, PLC/D) and lipid-transfer proteins (START). The dataset contains 1328 experimental structures and 1194 AphaFold models. We mapped the amino acid composition and structural patterns of the IBS of each protein in this dataset, and evaluated which were more likely to be found at the IBS compared to the rest of the domains' accessible surface. In agreement with earlier work we find that about two thirds of the PMPs in the dataset have protruding hydrophobes (Leu, Ile, Phe, Tyr, Trp and Met) at their IBS. The three aromatic amino acids Trp, Tyr and Phe are a hallmark of PMPs IBS regardless of whether they protrude on loops or not. This is also the case for lysines but not arginines suggesting that, unlike for Arg-rich membrane-active peptides, the less membrane-disruptive lysine is preferred in PMPs. Another striking observation was the over-representation of glycines at the IBS of PMPs compared to the rest of their surface, possibly procuring IBS loops a much-needed flexibility to insert in-between membrane lipids. The analysis of the 9 superfamilies revealed amino acid distribution patterns in agreement with their known functions and membrane-binding mechanisms. Besides revealing novel amino acids patterns at protein-membrane interfaces, our work contributes a new PMP dataset and an analysis pipeline that can be further built upon for future studies of PMPs properties, or for developing PMPs prediction tools using for example, machine learning approaches.
Collapse
Affiliation(s)
- Thibault Tubiana
- Department of Chemistry, University of Bergen, Bergen, Norway
- Computational Biology Unit, University of Bergen, Bergen, Norway
| | - Ian Sillitoe
- Department of Structural and Molecular Biology, University College London, London, United Kingdom
| | - Christine Orengo
- Department of Structural and Molecular Biology, University College London, London, United Kingdom
| | - Nathalie Reuter
- Department of Chemistry, University of Bergen, Bergen, Norway
- Computational Biology Unit, University of Bergen, Bergen, Norway
| |
Collapse
|
12
|
Pal A, Chakrabarti P, Dey S. ProDFace: A web-tool for the dissection of protein-DNA interfaces. Front Mol Biosci 2022; 9:978310. [PMID: 36148013 PMCID: PMC9486321 DOI: 10.3389/fmolb.2022.978310] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2022] [Accepted: 08/09/2022] [Indexed: 11/30/2022] Open
Abstract
Protein-DNA interactions play a crucial role in gene expression and regulation. Identifying the DNA binding surface of proteins has long been a challenge–in comparison to protein-protein interactions, limited progress has been made in the development of efficient DNA binding site prediction and protein-DNA docking methods. Here we present ProDFace, a web tool that characterizes the binding region of a protein-DNA complex based on amino acid propensity, hydrogen bond (HB) donor capacity (number of solvent accessible HB donor groups), sequence conservation at the interface core and rim region, and geometry. The program takes as input the structure of a protein-DNA complex in PDB (Protein Data Bank) format, and outputs various physicochemical and geometric parameters of the interface, as well as conservation of the interface residues in the protein component. Values are provided for the whole interface, and after dissecting it into core and rim regions. Details of water mediated HBs between protein and DNA, potential HB donor groups present at the binding surface of protein, and conserved interface residues are also provided as downloadable text files. These parameters can be useful in evaluating and validating protein-DNA docking solutions, structures derived from simulation as well as solutions from the available prediction tools, and facilitate the development of more efficient prediction methods. The web-tool is freely available at structbioinfo.iitj.ac.in/resources/bioinfo/pd_interface.
Collapse
Affiliation(s)
- Arumay Pal
- School of Bioengineering, Vellore Institute of Technology, Bhopal, India
| | | | - Sucharita Dey
- Department of Bioscience and Bioengineering, Indian Institute of Technology Jodhpur, Karwar, India
- *Correspondence: Sucharita Dey,
| |
Collapse
|
13
|
Kleiner D, Shapiro Tuchman Z, Shmulevich F, Shahar A, Zarivach R, Kosloff M, Bershtein S. Evolution of homo-oligomerization of methionine S-adenosyltransferases is replete with structure-function constrains. Protein Sci 2022; 31:e4352. [PMID: 35762725 PMCID: PMC9202080 DOI: 10.1002/pro.4352] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Revised: 04/14/2022] [Accepted: 05/07/2022] [Indexed: 11/16/2022]
Abstract
Homomers are prevalent in bacterial proteomes, particularly among core metabolic enzymes. Homomerization is often key to function and regulation, and interfaces that facilitate the formation of homomeric enzymes are subject to intense evolutionary change. However, our understanding of the molecular mechanisms that drive evolutionary variation in homomeric complexes is still lacking. How is the diversification of protein interfaces linked to variation in functional regulation and structural integrity of homomeric complexes? To address this question, we studied quaternary structure evolution of bacterial methionine S-adenosyltransferases (MATs)-dihedral homotetramers formed along a large and conserved dimeric interface harboring two active sites, and a small, recently evolved, interdimeric interface. Here, we show that diversity in the physicochemical properties of small interfaces is directly linked to variability in the kinetic stability of MAT quaternary complexes and in modes of their functional regulation. Specifically, hydrophobic interactions within the small interface of Escherichia coli MAT render the functional homotetramer kinetically stable yet impose severe aggregation constraints on complex assembly. These constraints are alleviated by electrostatic interactions that accelerate dimer-dimer assembly. In contrast, Neisseria gonorrhoeae MAT adopts a nonfunctional dimeric state due to the low hydrophobicity of its small interface and the high flexibility of its active site loops, which perturbs small interface integrity. Remarkably, in the presence of methionine and ATP, N. gonorrhoeae MAT undergoes substrate-induced assembly into a functional tetrameric state. We suggest that evolution acts on the interdimeric interfaces of MATs to tailor the regulation of their activity and stability to unique organismal needs.
Collapse
Affiliation(s)
- Daniel Kleiner
- Department of Life SciencesBen‐Gurion University of the NegevBeer‐ShevaIsrael
| | - Ziva Shapiro Tuchman
- The Department of Human Biology, Faculty of Natural SciencesUniversity of HaifaHaifaIsrael
| | - Fannia Shmulevich
- Department of Life SciencesBen‐Gurion University of the NegevBeer‐ShevaIsrael
| | - Anat Shahar
- Ilse Katz Institute for Nanoscale Science & TechnologyBen‐Gurion University of the NegevBeer‐ShevaIsrael
| | - Raz Zarivach
- Department of Life SciencesBen‐Gurion University of the NegevBeer‐ShevaIsrael
- Macromolecular Crystallography and Cryo‐EM Research Center, The National Institute for Biotechnology in the NegevBen‐Gurion University of the NegevBeer‐ShevaIsrael
| | - Mickey Kosloff
- The Department of Human Biology, Faculty of Natural SciencesUniversity of HaifaHaifaIsrael
| | - Shimon Bershtein
- Department of Life SciencesBen‐Gurion University of the NegevBeer‐ShevaIsrael
| |
Collapse
|
14
|
Dey S, Prilusky J, Levy ED. QSalignWeb: A Server to Predict and Analyze Protein Quaternary Structure. Front Mol Biosci 2022; 8:787510. [PMID: 35071324 PMCID: PMC8769216 DOI: 10.3389/fmolb.2021.787510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Accepted: 12/02/2021] [Indexed: 11/16/2022] Open
Abstract
The identification of physiologically relevant quaternary structures (QSs) in crystal lattices is challenging. To predict the physiological relevance of a particular QS, QSalign searches for homologous structures in which subunits interact in the same geometry. This approach proved accurate but was limited to structures already present in the Protein Data Bank (PDB). Here, we introduce a webserver (www.QSalign.org) allowing users to submit homo-oligomeric structures of their choice to the QSalign pipeline. Given a user-uploaded structure, the sequence is extracted and used to search homologs based on sequence similarity and PFAM domain architecture. If structural conservation is detected between a homolog and the user-uploaded QS, physiological relevance is inferred. The web server also generates alternative QSs with PISA and processes them the same way as the query submitted to widen the predictions. The result page also shows representative QSs in the protein family of the query, which is informative if no QS conservation was detected or if the protein appears monomeric. These representative QSs can also serve as a starting point for homology modeling.
Collapse
Affiliation(s)
- Sucharita Dey
- Department of Chemical and Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Jaime Prilusky
- Department of Life Sciences and Core Facilities, Weizmann Institute of Science, Rehovot, Israel
| | - Emmanuel D. Levy
- Department of Chemical and Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| |
Collapse
|
15
|
Zhu J, Avakyan N, Kakkis AA, Hoffnagle AM, Han K, Li Y, Zhang Z, Choi TS, Na Y, Yu CJ, Tezcan FA. Protein Assembly by Design. Chem Rev 2021; 121:13701-13796. [PMID: 34405992 PMCID: PMC9148388 DOI: 10.1021/acs.chemrev.1c00308] [Citation(s) in RCA: 112] [Impact Index Per Article: 37.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Proteins are nature's primary building blocks for the construction of sophisticated molecular machines and dynamic materials, ranging from protein complexes such as photosystem II and nitrogenase that drive biogeochemical cycles to cytoskeletal assemblies and muscle fibers for motion. Such natural systems have inspired extensive efforts in the rational design of artificial protein assemblies in the last two decades. As molecular building blocks, proteins are highly complex, in terms of both their three-dimensional structures and chemical compositions. To enable control over the self-assembly of such complex molecules, scientists have devised many creative strategies by combining tools and principles of experimental and computational biophysics, supramolecular chemistry, inorganic chemistry, materials science, and polymer chemistry, among others. Owing to these innovative strategies, what started as a purely structure-building exercise two decades ago has, in short order, led to artificial protein assemblies with unprecedented structures and functions and protein-based materials with unusual properties. Our goal in this review is to give an overview of this exciting and highly interdisciplinary area of research, first outlining the design strategies and tools that have been devised for controlling protein self-assembly, then describing the diverse structures of artificial protein assemblies, and finally highlighting the emergent properties and functions of these assemblies.
Collapse
Affiliation(s)
| | | | - Albert A. Kakkis
- Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093-0340, United States
| | - Alexander M. Hoffnagle
- Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093-0340, United States
| | - Kenneth Han
- Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093-0340, United States
| | - Yiying Li
- Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093-0340, United States
| | - Zhiyin Zhang
- Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093-0340, United States
| | - Tae Su Choi
- Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093-0340, United States
| | - Youjeong Na
- Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093-0340, United States
| | - Chung-Jui Yu
- Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093-0340, United States
| | - F. Akif Tezcan
- Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093-0340, United States
| |
Collapse
|
16
|
Béreux S, Delmas B, Cazals F. Boosting the analysis of protein interfaces with multiple interface string alignments: Illustration on the spikes of coronaviruses. Proteins 2021; 90:848-857. [PMID: 34779026 DOI: 10.1002/prot.26279] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2020] [Revised: 06/24/2021] [Accepted: 10/19/2021] [Indexed: 11/11/2022]
Abstract
We introduce multiple interface string alignment (MISA), a visualization tool to display coherently various sequence and structure based statistics at protein-protein interfaces (SSE elements, buried surface area, Δ ASA , B factor values, etc). The amino acids supporting these annotations are obtained from Voronoi interface models. The benefit of MISA is to collate annotated sequences of (homologous) chains found in different biological contexts, that is, bound with different partners or unbound. The aggregated views MISA/SSE, MISA/BSA, MISA/ΔASA, and so forth, make it trivial to identify commonalities and differences between chains, to infer key interface residues, and to understand where conformational changes occur upon binding. As such, they should prove of key relevance for knowledge-based annotations of protein databases such as the Protein Data Bank. Illustrations are provided on the receptor binding domain of coronaviruses, in complex with their cognate partner or (neutralizing) antibodies. MISA computed with a minimal number of structures complement and enrich findings previously reported. The corresponding package is available from the Structural Bioinformatics Library (http://sbl.inria.frand https://sbl.inria.fr/doc/Multiple_interface_string_alignment-user-manual.html).
Collapse
|
17
|
Influence of surface termination of ultrananocrystalline diamond films coated on titanium on response of human osteoblast cells: A proteome study. MATERIALS SCIENCE & ENGINEERING. C, MATERIALS FOR BIOLOGICAL APPLICATIONS 2021; 128:112289. [PMID: 34474840 DOI: 10.1016/j.msec.2021.112289] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Revised: 06/15/2021] [Accepted: 06/25/2021] [Indexed: 12/26/2022]
Abstract
Successful osseointegration, i.e. the fully functional connection of patient's bone and artificial implant depends on the response of the cells to the direct contact with the surface of the implant. The surface properties of the implant which trigger cell responses leading to its integration into the surrounding bone can be tailored by surface modifications or coating with thin layers. One potential material for such applications is ultrananocrystalline diamond (UNCD). It combines the exceptional mechanical properties of diamond with good biocompatibility and possibility of coating as thin uniform films on different substrates of biological interest. In the current work we firstly deposited UNCD films on titanium-coated substrates and applied oxygen or ammonia plasma to modify their surface properties. The as-grown and modified UNCD exhibited relatively smooth surfaces with topography dominated by rounded features. The modifications induced oxygen- or amino-terminated surfaces with increased hydrophilicity. In addition, the UNCD coatings exhibited very low coefficient of friction when diamond was used as a counterpart. As-grown and modified UNCD samples were applied to study the responses of human osteoblast MG63 cells triggered by surfaces with various terminations assessed by proteomic analysis. The results revealed that the coating of Ti with UNCD as well as the plasma modifications resulting in O- or NH2-terminated UNCD induced upregulation of proteins specific for cytoskeleton, cell membrane, and extracellular matrix (ECM) involved in the cell-ECM-surface interactions. Proteins from each of these groups, namely, vimentin, cadherin and fibronectin were further studied immunocytochemically and the results confirmed their increased abundance leading to improved cell-to-surface adhesion and cell-to-cell interactions. These findings demonstrate the potential of implant coating with UNCD and its surface modifications for better osseointegration and bone formation.
Collapse
|
18
|
Kanapeckaitė A, Beaurivage C, Jančorienė L, Mažeikienė A. In silico drug discovery for a complex immunotherapeutic target - human c-Rel protein. Biophys Chem 2021; 276:106593. [PMID: 34087524 DOI: 10.1016/j.bpc.2021.106593] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Revised: 03/28/2021] [Accepted: 04/12/2021] [Indexed: 12/22/2022]
Abstract
Target evaluation and rational drug design rely on identifying and characterising small-molecule binding sites on therapeutically relevant target proteins. Immunotherapeutics development is especially challenging because of complex disease etiology and heterogenous nature of targets. c-Rel protein, a promising target in many human inflammatory and cancer pathologies, was selected as a case study for an effective in silico screening platform development since this transcription factor currently has no successful therapeutic inhibitors or modulators. This study introduces a novel in silico screening approach to probe binding sites using structural validation sets, molecular modelling and describes a method of a computer-aided drug design when a crystal structure is not available for the target of interest. In addition, we showed that binding sites can be analysed with the machine learning as well as molecular simulation approaches to help assess and systematically analyse how drug candidates can exert their mode of action. Finally, this cutting-edge approach was subjected to a high through-put virtual screen of selected 34 M drug-like compounds filtered from a library of 659 M compounds by identifying the most promising structures and proposing potential action mechanisms for the future development of highly selective human c-Rel inhibitors and/or modulators.
Collapse
Affiliation(s)
| | | | - Ligita Jančorienė
- Vilnius University Medical Faculty InsTtute of Clinical Medicine, Clinic of InfecTous Diseases and Dermatovenerology, Santariškių str. 14, 08406 Vilnius, Lithuania
| | - Asta Mažeikienė
- Department of Physiology, Biochemistry, Microbiology and Laboratory Medicine, Institute of Biomedical Sciences, Faculty of Medicine, Vilnius University, M. K. Čiurlionio g. 21, LT-03101, Vilnius, Lithuania
| |
Collapse
|
19
|
Fogha J, Bayry J, Diharce J, de Brevern AG. Structural and evolutionary exploration of the IL-3 family and its alpha subunit receptors. Amino Acids 2021; 53:1211-1227. [PMID: 34196789 DOI: 10.1007/s00726-021-03026-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Accepted: 06/21/2021] [Indexed: 12/14/2022]
Abstract
Interleukin-3 (IL-3) is a cytokine belonging to the family of common β (βc) and is involved in various biological systems. Its activity is mediated by the interaction with its receptor (IL-3R), a heterodimer composed of two distinct subunits: IL-3Rα and βc. IL-3 and its receptor, especially IL-3Rα, play a crucial role in pathologies like inflammatory diseases and therefore are interesting therapeutic targets. Here, we have performed an analysis of these proteins and their interaction based on structural and evolutionary information. We highlighted that IL-3 and IL-3Rα structural architectures are conserved across evolution and shared with other proteins belonging to the same βc family interleukin-5 (IL-5) and granulocyte-macrophage colony-stimulating factor (GM-CSF). The IL-3Rα/IL-3 interaction is mediated by a large interface in which most residues are surprisingly not conserved during evolution and across family members. In spite of this high variability, we suggested small regions constituted by few residues conserved during the evolution in both proteins that could be important for the binding affinity.
Collapse
Affiliation(s)
- Jade Fogha
- UMR_S 1134, DSIMB, Université de Paris, Inserm, Biologie Intégrée du Globule Rouge, 75739, Paris, France
- Institut National de La Transfusion Sanguine (INTS), 75739, Paris, France
- Laboratoire D'Excellence GR-Ex, 75739, Paris, France
| | - Jagadeesh Bayry
- Centre de Recherche Des Cordeliers, Institut National de La Santé Et de La Recherche Médicale, Sorbonne Université, Université de Paris, 75006, Paris, France
- Indian Institute of Technology Palakkad, Kozhippara, Palakkad, 678 557, India
| | - Julien Diharce
- UMR_S 1134, DSIMB, Université de Paris, Inserm, Biologie Intégrée du Globule Rouge, 75739, Paris, France.
- Institut National de La Transfusion Sanguine (INTS), 75739, Paris, France.
- Laboratoire D'Excellence GR-Ex, 75739, Paris, France.
| | - Alexandre G de Brevern
- UMR_S 1134, DSIMB, Université de Paris, Inserm, Biologie Intégrée du Globule Rouge, 75739, Paris, France.
- Institut National de La Transfusion Sanguine (INTS), 75739, Paris, France.
- Laboratoire D'Excellence GR-Ex, 75739, Paris, France.
- UMR_S 1134, DSIMB, Université de La Réunion, Inserm, Biologie Intégrée du Globule Rouge, La Réunion, 97744, Saint-Denis, France.
| |
Collapse
|
20
|
Frutiger A, Tanno A, Hwu S, Tiefenauer RF, Vörös J, Nakatsuka N. Nonspecific Binding-Fundamental Concepts and Consequences for Biosensing Applications. Chem Rev 2021; 121:8095-8160. [PMID: 34105942 DOI: 10.1021/acs.chemrev.1c00044] [Citation(s) in RCA: 103] [Impact Index Per Article: 34.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Nature achieves differentiation of specific and nonspecific binding in molecular interactions through precise control of biomolecules in space and time. Artificial systems such as biosensors that rely on distinguishing specific molecular binding events in a sea of nonspecific interactions have struggled to overcome this issue. Despite the numerous technological advancements in biosensor technologies, nonspecific binding has remained a critical bottleneck due to the lack of a fundamental understanding of the phenomenon. To date, the identity, cause, and influence of nonspecific binding remain topics of debate within the scientific community. In this review, we discuss the evolution of the concept of nonspecific binding over the past five decades based upon the thermodynamic, intermolecular, and structural perspectives to provide classification frameworks for biomolecular interactions. Further, we introduce various theoretical models that predict the expected behavior of biosensors in physiologically relevant environments to calculate the theoretical detection limit and to optimize sensor performance. We conclude by discussing existing practical approaches to tackle the nonspecific binding challenge in vitro for biosensing platforms and how we can both address and harness nonspecific interactions for in vivo systems.
Collapse
Affiliation(s)
- Andreas Frutiger
- Laboratory of Biosensors and Bioelectronics, Institute for Biomedical Engineering, ETH Zürich, Zürich CH-8092, Switzerland
| | - Alexander Tanno
- Laboratory of Biosensors and Bioelectronics, Institute for Biomedical Engineering, ETH Zürich, Zürich CH-8092, Switzerland
| | - Stephanie Hwu
- Laboratory of Biosensors and Bioelectronics, Institute for Biomedical Engineering, ETH Zürich, Zürich CH-8092, Switzerland
| | - Raphael F Tiefenauer
- Laboratory of Biosensors and Bioelectronics, Institute for Biomedical Engineering, ETH Zürich, Zürich CH-8092, Switzerland
| | - János Vörös
- Laboratory of Biosensors and Bioelectronics, Institute for Biomedical Engineering, ETH Zürich, Zürich CH-8092, Switzerland
| | - Nako Nakatsuka
- Laboratory of Biosensors and Bioelectronics, Institute for Biomedical Engineering, ETH Zürich, Zürich CH-8092, Switzerland
| |
Collapse
|
21
|
Controlling Protein Crystallization by Free Energy Guided Design of Interactions at Crystal Contacts. CRYSTALS 2021. [DOI: 10.3390/cryst11060588] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Protein crystallization can function as an effective method for protein purification or formulation. Such an application requires a comprehensive understanding of the intermolecular protein–protein interactions that drive and stabilize protein crystal formation to ensure a reproducible process. Using alcohol dehydrogenase from Lactobacillus brevis (LbADH) as a model system, we probed in our combined experimental and computational study the effect of residue substitutions at the protein crystal contacts on the crystallizability and the contact stability. Increased or decreased contact stability was calculated using molecular dynamics (MD) free energy simulations and showed excellent qualitative correlation with experimentally determined increased or decreased crystallizability. The MD simulations allowed us to trace back the changes to their physical origins at the atomic level. Engineered charge–charge interactions as well as engineered hydrophobic effects could be characterized and were found to improve crystallizability. For example, the simulations revealed a redesigning of a water mediated electrostatic interaction (“wet contact”) into a water depleted hydrophobic effect (“dry contact”) and the optimization of a weak hydrogen bonding contact towards a strong one. These findings explained the experimentally found improved crystallizability. Our study emphasizes that it is difficult to derive simple rules for engineering crystallizability but that free energy simulations could be a very useful tool for understanding the contribution of crystal contacts for stability and furthermore could help guide protein engineering strategies to enhance crystallization for technical purposes.
Collapse
|
22
|
Kyrilis FL, Semchonok DA, Skalidis I, Tüting C, Hamdi F, O'Reilly FJ, Rappsilber J, Kastritis PL. Integrative structure of a 10-megadalton eukaryotic pyruvate dehydrogenase complex from native cell extracts. Cell Rep 2021; 34:108727. [PMID: 33567276 DOI: 10.1016/j.celrep.2021.108727] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Revised: 12/02/2020] [Accepted: 01/14/2021] [Indexed: 12/29/2022] Open
Abstract
The pyruvate dehydrogenase complex (PDHc) is a giant enzymatic assembly involved in pyruvate oxidation. PDHc components have been characterized in isolation, but the complex's quaternary structure has remained elusive due to sheer size, heterogeneity, and plasticity. Here, we identify fully assembled Chaetomium thermophilum α-keto acid dehydrogenase complexes in native cell extracts and characterize their domain arrangements utilizing mass spectrometry, activity assays, crosslinking, electron microscopy (EM), and computational modeling. We report the cryo-EM structure of the PDHc core and observe unique features of the previously unknown native state. The asymmetric reconstruction of the 10-MDa PDHc resolves spatial proximity of its components, agrees with stoichiometric data (60 E2p:12 E3BP:∼20 E1p: ≤ 12 E3), and proposes a minimum reaction path among component enzymes. The PDHc shows the presence of a dynamic pyruvate oxidation compartment, organized by core and peripheral protein species. Our data provide a framework for further understanding PDHc and α-keto acid dehydrogenase complex structure and function.
Collapse
Affiliation(s)
- Fotis L Kyrilis
- Interdisciplinary Research Center HALOmem, Charles Tanford Protein Center, Martin Luther University Halle-Wittenberg, Kurt-Mothes-Straße 3a, Halle/Saale, Germany; Institute of Biochemistry and Biotechnology, Martin Luther University Halle-Wittenberg, Kurt-Mothes-Straße 3, Halle/Saale, Germany
| | - Dmitry A Semchonok
- Interdisciplinary Research Center HALOmem, Charles Tanford Protein Center, Martin Luther University Halle-Wittenberg, Kurt-Mothes-Straße 3a, Halle/Saale, Germany
| | - Ioannis Skalidis
- Interdisciplinary Research Center HALOmem, Charles Tanford Protein Center, Martin Luther University Halle-Wittenberg, Kurt-Mothes-Straße 3a, Halle/Saale, Germany; Institute of Biochemistry and Biotechnology, Martin Luther University Halle-Wittenberg, Kurt-Mothes-Straße 3, Halle/Saale, Germany
| | - Christian Tüting
- Interdisciplinary Research Center HALOmem, Charles Tanford Protein Center, Martin Luther University Halle-Wittenberg, Kurt-Mothes-Straße 3a, Halle/Saale, Germany
| | - Farzad Hamdi
- Interdisciplinary Research Center HALOmem, Charles Tanford Protein Center, Martin Luther University Halle-Wittenberg, Kurt-Mothes-Straße 3a, Halle/Saale, Germany
| | - Francis J O'Reilly
- Bioanalytics, Institute of Biotechnology, Technische Universität Berlin, 13355 Berlin, Germany
| | - Juri Rappsilber
- Bioanalytics, Institute of Biotechnology, Technische Universität Berlin, 13355 Berlin, Germany; Wellcome Centre for Cell Biology, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3BF, Scotland, United Kingdom
| | - Panagiotis L Kastritis
- Interdisciplinary Research Center HALOmem, Charles Tanford Protein Center, Martin Luther University Halle-Wittenberg, Kurt-Mothes-Straße 3a, Halle/Saale, Germany; Institute of Biochemistry and Biotechnology, Martin Luther University Halle-Wittenberg, Kurt-Mothes-Straße 3, Halle/Saale, Germany; Biozentrum, Martin Luther University Halle-Wittenberg, Weinbergweg 22, Halle/Saale, Germany.
| |
Collapse
|
23
|
Slater O, Miller B, Kontoyianni M. Decoding Protein-protein Interactions: An Overview. Curr Top Med Chem 2021; 20:855-882. [PMID: 32101126 DOI: 10.2174/1568026620666200226105312] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2019] [Revised: 11/27/2019] [Accepted: 11/27/2019] [Indexed: 12/24/2022]
Abstract
Drug discovery has focused on the paradigm "one drug, one target" for a long time. However, small molecules can act at multiple macromolecular targets, which serves as the basis for drug repurposing. In an effort to expand the target space, and given advances in X-ray crystallography, protein-protein interactions have become an emerging focus area of drug discovery enterprises. Proteins interact with other biomolecules and it is this intricate network of interactions that determines the behavior of the system and its biological processes. In this review, we briefly discuss networks in disease, followed by computational methods for protein-protein complex prediction. Computational methodologies and techniques employed towards objectives such as protein-protein docking, protein-protein interactions, and interface predictions are described extensively. Docking aims at producing a complex between proteins, while interface predictions identify a subset of residues on one protein that could interact with a partner, and protein-protein interaction sites address whether two proteins interact. In addition, approaches to predict hot spots and binding sites are presented along with a representative example of our internal project on the chemokine CXC receptor 3 B-isoform and predictive modeling with IP10 and PF4.
Collapse
Affiliation(s)
- Olivia Slater
- Department of Pharmaceutical Sciences, Southern Illinois University, Edwardsville, IL 62026, United States
| | - Bethany Miller
- Department of Pharmaceutical Sciences, Southern Illinois University, Edwardsville, IL 62026, United States
| | - Maria Kontoyianni
- Department of Pharmaceutical Sciences, Southern Illinois University, Edwardsville, IL 62026, United States
| |
Collapse
|
24
|
Seychell BC, Beck T. Molecular basis for protein-protein interactions. Beilstein J Org Chem 2021; 17:1-10. [PMID: 33488826 PMCID: PMC7801801 DOI: 10.3762/bjoc.17.1] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Accepted: 12/07/2020] [Indexed: 01/11/2023] Open
Abstract
This minireview provides an overview on the current knowledge of protein-protein interactions, common characterisation methods to characterise them, and their role in protein complex formation with some examples. A deep understanding of protein-protein interactions and their molecular interactions is important for a number of applications, including drug design. Protein-protein interactions and their discovery are thus an interesting avenue for understanding how protein complexes, which make up the majority of proteins, work.
Collapse
Affiliation(s)
- Brandon Charles Seychell
- Universität Hamburg, Department of Chemistry, Institute of Physical Chemistry, Grindelallee 117, 20146 Hamburg, Germany
| | - Tobias Beck
- Universität Hamburg, Department of Chemistry, Institute of Physical Chemistry, Grindelallee 117, 20146 Hamburg, Germany
- The Hamburg Centre for Ultrafast Imaging, Hamburg, Germany
| |
Collapse
|
25
|
Dhawanjewar AS, Roy AA, Madhusudhan MS. A knowledge-based scoring function to assess quaternary associations of proteins. Bioinformatics 2020; 36:3739-3748. [PMID: 32246820 DOI: 10.1093/bioinformatics/btaa207] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2019] [Revised: 03/01/2020] [Accepted: 03/30/2020] [Indexed: 12/21/2022] Open
Abstract
MOTIVATION The elucidation of all inter-protein interactions would significantly enhance our knowledge of cellular processes at a molecular level. Given the enormity of the problem, the expenses and limitations of experimental methods, it is imperative that this problem is tackled computationally. In silico predictions of protein interactions entail sampling different conformations of the purported complex and then scoring these to assess for interaction viability. In this study, we have devised a new scheme for scoring protein-protein interactions. RESULTS Our method, PIZSA (Protein Interaction Z-Score Assessment), is a binary classification scheme for identification of native protein quaternary assemblies (binders/nonbinders) based on statistical potentials. The scoring scheme incorporates residue-residue contact preference on the interface with per residue-pair atomic contributions and accounts for clashes. PIZSA can accurately discriminate between native and non-native structural conformations from protein docking experiments and outperform other contact-based potential scoring functions. The method has been extensively benchmarked and is among the top 6 methods, outperforming 31 other statistical, physics based and machine learning scoring schemes. The PIZSA potentials can also distinguish crystallization artifacts from biological interactions. AVAILABILITY AND IMPLEMENTATION PIZSA is implemented as a web server at http://cospi.iiserpune.ac.in/pizsa and can be downloaded as a standalone package from http://cospi.iiserpune.ac.in/pizsa/Download/Download.html. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Abhilesh S Dhawanjewar
- Indian Institute of Science Education and Research, Pashan, Pune 411008, India.,School of Biological Sciences, University of Nebraska, Lincoln, NE 68588, USA
| | - Ankit A Roy
- Indian Institute of Science Education and Research, Pashan, Pune 411008, India
| | | |
Collapse
|
26
|
Modeling Epac1 interactions with the allosteric inhibitor AM-001 by co-solvent molecular dynamics. J Comput Aided Mol Des 2020; 34:1171-1179. [PMID: 32700175 PMCID: PMC7533256 DOI: 10.1007/s10822-020-00332-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2020] [Accepted: 07/13/2020] [Indexed: 12/13/2022]
Abstract
The exchange proteins activated by cAMP (EPAC) are implicated in a large variety of physiological processes and they are considered as promising targets for a wide range of therapeutic applications. Several recent reports provided evidence for the therapeutic effectiveness of the inhibiting EPAC1 activity cardiac diseases. In that context, we recently characterized a selective EPAC1 antagonist named AM-001. This compound was featured by a non-competitive mechanism of action but the localization of its allosteric site to EPAC1 structure has yet to be investigated. Therefore, we performed cosolvent molecular dynamics with the aim to identify a suitable allosteric binding site. Then, the docking and molecular dynamics were used to determine the binding of the AM-001 to the regions highlighted by cosolvent molecular dynamics for EPAC1. These analyses led us to the identification of a suitable allosteric AM-001 binding pocket at EPAC1. As a model validation, we also evaluated the binding poses of the available AM-001 analogues, with a different biological potency. Finally, the complex EPAC1 with AM-001 bound at the putative allosteric site was further refined by molecular dynamics. The principal component analysis led us to identify the protein motion that resulted in an inactive like conformation upon the allosteric inhibitor binding.
Collapse
|
27
|
Zhang J, Kurgan L. SCRIBER: accurate and partner type-specific prediction of protein-binding residues from proteins sequences. Bioinformatics 2020; 35:i343-i353. [PMID: 31510679 PMCID: PMC6612887 DOI: 10.1093/bioinformatics/btz324] [Citation(s) in RCA: 76] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
Motivation Accurate predictions of protein-binding residues (PBRs) enhances understanding of molecular-level rules governing protein–protein interactions, helps protein–protein docking and facilitates annotation of protein functions. Recent studies show that current sequence-based predictors of PBRs severely cross-predict residues that interact with other types of protein partners (e.g. RNA and DNA) as PBRs. Moreover, these methods are relatively slow, prohibiting genome-scale use. Results We propose a novel, accurate and fast sequence-based predictor of PBRs that minimizes the cross-predictions. Our SCRIBER (SeleCtive pRoteIn-Binding rEsidue pRedictor) method takes advantage of three innovations: comprehensive dataset that covers multiple types of binding residues, novel types of inputs that are relevant to the prediction of PBRs, and an architecture that is tailored to reduce the cross-predictions. The dataset includes complete protein chains and offers improved coverage of binding annotations that are transferred from multiple protein–protein complexes. We utilize innovative two-layer architecture where the first layer generates a prediction of protein-binding, RNA-binding, DNA-binding and small ligand-binding residues. The second layer re-predicts PBRs by reducing overlap between PBRs and the other types of binding residues produced in the first layer. Empirical tests on an independent test dataset reveal that SCRIBER significantly outperforms current predictors and that all three innovations contribute to its high predictive performance. SCRIBER reduces cross-predictions by between 41% and 69% and our conservative estimates show that it is at least 3 times faster. We provide putative PBRs produced by SCRIBER for the entire human proteome and use these results to hypothesize that about 14% of currently known human protein domains bind proteins. Availability and implementation SCRIBER webserver is available at http://biomine.cs.vcu.edu/servers/SCRIBER/. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jian Zhang
- School of Computer and Information Technology, Xinyang Normal University, Xinyang, China.,Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA
| |
Collapse
|
28
|
Biological vs. Crystallographic Protein Interfaces: An Overview of Computational Approaches for Their Classification. CRYSTALS 2020. [DOI: 10.3390/cryst10020114] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Complexes between proteins are at the basis of almost every process in cells. Their study, from a structural perspective, has a pivotal role in understanding biological functions and, importantly, in drug development. X-ray crystallography represents the broadest source for the experimental structural characterization of protein-protein complexes. Correctly identifying the biologically relevant interface from the crystallographic ones is, however, not trivial and can be prone to errors. Over the past two decades, computational methodologies have been developed to study the differences of those interfaces and automatically classify them as biological or crystallographic. Overall, protein-protein interfaces show differences in terms of composition, energetics and evolutionary conservation between biological and crystallographic ones. Based on those observations, a number of computational methods have been developed for this classification problem, which can be grouped into three main categories: Energy-, empirical knowledge- and machine learning-based approaches. In this review, we give a comprehensive overview of the training datasets and methods so far implemented, providing useful links and a brief description of each method.
Collapse
|
29
|
Basu S, Bahadur RP. Do sequence neighbours of intrinsically disordered regions promote structural flexibility in intrinsically disordered proteins? J Struct Biol 2020; 209:107428. [DOI: 10.1016/j.jsb.2019.107428] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2019] [Revised: 11/14/2019] [Accepted: 11/17/2019] [Indexed: 10/25/2022]
|
30
|
Nithin C, Mukherjee S, Bahadur RP. A structure-based model for the prediction of protein-RNA binding affinity. RNA (NEW YORK, N.Y.) 2019; 25:1628-1645. [PMID: 31395671 PMCID: PMC6859855 DOI: 10.1261/rna.071779.119] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/04/2019] [Accepted: 08/05/2019] [Indexed: 05/28/2023]
Abstract
Protein-RNA recognition is highly affinity-driven and regulates a wide array of cellular functions. In this study, we have curated a binding affinity data set of 40 protein-RNA complexes, for which at least one unbound partner is available in the docking benchmark. The data set covers a wide affinity range of eight orders of magnitude as well as four different structural classes. On average, we find the complexes with single-stranded RNA have the highest affinity, whereas the complexes with the duplex RNA have the lowest. Nevertheless, free energy gain upon binding is the highest for the complexes with ribosomal proteins and the lowest for the complexes with tRNA with an average of -5.7 cal/mol/Å2 in the entire data set. We train regression models to predict the binding affinity from the structural and physicochemical parameters of protein-RNA interfaces. The best fit model with the lowest maximum error is provided with three interface parameters: relative hydrophobicity, conformational change upon binding and relative hydration pattern. This model has been used for predicting the binding affinity on a test data set, generated using mutated structures of yeast aspartyl-tRNA synthetase, for which experimentally determined ΔG values of 40 mutations are available. The predicted ΔGempirical values highly correlate with the experimental observations. The data set provided in this study should be useful for further development of the binding affinity prediction methods. Moreover, the model developed in this study enhances our understanding on the structural basis of protein-RNA binding affinity and provides a platform to engineer protein-RNA interfaces with desired affinity.
Collapse
Affiliation(s)
- Chandran Nithin
- Computational Structural Biology Lab, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur 721302, India
| | - Sunandan Mukherjee
- Computational Structural Biology Lab, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur 721302, India
| | - Ranjit Prasad Bahadur
- Computational Structural Biology Lab, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur 721302, India
| |
Collapse
|
31
|
Siebenmorgen T, Zacharias M. Computational prediction of protein–protein binding affinities. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2019. [DOI: 10.1002/wcms.1448] [Citation(s) in RCA: 50] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Affiliation(s)
- Till Siebenmorgen
- Physics Department T38 Technical University of Munich Garching Germany
| | - Martin Zacharias
- Physics Department T38 Technical University of Munich Garching Germany
| |
Collapse
|
32
|
Rajagopal N, Nangia S. Obtaining Protein Association Energy Landscape for Integral Membrane Proteins. J Chem Theory Comput 2019; 15:6444-6455. [DOI: 10.1021/acs.jctc.9b00626] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Affiliation(s)
- Nandhini Rajagopal
- Department of Biomedical and Chemical Engineering, Syracuse University, 343 Link Hall, Syracuse, New York 13244, United States
| | - Shikha Nangia
- Department of Biomedical and Chemical Engineering, Syracuse University, 343 Link Hall, Syracuse, New York 13244, United States
| |
Collapse
|
33
|
Breberina LM, Zlatović MV, Nikolić MR, Stojanović SĐ. Computational Analysis of Non-covalent Interactions in Phycocyanin Subunit Interfaces. Mol Inform 2019; 38:e1800145. [PMID: 31535472 DOI: 10.1002/minf.201800145] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2018] [Accepted: 08/26/2019] [Indexed: 11/10/2022]
Abstract
Protein-protein interactions are an important phenomenon in biological processes and functions. We used the manually curated non-redundant dataset of 118 phycocyanin interfaces to gain additional insight into this phenomenon using a robust inter-atomic non-covalent interaction analyzing tool PPCheck. Our observations indicate that there is a relatively high composition of hydrophobic residues at the interfaces. Most of the interface residues are clustered at the middle of the range which we call "standard-size" interfaces. Furthermore, the multiple interaction patterns founded in the present study indicate that more than half of the residues involved in these interactions participate in multiple and water-bridged hydrogen bonds. Thus, hydrogen bonds contribute maximally towards the stability of protein-protein complexes. The analysis shows that hydrogen bond energies contribute to about 88 % to the total energy and it also increases with interface size. Van der Waals (vdW) energy contributes to 9.3 %±1.7 % on average in these complexes. Moreover, there is about 1.9 %±1.5 % contribution by electrostatic energy. Nevertheless, the role by vdW and electrostatic energy could not be ignored in interface binding. Results show that the total binding energy is more for large phycocyanin interfaces. The normalized energy per residue was less than -16 kJ mol-1 , while most of them have energy in the range from -6 to -14 kJ mol-1 . The non-covalent interacting residues in these proteins were found to be highly conserved. Obtained results might contribute to the understanding of structural stability of this class of evolutionary essential proteins with increased practical application and future designs of novel protein-bioactive compound interactions.
Collapse
Affiliation(s)
- Luka M Breberina
- University of Belgrade - Faculty of Chemistry, Department of Biochemistry, Belgrade, Serbia
| | - Mario V Zlatović
- University of Belgrade - Faculty of Chemistry, Center for Computational Chemistry and Bioinformatics, Belgrade, Serbia
| | - Milan R Nikolić
- University of Belgrade - Faculty of Chemistry, Department of Biochemistry, Belgrade, Serbia
| | - Srđan Đ Stojanović
- Institute of Chemistry, Technology and Metallurgy (ICTM) - Department of Chemistry, University of Belgrade, Belgrade, Serbia
| |
Collapse
|
34
|
Hadi-Alijanvand H. Soft regions of protein surface are potent for stable dimer formation. J Biomol Struct Dyn 2019; 38:3587-3598. [PMID: 31476974 DOI: 10.1080/07391102.2019.1662328] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
By having knowledge about the characteristics of protein interaction interfaces, we will be able to manipulate protein complexes for therapies. Dimer state is considered as the primary alphabet of the most proteins' quaternary structure. The properties of binding interface between subunits and of noninterface region define the specificity and stability of the intended protein complex. Considering some topological properties and amino acids' affinity for binding in interfaces of protein dimers, we construct the interface-specific recurrence plots. The data obtained from recurrence quantitative analysis, and accessibility-related metrics help us to classify the protein dimers into four distinct classes. Some mechanical properties of binding interfaces are computed for each predefined class of the dimers. The computed mechanical characteristics of binding patch region are compared with those of nonbinding region of proteins. Our observations indicate that the mechanical properties of protein binding sites have a decisive impact on determining the dimer stability. We introduce a new concept in analyzing protein structure by considering mechanical properties of protein structure. We conclude that the interface region between subunits of stable dimers is usually mechanically softer than the interface of unstable protein dimers. AbbreviationsAABaverage affinity for bindingANManisotropic network modelAPCaffinity propagation clusteringASAaccessible surface areaCCDinter residues distanceCSCcomplex stability codeDMdistance matrixΔGdissPISA-computed dissociation free energyGNMGaussian normal mode analysisNMAnormal mode analysisPBPprotein binding patchPISAproteins, interfaces, structures and assembliesrASArelative accessible area in respect to unfolded state of residuesRMrecurrence matrixrPrelative protrusionRPrecurrence plotRQArecurrence quantitative analysisSEMstandard error of meanCommunicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Hamid Hadi-Alijanvand
- Department of Biological Sciences, Institute for Advanced Studies in Basic Sciences (IASBS), Zanjan, Iran
| |
Collapse
|
35
|
Nilofer C, Sukhwal A, Mohanapriya A, Sakharkar MK, Kangueane P. Small protein-protein interfaces rich in electrostatic are often linked to regulatory function. J Biomol Struct Dyn 2019; 38:3260-3279. [PMID: 31495333 DOI: 10.1080/07391102.2019.1657040] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Protein-protein interaction (PPI) is critical for several biological functions in living cells through the formation of an interface. Therefore, it is of interest to characterize protein-protein interfaces using an updated non-redundant structural dataset of 2557 homo (identical subunits) and 393 hetero (different subunits) dimer protein complexes determined by X-ray crystallography. We analyzed the interfaces using van der Waals (vdW), hydrogen bonding and electrostatic energies. Results show that on average homo and hetero interfaces are similar. Hence, we further grouped the 2950 interfaces based on percentage vdW to total energies into dominant (≥60%) and sub-dominant (<60%) vdW interfaces. Majority (92%) of interfaces have dominant vdW energy with large interface size (146 ± 87 (homo) and 137 ± 76 (hetero) residues) and interface area (1622 ± 1135 Å2 (homo) and 1579 ± 1060 Å2 (hetero)). However, a proportion (8%) of interfaces have sub-dominant vdW energy with small interface size (85 ± 46 (homo) and 88 ± 36 (hetero) residues) and interface area (823 ± 538 Å2 (homo) and 881 ± 377 Å2 (hetero)). It is found that large interfaces have two-fold more interface area and interface size than small interfaces with increasing hydrogen bonding energy to interface size. However, small interfaces have three-fold more electrostatics energy than large interfaces with increasing electrostatics to interface size. Thus, 8% of complexes having small interfaces with limited interface area and sub-dominant vdW energy are rich in electrostatics. It is interesting to observe that complexes having small interfaces are often associated with regulatory function. Hence, the observed structural features with known molecular function provide insights for the better understanding of PPI.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Christina Nilofer
- Biomedical Informatics (P) Ltd., Pondicherry, India.,School of Biosciences & Technology, VIT University, Vellore, Tamil Nadu, India
| | - Anshul Sukhwal
- National Centre for Biological Sciences (NCBS), Bangalore, India
| | | | | | | |
Collapse
|
36
|
Accurate Classification of Biological and non-Biological Interfaces in Protein Crystal Structures using Subtle Covariation Signals. Sci Rep 2019; 9:12603. [PMID: 31471543 PMCID: PMC6717244 DOI: 10.1038/s41598-019-48913-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2017] [Accepted: 08/14/2019] [Indexed: 11/08/2022] Open
Abstract
Proteins often work as oligomers or multimers in vivo. Therefore, elucidating their oligomeric or multimeric form (quaternary structure) is crucially important to ascertain their function. X-ray crystal structures of numerous proteins have been accumulated, providing information related to their biological units. Extracting information of biological units from protein crystal structures represents a meaningful task for modern biology. Nevertheless, although many methods have been proposed for identifying biological units appearing in protein crystal structures, it is difficult to distinguish biological protein-protein interfaces from crystallographic ones. Therefore, our simple but highly accurate classifier was developed to infer biological units in protein crystal structures using large amounts of protein sequence information and a modern contact prediction method to exploit covariation signals (CSs) in proteins. We demonstrate that our proposed method is promising even for weak signals of biological interfaces. We also discuss the relation between classification accuracy and conservation of biological units, and illustrate how the selection of sequences included in multiple sequence alignments as sources for obtaining CSs affects the results. With increased amounts of sequence data, the proposed method is expected to become increasingly useful.
Collapse
|
37
|
Schrag JD, Picard MÈ, Gaudreault F, Gagnon LP, Baardsnes J, Manenda MS, Sheff J, Deprez C, Baptista C, Hogues H, Kelly JF, Purisima EO, Shi R, Sulea T. Binding symmetry and surface flexibility mediate antibody self-association. MAbs 2019; 11:1300-1318. [PMID: 31318308 PMCID: PMC6748613 DOI: 10.1080/19420862.2019.1632114] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Solution stability is an important factor in the optimization of engineered biotherapeutic candidates such as monoclonal antibodies because of its possible effects on manufacturability, pharmacology, efficacy and safety. A detailed atomic understanding of the mechanisms governing self-association of natively folded protein monomers is required to devise predictive tools to guide screening and re-engineering along the drug development pipeline. We investigated pairs of affinity-matured full-size antibodies and observed drastically different propensities to aggregate from variants differing by a single amino-acid. Biophysical testing showed that antigen-binding fragments (Fabs) from the aggregating antibodies also reversibly associated with equilibrium dissociation constants in the low-micromolar range. Crystal structures (PDB accession codes 6MXR, 6MXS, 6MY4, 6MY5) and bottom-up hydrogen-exchange mass spectrometry revealed that Fab self-association occurs in a symmetric mode that involves the antigen complementarity-determining regions. Subtle local conformational changes incurred upon point mutation of monomeric variants foster formation of complementary polar interactions and hydrophobic contacts to generate a dimeric Fab interface. Testing of popular in silico tools generally indicated low reliabilities for predicting the aggregation propensities observed. A structure-aggregation data set is provided here in order to stimulate further improvements of in silico tools for prediction of native aggregation. Incorporation of intermolecular docking, conformational flexibility, and short-range packing interactions may all be necessary features of the ideal algorithm.
Collapse
Affiliation(s)
- Joseph D Schrag
- Human Health Therapeutics Research Centre, National Research Council Canada , Montreal , QC H4P 2R2 , Canada
| | - Marie-Ève Picard
- Département de Biochimie, de Microbiologie et de Bio-informatique, PROTEO, and Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Pavillon Charles-Eugène-Marchand , Québec City, QC G1V 0A6 , Canada
| | - Francis Gaudreault
- Human Health Therapeutics Research Centre, National Research Council Canada , Montreal , QC H4P 2R2 , Canada
| | - Louis-Patrick Gagnon
- Human Health Therapeutics Research Centre, National Research Council Canada , Montreal , QC H4P 2R2 , Canada
| | - Jason Baardsnes
- Human Health Therapeutics Research Centre, National Research Council Canada , Montreal , QC H4P 2R2 , Canada
| | - Mahder S Manenda
- Département de Biochimie, de Microbiologie et de Bio-informatique, PROTEO, and Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Pavillon Charles-Eugène-Marchand , Québec City, QC G1V 0A6 , Canada
| | - Joey Sheff
- Human Health Therapeutics Research Centre, National Research Council Canada , Ottawa , ON K1A 0R6 , Canada
| | - Christophe Deprez
- Human Health Therapeutics Research Centre, National Research Council Canada , Montreal , QC H4P 2R2 , Canada
| | - Cassio Baptista
- Human Health Therapeutics Research Centre, National Research Council Canada , Montreal , QC H4P 2R2 , Canada
| | - Hervé Hogues
- Human Health Therapeutics Research Centre, National Research Council Canada , Montreal , QC H4P 2R2 , Canada
| | - John F Kelly
- Human Health Therapeutics Research Centre, National Research Council Canada , Ottawa , ON K1A 0R6 , Canada
| | - Enrico O Purisima
- Human Health Therapeutics Research Centre, National Research Council Canada , Montreal , QC H4P 2R2 , Canada
| | - Rong Shi
- Département de Biochimie, de Microbiologie et de Bio-informatique, PROTEO, and Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Pavillon Charles-Eugène-Marchand , Québec City, QC G1V 0A6 , Canada
| | - Traian Sulea
- Human Health Therapeutics Research Centre, National Research Council Canada , Montreal , QC H4P 2R2 , Canada
| |
Collapse
|
38
|
Fährrolfes R, Bietz S, Flachsenberg F, Meyder A, Nittinger E, Otto T, Volkamer A, Rarey M. ProteinsPlus: a web portal for structure analysis of macromolecules. Nucleic Acids Res 2019; 45:W337-W343. [PMID: 28472372 PMCID: PMC5570178 DOI: 10.1093/nar/gkx333] [Citation(s) in RCA: 135] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2017] [Accepted: 04/18/2017] [Indexed: 11/15/2022] Open
Abstract
With currently more than 126 000 publicly available structures and an increasing growth rate, the Protein Data Bank constitutes a rich data source for structure-driven research in fields like drug discovery, crop science and biotechnology in general. Typical workflows in these areas involve manifold computational tools for the analysis and prediction of molecular functions. Here, we present the ProteinsPlus web server that offers a unified easy-to-use interface to a broad range of tools for the early phase of structure-based molecular modeling. This includes solutions for commonly required pre-processing tasks like structure quality assessment (EDIA), hydrogen placement (Protoss) and the search for alternative conformations (SIENA). Beyond that, it also addresses frequent problems as the generation of 2D-interaction diagrams (PoseView), protein-protein interface classification (HyPPI) as well as automatic pocket detection and druggablity assessment (DoGSiteScorer). The unified ProteinsPlus interface covering all featured approaches provides various facilities for intuitive input and result visualization, case-specific parameterization and download options for further processing. Moreover, its generalized workflow allows the user a quick familiarization with the different tools. ProteinsPlus also stores the calculated results temporarily for future request and thus facilitates convenient result communication and re-access. The server is freely available at http://proteins.plus.
Collapse
Affiliation(s)
- Rainer Fährrolfes
- Universität Hamburg, ZBH-Center for Bioinformatics, Bundesstrasse 43, 20146 Hamburg, Germany
| | - Stefan Bietz
- Universität Hamburg, ZBH-Center for Bioinformatics, Bundesstrasse 43, 20146 Hamburg, Germany
| | - Florian Flachsenberg
- Universität Hamburg, ZBH-Center for Bioinformatics, Bundesstrasse 43, 20146 Hamburg, Germany
| | - Agnes Meyder
- Universität Hamburg, ZBH-Center for Bioinformatics, Bundesstrasse 43, 20146 Hamburg, Germany
| | - Eva Nittinger
- Universität Hamburg, ZBH-Center for Bioinformatics, Bundesstrasse 43, 20146 Hamburg, Germany
| | - Thomas Otto
- Universität Hamburg, ZBH-Center for Bioinformatics, Bundesstrasse 43, 20146 Hamburg, Germany
| | - Andrea Volkamer
- Institute of Physiology, Charité-Universitätsmedizin Berlin, Virchowweg 6, 10117 Berlin, Germany
| | - Matthias Rarey
- Universität Hamburg, ZBH-Center for Bioinformatics, Bundesstrasse 43, 20146 Hamburg, Germany
| |
Collapse
|
39
|
Cicaloni V, Trezza A, Pettini F, Spiga O. Applications of in Silico Methods for Design and Development of Drugs Targeting Protein-Protein Interactions. Curr Top Med Chem 2019; 19:534-554. [PMID: 30836920 DOI: 10.2174/1568026619666190304153901] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2018] [Revised: 01/02/2019] [Accepted: 01/25/2019] [Indexed: 11/22/2022]
Abstract
BACKGROUND Identification of Protein-Protein Interactions (PPIs) is a major challenge in modern molecular biology and biochemistry research, due to the unquestionable role of proteins in cells, biological process and pathological states. Over the past decade, the PPIs have evolved from being considered a highly challenging field of research to being investigated and examined as targets for pharmacological intervention. OBJECTIVE Comprehension of protein interactions is crucial to known how proteins come together to build signalling pathways, to carry out their functions, or to cause diseases, when deregulated. Multiplicity and great amount of PPIs structures offer a huge number of new and potential targets for the treatment of different diseases. METHODS Computational techniques are becoming predominant in PPIs studies for their effectiveness, flexibility, accuracy and cost. As a matter of fact, there are effective in silico approaches which are able to identify PPIs and PPI site. Such methods for computational target prediction have been developed through molecular descriptors and data-mining procedures. RESULTS In this review, we present different types of interactions between protein-protein and the application of in silico methods for design and development of drugs targeting PPIs. We described computational approaches for the identification of possible targets on protein surface and to detect of stimulator/ inhibitor molecules. CONCLUSION A deeper study of the most recent bioinformatics methodologies for PPIs studies is vital for a better understanding of protein complexes and for discover new potential PPI modulators in therapeutic intervention.
Collapse
Affiliation(s)
- Vittoria Cicaloni
- Department of Biotechnology, Chemistry and Pharmacy (Dept. of Excellence 2018-2022), University of Siena, via A. Moro 2, 53100 Siena, Italy.,Toscana Life Sciences Foundation, via Fiorentina 1, 53100 Siena, Italy
| | - Alfonso Trezza
- Department of Biotechnology, Chemistry and Pharmacy (Dept. of Excellence 2018-2022), University of Siena, via A. Moro 2, 53100 Siena, Italy
| | - Francesco Pettini
- Department of Biotechnology, Chemistry and Pharmacy (Dept. of Excellence 2018-2022), University of Siena, via A. Moro 2, 53100 Siena, Italy
| | - Ottavia Spiga
- Department of Biotechnology, Chemistry and Pharmacy (Dept. of Excellence 2018-2022), University of Siena, via A. Moro 2, 53100 Siena, Italy
| |
Collapse
|
40
|
Pilla SP, Thomas A, Bahadur RP. Dissecting macromolecular recognition sites in ribosome: implication to its self-assembly. RNA Biol 2019; 16:1300-1312. [PMID: 31179876 DOI: 10.1080/15476286.2019.1629767] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Interactions between macromolecules play a crucial role in ribosome assembly that follows a highly coordinated process involving RNA folding and binding of ribosomal proteins (r-proteins). Although extensive studies have been carried out to understand macromolecular interactions in ribosomes, most of them are confined to either large or small ribosomal-subunit of few species. A comparative analysis of macromolecular interactions across different domains is still missing. We have analyzed the structural and physicochemical properties of protein-protein (PP), protein-RNA (PR) and RNA-RNA (RR) interfaces in small and large subunits of ribosomes, as well as in between the two subunits. Additionally, we have also developed Random Forest (RF) classifier to catalog the r-proteins. We find significant differences as well as similarities in macromolecular recognition sites between ribosomal assemblies of prokaryotes and eukaryotes. PR interfaces are substantially larger and have more ionic interactions than PP and RR interfaces in both prokaryotes and eukaryotes. PP, PR and RR interfaces in eukaryotes are well packed compared to those in prokaryotes. However, the packing density between the large and the small subunit interfaces in the entire assembly is strikingly low in both prokaryotes and eukaryotes, indicating the periodic association and dissociation of the two subunits during the translation. The structural and physicochemical properties of PR interfaces are used to predict the r-proteins in the assembly pathway into early, intermediate and late binders using RF classifier with an accuracy of 80%. The results provide new insights into the classification of r-proteins in the assembly pathway.
Collapse
Affiliation(s)
- Smita P Pilla
- a Computational Structural Biology Laboratory, Department of Biotechnology, Indian Institute of Technology Kharagpur , Kharagpur , India
| | - Amal Thomas
- a Computational Structural Biology Laboratory, Department of Biotechnology, Indian Institute of Technology Kharagpur , Kharagpur , India
| | - Ranjit Prasad Bahadur
- a Computational Structural Biology Laboratory, Department of Biotechnology, Indian Institute of Technology Kharagpur , Kharagpur , India
| |
Collapse
|
41
|
Pilla SP, R B, Bahadur RP. Dissecting protein‐protein interactions in proteasome assembly: Implication to its self‐assembly. J Mol Recognit 2019; 32:e2784. [DOI: 10.1002/jmr.2784] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2018] [Revised: 03/07/2019] [Accepted: 03/19/2019] [Indexed: 01/18/2023]
Affiliation(s)
- Smita P. Pilla
- Computational Structural Biology Laboratory, Department of BiotechnologyIndian Institute of Technology Kharagpur Kharagpur India
| | - Babu R
- Computational Structural Biology Laboratory, Department of BiotechnologyIndian Institute of Technology Kharagpur Kharagpur India
| | - Ranjit P. Bahadur
- Computational Structural Biology Laboratory, Department of BiotechnologyIndian Institute of Technology Kharagpur Kharagpur India
| |
Collapse
|
42
|
Principles and characteristics of biological assemblies in experimentally determined protein structures. Curr Opin Struct Biol 2019; 55:34-49. [PMID: 30965224 DOI: 10.1016/j.sbi.2019.03.006] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2019] [Accepted: 03/01/2019] [Indexed: 12/27/2022]
Abstract
More than half of all structures in the PDB are assemblies of two or more proteins, including both homooligomers and heterooligomers. Structural information on these assemblies comes from X-ray crystallography, NMR, and cryo-EM spectroscopy. The correct assembly in an X-ray structure is often ambiguous, and computational methods have been developed to identify the most likely biologically relevant assembly based on physical properties of assemblies and sequence conservation in interfaces. Taking advantage of the large number of structures now available, some of the most recent methods have relied on similarity of interfaces and assemblies across structures of homologous proteins.
Collapse
|
43
|
Si Y, Xu D, Bum-Erdene K, Ghozayel MK, Yang B, Clemons PA, Meroueh SO. Chemical Space Overlap with Critical Protein-Protein Interface Residues in Commercial and Specialized Small-Molecule Libraries. ChemMedChem 2019; 14:119-131. [PMID: 30548204 PMCID: PMC7175409 DOI: 10.1002/cmdc.201800537] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2018] [Revised: 11/29/2018] [Indexed: 12/14/2022]
Abstract
There is growing interest in the use of structure-based virtual screening to identify small molecules that inhibit challenging protein-protein interactions (PPIs). In this study, we investigated how effectively chemical library members docked at the PPI interface mimic the position of critical side-chain residues known as "hot spots". Three compound collections were considered, a commercially available screening collection (ChemDiv), a collection of diversity-oriented synthesis (DOS) compounds that contains natural-product-like small molecules, and a library constructed using established reactions (the "screenable chemical universe based on intuitive data organization", SCUBIDOO). Three different tight PPIs for which hot-spot residues have been identified were selected for analysis: uPAR⋅uPA, TEAD4⋅Yap1, and CaV α⋅CaV β. Analysis of library physicochemical properties was followed by docking to the PPI receptors. A pharmacophore method was used to measure overlap between small-molecule substituents and hot-spot side chains. Fragment-like conformationally restricted small molecules showed better hot-spot overlap for interfaces with well-defined pockets such as uPAR⋅uPA, whereas better overlap was observed for more complex DOS compounds in interfaces lacking a well-defined binding site such as TEAD4⋅Yap1. Virtual screening of conformationally restricted compounds targeting uPAR⋅uPA and TEAD4⋅Yap1 followed by experimental validation reinforce these findings, as the best hits were fragment-like and had few rotatable bonds for the former, while no hits were identified for the latter. Overall, such studies provide a framework for understanding PPIs in the context of additional chemical matter and new PPI definitions.
Collapse
Affiliation(s)
- Yubing Si
- Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | - David Xu
- Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, IN, 46202, USA
- Department of BioHealth Informatics, Indiana University School of Informatics and Computing, Indianapolis, IN, 46202, USA
| | - Khuchtumur Bum-Erdene
- Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | - Mona K Ghozayel
- Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | - Baocheng Yang
- Henan Provincial Key Laboratory of Nanocomposites and Applications, Institute of Nanostructured Functional Materials, Huanghe Science and Technology College, Zhengzhou, Henan, 450006, China
| | - Paul A Clemons
- Chemical Biology and Therapeutics Science Program, Broad Institute of Harvard and MIT, Cambridge, MA, 02142, USA
| | - Samy O Meroueh
- Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, IN, 46202, USA
- Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| |
Collapse
|
44
|
Lee JY, Krieger J, Herguedas B, García-Nafría J, Dutta A, Shaikh SA, Greger IH, Bahar I. Druggability Simulations and X-Ray Crystallography Reveal a Ligand-Binding Site in the GluA3 AMPA Receptor N-Terminal Domain. Structure 2018; 27:241-252.e3. [PMID: 30528594 DOI: 10.1016/j.str.2018.10.017] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2018] [Revised: 07/25/2018] [Accepted: 10/18/2018] [Indexed: 11/19/2022]
Abstract
Ionotropic glutamate receptors (iGluRs) mediate the majority of excitatory neurotransmission in the brain. Their dysfunction is implicated in many neurological disorders, rendering iGluRs potential drug targets. Here, we performed a systematic analysis of the druggability of two major iGluR subfamilies, using molecular dynamics simulations in the presence of drug-like molecules. We demonstrate the applicability of druggability simulations by faithfully identifying known agonist and modulator sites on AMPA receptors (AMPARs) and NMDA receptors. Simulations produced the expected allosteric changes of the AMPAR ligand-binding domain in response to agonist. We also identified a novel ligand-binding site specific to the GluA3 AMPAR N-terminal domain (NTD), resulting from its unique conformational flexibility that we explored further with crystal structures trapped in vastly different states. In addition to providing an in-depth analysis into iGluR NTD dynamics, our approach identifies druggable sites and permits the determination of pharmacophoric features toward novel iGluR modulators.
Collapse
Affiliation(s)
- Ji Young Lee
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, 3501 Fifth Avenue, Suite 3064 BST3, Pittsburgh, PA 15260, USA
| | - James Krieger
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, 3501 Fifth Avenue, Suite 3064 BST3, Pittsburgh, PA 15260, USA
| | - Beatriz Herguedas
- Neurobiology Division, MRC Laboratory of Molecular Biology, Cambridge Biomedical Campus, Francis Crick Avenue, Cambridge CB2 0QH, UK
| | - Javier García-Nafría
- Neurobiology Division, MRC Laboratory of Molecular Biology, Cambridge Biomedical Campus, Francis Crick Avenue, Cambridge CB2 0QH, UK
| | - Anindita Dutta
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, 3501 Fifth Avenue, Suite 3064 BST3, Pittsburgh, PA 15260, USA
| | - Saher A Shaikh
- Neurobiology Division, MRC Laboratory of Molecular Biology, Cambridge Biomedical Campus, Francis Crick Avenue, Cambridge CB2 0QH, UK
| | - Ingo H Greger
- Neurobiology Division, MRC Laboratory of Molecular Biology, Cambridge Biomedical Campus, Francis Crick Avenue, Cambridge CB2 0QH, UK.
| | - Ivet Bahar
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, 3501 Fifth Avenue, Suite 3064 BST3, Pittsburgh, PA 15260, USA.
| |
Collapse
|
45
|
Elez K, Bonvin AMJJ, Vangone A. Distinguishing crystallographic from biological interfaces in protein complexes: role of intermolecular contacts and energetics for classification. BMC Bioinformatics 2018; 19:438. [PMID: 30497368 PMCID: PMC6266931 DOI: 10.1186/s12859-018-2414-9] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
BACKGROUND Study of macromolecular assemblies is fundamental to understand functions in cells. X-ray crystallography is the most common technique to solve their 3D structure at atomic resolution. In a crystal, however, both biologically-relevant interfaces and non-specific interfaces resulting from crystallographic packing are observed. Due to the complexity of the biological assemblies currently tackled, classifying those interfaces, i.e. distinguishing biological from crystal lattice interfaces, is not trivial and often prone to errors. In this context, analyzing the physico-chemical characteristics of biological/crystal interfaces can help researchers identify possible features that distinguish them and gain a better understanding of the systems. RESULTS In this work, we are providing new insights into the differences between biological and crystallographic complexes by focusing on "pair-properties" of interfaces that have not yet been fully investigated. We investigated properties such intermolecular residue-residue contacts (already successfully applied to the prediction of binding affinities) and interaction energies (electrostatic, Van der Waals and desolvation). By using the XtalMany and BioMany interface datasets, we show that interfacial residue contacts, classified as a function of their physico-chemical properties, can distinguish between biological and crystallographic interfaces. The energetic terms show, on average, higher values for crystal interfaces, reflecting a less stable interface due to crystal packing compared to biological interfaces. By using a variety of machine learning approaches, we trained a new interface classification predictor based on contacts and interaction energetic features. Our predictor reaches an accuracy in classifying biological vs crystal interfaces of 0.92, compared to 0.88 for EPPIC (one of the main state-of-the-art classifiers reporting same performance as PISA). CONCLUSION In this work we have gained insights into the nature of intermolecular contacts and energetics terms distinguishing biological from crystallographic interfaces. Our findings might have a broader applicability in structural biology, for example for the identification of near native poses in docking. We implemented our classification approach into an easy-to-use and fast software, freely available to the scientific community from http://github.com/haddocking/interface-classifier .
Collapse
Affiliation(s)
- Katarina Elez
- Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, 3584 CH, Utrecht, The Netherlands
- Present address: University of Bologna, Via Selmi 3, 40126, Bologna, Italy
| | - Alexandre M J J Bonvin
- Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, 3584 CH, Utrecht, The Netherlands.
| | - Anna Vangone
- Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, 3584 CH, Utrecht, The Netherlands.
- present address: Pharma Research and Early Development, Large Molecule Research, Roche Innovation Center Munich, Nonnenwald 2, Penzberg, Germany.
| |
Collapse
|
46
|
Abstract
This paper reviews investigations on protein crystallization. It aims to present a comprehensive rather than complete account of recent studies and efforts to elucidate the most intimate mechanisms of protein crystal nucleation. It is emphasized that both physical and biochemical factors are at play during this process. Recently-discovered molecular scale pathways for protein crystal nucleation are considered first. The bond selection during protein crystal lattice formation, which is a typical biochemically-conditioned peculiarity of the crystallization process, is revisited. Novel approaches allow us to quantitatively describe some protein crystallization cases. Additional light is shed on the protein crystal nucleation in pores and crevices by employing the so-called EBDE method (equilibration between crystal bond and destructive energies). Also, protein crystal nucleation in solution flow is considered.
Collapse
|
47
|
Srivastava A, Nagai T, Srivastava A, Miyashita O, Tama F. Role of Computational Methods in Going beyond X-ray Crystallography to Explore Protein Structure and Dynamics. Int J Mol Sci 2018; 19:E3401. [PMID: 30380757 PMCID: PMC6274748 DOI: 10.3390/ijms19113401] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2018] [Revised: 10/20/2018] [Accepted: 10/27/2018] [Indexed: 12/13/2022] Open
Abstract
Protein structural biology came a long way since the determination of the first three-dimensional structure of myoglobin about six decades ago. Across this period, X-ray crystallography was the most important experimental method for gaining atomic-resolution insight into protein structures. However, as the role of dynamics gained importance in the function of proteins, the limitations of X-ray crystallography in not being able to capture dynamics came to the forefront. Computational methods proved to be immensely successful in understanding protein dynamics in solution, and they continue to improve in terms of both the scale and the types of systems that can be studied. In this review, we briefly discuss the limitations of X-ray crystallography in studying protein dynamics, and then provide an overview of different computational methods that are instrumental in understanding the dynamics of proteins and biomacromolecular complexes.
Collapse
Affiliation(s)
- Ashutosh Srivastava
- Institute of Transformative Bio-Molecules (WPI), Nagoya University, Nagoya, Aichi 464-8601, Japan.
| | - Tetsuro Nagai
- Department of Physics, Graduate School of Science, Nagoya University, Nagoya, Aichi 464-8602, Japan.
| | - Arpita Srivastava
- Department of Physics, Graduate School of Science, Nagoya University, Nagoya, Aichi 464-8602, Japan.
| | - Osamu Miyashita
- RIKEN-Center for Computational Science, Kobe, Hyogo 650-0047, Japan.
| | - Florence Tama
- Institute of Transformative Bio-Molecules (WPI), Nagoya University, Nagoya, Aichi 464-8601, Japan.
- Department of Physics, Graduate School of Science, Nagoya University, Nagoya, Aichi 464-8602, Japan.
- RIKEN-Center for Computational Science, Kobe, Hyogo 650-0047, Japan.
| |
Collapse
|
48
|
Lazar T, Guharoy M, Schad E, Tompa P. Unique Physicochemical Patterns of Residues in Protein–Protein Interfaces. J Chem Inf Model 2018; 58:2164-2173. [DOI: 10.1021/acs.jcim.8b00270] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Affiliation(s)
- Tamas Lazar
- VIB-VUB Center for Structural Biology, Vlaams Instituut voor Biotechnologie, Pleinlaan 2, 1050 Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
| | - Mainak Guharoy
- VIB-VUB Center for Structural Biology, Vlaams Instituut voor Biotechnologie, Pleinlaan 2, 1050 Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
| | - Eva Schad
- Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, Magyar tudosok korutja 2, 1117 Budapest, Hungary
| | - Peter Tompa
- VIB-VUB Center for Structural Biology, Vlaams Instituut voor Biotechnologie, Pleinlaan 2, 1050 Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
- Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, Magyar tudosok korutja 2, 1117 Budapest, Hungary
| |
Collapse
|
49
|
Hu J, Liu HF, Sun J, Wang J, Liu R. Integrating co-evolutionary signals and other properties of residue pairs to distinguish biological interfaces from crystal contacts. Protein Sci 2018; 27:1723-1735. [PMID: 29931702 DOI: 10.1002/pro.3448] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2018] [Revised: 04/21/2018] [Accepted: 05/16/2018] [Indexed: 12/25/2022]
Abstract
It remains challenging to accurately discriminate between biological and crystal interfaces. Most existing analyses and algorithms focused on the features derived from a single side of the interface. However, less attention has been paid to the properties of residue pairs across protein interfaces. To address this problem, we defined a novel co-evolutionary feature for homodimers through integrating direct coupling analysis and image processing techniques. The residue pairs across biological homodimeric interfaces were significantly enriched in co-evolving residues compared to those across crystal contacts, resulting in a promising classification accuracy with area under the curves (AUCs) of >0.85. Considering the availability of co-evolutionary feature, we also designed other residue pair based features that were useful for both homodimers and heterodimers. The most informative residue pairs were identified to reflect the interaction preferences across protein interfaces. Regarding the other extant properties, we designed the new descriptors at the interface residue level as well as at the pairwise contact level. Extensive validation showed that these single properties can be used to identify biological interfaces with AUCs ranging from 0.60 to 0.88. By integrating co-evolutionary feature with other residue pair based properties, our final prediction model output excellent performance with AUCs of >0.91 on different datasets. Compared to existing methods, our algorithm not only yielded better or comparable results but also provided complementary information. An easy-to-use web server is freely accessible at http://liulab.hzau.edu.cn/RPAIAnalyst.
Collapse
Affiliation(s)
- Jian Hu
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, P. R. China.,College of Biomedical Engineering, South-Central University for Nationalities, Wuhan, 430074, P. R. China
| | - Hui-Fang Liu
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, P. R. China
| | - Jun Sun
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, P. R. China
| | - Jia Wang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, P. R. China
| | - Rong Liu
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, P. R. China
| |
Collapse
|
50
|
Investigation of protein quaternary structure via stoichiometry and symmetry information. PLoS One 2018; 13:e0197176. [PMID: 29864163 PMCID: PMC5986128 DOI: 10.1371/journal.pone.0197176] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2017] [Accepted: 04/27/2018] [Indexed: 11/30/2022] Open
Abstract
The Protein Data Bank (PDB) is the single worldwide archive of experimentally-determined three-dimensional (3D) structures of proteins and nucleic acids. As of January 2017, the PDB housed more than 125,000 structures and was growing by more than 11,000 structures annually. Since the 3D structure of a protein is vital to understand the mechanisms of biological processes, diseases, and drug design, correct oligomeric assembly information is of critical importance. Unfortunately, the biologically relevant oligomeric form of a 3D structure is not directly obtainable by X-ray crystallography, whilst in solution methods (NMR or single particle EM) it is known from the experiment. Instead, this information may be provided by the PDB Depositor as metadata coming from additional experiments, be inferred by sequence-sequence comparisons with similar proteins of known oligomeric state, or predicted using software, such as PISA (Proteins, Interfaces, Structures and Assemblies) or EPPIC (Evolutionary Protein Protein Interface Classifier). Despite significant efforts by professional PDB Biocurators during data deposition, there remain a number of structures in the archive with incorrect quaternary structure descriptions (or annotations). Further investigation is, therefore, needed to evaluate the correctness of quaternary structure annotations. In this study, we aim to identify the most probable oligomeric states for proteins represented in the PDB. Our approach evaluated the performance of four independent prediction methods, including text mining of primary publications, inference from homologous protein structures, and two computational methods (PISA and EPPIC). Aggregating predictions to give consensus results outperformed all four of the independent prediction methods, yielding 83% correct, 9% wrong, and 8% inconclusive predictions, when tested with a well-curated benchmark dataset. We have developed a freely-available web-based tool to make this approach accessible to researchers and PDB Biocurators (http://quatstruct.rcsb.org/).
Collapse
|