1
|
Gupta S, Sgourakis NG. A structure-guided approach to predict MHC-I restriction of T cell receptors for public antigens. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.04.597418. [PMID: 38895339 PMCID: PMC11185663 DOI: 10.1101/2024.06.04.597418] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/21/2024]
Abstract
Peptides presented by class I major histocompatibility complex (MHC-I) proteins provide biomarkers for therapeutic targeting using T cell receptors (TCRs), TCR-mimicking antibodies (TMAs), or other engineered protein binders. Despite the extreme sequence diversity of the Human Leucocyte Antigen (HLA, the human MHC), a given TCR or TMA is restricted to recognize epitopic peptides in the context of a limited set of different HLA allotypes. Here, guided by our analysis of 96 TCR:pHLA complex structures in the Protein Data Bank (PDB), we identify TCR contact residues and classify 148 common HLA allotypes into T-cell cross-reactivity groups (T-CREGs) on the basis of their interaction surface features. Insights from our work have actionable value for resolving MHC-I restriction of TCRs, guiding therapeutic expansion of existing therapies, and informing the selection of peptide targets for forthcoming immunotherapy modalities.
Collapse
|
2
|
Ellaway JIJ, Anyango S, Nair S, Zaki HA, Nadzirin N, Powell HR, Gutmanas A, Varadi M, Velankar S. Identifying protein conformational states in the Protein Data Bank: Toward unlocking the potential of integrative dynamics studies. STRUCTURAL DYNAMICS (MELVILLE, N.Y.) 2024; 11:034701. [PMID: 38774441 PMCID: PMC11106648 DOI: 10.1063/4.0000251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/07/2024] [Accepted: 05/08/2024] [Indexed: 05/24/2024]
Abstract
Studying protein dynamics and conformational heterogeneity is crucial for understanding biomolecular systems and treating disease. Despite the deposition of over 215 000 macromolecular structures in the Protein Data Bank and the advent of AI-based structure prediction tools such as AlphaFold2, RoseTTAFold, and ESMFold, static representations are typically produced, which fail to fully capture macromolecular motion. Here, we discuss the importance of integrating experimental structures with computational clustering to explore the conformational landscapes that manifest protein function. We describe the method developed by the Protein Data Bank in Europe - Knowledge Base to identify distinct conformational states, demonstrate the resource's primary use cases, through examples, and discuss the need for further efforts to annotate protein conformations with functional information. Such initiatives will be crucial in unlocking the potential of protein dynamics data, expediting drug discovery research, and deepening our understanding of macromolecular mechanisms.
Collapse
Affiliation(s)
- Joseph I. J. Ellaway
- Protein Data Bank in Europe, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Stephen Anyango
- Protein Data Bank in Europe, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Sreenath Nair
- Protein Data Bank in Europe, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Hossam A. Zaki
- The Warren Alpert Medical School of Brown University, Providence, Rhode Island 02903, USA
| | - Nurul Nadzirin
- Protein Data Bank in Europe, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Harold R. Powell
- Imperial College London, Department of Life Sciences, London, United Kingdom
| | - Aleksandras Gutmanas
- WaveBreak Therapeutics Ltd., Clarendon House, Clarendon Road, Cambridge, United Kingdom
| | - Mihaly Varadi
- Protein Data Bank in Europe, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Sameer Velankar
- Protein Data Bank in Europe, European Bioinformatics Institute, Hinxton, United Kingdom
| |
Collapse
|
3
|
Custodio JM, Ayres CM, Rosales TJ, Brambley CA, Arbuiso AG, Landau LM, Keller GLJ, Srivastava PK, Baker BM. Structural and physical features that distinguish tumor-controlling from inactive cancer neoepitopes. Proc Natl Acad Sci U S A 2023; 120:e2312057120. [PMID: 38085776 PMCID: PMC10742377 DOI: 10.1073/pnas.2312057120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Accepted: 10/23/2023] [Indexed: 12/18/2023] Open
Abstract
Neoepitopes arising from amino acid substitutions due to single nucleotide polymorphisms are targets of T cell immune responses to cancer and are of significant interest in the development of cancer vaccines. However, understanding the characteristics of rare protective neoepitopes that truly control tumor growth has been a challenge, due to their scarcity as well as the challenge of verifying true, neoepitope-dependent tumor control in humans. Taking advantage of recent work in mouse models that circumvented these challenges, here, we compared the structural and physical properties of neoepitopes that range from fully protective to immunologically inactive. As neoepitopes are derived from self-peptides that can induce immune tolerance, we studied not only how the various neoepitopes differ from each other but also from their wild-type counterparts. We identified multiple features associated with protection, including features that describe how neoepitopes differ from self as well as features associated with recognition by diverse T cell receptor repertoires. We demonstrate both the promise and limitations of neoepitope structural analysis and predictive modeling and illustrate important aspects that can be incorporated into neoepitope prediction pipelines.
Collapse
Affiliation(s)
- Jean M. Custodio
- Department of Chemistry and Biochemistry and the Harper Cancer Research Institute, University of Notre Dame, Notre Dame, IN46556
| | - Cory M. Ayres
- Department of Chemistry and Biochemistry and the Harper Cancer Research Institute, University of Notre Dame, Notre Dame, IN46556
| | - Tatiana J. Rosales
- Department of Chemistry and Biochemistry and the Harper Cancer Research Institute, University of Notre Dame, Notre Dame, IN46556
| | - Chad A. Brambley
- Department of Chemistry and Biochemistry and the Harper Cancer Research Institute, University of Notre Dame, Notre Dame, IN46556
| | - Alyssa G. Arbuiso
- Department of Chemistry and Biochemistry and the Harper Cancer Research Institute, University of Notre Dame, Notre Dame, IN46556
| | - Lauren M. Landau
- Department of Chemistry and Biochemistry and the Harper Cancer Research Institute, University of Notre Dame, Notre Dame, IN46556
| | - Grant L. J. Keller
- Department of Chemistry and Biochemistry and the Harper Cancer Research Institute, University of Notre Dame, Notre Dame, IN46556
| | - Pramod K. Srivastava
- Department of Immunology, and Carole and Ray Neag Comprehensive Cancer Center, University of Connecticut School of Medicine, Farmington, CT06030
| | - Brian M. Baker
- Department of Chemistry and Biochemistry and the Harper Cancer Research Institute, University of Notre Dame, Notre Dame, IN46556
| |
Collapse
|
4
|
Kouba P, Kohout P, Haddadi F, Bushuiev A, Samusevich R, Sedlar J, Damborsky J, Pluskal T, Sivic J, Mazurenko S. Machine Learning-Guided Protein Engineering. ACS Catal 2023; 13:13863-13895. [PMID: 37942269 PMCID: PMC10629210 DOI: 10.1021/acscatal.3c02743] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 09/20/2023] [Indexed: 11/10/2023]
Abstract
Recent progress in engineering highly promising biocatalysts has increasingly involved machine learning methods. These methods leverage existing experimental and simulation data to aid in the discovery and annotation of promising enzymes, as well as in suggesting beneficial mutations for improving known targets. The field of machine learning for protein engineering is gathering steam, driven by recent success stories and notable progress in other areas. It already encompasses ambitious tasks such as understanding and predicting protein structure and function, catalytic efficiency, enantioselectivity, protein dynamics, stability, solubility, aggregation, and more. Nonetheless, the field is still evolving, with many challenges to overcome and questions to address. In this Perspective, we provide an overview of ongoing trends in this domain, highlight recent case studies, and examine the current limitations of machine learning-based methods. We emphasize the crucial importance of thorough experimental validation of emerging models before their use for rational protein design. We present our opinions on the fundamental problems and outline the potential directions for future research.
Collapse
Affiliation(s)
- Petr Kouba
- Loschmidt
Laboratories, Department of Experimental Biology and RECETOX, Faculty
of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech
Republic
- Czech Institute
of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
- Faculty of
Electrical Engineering, Czech Technical
University in Prague, Technicka 2, 166 27 Prague 6, Czech Republic
| | - Pavel Kohout
- Loschmidt
Laboratories, Department of Experimental Biology and RECETOX, Faculty
of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech
Republic
- International
Clinical Research Center, St. Anne’s
University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Faraneh Haddadi
- Loschmidt
Laboratories, Department of Experimental Biology and RECETOX, Faculty
of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech
Republic
- International
Clinical Research Center, St. Anne’s
University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Anton Bushuiev
- Czech Institute
of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
| | - Raman Samusevich
- Czech Institute
of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
- Institute
of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo nám. 2, 160 00 Prague 6, Czech Republic
| | - Jiri Sedlar
- Czech Institute
of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
| | - Jiri Damborsky
- Loschmidt
Laboratories, Department of Experimental Biology and RECETOX, Faculty
of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech
Republic
- International
Clinical Research Center, St. Anne’s
University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Tomas Pluskal
- Institute
of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo nám. 2, 160 00 Prague 6, Czech Republic
| | - Josef Sivic
- Czech Institute
of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
| | - Stanislav Mazurenko
- Loschmidt
Laboratories, Department of Experimental Biology and RECETOX, Faculty
of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech
Republic
- International
Clinical Research Center, St. Anne’s
University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| |
Collapse
|
5
|
Khan NS, Rahaman MM, Islam S, Zhang S. RNA-NRD: a non-redundant RNA structural dataset for benchmarking and functional analysis. NAR Genom Bioinform 2023; 5:lqad040. [PMID: 37123530 PMCID: PMC10132383 DOI: 10.1093/nargab/lqad040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2022] [Revised: 03/04/2023] [Accepted: 04/12/2023] [Indexed: 05/02/2023] Open
Abstract
The significance of RNA functions and their role in evolution and disease control have remarkably increased the research scope in the field of RNA science. Though the availability of RNA structure data in PBD has been growing tremendously, maintaining their quality and integrity has become the greater challenge. Since the data available in PDB are results of different independent research, they might contain redundancy. As a result, there remains a possibility of data bias for both protein and RNA chains. Quite a few studies have been conducted to remove the redundancy of protein structures by introducing high-quality representatives. However, the amount of research done to remove the redundancy of RNA structures is still very low. To remove RNA chain redundancy in PDB, we have introduced RNA-NRD, a non-redundant dataset of RNA chains based on sequence and 3D structural similarity. We compared RNA-NRD with the existing non-redundant RNA structure dataset RS-RNA and showed that it has better-formed clusters of redundant RNA chains with lower average RMSD and higher average PSI, thus improving the overall quality of the dataset.
Collapse
Affiliation(s)
- Nabila Shahnaz Khan
- Department of Computer Science, University of Central Florida, Orlando, FL 32816, USA
| | - Md Mahfuzur Rahaman
- Department of Computer Science, University of Central Florida, Orlando, FL 32816, USA
| | - Shahidul Islam
- School of Computing and Design, California State University, Monterey Bay, Seaside, CA 93955, USA
| | - Shaojie Zhang
- Department of Computer Science, University of Central Florida, Orlando, FL 32816, USA
| |
Collapse
|
6
|
Escobedo N, Monzon AM, Fornasari MS, Palopoli N, Parisi G. Combining Protein Conformational Diversity and Phylogenetic Information Using CoDNaS and CoDNaS-Q. Curr Protoc 2023; 3:e764. [PMID: 37184204 DOI: 10.1002/cpz1.764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
CoDNaS (http://ufq.unq.edu.ar/codnas/) and CoDNaS-Q (http://ufq.unq.edu.ar/codnasq) are repositories of proteins with different degrees of conformational diversity. Following the ensemble nature of the native state, conformational diversity represents the structural differences between the conformers in the ensemble. Each entry in CoDNaS and CoDNaS-Q contains a redundant collection of experimentally determined conformers obtained under different conditions. These conformers represent snapshots of the protein dynamism. While CoDNaS contains examples of conformational diversity at the tertiary level, a recent development, CoDNaS-Q, contains examples at the quaternary level. In the emerging age of accurate protein structure prediction by machine learning approaches, many questions remain open regarding the characterization of protein dynamism. In this context, most bioinformatics resources take advantage of distinct features derived from protein alignments, however, the complexity and heterogeneity of information makes it difficult to recover reliable biological signatures. Here we present five protocols to explore tertiary and quaternary conformational diversity at the individual protein level as well as for the characterization of the distribution of conformational diversity at the protein family level in a phylogenetic context. These protocols can provide curated protein families with experimentally known conformational diversity, facilitating the exploration of sequence determinants of protein dynamism. © 2023 Wiley Periodicals LLC. Basic Protocol 1: Assessing conformational diversity with CoDNaS Alternate Protocol 1: Assessing conformational diversity at the quaternary level with CoDNaS-Q Basic Protocol 2: Exploring conformational diversity in a protein family Alternate Protocol 2: Exploring quaternary conformational diversity in a protein family Basic Protocol 3: Representing conformational diversity in a phylogenetic context.
Collapse
Affiliation(s)
- Nahuel Escobedo
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | | | - María Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Nicolas Palopoli
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| |
Collapse
|
7
|
Choudhary P, Anyango S, Berrisford J, Tolchard J, Varadi M, Velankar S. Unified access to up-to-date residue-level annotations from UniProtKB and other biological databases for PDB data. Sci Data 2023; 10:204. [PMID: 37045837 PMCID: PMC10097656 DOI: 10.1038/s41597-023-02101-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Accepted: 03/23/2023] [Indexed: 04/14/2023] Open
Abstract
More than 61,000 proteins have up-to-date correspondence between their amino acid sequence (UniProtKB) and their 3D structures (PDB), enabled by the Structure Integration with Function, Taxonomy and Sequences (SIFTS) resource. SIFTS incorporates residue-level annotations from many other biological resources. SIFTS data is available in various formats like XML, CSV and TSV format or also accessible via the PDBe REST API but always maintained separately from the structure data (PDBx/mmCIF file) in the PDB archive. Here, we extended the wwPDB PDBx/mmCIF data dictionary with additional categories to accommodate SIFTS data and added the UniProtKB, Pfam, SCOP2, and CATH residue-level annotations directly into the PDBx/mmCIF files from the PDB archive. With the integrated UniProtKB annotations, these files now provide consistent numbering of residues in different PDB entries allowing easy comparison of structure models. The extended dictionary yields a more consistent, standardised metadata description without altering the core PDB information. This development enables up-to-date cross-reference information at the residue level resulting in better data interoperability, supporting improved data analysis and visualisation.
Collapse
Grants
- BB/V004247/1, PI:Sameer Velankar RCUK | Biotechnology and Biological Sciences Research Council (BBSRC)
- BB/V004247/1, PI:Sameer Velankar RCUK | Biotechnology and Biological Sciences Research Council (BBSRC)
- BB/V004247/1, PI:Sameer Velankar RCUK | Biotechnology and Biological Sciences Research Council (BBSRC)
- BB/V004247/1, PI:Sameer Velankar RCUK | Biotechnology and Biological Sciences Research Council (BBSRC)
- BB/V004247/1, PI:Sameer Velankar RCUK | Biotechnology and Biological Sciences Research Council (BBSRC)
- BB/V004247/1, PI:Sameer Velankar RCUK | Biotechnology and Biological Sciences Research Council (BBSRC)
- DBI-2019297, PI: S.K. Burley National Science Foundation (NSF)
- DBI-2019297, PI: S.K. Burley National Science Foundation (NSF)
- DBI-2019297, PI: S.K. Burley) National Science Foundation (NSF)
- DBI-2019297, PI: S.K. Burley National Science Foundation (NSF)
- DBI-2019297, PI: S.K. Burley National Science Foundation (NSF)
- DBI-2019297, PI: S.K. Burley NSF | National Science Board (NSB)
Collapse
Affiliation(s)
- Preeti Choudhary
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK.
| | - Stephen Anyango
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - John Berrisford
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
- AstraZeneca, Biomedical Campus, 1 Francis Crick Ave, Trumpington, Cambridge, CB2 0AA, UK
| | - James Tolchard
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
- Claude Bernard University, Villeurbanne, Lyon, 69100, France
| | - Mihaly Varadi
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Sameer Velankar
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| |
Collapse
|
8
|
Zea DJ, Teppa E, Marino-Buslje C. Easy Not Easy: Comparative Modeling with High-Sequence Identity Templates. Methods Mol Biol 2023; 2627:83-100. [PMID: 36959443 DOI: 10.1007/978-1-0716-2974-1_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/25/2023]
Abstract
Homology modeling is the most common technique to build structural models of a target protein based on the structure of proteins with high-sequence identity and available high-resolution structures. This technique is based on the idea that protein structure shows fewer changes than sequence through evolution. While in this scenario single mutations would minimally perturb the structure, experimental evidence shows otherwise: proteins with high conformational diversity impose a limit of the paradigm of comparative modeling as the same protein sequence can adopt dissimilar three-dimensional structures. These cases present challenges for modeling; at first glance, they may seem to be easy cases, but they have a complexity that is not evident at the sequence level. In this chapter, we address the following questions: Why should we care about conformational diversity? How to consider conformational diversity when doing template-based modeling in a practical way?
Collapse
Affiliation(s)
- Diego Javier Zea
- Laboratory of Computational and Quantitative Biology, LCQB, UMR 7238 CNRS, IBPS, Sorbonne Université, Paris, France
| | - Elin Teppa
- Toulouse Biotechnology Institute, TBI, Université de Toulouse, CNRS, INRA, INSA, Toulouse, France
| | | |
Collapse
|
9
|
Mitusińska K, Bzówka M, Magdziarz T, Góra A. Geometry-Based versus Small-Molecule Tracking Method for Tunnel Identification: Benefits and Pitfalls. J Chem Inf Model 2022; 62:6803-6811. [PMID: 36374085 PMCID: PMC9795556 DOI: 10.1021/acs.jcim.2c00985] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Different methods for tunnel identification, geometry-based and small-molecule tracking approaches, were compared to provide their benefits and pitfalls. Results obtained for both crystal structures and molecular dynamics (MD) simulations were analyzed to investigate if a more computationally demanding method would be beneficial. Careful examination of the results is essential for the low-diameter tunnel description, and assessment of the tunnel functionality based only on their geometrical parameters is challenging. We showed that the small-molecule tracking approach can provide a detailed description of the system; however, it can also be the most computationally demanding.
Collapse
|
10
|
Salaikumaran MR, Kasamuthu PS, Aathmanathan VS, Burra VLSP. An in silico approach to study the role of epitope order in the multi-epitope-based peptide (MEBP) vaccine design. Sci Rep 2022; 12:12584. [PMID: 35869117 PMCID: PMC9307121 DOI: 10.1038/s41598-022-16445-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2021] [Accepted: 07/11/2022] [Indexed: 11/09/2022] Open
Abstract
AbstractWith different countries facing multiple waves, with some SARS-CoV-2 variants more deadly and virulent, the COVID-19 pandemic is becoming more dangerous by the day and the world is facing an even more dreadful extended pandemic with exponential positive cases and increasing death rates. There is an urgent need for more efficient and faster methods of vaccine development against SARS-CoV-2. Compared to experimental protocols, the opportunities to innovate are very high in immunoinformatics/in silico approaches, especially with the recent adoption of structural bioinformatics in peptide vaccine design. In recent times, multi-epitope-based peptide vaccine candidates (MEBPVCs) have shown extraordinarily high humoral and cellular responses to immunization. Most of the publications claim that respective reported MEBPVC(s) assembled using a set of in silico predicted epitopes, to be the computationally validated potent vaccine candidate(s) ready for experimental validation. However, in this article, for a given set of predicted epitopes, it is shown that the published MEBPVC is one among the many possible variants and there is high likelihood of finding more potent MEBPVCs than the published candidates. To test the same, a methodology is developed where novel MEBP variants are derived by changing the epitope order of the published MEBPVC. Further, to overcome the limitations of current qualitative methods of assessment of MEBPVC, to enable quantitative comparison and ranking for the discovery of more potent MEBPVCs, novel predictors, Percent Epitope Accessibility (PEA), Receptor specific MEBP vaccine potency (RMVP), MEBP vaccine potency (MVP) are introduced. The MEBP variants indeed showed varied MVP scores indicating varied immunogenicity. Further, the MEBP variants with IDs, SPVC_446 and SPVC_537, had the highest MVP scores indicating these variants to be more potent MEBPVCs than the published MEBPVC and hence should be preferred candidates for immediate experimental testing and validation. The method enables quicker selection and high throughput experimental validation of vaccine candidates. This study also opens the opportunity to develop new software tools for designing more potent MEBPVCs in less time.
Collapse
|
11
|
Yabukarski F, Doukov T, Pinney MM, Biel JT, Fraser JS, Herschlag D. Ensemble-function relationships to dissect mechanisms of enzyme catalysis. SCIENCE ADVANCES 2022; 8:eabn7738. [PMID: 36240280 PMCID: PMC9565801 DOI: 10.1126/sciadv.abn7738] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/19/2021] [Accepted: 08/30/2022] [Indexed: 05/27/2023]
Abstract
Decades of structure-function studies have established our current extensive understanding of enzymes. However, traditional structural models are snapshots of broader conformational ensembles of interchanging states. We demonstrate the need for conformational ensembles to understand function, using the enzyme ketosteroid isomerase (KSI) as an example. Comparison of prior KSI cryogenic x-ray structures suggested deleterious mutational effects from a misaligned oxyanion hole catalytic residue. However, ensemble information from room-temperature x-ray crystallography, combined with functional studies, excluded this model. Ensemble-function analyses can deconvolute effects from altering the probability of occupying a state (P-effects) and changing the reactivity of each state (k-effects); our ensemble-function analyses revealed functional effects arising from weakened oxyanion hole hydrogen bonding and substrate repositioning within the active site. Ensemble-function studies will have an integral role in understanding enzymes and in meeting the future goals of a predictive understanding of enzyme catalysis and engineering new enzymes.
Collapse
Affiliation(s)
- Filip Yabukarski
- Department of Biochemistry, Stanford University, Stanford, CA 94305, USA
| | - Tzanko Doukov
- Stanford Synchrotron Radiation Light Source, SLAC National Accelerator Laboratory, Menlo Park, CA 94025, USA
| | - Margaux M. Pinney
- Department of Biochemistry, Stanford University, Stanford, CA 94305, USA
| | - Justin T. Biel
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA
| | - James S. Fraser
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Daniel Herschlag
- Department of Biochemistry, Stanford University, Stanford, CA 94305, USA
- Department of Chemical Engineering, Stanford University, Stanford, CA 94305, USA
- Stanford ChEM-H, Stanford University, Stanford, CA 94305, USA
| |
Collapse
|
12
|
Bonilla SL, Kieft JS. The promise of cryo-EM to explore RNA structural dynamics. J Mol Biol 2022; 434:167802. [PMID: 36049551 PMCID: PMC10084733 DOI: 10.1016/j.jmb.2022.167802] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2022] [Revised: 08/24/2022] [Accepted: 08/24/2022] [Indexed: 01/13/2023]
Abstract
Conformational dynamics are essential to macromolecular function. This is certainly true of RNA, whose ability to undergo programmed conformational dynamics is essential to create and regulate complex biological processes. However, methods to easily and simultaneously interrogate both the structure and conformational dynamics of fully functional RNAs in isolation and in complex with proteins have not historically been available. Due to its ability to image and classify single particles, cryogenic electron microscopy (cryo-EM) has the potential to address this gap and may be particularly amenable to exploring structural dynamics within the three-dimensional folds of biologically active RNAs. We discuss the possibilities and current limitations of applying cryo-EM to simultaneously study RNA structure and conformational dynamics, and present one example that illustrates this (as of yet) not fully realized potential.
Collapse
Affiliation(s)
- Steve L Bonilla
- Department of Biochemistry and Molecular Genetics, Aurora, CO 80045, USA. https://twitter.com/Steve_Bonilla
| | - Jeffrey S Kieft
- Department of Biochemistry and Molecular Genetics, Aurora, CO 80045, USA; RNA BioScience Initiative, University of Colorado Anschutz Medical Campus, School of Medicine, Aurora, CO 80045, USA.
| |
Collapse
|
13
|
Schoeman D, Cloete R, Fielding BC. The Flexible, Extended Coil of the PDZ-Binding Motif of the Three Deadly Human Coronavirus E Proteins Plays a Role in Pathogenicity. Viruses 2022; 14:v14081707. [PMID: 36016329 PMCID: PMC9416557 DOI: 10.3390/v14081707] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2022] [Revised: 07/22/2022] [Accepted: 07/29/2022] [Indexed: 02/04/2023] Open
Abstract
The less virulent human (h) coronaviruses (CoVs) 229E, NL63, OC43, and HKU1 cause mild, self-limiting respiratory tract infections, while the more virulent SARS-CoV-1, MERS-CoV, and SARS-CoV-2 have caused severe outbreaks. The CoV envelope (E) protein, an important contributor to the pathogenesis of severe hCoV infections, may provide insight into this disparate severity of the disease. We, therefore, generated full-length E protein models for SARS-CoV-1 and -2, MERS-CoV, HCoV-229E, and HCoV-NL63 and docked C-terminal peptides of each model to the PDZ domain of the human PALS1 protein. The PDZ-binding motif (PBM) of the SARS-CoV-1 and -2 and MERS-CoV models adopted a more flexible, extended coil, while the HCoV-229E and HCoV-NL63 models adopted a less flexible alpha helix. All the E peptides docked to PALS1 occupied the same binding site and the more virulent hCoV E peptides generally interacted more stably with PALS1 than the less virulent ones. We hypothesize that the increased flexibility of the PBM in the more virulent hCoVs facilitates more stable binding to various host proteins, thereby contributing to more severe disease. This is the first paper to model full-length 3D structures for both the more virulent and less virulent hCoV E proteins, providing novel insights for possible drug and/or vaccine development.
Collapse
Affiliation(s)
- Dewald Schoeman
- Molecular Biology and Virology Research Laboratory, Department of Medical Biosciences, University of the Western Cape, Private Bag X17, Bellville, Cape Town 7535, South Africa;
| | - Ruben Cloete
- South African Medical Research Council Bioinformatics Unit, South African National Bioinformatics Institute, University of the Western Cape, Private Bag X17, Bellville, Cape Town 7535, South Africa;
| | - Burtram C. Fielding
- Molecular Biology and Virology Research Laboratory, Department of Medical Biosciences, University of the Western Cape, Private Bag X17, Bellville, Cape Town 7535, South Africa;
- Correspondence:
| |
Collapse
|
14
|
Iyer M, Jaroszewski L, Sedova M, Godzik A. What the protein data bank tells us about the evolutionary conservation of protein conformational diversity. Protein Sci 2022; 31:e4325. [PMID: 35762711 PMCID: PMC9207624 DOI: 10.1002/pro.4325] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2022] [Revised: 03/29/2022] [Accepted: 04/06/2022] [Indexed: 11/09/2022]
Abstract
Proteins sample a multitude of different conformations by undergoing small- and large-scale conformational changes that are often intrinsic to their functions. Information about these changes is often captured in the Protein Data Bank by the apparently redundant deposition of independent structural solutions of identical proteins. Here, we mine these data to examine the conservation of large-scale conformational changes between homologous proteins. This is important for both practical reasons, such as predicting alternative conformations of a protein by comparative modeling, and conceptual reasons, such as understanding the extent of conservation of different features in evolution. To study this question, we introduce a novel approach to compare conformational changes between proteins by the comparison of their difference distance maps (DDMs). We found that proteins undergoing similar conformational changes have similar DDMs and that this similarity could be quantified by the correlation between the DDMs. By comparing the DDMs of homologous protein pairs, we found that large-scale conformational changes show a high level of conservation across a broad range of sequence identities. This shows that conformational space is usually conserved between homologs, even relatively distant ones.
Collapse
Affiliation(s)
- Mallika Iyer
- Graduate School of Biomedical Sciences, Sanford Burnham Prebys Medical Discovery Institute, La Jolla, California, USA
| | - Lukasz Jaroszewski
- Biosciences Division, University of California Riverside School of Medicine, Riverside, California, USA
| | - Mayya Sedova
- Biosciences Division, University of California Riverside School of Medicine, Riverside, California, USA
| | - Adam Godzik
- Biosciences Division, University of California Riverside School of Medicine, Riverside, California, USA
| |
Collapse
|
15
|
Prabantu VM, Gadiyaram V, Vishveshwara S, Srinivasan N. Understanding structural variability in proteins using protein structural networks. Curr Res Struct Biol 2022; 4:134-145. [PMID: 35586857 PMCID: PMC9108755 DOI: 10.1016/j.crstbi.2022.04.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Revised: 03/01/2022] [Accepted: 04/09/2022] [Indexed: 11/13/2022] Open
Abstract
Proteins perform their function by accessing a suitable conformer from the ensemble of available conformations. The conformational diversity of a chosen protein structure can be obtained by experimental methods under different conditions. A key issue is the accurate comparison of different conformations. A gold standard used for such a comparison is the root mean square deviation (RMSD) between the two structures. While extensive refinements of RMSD evaluation at the backbone level are available, a comprehensive framework including the side chain interaction is not well understood. Here we employ protein structure network (PSN) formalism, with the non-covalent interactions of side chain, explicitly treated. The PSNs thus constructed are compared through graph spectral method, which provides a comparison at the local and at the global structural level. In this work, PSNs of multiple crystal conformers of single-chain, single-domain proteins, are subject to pair-wise analysis to examine the dissimilarity in their network topologies and in order to determine the conformational diversity of their native structures. This information is utilized to classify the structural domains of proteins into different categories. It is observed that proteins typically tend to retain structure and interactions at the backbone level. However, some of them also depict variability in either their overall structure or only in their inter-residue connectivity at the sidechain level, or both. Variability of sub-networks based on solvent accessibility and secondary structure is studied. The types of specific interactions are found to contribute differently to structure variability. An ensemble analysis by computing the mathematical variance of edge-weights across multiple conformers provided information on the contribution to overall variability from each edge of the PSN. Interactions that are highly variable are identified and their impact on structure variability has been discussed with the help of a case study. The classification based on the present side-chain network-based studies provides a framework to correlate the structure-function relationships in protein structures. Monomeric, single domain protein structures can exhibit non-rigid behaviour and be highly variable. The comparison of protein structural networks can better discriminate conformations with similar backbones. Specific interactions between solvent accessible and inaccessible residues are poorly preserved. Network edge-variation offers insights on which interacting residues are likely to influence their dynamics and function. These side-chain network-based studies provide a framework to correlate protein structure-function relationships.
Collapse
|
16
|
Rowaiye AB, Ogugua AJ, Ibeanu G, Bur D, Asala MT, Ogbeide OB, Abraham EO, Usman HB. Identifying potential natural inhibitors of Brucella melitensis Methionyl-tRNA synthetase through an in-silico approach. PLoS Negl Trop Dis 2022; 16:e0009799. [PMID: 35312681 PMCID: PMC8970508 DOI: 10.1371/journal.pntd.0009799] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Revised: 03/31/2022] [Accepted: 02/16/2022] [Indexed: 12/12/2022] Open
Abstract
Background Brucellosis is an infectious disease caused by bacteria of the genus Brucella. Although it is the most common zoonosis worldwide, there are increasing reports of drug resistance and cases of relapse after long term treatment with the existing drugs of choice. This study therefore aims at identifying possible natural inhibitors of Brucella melitensis Methionyl-tRNA synthetase through an in-silico approach. Methods Using PyRx 0.8 virtual screening software, the target was docked against a library of natural compounds obtained from edible African plants. The compound, 2-({3-[(3,5-dichlorobenzyl) amino] propyl} amino) quinolin-4(1H)-one (OOU) which is a co-crystallized ligand with the target was used as the reference compound. Screening of the molecular descriptors of the compounds for bioavailability, pharmacokinetic properties, and bioactivity was performed using the SWISSADME, pkCSM, and Molinspiration web servers respectively. The Fpocket and PLIP webservers were used to perform the analyses of the binding pockets and the protein ligand interactions. Analysis of the time-resolved trajectories of the Apo and Holo forms of the target was performed using the Galaxy and MDWeb servers. Results The lead compounds, Strophanthidin and Isopteropodin are present in Corchorus olitorius and Uncaria tomentosa (Cat’s-claw) plants respectively. Isopteropodin had a binding affinity score of -8.9 kcal / ml with the target and had 17 anti-correlating residues in Pocket 1 after molecular dynamics simulation. The complex formed by Isopteropodin and the target had a total RMSD of 4.408 and a total RMSF of 9.8067. However, Strophanthidin formed 3 hydrogen bonds with the target at ILE21, GLY262 and LEU294, and induced a total RMSF of 5.4541 at Pocket 1. Conclusion Overall, Isopteropodin and Strophanthidin were found to be better drug candidates than OOU and they showed potentials to inhibit the Brucella melitensis Methionyl-tRNA synthetase at Pocket 1, hence abilities to treat brucellosis. In-vivo and in-vitro investigations are needed to further evaluate the efficacy and toxicity of the lead compounds. The cure for brucellosis involves a long course of treatment with a combination of antibiotics. However, some of the drugs are not recommended for very young children and pregnant women. Moreover, cases of relapse and resistance to these drugs are reported. With the Brucella Methionyl-tRNA synthetase as a target, molecular docking and virtual screening was used to identify possible drug candidates from a library of 1524 compounds obtained from edible African plants. Two lead compounds, Strophanthidin and Isopteropodin usually present in Corchorus olitorius and Uncaria tomentosa (Cat’s claw) plants showed potentials to inhibit the Brucella melitensis Methionyl-tRNA synthetase. Their bioactivities were also confirmed in their molecular dynamic simulation with the target protein. Consequently, both compounds have potentials for safety and efficacy in the treatment of brucellosis.
Collapse
Affiliation(s)
| | - Akwoba Joseph Ogugua
- Department of Veterinary Public Health and Preventive Medicine, University of Nigeria, Nsukka, Nigeria
- * E-mail:
| | - Gordon Ibeanu
- Department of Pharmaceutical Science, North Carolina Central University, Durham, North Carolina, United States of America
| | - Doofan Bur
- Department of Medical Biotechnology, National Biotechnology Development Agency, Abuja, Nigeria
| | - Mercy Titilayo Asala
- Department of Medical Biotechnology, National Biotechnology Development Agency, Abuja, Nigeria
| | | | | | - Hamzah Bundu Usman
- Department of Plant Science and Biotechnology, Federal University Gusau, Gusau, Nigeria
| |
Collapse
|
17
|
Parisi G, Palopoli N, Tosatto SC, Fornasari MS, Tompa P. "Protein" no longer means what it used to. Curr Res Struct Biol 2021; 3:146-152. [PMID: 34308370 PMCID: PMC8283027 DOI: 10.1016/j.crstbi.2021.06.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Revised: 06/18/2021] [Accepted: 06/22/2021] [Indexed: 01/02/2023] Open
Abstract
Every biologist knows that the word protein describes a group of macromolecules essential to sustain life on Earth. As biologists, we are invariably trained under a protein paradigm established since the early twentieth century. However, in recent years, the term protein unveiled itself as an euphemism to describe the overwhelming heterogeneity of these compounds. Most of our current studies are targeted on carefully selected subsets of proteins, but we tend to think and write about these as representative of the whole population. Here we discuss how seeking for universal definitions and general rules in any arbitrarily segmented study would be misleading about the conclusions. Of course, it is not our purpose to discourage the use of the word protein. Instead, we suggest to embrace the extended universe of proteins to reach a deeper understanding of their full potential, realizing that the term encompasses a group of molecules very heterogeneous in terms of size, shape, chemistry and functions, i.e. the term protein no longer means what it used to.
Collapse
Affiliation(s)
- Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Buenos Aires, Argentina
| | - Nicolas Palopoli
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Buenos Aires, Argentina
| | | | - María Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Buenos Aires, Argentina
| | - Peter Tompa
- VIB-VUB Center for Structural Biology (CSB), Brussels, Belgium
- Structural Biology Brussels (SBB), Vrije Universiteit Brussel (VUB), Brussels, Belgium
- Institute of Enzymology, Research Centre for Natural Sciences, Budapest, Hungary
| |
Collapse
|
18
|
Masrati G, Landau M, Ben-Tal N, Lupas A, Kosloff M, Kosinski J. Integrative Structural Biology in the Era of Accurate Structure Prediction. J Mol Biol 2021; 433:167127. [PMID: 34224746 DOI: 10.1016/j.jmb.2021.167127] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Revised: 06/28/2021] [Accepted: 06/28/2021] [Indexed: 11/16/2022]
Abstract
Characterizing the three-dimensional structure of macromolecules is central to understanding their function. Traditionally, structures of proteins and their complexes have been determined using experimental techniques such as X-ray crystallography, NMR, or cryo-electron microscopy-applied individually or in an integrative manner. Meanwhile, however, computational methods for protein structure prediction have been improving their accuracy, gradually, then suddenly, with the breakthrough advance by AlphaFold2, whose models of monomeric proteins are often as accurate as experimental structures. This breakthrough foreshadows a new era of computational methods that can build accurate models for most monomeric proteins. Here, we envision how such accurate modeling methods can combine with experimental structural biology techniques, enhancing integrative structural biology. We highlight the challenges that arise when considering multiple structural conformations, protein complexes, and polymorphic assemblies. These challenges will motivate further developments, both in modeling programs and in methods to solve experimental structures, towards better and quicker investigation of structure-function relationships.
Collapse
Affiliation(s)
- Gal Masrati
- Department of Biochemistry and Molecular Biology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Meytal Landau
- Department of Biology, Technion-Israel Institute of Technology, Haifa 3200003, Israel; European Molecular Biology Laboratory (EMBL), Hamburg 22607, Germany
| | - Nir Ben-Tal
- Department of Biochemistry and Molecular Biology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Andrei Lupas
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany.
| | - Mickey Kosloff
- Department of Human Biology, Faculty of Natural Sciences, University of Haifa, 199 Aba Khoushy Ave., Mt. Carmel, 3498838 Haifa, Israel.
| | - Jan Kosinski
- European Molecular Biology Laboratory (EMBL), Hamburg 22607, Germany; Centre for Structural Systems Biology (CSSB), Hamburg 22607, Germany; Structural and Computational Biology Unit, European Molecular Biology Laboratory, Meyerhofstrasse 1, 69117 Heidelberg, Germany.
| |
Collapse
|
19
|
Sedova M, Jaroszewski L, Iyer M, Li Z, Godzik A. ModFlex: Towards Function Focused Protein Modeling. J Mol Biol 2021; 433:166828. [PMID: 33972023 DOI: 10.1016/j.jmb.2021.166828] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Revised: 01/07/2021] [Accepted: 01/09/2021] [Indexed: 11/19/2022]
Abstract
There is a wide, and continuously widening, gap between the number of proteins known only by their amino acid sequence versus those structurally characterized by direct experiment. To close this gap, we mostly rely on homology-based inference and modeling to reason about the structures of the uncharacterized proteins by using structures of homologous proteins as templates. With the rapidly growing size of the Protein Data Bank, there are often multiple choices of templates, including multiple sets of coordinates from the same protein. The substantial conformational differences observed between different experimental structures of the same protein often reflect function related structural flexibility. Thus, depending on the questions being asked, using distant homologs, or coordinate sets with lower resolution but solved in the appropriate functional form, as templates may be more informative. The ModFlex server (https://modflex.org/) addresses this seldom mentioned gap in the standard homology modeling approach by providing the user with an interface with multiple options and tools to select the most relevant template and explore the range of structural diversity in the available templates. ModFlex is closely integrated with a range of other programs and servers developed in our group for the analysis and visualization of protein structural flexibility and divergence.
Collapse
Affiliation(s)
- Mayya Sedova
- University of California Riverside School of Medicine, Biosciences Division, Riverside, CA, United States
| | - Lukasz Jaroszewski
- University of California Riverside School of Medicine, Biosciences Division, Riverside, CA, United States
| | - Mallika Iyer
- Graduate School of Biomedical Sciences, Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA, United States
| | - Zhanwen Li
- University of California Riverside School of Medicine, Biosciences Division, Riverside, CA, United States
| | - Adam Godzik
- University of California Riverside School of Medicine, Biosciences Division, Riverside, CA, United States.
| |
Collapse
|
20
|
Lynch ML, Dudek MF, Bowman SE. A Searchable Database of Crystallization Cocktails in the PDB: Analyzing the Chemical Condition Space. PATTERNS (NEW YORK, N.Y.) 2020; 1:100024. [PMID: 32776019 PMCID: PMC7409820 DOI: 10.1016/j.patter.2020.100024] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/31/2019] [Revised: 03/22/2020] [Accepted: 03/30/2020] [Indexed: 10/26/2022]
Abstract
Nearly 90% of structural models in the Protein Data Bank (PDB), the central resource worldwide for three-dimensional structural information, are currently derived from macromolecular crystallography (MX). A major bottleneck in determining MX structures is finding conditions in which a biomolecule will crystallize. Here, we present a searchable database of the chemicals associated with successful crystallization experiments from the PDB. We use these data to examine the relationship between protein secondary structure and average molecular weight of polyethylene glycol and to investigate patterns in crystallization conditions. Our analyses reveal striking patterns of both redundancy of chemical compositions in crystallization experiments and extreme sparsity of specific chemical combinations, underscoring the challenges faced in generating predictive models for de novo optimal crystallization experiments.
Collapse
Affiliation(s)
- Miranda L. Lynch
- High-Throughput Crystallization Screening Center, Hauptman-Woodward Medical Research Institute, Buffalo, NY 14203, USA
| | - Max F. Dudek
- University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Sarah E.J. Bowman
- High-Throughput Crystallization Screening Center, Hauptman-Woodward Medical Research Institute, Buffalo, NY 14203, USA
- Department of Biochemistry, Jacobs School of Medicine & Biomedical Sciences at the University at Buffalo, Buffalo, NY 14203, USA
| |
Collapse
|
21
|
Iyer M, Li Z, Jaroszewski L, Sedova M, Godzik A. Difference contact maps: From what to why in the analysis of the conformational flexibility of proteins. PLoS One 2020; 15:e0226702. [PMID: 32163442 PMCID: PMC7067477 DOI: 10.1371/journal.pone.0226702] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2019] [Accepted: 12/03/2019] [Indexed: 02/05/2023] Open
Abstract
Protein structures, usually visualized in various highly idealized forms focusing on the three-dimensional arrangements of secondary structure elements, can also be described as lists of interacting residues or atoms and visualized as two-dimensional distance or contact maps. We show that contact maps provide an ideal tool to describe and analyze differences between structures of proteins in different conformations. Expanding functionality of the PDBFlex server and database developed previously in our group, we describe how analysis of difference contact maps (DCMs) can be used to identify critical interactions stabilizing alternative protein conformations, recognize residues and positions controlling protein functions and build hypotheses as to molecular mechanisms of disease mutations.
Collapse
Affiliation(s)
- Mallika Iyer
- Graduate School of Biomedical Sciences, Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA, United States of America
| | - Zhanwen Li
- Biosciences Division, University of California Riverside School of Medicine, Riverside, CA, United States of America
| | - Lukasz Jaroszewski
- Biosciences Division, University of California Riverside School of Medicine, Riverside, CA, United States of America
| | - Mayya Sedova
- Biosciences Division, University of California Riverside School of Medicine, Riverside, CA, United States of America
| | - Adam Godzik
- Biosciences Division, University of California Riverside School of Medicine, Riverside, CA, United States of America
- * E-mail:
| |
Collapse
|
22
|
Craveur P, Narwani TJ, Rebehmed J, de Brevern AG. Investigation of the impact of PTMs on the protein backbone conformation. Amino Acids 2019; 51:1065-1079. [DOI: 10.1007/s00726-019-02747-w] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2018] [Accepted: 05/18/2019] [Indexed: 12/17/2022]
|
23
|
Porcine Circovirus 2 Uses a Multitude of Weak Binding Sites To Interact with Heparan Sulfate, and the Interactions Do Not Follow the Symmetry of the Capsid. J Virol 2019; 93:JVI.02222-18. [PMID: 30602608 DOI: 10.1128/jvi.02222-18] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2018] [Accepted: 12/12/2018] [Indexed: 12/24/2022] Open
Abstract
Porcine circovirus 2 (PCV2) is the smallest pathogenic virus capable of autonomous replication within its host. Infections result in immunosuppression and subsequent death of the host and are initiated via the attachment of the PCV2 icosahedral capsid to heparan sulfate (HS) and chondroitin sulfate B (CSB) glycosaminoglycans on the cell surface. However, the underlying mechanism of structural recognition remains to be explored. Using heparin, a routinely used analog of heparan sulfate, we demonstrate that increasing lengths of heparin exhibit a greater affinity toward PCV2. Our competition assays indicate that dextran sulfate (8 kDa) has a higher affinity for PCV2 than heparin (12 kDa), chondroitin sulfate B (41 kDa), hyaluronic acid (1.6 MDa), and dextran (6 kDa). This suggests that polymers high in sulfate content are capable of competing with the PCV2-heparan sulfate interaction and, thus, have the potential to inhibit PCV2 infection. Finally, we visualized the interaction between heparin and the PCV2 capsid using cryo-electron microscopy single-particle analysis, symmetry expansion, and focused classification. The image reconstructions provide the first example of an asymmetric distribution of heparin on the surface of an icosahedral virus capsid. We demonstrate that each of the 60 capsid subunits that generate the T=1 capsid can bind heparin via one of five binding sites. However, not all of the binding sites were occupied by heparin, and only one-third to two-thirds of the binding sites were occupied. The binding sites are defined by arginine, lysine, and polar amino acids. Mutating the arginine, lysine, and polar amino acids to alanine diminished the binding capacity of PCV2 to heparin.IMPORTANCE It has been demonstrated that porcine circovirus 2 (PCV2) attaches to cells via heparan sulfate (HS) and chondroitin sulfate B (CSB) glycosaminoglycans; however, the underlying structural mechanism describing the HS/CSB recognition by PCV2 remains to be explored. We used cryo-electron microscopy with single-particle analysis, symmetry expansion, and focused classification to visualize the interaction between the PCV2 capsid and heparin, an analog of heparan sulfate, to better than 3.6-Å resolution. We observed that the interaction between PCV2 and heparin does not adhere to the icosahedral symmetry of the capsid. To the best of our knowledge, this is the first example where the interaction between heparin and an icosahedral capsid does not follow the symmetry elements of the capsid. Our findings also suggest that anionic polymers, such as dextran sulfate, may act to inhibit PCV2 infection.
Collapse
|
24
|
Abstract
The native state of proteins is composed of conformers in dynamical equilibrium. In this chapter, different issues related to conformational diversity are explored using a curated and experimentally based database called CoDNaS (Conformational Diversity in the Native State). This database is a collection of redundant structures for the same sequence. CoDNaS estimates the degree of conformational diversity using different global and local structural similarity measures. It allows the user to explore how structural differences among conformers change as a function of several structural features providing further biological information. This chapter explores the measurement of conformational diversity and its relationship with sequence divergence. Also, it discusses how proteins with high conformational diversity could affect homology modeling techniques.
Collapse
Affiliation(s)
- Alexander Miguel Monzon
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Argentina
| | - Maria Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Argentina
| | - Diego Javier Zea
- Structural Bioinformatics Unit, Fundación Instituto Leloir, CONICET, Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Argentina.
| |
Collapse
|
25
|
How is structural divergence related to evolutionary information? Mol Phylogenet Evol 2018; 127:859-866. [DOI: 10.1016/j.ympev.2018.06.033] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2017] [Revised: 06/01/2018] [Accepted: 06/19/2018] [Indexed: 12/15/2022]
|
26
|
Similarity/dissimilarity analysis of protein structures based on Markov random fields. Comput Biol Chem 2018; 75:45-53. [PMID: 29747075 DOI: 10.1016/j.compbiolchem.2018.04.016] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2017] [Revised: 03/08/2018] [Accepted: 04/23/2018] [Indexed: 11/21/2022]
Abstract
Protein Structure Similarity plays an important role in study on functional properties of proteins and evolutionary study. Many efficient methods have been proposed to advance protein structural comparison, but there are still some challenges in the contact strength definitions and similarity measures. In this work, we schemed out a new method to analyze the similarity/dissimilarity of the protein structures based on Markov random fields. We evaluated the proposed method with two experiments and compared it with the competing methods The results indicate that the proposed method exhibits a strong ability to detect the similarities/dissimilarities among the conformation of different cyclic peptides and protein structures. We also found that the alpha-C, oxygen O and N allow us to extract more conserved structures of the proteins, and Markov random fields with 2-point cliques (V) and orders 3 and 1 are more efficient in detecting the similarities/dissimilarities among different protein structures. This understanding can be used to design more powerful methods for similarities/dissimilarities analysis of different protein structures.
Collapse
|
27
|
Rueda AJV, Monzon AM, Ardanaz SM, Iglesias LE, Parisi G. Large scale analysis of protein conformational transitions from aqueous to non-aqueous media. BMC Bioinformatics 2018; 19:27. [PMID: 29382320 PMCID: PMC5791380 DOI: 10.1186/s12859-018-2044-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2017] [Accepted: 01/24/2018] [Indexed: 12/13/2022] Open
Abstract
Background Biocatalysis in organic solvents is nowadays a common practice with a large potential in Biotechnology. Several studies report that proteins which are co-crystallized or soaked in organic solvents preserve their fold integrity showing almost identical arrangements when compared to their aqueous forms. However, it is well established that the catalytic activity of proteins in organic solvents is much lower than in water. In order to explain this diminished activity and to further characterize the behaviour of proteins in non-aqueous environments, we performed a large-scale analysis (1737 proteins) of the conformational diversity of proteins crystallized in aqueous and co-crystallized or soaked in non-aqueous media. Results Using proteins’ experimentally determined conformational diversity taken from CoDNaS database, we found that proteins in non-aqueous media display much lower conformational diversity when compared to the corresponding conformers obtained in water. When conformational diversity is compared between conformers obtained in different non-aqueous media, their structural differences are larger and mostly independent of the presence of cognate ligands. We also found that conformers corresponding to non-aqueous media have larger but less flexible cavities, lower number of disordered regions and lower active-site residue mobility. Conclusions Our results show that non-aqueous media conformers have specific structural features and that they do not adopt extreme conformations found in aqueous media. This makes them clearly different from their corresponding aqueous conformers. Electronic supplementary material The online version of this article (10.1186/s12859-018-2044-2) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Ana Julia Velez Rueda
- Departamento de Ciencia y Tecnología, CONICET, Universidad Nacional de Quilmes, Roque Sáenz Peña 352, B1876BXD, Bernal, Provincia de Buenos Aires, Argentina
| | - Alexander Miguel Monzon
- Departamento de Ciencia y Tecnología, CONICET, Universidad Nacional de Quilmes, Roque Sáenz Peña 352, B1876BXD, Bernal, Provincia de Buenos Aires, Argentina
| | - Sebastián M Ardanaz
- Laboratorio de Biocatálisis y Biotransformaciones, Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Roque Sáenz Peña 352, B1876BXD, Bernal, Provincia de Buenos Aires, Argentina
| | - Luis E Iglesias
- Laboratorio de Biocatálisis y Biotransformaciones, Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Roque Sáenz Peña 352, B1876BXD, Bernal, Provincia de Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, CONICET, Universidad Nacional de Quilmes, Roque Sáenz Peña 352, B1876BXD, Bernal, Provincia de Buenos Aires, Argentina.
| |
Collapse
|
28
|
Gómez Tejeda Zañudo J, Scaltriti M, Albert R. A network modeling approach to elucidate drug resistance mechanisms and predict combinatorial drug treatments in breast cancer. CANCER CONVERGENCE 2017; 1:5. [PMID: 29623959 PMCID: PMC5876695 DOI: 10.1186/s41236-017-0007-6] [Citation(s) in RCA: 42] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2017] [Accepted: 11/30/2017] [Indexed: 02/08/2023] Open
Abstract
Background Mechanistic models of within-cell signal transduction networks can explain how these networks integrate internal and external inputs to give rise to the appropriate cellular response. These models can be fruitfully used in cancer cells, whose aberrant decision-making regarding their survival or death, proliferation or quiescence can be connected to errors in the state of nodes or edges of the signal transduction network. Results Here we present a comprehensive network, and discrete dynamic model, of signal transduction in ER+ breast cancer based on the literature of ER+, HER2+, and PIK3CA-mutant breast cancers. The network model recapitulates known resistance mechanisms to PI3K inhibitors and suggests other possibilities for resistance. The model also reveals known and novel combinatorial interventions that are more effective than PI3K inhibition alone. Conclusions The use of a logic-based, discrete dynamic model enables the identification of results that are mainly due to the organization of the signaling network, and those that also depend on the kinetics of individual events. Network-based models such as this will play an increasing role in the rational design of high-order therapeutic combinations. Electronic supplementary material The online version of this article (10.1186/s41236-017-0007-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jorge Gómez Tejeda Zañudo
- 1Department of Physics, The Pennsylvania State University, University Park, PA 16802-6300 USA.,2Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA 02215 USA.,3Broad Institute of Harvard and Massachusetts Institute of Technology, 7 Cambridge Center, Cambridge, MA 02142 USA
| | - Maurizio Scaltriti
- 4Human Oncology & Pathogenesis Program, Memorial Sloan Kettering Cancer Center, 1275 York Avenue, New York, NY 10065 USA.,5Department of Pathology, Memorial Sloan Kettering Cancer Center, 1275 York Avenue, New York, NY 10065 USA
| | - Réka Albert
- 1Department of Physics, The Pennsylvania State University, University Park, PA 16802-6300 USA.,6Department of Biology, The Pennsylvania State University, University Park, PA 16802-6300 USA
| |
Collapse
|
29
|
Monzon AM, Zea DJ, Marino-Buslje C, Parisi G. Homology modeling in a dynamical world. Protein Sci 2017; 26:2195-2206. [PMID: 28815769 DOI: 10.1002/pro.3274] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2017] [Revised: 08/09/2017] [Accepted: 08/09/2017] [Indexed: 12/31/2022]
Abstract
A key concept in template-based modeling (TBM) is the high correlation between sequence and structural divergence, with the practical consequence that homologous proteins that are similar at the sequence level will also be similar at the structural level. However, conformational diversity of the native state will reduce the correlation between structural and sequence divergence, because structural variation can appear without sequence diversity. In this work, we explore the impact that conformational diversity has on the relationship between structural and sequence divergence. We find that the extent of conformational diversity can be as high as the maximum structural divergence among families. Also, as expected, conformational diversity impairs the well-established correlation between sequence and structural divergence, which is nosier than previously suggested. However, we found that this noise can be resolved using a priori information coming from the structure-function relationship. We show that protein families with low conformational diversity show a well-correlated relationship between sequence and structural divergence, which is severely reduced in proteins with larger conformational diversity. This lack of correlation could impair TBM results in highly dynamical proteins. Finally, we also find that the presence of order/disorder can provide useful beforehand information for better TBM performance.
Collapse
Affiliation(s)
- Alexander Miguel Monzon
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, B1876BXD, Bernal, Argentina
| | - Diego Javier Zea
- Structural Bioinformatics Unit, Fundación Instituto Leloir, CONICET, C1405BWE Ciudad Autónoma de Buenos Aires, Argentina
| | - Cristina Marino-Buslje
- Structural Bioinformatics Unit, Fundación Instituto Leloir, CONICET, C1405BWE Ciudad Autónoma de Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, B1876BXD, Bernal, Argentina
| |
Collapse
|
30
|
Putz I, Brock O. Elastic network model of learned maintained contacts to predict protein motion. PLoS One 2017; 12:e0183889. [PMID: 28854238 PMCID: PMC5576689 DOI: 10.1371/journal.pone.0183889] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2016] [Accepted: 08/14/2017] [Indexed: 12/21/2022] Open
Abstract
We present a novel elastic network model, lmcENM, to determine protein motion even for localized functional motions that involve substantial changes in the protein's contact topology. Existing elastic network models assume that the contact topology remains unchanged throughout the motion and are thus most appropriate to simulate highly collective function-related movements. lmcENM uses machine learning to differentiate breaking from maintained contacts. We show that lmcENM accurately captures functional transitions unexplained by the classical ENM and three reference ENM variants, while preserving the simplicity of classical ENM. We demonstrate the effectiveness of our approach on a large set of proteins covering different motion types. Our results suggest that accurately predicting a "deformation-invariant" contact topology offers a promising route to increase the general applicability of ENMs. We also find that to correctly predict this contact topology a combination of several features seems to be relevant which may vary slightly depending on the protein. Additionally, we present case studies of two biologically interesting systems, Ferric Citrate membrane transporter FecA and Arachidonate 15-Lipoxygenase.
Collapse
Affiliation(s)
- Ines Putz
- Robotics and Biology Laboratory, Department of Computer Science and Electrical Engineering, Technische Universität Berlin, Berlin, Berlin, Germany
| | - Oliver Brock
- Robotics and Biology Laboratory, Department of Computer Science and Electrical Engineering, Technische Universität Berlin, Berlin, Berlin, Germany
| |
Collapse
|
31
|
Regad L, Chéron JB, Triki D, Senac C, Flatters D, Camproux AC. Exploring the potential of a structural alphabet-based tool for mining multiple target conformations and target flexibility insight. PLoS One 2017; 12:e0182972. [PMID: 28817602 PMCID: PMC5560695 DOI: 10.1371/journal.pone.0182972] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2017] [Accepted: 07/27/2017] [Indexed: 11/18/2022] Open
Abstract
Protein flexibility is often implied in binding with different partners and is essential for protein function. The growing number of macromolecular structures in the Protein Data Bank entries and their redundancy has become a major source of structural knowledge of the protein universe. The analysis of structural variability through available redundant structures of a target, called multiple target conformations (MTC), obtained using experimental or modeling methods and under different biological conditions or different sources is one way to explore protein flexibility. This analysis is essential to improve the understanding of various mechanisms associated with protein target function and flexibility. In this study, we explored structural variability of three biological targets by analyzing different MTC sets associated with these targets. To facilitate the study of these MTC sets, we have developed an efficient tool, SA-conf, dedicated to capturing and linking the amino acid and local structure variability and analyzing the target structural variability space. The advantage of SA-conf is that it could be applied to divers sets composed of MTCs available in the PDB obtained using NMR and crystallography or homology models. This tool could also be applied to analyze MTC sets obtained by dynamics approaches. Our results showed that SA-conf tool is effective to quantify the structural variability of a MTC set and to localize the structural variable positions and regions of the target. By selecting adapted MTC subsets and comparing their variability detected by SA-conf, we highlighted different sources of target flexibility such as induced by binding partner, by mutation and intrinsic flexibility. Our results support the interest to mine available structures associated with a target using to offer valuable insight into target flexibility and interaction mechanisms. The SA-conf executable script, with a set of pre-compiled binaries are available at http://www.mti.univ-paris-diderot.fr/recherche/plateformes/logiciels.
Collapse
Affiliation(s)
- Leslie Regad
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, Paris, France
- * E-mail: anne-claude.camproux@univ-paris-diderot (ACC); (LR)
| | - Jean-Baptiste Chéron
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, Paris, France
- Institut de Chimie de Nice, UMR-CNRS 7272, Faculté des Sciences, Université de Nice-Sophia Antipolis, Nice, France
| | - Dhoha Triki
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, Paris, France
| | - Caroline Senac
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, Paris, France
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, INSERM, Laboratoire d'Imagerie Biomédicale (LIB), Paris, France
| | - Delphine Flatters
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, Paris, France
| | - Anne-Claude Camproux
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, Paris, France
- * E-mail: anne-claude.camproux@univ-paris-diderot (ACC); (LR)
| |
Collapse
|
32
|
Monzon AM, Zea DJ, Fornasari MS, Saldaño TE, Fernandez-Alberti S, Tosatto SCE, Parisi G. Conformational diversity analysis reveals three functional mechanisms in proteins. PLoS Comput Biol 2017; 13:e1005398. [PMID: 28192432 PMCID: PMC5330503 DOI: 10.1371/journal.pcbi.1005398] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2016] [Revised: 02/28/2017] [Accepted: 02/02/2017] [Indexed: 02/02/2023] Open
Abstract
Protein motions are a key feature to understand biological function. Recently, a large-scale analysis of protein conformational diversity showed a positively skewed distribution with a peak at 0.5 Å C-alpha root-mean-square-deviation (RMSD). To understand this distribution in terms of structure-function relationships, we studied a well curated and large dataset of ~5,000 proteins with experimentally determined conformational diversity. We searched for global behaviour patterns studying how structure-based features change among the available conformer population for each protein. This procedure allowed us to describe the RMSD distribution in terms of three main protein classes sharing given properties. The largest of these protein subsets (~60%), which we call "rigid" (average RMSD = 0.83 Å), has no disordered regions, shows low conformational diversity, the largest tunnels and smaller and buried cavities. The two additional subsets contain disordered regions, but with differential sequence composition and behaviour. Partially disordered proteins have on average 67% of their conformers with disordered regions, average RMSD = 1.1 Å, the highest number of hinges and the longest disordered regions. In contrast, malleable proteins have on average only 25% of disordered conformers and average RMSD = 1.3 Å, flexible cavities affected in size by the presence of disordered regions and show the highest diversity of cognate ligands. Proteins in each set are mostly non-homologous to each other, share no given fold class, nor functional similarity but do share features derived from their conformer population. These shared features could represent conformational mechanisms related with biological functions.
Collapse
Affiliation(s)
- Alexander Miguel Monzon
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes (CONICET), Bernal, Buenos Aires, Argentina
| | - Diego Javier Zea
- Bioinformatics Unit, Fundación Instituto Leloir (CONICET), Buenos Aires, Argentina
| | - María Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes (CONICET), Bernal, Buenos Aires, Argentina
| | - Tadeo E. Saldaño
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes (CONICET), Bernal, Buenos Aires, Argentina
| | - Sebastian Fernandez-Alberti
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes (CONICET), Bernal, Buenos Aires, Argentina
| | | | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes (CONICET), Bernal, Buenos Aires, Argentina
| |
Collapse
|
33
|
Sorin EJ, Alvarado W, Cao S, Radcliffe A, La P, An Y. Ensemble Molecular Dynamics of a Protein-Ligand Complex: Residual Inhibitor Entropy Enhances Drug Potency in Butyrylcholinesterase. ACTA ACUST UNITED AC 2017; 6. [PMID: 28944107 DOI: 10.4172/2167-7662.1000145] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
Butyrylcholinesterase is a key enzyme that catalyzes the hydrolysis of the neurotransmitter acetylcholine and shows an increased activity in patients suffering from Alzheimer's disease (AD), making this enzyme a primary target in treating AD. Central to this problem, and to similar scenarios involving biomolecular recognition, is our understanding of the nature of the protein-ligand complex. The butyrylcholinesterase enzyme was studied via all-atom, explicit solvent, ensemble molecular dynamics simulations sans inhibitor and in the presence of three dialkyl phenyl phosphate inhibitors of known potency to a cumulative sampling of over 40 μs. Following the relaxation of these ensembles to conformational equilibria, binding modes for each inhibitor were identified. While classical models, which assume significant reduction in protein and ligand conformational entropies, continue to be favored in contemporary studies, our observations contradict those assumptions: bound ligands occupy many conformational states, thereby stabilizing the complex, while also promoting protein flexibility.
Collapse
Affiliation(s)
- Eric J Sorin
- Department of Chemistry and Biochemistry, California State University Long Beach, California, USA
| | - Walter Alvarado
- Department of Physics and Astronomy, California State University Long Beach, California, USA
| | - Samantha Cao
- Department of Chemistry and Biochemistry, California State University Long Beach, California, USA
| | - Amethyst Radcliffe
- Department of Physics and Astronomy, California State University Long Beach, California, USA
| | - Phuc La
- Department of Chemistry and Biochemistry, California State University Long Beach, California, USA
| | - Yi An
- Department of Chemistry and Biochemistry, California State University Long Beach, California, USA
| |
Collapse
|
34
|
Zhou RB, Lu HM, Liu J, Shi JY, Zhu J, Lu QQ, Yin DC. A Systematic Analysis of the Structures of Heterologously Expressed Proteins and Those from Their Native Hosts in the RCSB PDB Archive. PLoS One 2016; 11:e0161254. [PMID: 27517583 PMCID: PMC4982684 DOI: 10.1371/journal.pone.0161254] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2016] [Accepted: 08/02/2016] [Indexed: 11/18/2022] Open
Abstract
Recombinant expression of proteins has become an indispensable tool in modern day research. The large yields of recombinantly expressed proteins accelerate the structural and functional characterization of proteins. Nevertheless, there are literature reported that the recombinant proteins show some differences in structure and function as compared with the native ones. Now there have been more than 100,000 structures (from both recombinant and native sources) publicly available in the Protein Data Bank (PDB) archive, which makes it possible to investigate if there exist any proteins in the RCSB PDB archive that have identical sequence but have some difference in structures. In this paper, we present the results of a systematic comparative study of the 3D structures of identical naturally purified versus recombinantly expressed proteins. The structural data and sequence information of the proteins were mined from the RCSB PDB archive. The combinatorial extension (CE), FATCAT-flexible and TM-Align methods were employed to align the protein structures. The root-mean-square distance (RMSD), TM-score, P-value, Z-score, secondary structural elements and hydrogen bonds were used to assess the structure similarity. A thorough analysis of the PDB archive generated five-hundred-seventeen pairs of native and recombinant proteins that have identical sequence. There were no pairs of proteins that had the same sequence and significantly different structural fold, which support the hypothesis that expression in a heterologous host usually could fold correctly into their native forms.
Collapse
Affiliation(s)
- Ren-Bin Zhou
- Institute for Special Environmental Biophysics, Key Laboratory for Space Bioscience and Biotechnology, School of Life Sciences, Northwestern Polytechnical University, Xi’an, Shaanxi, PR China
| | - Hui-Meng Lu
- Institute for Special Environmental Biophysics, Key Laboratory for Space Bioscience and Biotechnology, School of Life Sciences, Northwestern Polytechnical University, Xi’an, Shaanxi, PR China
| | - Jie Liu
- School of Computer Science and Technology, Xidian University, Xi’an, PR China
| | - Jian-Yu Shi
- Institute for Special Environmental Biophysics, Key Laboratory for Space Bioscience and Biotechnology, School of Life Sciences, Northwestern Polytechnical University, Xi’an, Shaanxi, PR China
| | - Jing Zhu
- Institute for Special Environmental Biophysics, Key Laboratory for Space Bioscience and Biotechnology, School of Life Sciences, Northwestern Polytechnical University, Xi’an, Shaanxi, PR China
| | - Qin-Qin Lu
- Institute for Special Environmental Biophysics, Key Laboratory for Space Bioscience and Biotechnology, School of Life Sciences, Northwestern Polytechnical University, Xi’an, Shaanxi, PR China
| | - Da-Chuan Yin
- Institute for Special Environmental Biophysics, Key Laboratory for Space Bioscience and Biotechnology, School of Life Sciences, Northwestern Polytechnical University, Xi’an, Shaanxi, PR China
- * E-mail:
| |
Collapse
|
35
|
Addressing the Role of Conformational Diversity in Protein Structure Prediction. PLoS One 2016; 11:e0154923. [PMID: 27159429 PMCID: PMC4861349 DOI: 10.1371/journal.pone.0154923] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2015] [Accepted: 04/21/2016] [Indexed: 11/19/2022] Open
Abstract
Computational modeling of tertiary structures has become of standard use to study proteins that lack experimental characterization. Unfortunately, 3D structure prediction methods and model quality assessment programs often overlook that an ensemble of conformers in equilibrium populates the native state of proteins. In this work we collected sets of publicly available protein models and the corresponding target structures experimentally solved and studied how they describe the conformational diversity of the protein. For each protein, we assessed the quality of the models against known conformers by several standard measures and identified those models ranked best. We found that model rankings are defined by both the selected target conformer and the similarity measure used. 70% of the proteins in our datasets show that different models are structurally closest to different conformers of the same protein target. We observed that model building protocols such as template-based or ab initio approaches describe in similar ways the conformational diversity of the protein, although for template-based methods this description may depend on the sequence similarity between target and template sequences. Taken together, our results support the idea that protein structure modeling could help to identify members of the native ensemble, highlight the importance of considering conformational diversity in protein 3D quality evaluations and endorse the study of the variability of the native structure for a meaningful biological analysis.
Collapse
|
36
|
Zea DJ, Monzon AM, Gonzalez C, Fornasari MS, Tosatto SCE, Parisi G. Disorder transitions and conformational diversity cooperatively modulate biological function in proteins. Protein Sci 2016; 25:1138-46. [PMID: 27038125 DOI: 10.1002/pro.2931] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2016] [Revised: 03/30/2016] [Accepted: 03/31/2016] [Indexed: 12/20/2022]
Abstract
Structural differences between conformers sustain protein biological function. Here, we studied in a large dataset of 745 intrinsically disordered proteins, how ordered-disordered transitions modulate structural differences between conformers as derived from crystallographic data. We found that almost 50% of the proteins studied show no transitions and have low conformational diversity while the rest show transitions and a higher conformational diversity. In this last subset, 60% of the proteins become more ordered after ligand binding, while 40% more disordered. As protein conformational diversity is inherently connected with protein function our analysis suggests differences in structure-function relationships related to order-disorder transitions.
Collapse
Affiliation(s)
- Diego Javier Zea
- Structural Bioinformatics Group, Department of Science and Technology, National University of Quilmes, Argentina
| | - Alexander Miguel Monzon
- Structural Bioinformatics Group, Department of Science and Technology, National University of Quilmes, Argentina
| | - Claudia Gonzalez
- Structural Bioinformatics Group, Department of Science and Technology, National University of Quilmes, Argentina
| | - María Silvina Fornasari
- Structural Bioinformatics Group, Department of Science and Technology, National University of Quilmes, Argentina
| | - Silvio C E Tosatto
- Biocomputing up, Department of Biomedical Sciences, University of Padova, Italy
| | - Gustavo Parisi
- Structural Bioinformatics Group, Department of Science and Technology, National University of Quilmes, Argentina
| |
Collapse
|
37
|
Šink R, Kotnik M, Zega A, Barreteau H, Gobec S, Blanot D, Dessen A, Contreras-Martel C. Crystallographic Study of Peptidoglycan Biosynthesis Enzyme MurD: Domain Movement Revisited. PLoS One 2016; 11:e0152075. [PMID: 27031227 PMCID: PMC4816537 DOI: 10.1371/journal.pone.0152075] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2015] [Accepted: 03/08/2016] [Indexed: 11/30/2022] Open
Abstract
The biosynthetic pathway of peptidoglycan, an essential component of bacterial cell wall, is a well-recognized target for antibiotic development. Peptidoglycan precursors are synthesized in the bacterial cytosol by various enzymes including the ATP-hydrolyzing Mur ligases, which catalyze the stepwise addition of amino acids to a UDP-MurNAc precursor to yield UDP-MurNAc-pentapeptide. MurD catalyzes the addition of D-glutamic acid to UDP-MurNAc-L-Ala in the presence of ATP; structural and biochemical studies have suggested the binding of the substrates with an ordered kinetic mechanism in which ligand binding inevitably closes the active site. In this work, we challenge this assumption by reporting the crystal structures of intermediate forms of MurD either in the absence of ligands or in the presence of small molecules. A detailed analysis provides insight into the events that lead to the closure of MurD and reveals that minor structural modifications contribute to major overall conformation alterations. These novel insights will be instrumental in the development of new potential antibiotics designed to target the peptidoglycan biosynthetic pathway.
Collapse
Affiliation(s)
- Roman Šink
- University of Ljubljana, Faculty of Pharmacy, Aškerčeva 7, Ljubljana, Slovenia
| | - Miha Kotnik
- Lek Pharmaceuticals d. d., Verovškova 57, Ljubljana, Slovenia
| | - Anamarija Zega
- University of Ljubljana, Faculty of Pharmacy, Aškerčeva 7, Ljubljana, Slovenia
| | - Hélène Barreteau
- Laboratoire des Enveloppes Bactériennes et Antibiotiques, Institut de Biologie Intégrative de la Cellule (I2BC), CEA, CNRS, Université Paris-Sud, Gif-sur-Yvette, France
| | - Stanislav Gobec
- University of Ljubljana, Faculty of Pharmacy, Aškerčeva 7, Ljubljana, Slovenia
| | - Didier Blanot
- Laboratoire des Enveloppes Bactériennes et Antibiotiques, Institut de Biologie Intégrative de la Cellule (I2BC), CEA, CNRS, Université Paris-Sud, Gif-sur-Yvette, France
| | - Andréa Dessen
- Univ. Grenoble Alpes, Institut de Biologie Structurale, Grenoble, France
- CNRS, IBS, Grenoble, France
- CEA, IBS, Grenoble, France
- Brazilian National Laboratory for Biosciences (LNBio), CNPEM, Campinas, São Paulo, Brazil
| | - Carlos Contreras-Martel
- Univ. Grenoble Alpes, Institut de Biologie Structurale, Grenoble, France
- CNRS, IBS, Grenoble, France
- CEA, IBS, Grenoble, France
- * E-mail:
| |
Collapse
|
38
|
Monzon AM, Rohr CO, Fornasari MS, Parisi G. CoDNaS 2.0: a comprehensive database of protein conformational diversity in the native state. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2016; 2016:baw038. [PMID: 27022160 PMCID: PMC4809262 DOI: 10.1093/database/baw038] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/19/2015] [Accepted: 03/02/2016] [Indexed: 01/01/2023]
Abstract
CoDNaS (conformational diversity of the native state) is a protein conformational diversity database. Conformational diversity describes structural differences between conformers that define the native state of proteins. It is a key concept to understand protein function and biological processes related to protein functions. CoDNaS offers a well curated database that is experimentally driven, thoroughly linked, and annotated. CoDNaS facilitates the extraction of key information on small structural differences based on protein movements. CoDNaS enables users to easily relate the degree of conformational diversity with physical, chemical and biological properties derived from experiments on protein structure and biological characteristics. The new version of CoDNaS includes ∼70% of all available protein structures, and new tools have been added that run sequence searches, display structural flexibility profiles and allow users to browse the database for different structural classes. These tools facilitate the exploration of protein conformational diversity and its role in protein function. Database URL:http://ufq.unq.edu.ar/codnas
Collapse
Affiliation(s)
| | - Cristian Oscar Rohr
- Instituto de Ecología Genética y Evolución de Buenos Aires (IEGEBA)-Laboratorio de Genómica Médica y Evolución, Universidad Nacional de Buenos Aires, Argentina
| | | | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
| |
Collapse
|
39
|
Hrabe T, Li Z, Sedova M, Rotkiewicz P, Jaroszewski L, Godzik A. PDBFlex: exploring flexibility in protein structures. Nucleic Acids Res 2015; 44:D423-8. [PMID: 26615193 PMCID: PMC4702920 DOI: 10.1093/nar/gkv1316] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2015] [Accepted: 11/10/2015] [Indexed: 12/16/2022] Open
Abstract
The PDBFlex database, available freely and with no login requirements at http://pdbflex.org, provides information on flexibility of protein structures as revealed by the analysis of variations between depositions of different structural models of the same protein in the Protein Data Bank (PDB). PDBFlex collects information on all instances of such depositions, identifying them by a 95% sequence identity threshold, performs analysis of their structural differences and clusters them according to their structural similarities for easy analysis. The PDBFlex contains tools and viewers enabling in-depth examination of structural variability including: 2D-scaling visualization of RMSD distances between structures of the same protein, graphs of average local RMSD in the aligned structures of protein chains, graphical presentation of differences in secondary structure and observed structural disorder (unresolved residues), difference distance maps between all sets of coordinates and 3D views of individual structures and simulated transitions between different conformations, the latter displayed using JSMol visualization software.
Collapse
Affiliation(s)
- Thomas Hrabe
- Bioinformatics and Systems Biology Program, Sanford Burnham Prebys Medical Discovery Institute, 10901 North Torrey Pines Road, La Jolla, CA 92037, USA
| | - Zhanwen Li
- Bioinformatics and Systems Biology Program, Sanford Burnham Prebys Medical Discovery Institute, 10901 North Torrey Pines Road, La Jolla, CA 92037, USA
| | - Mayya Sedova
- Bioinformatics and Systems Biology Program, Sanford Burnham Prebys Medical Discovery Institute, 10901 North Torrey Pines Road, La Jolla, CA 92037, USA
| | - Piotr Rotkiewicz
- Bioinformatics and Systems Biology Program, Sanford Burnham Prebys Medical Discovery Institute, 10901 North Torrey Pines Road, La Jolla, CA 92037, USA
| | - Lukasz Jaroszewski
- Bioinformatics and Systems Biology Program, Sanford Burnham Prebys Medical Discovery Institute, 10901 North Torrey Pines Road, La Jolla, CA 92037, USA
| | - Adam Godzik
- Bioinformatics and Systems Biology Program, Sanford Burnham Prebys Medical Discovery Institute, 10901 North Torrey Pines Road, La Jolla, CA 92037, USA
| |
Collapse
|
40
|
Rackovsky S. Nonlinearities in protein space limit the utility of informatics in protein biophysics. Proteins 2015; 83:1923-8. [PMID: 26315852 DOI: 10.1002/prot.24916] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2015] [Revised: 08/12/2015] [Accepted: 08/20/2015] [Indexed: 11/08/2022]
Abstract
We examine the utility of informatic-based methods in computational protein biophysics. To do so, we use newly developed metric functions to define completely independent sequence and structure spaces for a large database of proteins. By investigating the relationship between these spaces, we demonstrate quantitatively the limits of knowledge-based correlation between the sequences and structures of proteins. It is shown that there are well-defined, nonlinear regions of protein space in which dissimilar structures map onto similar sequences (the conformational switch), and dissimilar sequences map onto similar structures (remote homology). These nonlinearities are shown to be quite common-almost half the proteins in our database fall into one or the other of these two regions. They are not anomalies, but rather intrinsic properties of structural encoding in amino acid sequences. It follows that extreme care must be exercised in using bioinformatic data as a basis for computational structure prediction. The implications of these results for protein evolution are examined.
Collapse
Affiliation(s)
- S Rackovsky
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, New York, 14853.,Department of Pharmacology and Systems Therapeutics, Icahn School of Medicine at Mount Sinai, New York, New York, 10029
| |
Collapse
|
41
|
Parisi G, Zea DJ, Monzon AM, Marino-Buslje C. Conformational diversity and the emergence of sequence signatures during evolution. Curr Opin Struct Biol 2015; 32:58-65. [DOI: 10.1016/j.sbi.2015.02.005] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2014] [Revised: 02/02/2015] [Accepted: 02/09/2015] [Indexed: 02/03/2023]
|
42
|
Lamb AL, Kappock TJ, Silvaggi NR. You are lost without a map: Navigating the sea of protein structures. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2014; 1854:258-68. [PMID: 25554228 DOI: 10.1016/j.bbapap.2014.12.021] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/08/2014] [Accepted: 12/22/2014] [Indexed: 11/26/2022]
Abstract
X-ray crystal structures propel biochemistry research like no other experimental method, since they answer many questions directly and inspire new hypotheses. Unfortunately, many users of crystallographic models mistake them for actual experimental data. Crystallographic models are interpretations, several steps removed from the experimental measurements, making it difficult for nonspecialists to assess the quality of the underlying data. Crystallographers mainly rely on "global" measures of data and model quality to build models. Robust validation procedures based on global measures now largely ensure that structures in the Protein Data Bank (PDB) are largely correct. However, global measures do not allow users of crystallographic models to judge the reliability of "local" features in a region of interest. Refinement of a model to fit into an electron density map requires interpretation of the data to produce a single "best" overall model. This process requires inclusion of most probable conformations in areas of poor density. Users who misunderstand this can be misled, especially in regions of the structure that are mobile, including active sites, surface residues, and especially ligands. This article aims to equip users of macromolecular models with tools to critically assess local model quality. Structure users should always check the agreement of the electron density map and the derived model in all areas of interest, even if the global statistics are good. We provide illustrated examples of interpreted electron density as a guide for those unaccustomed to viewing electron density.
Collapse
Affiliation(s)
- Audrey L Lamb
- Department of Molecular Biosciences, University of Kansas, Lawrence, KS 66045, United States.
| | - T Joseph Kappock
- Department of Biochemistry, Purdue University, West Lafayette, IN 47907, United States
| | - Nicholas R Silvaggi
- Department of Chemistry and Biochemistry, University of Wisconsin-Milwaukee, Milwaukee, WI 53211, United States.
| |
Collapse
|
43
|
Kurzynski M, Torchala M, Chelminiak P. Output-input ratio in thermally fluctuating biomolecular machines. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2014; 89:012722. [PMID: 24580272 DOI: 10.1103/physreve.89.012722] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2013] [Indexed: 06/03/2023]
Abstract
Biological molecular machines are proteins that operate under isothermal conditions and hence are referred to as free energy transducers. They can be formally considered as enzymes that simultaneously catalyze two chemical reactions: the free energy-donating (input) reaction and the free energy-accepting (output) one. Most if not all biologically active proteins display a slow stochastic dynamics of transitions between a variety of conformational substates composing their native state. This makes the description of the enzymatic reaction kinetics in terms of conventional rate constants insufficient. In the steady state, upon taking advantage of the assumption that each reaction proceeds through a single pair (the gate) of transition conformational substates of the enzyme-substrates complex, the degree of coupling between the output and the input reaction fluxes has been expressed in terms of the mean first-passage times on a conformational transition network between the distinguished substates. The theory is confronted with the results of random-walk simulations on the five-dimensional hypercube. The formal proof is given that, for single input and output gates, the output-input degree of coupling cannot exceed unity. As some experiments suggest such exceeding, looking for the conditions for increasing the degree of coupling value over unity challenges the theory. Performed simulations of random walks on several model networks involving more extended gates indicate that the case of the degree of coupling value higher than 1 is realized in a natural way on critical branching trees extended by long-range shortcuts. Such networks are scale-free and display the property of the small world. For short-range shortcuts, the networks are scale-free and fractal, representing a reasonable model for biomolecular machines displaying tight coupling, i.e., the degree of coupling equal exactly to unity. A hypothesis is stated that the protein conformational transition networks, as just as higher-level biological networks, the protein interaction network, and the metabolic network, have evolved in the process of self-organized criticality.
Collapse
Affiliation(s)
- Michal Kurzynski
- Faculty of Physics, A. Mickiewicz University, Umultowska 85, 61-614 Poznan, Poland
| | - Mieczyslaw Torchala
- Faculty of Physics, A. Mickiewicz University, Umultowska 85, 61-614 Poznan, Poland and BioInfoBank Institute, Limanowskiego 24A, 60-744 Poznan, Poland
| | | |
Collapse
|
44
|
Palopoli N, Lanzarotti E, Parisi G. BeEP Server: Using evolutionary information for quality assessment of protein structure models. Nucleic Acids Res 2013; 41:W398-405. [PMID: 23729471 PMCID: PMC3692104 DOI: 10.1093/nar/gkt453] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
The BeEP Server (http://www.embnet.qb.fcen.uba.ar/embnet/beep.php) is an online resource aimed to help in the endgame of protein structure prediction. It is able to rank submitted structural models of a protein through an explicit use of evolutionary information, a criterion differing from structural or energetic considerations commonly used in other assessment programs. The idea behind BeEP (Best Evolutionary Pattern) is to benefit from the substitution pattern derived from structural constraints present in a set of homologous proteins adopting a given protein conformation. The BeEP method uses a model of protein evolution that takes into account the structure of a protein to build site-specific substitution matrices. The suitability of these substitution matrices is assessed through maximum likelihood calculations from which position-specific and global scores can be derived. These scores estimate how well the structural constraints derived from each structural model are represented in a sequence alignment of homologous proteins. Our assessment on a subset of proteins from the Critical Assessment of techniques for protein Structure Prediction (CASP) experiment has shown that BeEP is capable of discriminating the models and selecting one or more native-like structures. Moreover, BeEP is not explicitly parameterized to find structural similarities between models and given targets, potentially helping to explore the conformational ensemble of the native state.
Collapse
Affiliation(s)
- Nicolas Palopoli
- Departamento de Ciencia y Tecnologia, Universidad Nacional de Quilmes, B1876BXD, Bernal, Buenos Aires, Argentina, Centre for Biological Sciences, University of Southampton, SO17 1BJ, Southampton, UK and Departamento de Quimica Biologica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, C1428EHA, Buenos Aires, Argentina
| | - Esteban Lanzarotti
- Departamento de Ciencia y Tecnologia, Universidad Nacional de Quilmes, B1876BXD, Bernal, Buenos Aires, Argentina, Centre for Biological Sciences, University of Southampton, SO17 1BJ, Southampton, UK and Departamento de Quimica Biologica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, C1428EHA, Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnologia, Universidad Nacional de Quilmes, B1876BXD, Bernal, Buenos Aires, Argentina, Centre for Biological Sciences, University of Southampton, SO17 1BJ, Southampton, UK and Departamento de Quimica Biologica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, C1428EHA, Buenos Aires, Argentina
- *To whom correspondence should be addressed. Tel: +54 011 43657100 (ext. 4135); Fax: +54 011 437657101;
| |
Collapse
|
45
|
Affiliation(s)
- Rachel Kolodny
- Department of Computer Science, University of Haifa, Haifa 31905, Israel;
| | - Leonid Pereyaslavets
- Department of Structural Biology, Stanford University, Stanford, California 94305; ,
| | | | - Michael Levitt
- Department of Structural Biology, Stanford University, Stanford, California 94305; ,
| |
Collapse
|
46
|
Stec B. Time passes yet errors remain: comments on the structure of N10-formyltetrahydrofolate synthetase. Protein Sci 2013; 22:671-4. [PMID: 23533144 DOI: 10.1002/pro.2252] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2013] [Accepted: 03/13/2013] [Indexed: 11/07/2022]
|
47
|
Kolodny R, Kosloff M. From Protein Structure to Function via Computational Tools and Approaches. Isr J Chem 2013. [DOI: 10.1002/ijch.201200078] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
|
48
|
Sikosek T, Bornberg-Bauer E, Chan HS. Evolutionary dynamics on protein bi-stability landscapes can potentially resolve adaptive conflicts. PLoS Comput Biol 2012; 8:e1002659. [PMID: 23028272 PMCID: PMC3441461 DOI: 10.1371/journal.pcbi.1002659] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2012] [Accepted: 07/12/2012] [Indexed: 11/18/2022] Open
Abstract
Experimental studies have shown that some proteins exist in two alternative native-state conformations. It has been proposed that such bi-stable proteins can potentially function as evolutionary bridges at the interface between two neutral networks of protein sequences that fold uniquely into the two different native conformations. Under adaptive conflict scenarios, bi-stable proteins may be of particular advantage if they simultaneously provide two beneficial biological functions. However, computational models that simulate protein structure evolution do not yet recognize the importance of bi-stability. Here we use a biophysical model to analyze sequence space to identify bi-stable or multi-stable proteins with two or more equally stable native-state structures. The inclusion of such proteins enhances phenotype connectivity between neutral networks in sequence space. Consideration of the sequence space neighborhood of bridge proteins revealed that bi-stability decreases gradually with each mutation that takes the sequence further away from an exactly bi-stable protein. With relaxed selection pressures, we found that bi-stable proteins in our model are highly successful under simulated adaptive conflict. Inspired by these model predictions, we developed a method to identify real proteins in the PDB with bridge-like properties, and have verified a clear bi-stability gradient for a series of mutants studied by Alexander et al. (Proc Nat Acad Sci USA 2009, 106:21149–21154) that connect two sequences that fold uniquely into two different native structures via a bridge-like intermediate mutant sequence. Based on these findings, new testable predictions for future studies on protein bi-stability and evolution are discussed. Proteins are essential molecules for performing a majority of functions in all biological systems. These functions often depend on the three-dimensional structures of proteins. Here, we investigate a fundamental question in molecular evolution: how can proteins acquire new advantageous structures via mutations while not sacrificing their existing structures that are still needed? Some authors have suggested that the same protein may adopt two or more alternative structures, switch between them and thus perform different functions with each of the alternative structures. Intuitively, such a protein could provide an evolutionary compromise between conflicting demands for existing and new protein structures. Yet no theoretical study has systematically tackled the biophysical basis of such compromises during evolutionary processes. Here we devise a model of evolution that specifically recognizes protein molecules that can exist in several different stable structures. Our model demonstrates that proteins can indeed utilize multiple structures to satisfy conflicting evolutionary requirements. In light of these results, we identify data from known protein structures that are consistent with our predictions and suggest novel directions for future investigation.
Collapse
Affiliation(s)
- Tobias Sikosek
- Evolutionary Bioinformatics Group, Institute for Evolution and Biodiversity, University of Münster, Münster, Germany.
| | | | | |
Collapse
|
49
|
Juritz E, Fornasari MS, Martelli PL, Fariselli P, Casadio R, Parisi G. On the effect of protein conformation diversity in discriminating among neutral and disease related single amino acid substitutions. BMC Genomics 2012; 13 Suppl 4:S5. [PMID: 22759653 PMCID: PMC3303731 DOI: 10.1186/1471-2164-13-s4-s5] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
BACKGROUND Non-synonymous coding SNPs (nsSNPs) that are associated to disease can also be related with alterations in protein stability. Computational methods are available to predict the effect of single amino acid substitutions (SASs) on protein stability based on a single folded structure. However, the native state of a protein is not unique and it is better represented by the ensemble of its conformers in dynamic equilibrium. The maintenance of the ensemble is essential for protein function. In this work we investigated how protein conformational diversity can affect the discrimination of neutral and disease related SASs based on protein stability estimations. For this purpose, we used 119 proteins with 803 associated SASs, 60% of which are disease related. Each protein was associated with its corresponding set of available conformers as found in the Protein Conformational Database (PCDB). Our dataset contains proteins with different extensions of conformational diversity summing up a total number of 1023 conformers. RESULTS The existence of different conformers for a given protein introduces great variability in the estimation of the protein stability (ΔΔG) after a single amino acid substitution (SAS) as computed with FoldX. Indeed, in 35% of our protein set at least one SAS can be described as stabilizing, destabilizing or neutral when a cutoff value of ±2 kcal/mol is adopted for discriminating neutral from perturbing SASs. However, when the ΔΔG variability among conformers is taken into account, the correlation among the perturbation of protein stability and the corresponding disease or neutral phenotype increases as compared with the same analysis on single protein structures. At the conformer level, we also found that the different conformers correlate in a different way to the corresponding phenotype. CONCLUSIONS Our results suggest that the consideration of conformational diversity can improve the discrimination of neutral and disease related protein SASs based on the evaluation of the corresponding Gibbs free energy change.
Collapse
Affiliation(s)
- Ezequiel Juritz
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Buenos Aires, Argentina
| | - Maria Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Buenos Aires, Argentina
| | | | - Piero Fariselli
- Biocomputing Group, Department of Computer Science, University of Bologna, Italy
| | - Rita Casadio
- Biocomputing Group, Department of Biology, University of Bologna, Italy
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Buenos Aires, Argentina
| |
Collapse
|
50
|
Abstract
We report an unexpected finding of common structural principles in two unrelated signaling systems: the FAS death domain transformation that initializes the extrinsic apoptotic pathway and signaling by calmodulin bending. The location and design of the hinge is postulated to be a general principle for creating potential signaling event. We suggest that already existing tool can predict the existence of such a hinge and formulate the hypothesis that the internal instabilities designed into the hinge sequences are necessary devices in effective signaling events.
Collapse
Affiliation(s)
- Boguslaw Stec
- Sanford-Burnham Medical Research Institute, 10901 N. Torrey Pines Rd., La Jolla, CA 92037, United States.
| |
Collapse
|