1
|
Cueno ME, Kamio N, Imai K. Avian influenza A H5N1 hemagglutinin protein models have distinct structural patterns re-occurring across the 1959-2023 strains. Biosystems 2024; 246:105347. [PMID: 39349133 DOI: 10.1016/j.biosystems.2024.105347] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2024] [Revised: 09/26/2024] [Accepted: 09/27/2024] [Indexed: 10/02/2024]
Abstract
Influenza A H5N1 hemagglutinin (HA) plays a crucial role in viral pathogenesis and changes in the HA receptor binding domain (RBD) have been attributed to alterations in viral pathogenesis. Mutations often occur within the HA which in-turn results in HA structural changes that consequently contribute to protein evolution. However, the possible occurrence of mutations that results to reversion of the HA protein (going back to an ancestral protein conformation) which in-turn creates distinct HA structural patterns across the 1959-2023 H5N1 viral evolution has never been investigated. Here, we generated and verified the quality of the HA models, identified similar HA structural patterns, and elucidated the possible variations in HA RBD structural dynamics. Our results show that there are 7 distinct structural patterns occurring among the 1959-2023 H5N1 HA models which suggests that reversion of the HA protein putatively occurs during viral evolution. Similarly, we found that the HA RBD structural dynamics vary among the 7 distinct structural patterns possibly affecting viral pathogenesis.
Collapse
Affiliation(s)
- Marni E Cueno
- Department of Microbiology and Immunology, Nihon University School of Dentistry, Tokyo, 101-8310, Japan.
| | - Noriaki Kamio
- Department of Microbiology and Immunology, Nihon University School of Dentistry, Tokyo, 101-8310, Japan
| | - Kenichi Imai
- Department of Microbiology and Immunology, Nihon University School of Dentistry, Tokyo, 101-8310, Japan
| |
Collapse
|
2
|
Vallat B, Tauriello G, Bienert S, Haas J, Webb BM, Žídek A, Zheng W, Peisach E, Piehl DW, Anischanka I, Sillitoe I, Tolchard J, Varadi M, Baker D, Orengo C, Zhang Y, Hoch JC, Kurisu G, Patwardhan A, Velankar S, Burley SK, Sali A, Schwede T, Berman HM, Westbrook JD. ModelCIF: An Extension of PDBx/mmCIF Data Representation for Computed Structure Models. J Mol Biol 2023; 435:168021. [PMID: 36828268 PMCID: PMC10293049 DOI: 10.1016/j.jmb.2023.168021] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 02/15/2023] [Accepted: 02/16/2023] [Indexed: 02/24/2023]
Abstract
ModelCIF (github.com/ihmwg/ModelCIF) is a data information framework developed for and by computational structural biologists to enable delivery of Findable, Accessible, Interoperable, and Reusable (FAIR) data to users worldwide. ModelCIF describes the specific set of attributes and metadata associated with macromolecular structures modeled by solely computational methods and provides an extensible data representation for deposition, archiving, and public dissemination of predicted three-dimensional (3D) models of macromolecules. It is an extension of the Protein Data Bank Exchange / macromolecular Crystallographic Information Framework (PDBx/mmCIF), which is the global data standard for representing experimentally-determined 3D structures of macromolecules and associated metadata. The PDBx/mmCIF framework and its extensions (e.g., ModelCIF) are managed by the Worldwide Protein Data Bank partnership (wwPDB, wwpdb.org) in collaboration with relevant community stakeholders such as the wwPDB ModelCIF Working Group (wwpdb.org/task/modelcif). This semantically rich and extensible data framework for representing computed structure models (CSMs) accelerates the pace of scientific discovery. Herein, we describe the architecture, contents, and governance of ModelCIF, and tools and processes for maintaining and extending the data standard. Community tools and software libraries that support ModelCIF are also described.
Collapse
Affiliation(s)
- Brinda Vallat
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ 08901, USA.
| | - Gerardo Tauriello
- Biozentrum, University of Basel, Basel, Switzerland; Computational Structural Biology, SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Stefan Bienert
- Biozentrum, University of Basel, Basel, Switzerland; Computational Structural Biology, SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Juergen Haas
- Biozentrum, University of Basel, Basel, Switzerland; Computational Structural Biology, SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Benjamin M Webb
- Department of Bioengineering and Therapeutic Sciences, the Quantitative Biosciences Institute (QBI), and the Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA 94157, USA
| | | | - Wei Zheng
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Ezra Peisach
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Dennis W Piehl
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Ivan Anischanka
- Department of Biochemistry, and Institute for Protein Design, University of Washington, Seattle, WA 98195, USA
| | - Ian Sillitoe
- Department of Structural and Molecular Biology, UCL, London, UK
| | - James Tolchard
- AlphaFold Protein Structure Database, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge CB10 1SD, UK; Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge CB10 1SD, UK
| | - Mihaly Varadi
- AlphaFold Protein Structure Database, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge CB10 1SD, UK; Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge CB10 1SD, UK
| | - David Baker
- Department of Biochemistry, and Institute for Protein Design, University of Washington, Seattle, WA 98195, USA; Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA
| | | | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Jeffrey C Hoch
- Biological Magnetic Resonance Data Bank, Department of Molecular Biology and Biophysics, University of Connecticut, Farmington, CT 06030, USA
| | - Genji Kurisu
- Protein Data Bank Japan, Institute for Protein Research, Osaka University, Suita, Osaka 565-0871, Japan
| | - Ardan Patwardhan
- Electron Microscopy Data Bank, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Sameer Velankar
- AlphaFold Protein Structure Database, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge CB10 1SD, UK; Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge CB10 1SD, UK
| | - Stephen K Burley
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ 08901, USA; Research Collaboratory for Structural Bioinformatics Protein Data Bank, San Diego Supercomputer Center, University of California, La Jolla, CA 92093, USA; Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, the Quantitative Biosciences Institute (QBI), and the Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA 94157, USA. https://twitter.com/salilab_ucsf
| | - Torsten Schwede
- Biozentrum, University of Basel, Basel, Switzerland; Computational Structural Biology, SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Helen M Berman
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - John D Westbrook
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ 08901, USA
| |
Collapse
|
3
|
Cueno ME, Taketsuna K, Saito M, Inoue S, Imai K. Network analysis of the autophagy biochemical network in relation to various autophagy-targeted proteins found among SARS-CoV-2 variants of concern. J Mol Graph Model 2023; 119:108396. [PMID: 36549224 PMCID: PMC9749836 DOI: 10.1016/j.jmgm.2022.108396] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Revised: 10/28/2022] [Accepted: 12/09/2022] [Indexed: 12/15/2022]
Abstract
Autophagy is an important cellular process that triggers a coordinated action involving multiple individual proteins and protein complexes while SARS-CoV-2 (SARS2) was found to both hinder autophagy to evade host defense and utilize autophagy for viral replication. Interestingly, the possible significant stages of the autophagy biochemical network in relation to the corresponding autophagy-targeted SARS2 proteins from the different variants of concern (VOC) were never established. In this study, we performed the following: autophagy biochemical network design and centrality analyses; generated autophagy-targeted SARS2 protein models; and superimposed protein models for structural comparison. We identified 2 significant biochemical pathways (one starts from the ULK complex and the other starts from the PI3P complex) within the autophagy biochemical network. Similarly, we determined that the autophagy-targeted SARS2 proteins (Nsp15, M, ORF7a, ORF3a, and E) are structurally conserved throughout the different SARS2 VOC suggesting that the function of each protein is preserved during SARS2 evolution. Interestingly, among the autophagy-targeted SARS2 proteins, the M protein coincides with the 2 significant biochemical pathways we identified within the autophagy biochemical network. In this regard, we propose that the SARS2 M protein is the main determinant that would influence autophagy outcome in regard to SARS2 infection.
Collapse
Affiliation(s)
- Marni E. Cueno
- Department of Microbiology and Immunology, Nihon University School of Dentistry, Tokyo, 101-8310, Japan,Immersion Biology Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, 178-0063, Japan,Corresponding author. Department of Microbiology and Immunology, Nihon University School of Dentistry, 1-8-13 Kanda-Surugadai, Chiyoda-ku, Tokyo, 101-8310, Japan
| | - Keiichi Taketsuna
- Immersion Biology Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, 178-0063, Japan
| | - Mitsuki Saito
- Immersion Biology Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, 178-0063, Japan
| | - Sara Inoue
- Immersion Biology Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, 178-0063, Japan
| | - Kenichi Imai
- Department of Microbiology and Immunology, Nihon University School of Dentistry, Tokyo, 101-8310, Japan
| |
Collapse
|
4
|
Cueno ME, Imai K. Structural Insights on the SARS-CoV-2 Variants of Concern Spike Glycoprotein: A Computational Study With Possible Clinical Implications. Front Genet 2021; 12:773726. [PMID: 34745235 PMCID: PMC8568765 DOI: 10.3389/fgene.2021.773726] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Accepted: 10/07/2021] [Indexed: 12/31/2022] Open
Abstract
Coronavirus disease 2019 (COVID-19) pandemic has been attributed to SARS-CoV-2 (SARS2) and, consequently, SARS2 has evolved into multiple SARS2 variants driving subsequent waves of infections. In particular, variants of concern (VOC) were identified to have both increased transmissibility and virulence ascribable to mutational changes occurring within the spike protein resulting to modifications in the protein structural orientation which in-turn may affect viral pathogenesis. However, this was never fully elucidated. Here, we generated spike models of endemic HCoVs (HCoV 229E, HCoV OC43, HCoV NL63, HCoV HKU1, SARS CoV, MERS CoV), original SARS2, and VOC (alpha, beta, gamma, delta). Model quality check, structural superimposition, and structural comparison based on RMSD values, TM scores, and contact mapping were all performed. We found that: 1) structural comparison between the original SARS2 and VOC whole spike protein model have minor structural differences (TM > 0.98); 2) the whole VOC spike models putatively have higher structural similarity (TM > 0.70) to spike models from endemic HCoVs coming from the same phylogenetic cluster; 3) original SARS2 S1-CTD and S1-NTD models are structurally comparable to VOC S1-CTD (TM = 1.0) and S1-NTD (TM > 0.96); and 4) endemic HCoV S1-CTD and S1-NTD models are structurally comparable to VOC S1-CTD (TM > 0.70) and S1-NTD (TM > 0.70) models belonging to the same phylogenetic cluster. Overall, we propose that structural similarities (possibly ascribable to similar conformational epitopes) may help determine immune cross-reactivity, whereas, structural differences (possibly associated with varying conformational epitopes) may lead to viral infection (either reinfection or breakthrough infection).
Collapse
Affiliation(s)
- Marni E Cueno
- Department of Microbiology, Nihon University School of Dentistry, Tokyo, Japan
| | - Kenichi Imai
- Department of Microbiology, Nihon University School of Dentistry, Tokyo, Japan
| |
Collapse
|
5
|
Cueno ME, Ueno M, Iguchi R, Harada T, Miki Y, Yasumaru K, Kiso N, Wada K, Baba K, Imai K. Insights on the Structural Variations of the Furin-Like Cleavage Site Found Among the December 2019-July 2020 SARS-CoV-2 Spike Glycoprotein: A Computational Study Linking Viral Evolution and Infection. Front Med (Lausanne) 2021; 8:613412. [PMID: 33777970 PMCID: PMC7987684 DOI: 10.3389/fmed.2021.613412] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Accepted: 02/16/2021] [Indexed: 12/17/2022] Open
Abstract
The SARS-CoV-2 (SARS2) is the cause of the coronavirus disease 2019 (COVID-19) pandemic. One unique structural feature of the SARS2 spike protein is the presence of a furin-like cleavage site (FLC) which is associated with both viral pathogenesis and host tropism. Specifically, SARS2 spike protein binds to the host ACE-2 receptor which in-turn is cleaved by furin proteases at the FLC site, suggesting that SARS2 FLC structural variations may have an impact on viral infectivity. However, this has not yet been fully elucidated. This study designed and analyzed a COVID-19 genomic epidemiology network for December 2019 to July 2020, and subsequently generated and analyzed representative SARS2 spike protein models from significant node clusters within the network. To distinguish possible structural variations, a model quality assessment was performed before further protein model analyses and superimposition of the protein models, particularly in both the receptor-binding domain (RBD) and FLC. Mutant spike models were generated with the unique 681PRRA684 amino acid sequence found within the deleted FLC. We found 9 SARS2 FLC structural patterns that could potentially correspond to nine node clusters encompassing various countries found within the COVID-19 genomic epidemiology network. Similarly, we associated this with the rapid evolution of the SARS2 genome. Furthermore, we observed that either in the presence or absence of the unique 681PRRA684 amino acid sequence no structural changes occurred within the SARS2 RBD, which we believe would mean that the SARS2 FLC has no structural influence on SARS2 RBD and may explain why host tropism was maintained.
Collapse
Affiliation(s)
- Marni E Cueno
- Department of Microbiology, Nihon University School of Dentistry, Tokyo, Japan.,Immersion Biology Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, Japan.,Immersion Physics Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, Japan
| | - Miu Ueno
- Immersion Physics Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, Japan
| | - Rinako Iguchi
- Immersion Physics Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, Japan
| | - Tsubasa Harada
- Immersion Physics Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, Japan
| | - Yoshifumi Miki
- Immersion Biology Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, Japan
| | - Kanae Yasumaru
- Immersion Biology Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, Japan
| | - Natsumi Kiso
- Immersion Biology Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, Japan
| | - Kanta Wada
- Immersion Biology Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, Japan
| | - Koki Baba
- Immersion Biology Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, Japan
| | - Kenichi Imai
- Department of Microbiology, Nihon University School of Dentistry, Tokyo, Japan
| |
Collapse
|
6
|
Cueno ME, Imai K. Structural Comparison of the SARS CoV 2 Spike Protein Relative to Other Human-Infecting Coronaviruses. Front Med (Lausanne) 2021; 7:594439. [PMID: 33585502 PMCID: PMC7874069 DOI: 10.3389/fmed.2020.594439] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2020] [Accepted: 12/14/2020] [Indexed: 12/19/2022] Open
Abstract
Coronaviruses (CoV) are enveloped positive-stranded RNA viruses and, historically, there are seven known human-infecting CoVs with varying degrees of virulence. CoV attachment to the host is the first step of viral pathogenesis and mainly relies on the spike glycoprotein located on the viral surface. Among the human-infecting CoVs, only the infection of SARS CoV 2 (SARS2) among humans resulted to a pandemic which would suggest that the protein structural conformation of SARS2 spike protein is distinct as compared to other human-infecting CoVs. Surprisingly, the possible differences and similarities in the protein structural conformation between the various human-infecting CoV spike proteins have not been fully elucidated. In this study, we utilized a computational approach to generate models and analyze the seven human-infecting CoV spike proteins, namely: HCoV 229E, HCoV OC43, HCoV NL63, HCoV HKU1, SARS CoV, MERS CoV, and SARS2. Model quality assessment of all CoV models generated, structural superimposition of the whole protein model and selected S1 domains (S1-CTD and S1-NTD), and structural comparison based on RMSD values, Tm scores, and contact mapping were all performed. We found that the structural orientation of S1-CTD is a potential structural feature associated to both the CoV phylogenetic cluster and lineage. Moreover, we observed that spike models in the same phylogenetic cluster or lineage could potentially have similar protein structure. Additionally, we established that there are potentially three distinct S1-CTD orientation (Pattern I, Pattern II, Pattern III) among the human-infecting CoVs. Furthermore, we postulate that human-infecting CoVs in the same phylogenetic cluster may have similar S1-CTD and S1-NTD structural orientation. Taken together, we propose that the SARS2 spike S1-CTD follows a Pattern III orientation which has a higher degree of similarity with SARS1 and some degree of similarity with both OC43 and HKU1 which coincidentally are in the same phylogenetic cluster and lineage, whereas, the SARS2 spike S1-NTD has some degree of similarity among human-infecting CoVs that are either in the same phylogenetic cluster or lineage.
Collapse
Affiliation(s)
- Marni E Cueno
- Department of Microbiology, Nihon University School of Dentistry, Tokyo, Japan
| | - Kenichi Imai
- Department of Microbiology, Nihon University School of Dentistry, Tokyo, Japan
| |
Collapse
|
7
|
Sali A. From integrative structural biology to cell biology. J Biol Chem 2021; 296:100743. [PMID: 33957123 PMCID: PMC8203844 DOI: 10.1016/j.jbc.2021.100743] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2021] [Revised: 04/09/2021] [Accepted: 04/30/2021] [Indexed: 12/16/2022] Open
Abstract
Integrative modeling is an increasingly important tool in structural biology, providing structures by combining data from varied experimental methods and prior information. As a result, molecular architectures of large, heterogeneous, and dynamic systems, such as the ∼52-MDa Nuclear Pore Complex, can be mapped with useful accuracy, precision, and completeness. Key challenges in improving integrative modeling include expanding model representations, increasing the variety of input data and prior information, quantifying a match between input information and a model in a Bayesian fashion, inventing more efficient structural sampling, as well as developing better model validation, analysis, and visualization. In addition, two community-level challenges in integrative modeling are being addressed under the auspices of the Worldwide Protein Data Bank (wwPDB). First, the impact of integrative structures is maximized by PDB-Development, a prototype wwPDB repository for archiving, validating, visualizing, and disseminating integrative structures. Second, the scope of structural biology is expanded by linking the wwPDB resource for integrative structures with archives of data that have not been generally used for structure determination but are increasingly important for computing integrative structures, such as data from various types of mass spectrometry, spectroscopy, optical microscopy, proteomics, and genetics. To address the largest of modeling problems, a type of integrative modeling called metamodeling is being developed; metamodeling combines different types of input models as opposed to different types of data to compute an output model. Collectively, these developments will facilitate the structural biology mindset in cell biology and underpin spatiotemporal mapping of the entire cell.
Collapse
Affiliation(s)
- Andrej Sali
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, the Department of Bioengineering and Therapeutic Sciences, the Quantitative Biosciences Institute (QBI), and the Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, California, USA.
| |
Collapse
|
8
|
Saltzberg DJ, Viswanath S, Echeverria I, Chemmama IE, Webb B, Sali A. Using Integrative Modeling Platform to compute, validate, and archive a model of a protein complex structure. Protein Sci 2020; 30:250-261. [PMID: 33166013 DOI: 10.1002/pro.3995] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Revised: 11/06/2020] [Accepted: 11/06/2020] [Indexed: 12/18/2022]
Abstract
Biology is advanced by producing structural models of biological systems, such as protein complexes. Some systems are recalcitrant to traditional structure determination methods. In such cases, it may still be possible to produce useful models by integrative structure determination that depends on simultaneous use of multiple types of data. An ensemble of models that are sufficiently consistent with the data is produced by a structural sampling method guided by a data-dependent scoring function. The variation in the ensemble of models quantified the uncertainty of the structure, generally resulting from the uncertainty in the input information and actual structural heterogeneity in the samples used to produce the data. Here, we describe how to generate, assess, and interpret ensembles of integrative structural models using our open source Integrative Modeling Platform program (https://integrativemodeling.org).
Collapse
Affiliation(s)
- Daniel J Saltzberg
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, and California Institute for Quantitative Biosciences, University of California, San Francisco, California, USA
| | - Shruthi Viswanath
- National Center for Biological Sciences, Tata Institute of Fundamental Research, Bangalore, India
| | - Ignacia Echeverria
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, and California Institute for Quantitative Biosciences, University of California, San Francisco, California, USA.,Department of Cellular and Molecular Pharmacology, University of California, San Francisco, California, USA
| | - Ilan E Chemmama
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, and California Institute for Quantitative Biosciences, University of California, San Francisco, California, USA
| | - Ben Webb
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, and California Institute for Quantitative Biosciences, University of California, San Francisco, California, USA
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, and California Institute for Quantitative Biosciences, University of California, San Francisco, California, USA
| |
Collapse
|
9
|
Toward Increased Reliability, Transparency, and Accessibility in Cross-linking Mass Spectrometry. Structure 2020; 28:1259-1268. [PMID: 33065067 DOI: 10.1016/j.str.2020.09.011] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2020] [Revised: 09/02/2020] [Accepted: 09/24/2020] [Indexed: 01/09/2023]
Abstract
Cross-linking mass spectrometry (MS) has substantially matured as a method over the past 2 decades through parallel development in multiple labs, demonstrating its applicability to protein structure determination, conformation analysis, and mapping protein interactions in complex mixtures. Cross-linking MS has become a much-appreciated and routinely applied tool, especially in structural biology. Therefore, it is timely that the community commits to the development of methodological and reporting standards. This white paper builds on an open process comprising a number of events at community conferences since 2015 and identifies aspects of Cross-linking MS for which guidelines should be developed as part of a Cross-linking MS standards initiative.
Collapse
|
10
|
Cueno ME, Iguchi K, Suemitsu K, Hirano M, Hanzawa K, Isoda T, Ueno M, Iguchi R, Otani A, Imai K. Structural insights into the potential changes in receptor binding site found in the 1998-2018 influenza B/Yamagata hemagglutinin: A putative correlation between receptor binding site structural variability and seasonal infection. J Mol Graph Model 2020; 97:107580. [PMID: 32193088 DOI: 10.1016/j.jmgm.2020.107580] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2019] [Revised: 03/04/2020] [Accepted: 03/05/2020] [Indexed: 12/09/2022]
Abstract
Influenza B virus has two distinct lineages (Victoria and Yamagata) and are associated with seasonal influenza epidemics that cause respiratory illness. Influenza B hemagglutinin (HA) is a major surface glycoprotein with the receptor-binding site (RBS) primarily involved in viral pathogenesis. Generally, influenza B exclusively infects the human population which would insinuate that the structural variability of the influenza B HA RBS rarely changes. However, to our knowledge, the potential impact of variations in the influenza B HA RBS structural variability was not fully elucidated. Throughout this study, we generated models from the transitioning (evolving viral lineage) 1998-2018 influenza B/Yamagata HA, verified the quality of each HA model, performed HA RBS structural variability measurements, superimposed varying HA models for comparison, and designed a phylogenetic tree network for further analyses. We found that measurements of the transitioning HA RBS structural variability were generally maintained and, similarly, measurements of the altered (years that differed from the evolving viral lineage, specifically 2003, 2007, 2017) HA RBS structural variability differed from the transitioning HA RBS. Moreover, we observed that the altered HA RBS structural variability favored the formation of a putative Y202-H191 hydrogen bond which we postulate may increase structural stability, thereby, allowing for a winter infection of the virus. Furthermore, we established that changes in HA RBS structural variability does not influence viral evolution, but putatively seasonal infection.
Collapse
Affiliation(s)
- Marni E Cueno
- Department of Microbiology, Nihon University School of Dentistry, Tokyo, 101-8310, Japan; Immersion Physics Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, 178-0063, Japan; Immersion Biology Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, 178-0063, Japan.
| | - Kanako Iguchi
- Immersion Physics Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, 178-0063, Japan
| | - Kanta Suemitsu
- Immersion Physics Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, 178-0063, Japan
| | - Marina Hirano
- Immersion Physics Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, 178-0063, Japan
| | - Kosei Hanzawa
- Immersion Physics Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, 178-0063, Japan
| | - Takemasa Isoda
- Immersion Biology Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, 178-0063, Japan
| | - Miu Ueno
- Immersion Biology Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, 178-0063, Japan
| | - Rinako Iguchi
- Immersion Biology Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, 178-0063, Japan
| | - Aoi Otani
- Immersion Biology Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, 178-0063, Japan
| | - Kenichi Imai
- Department of Microbiology, Nihon University School of Dentistry, Tokyo, 101-8310, Japan
| |
Collapse
|
11
|
Berman HM, Adams PD, Bonvin AA, Burley SK, Carragher B, Chiu W, DiMaio F, Ferrin TE, Gabanyi MJ, Goddard TD, Griffin PR, Haas J, Hanke CA, Hoch JC, Hummer G, Kurisu G, Lawson CL, Leitner A, Markley JL, Meiler J, Montelione GT, Phillips GN, Prisner T, Rappsilber J, Schriemer DC, Schwede T, Seidel CAM, Strutzenberg TS, Svergun DI, Tajkhorshid E, Trewhella J, Vallat B, Velankar S, Vuister GW, Webb B, Westbrook JD, White KL, Sali A. Federating Structural Models and Data: Outcomes from A Workshop on Archiving Integrative Structures. Structure 2019; 27:1745-1759. [PMID: 31780431 DOI: 10.1016/j.str.2019.11.002] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2019] [Revised: 10/31/2019] [Accepted: 11/06/2019] [Indexed: 12/23/2022]
Abstract
Structures of biomolecular systems are increasingly computed by integrative modeling. In this approach, a structural model is constructed by combining information from multiple sources, including varied experimental methods and prior models. In 2019, a Workshop was held as a Biophysical Society Satellite Meeting to assess progress and discuss further requirements for archiving integrative structures. The primary goal of the Workshop was to build consensus for addressing the challenges involved in creating common data standards, building methods for federated data exchange, and developing mechanisms for validating integrative structures. The summary of the Workshop and the recommendations that emerged are presented here.
Collapse
Affiliation(s)
- Helen M Berman
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA; Bridge Institute, Michelson Center, University of Southern California, Los Angeles, CA 90089, USA.
| | - Paul D Adams
- Physical Biosciences Division, Lawrence Berkeley Laboratory, Berkeley, CA 94720-8235, USA; Department of Bioengineering, University of California-Berkeley, Berkeley, CA 94720, USA
| | - Alexandre A Bonvin
- Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, 3584 CH Utrecht, the Netherlands
| | - Stephen K Burley
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Skaggs School of Pharmacy and Pharmaceutical Sciences and San Diego Supercomputer Center, University of California, San Diego, La Jolla, CA 92093, USA; Rutgers Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ 08903, USA
| | - Bridget Carragher
- Simons Electron Microscopy Center, New York Structural Biology Center, New York, NY 10027, USA; Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
| | - Wah Chiu
- Department of Bioengineering, Department of Microbiology and Immunology, Stanford University, Stanford, CA 94305-5447, USA; SLAC National Accelerator Laboratory, Menlo Park, CA 94025, USA
| | - Frank DiMaio
- Department of Biochemistry and Institute for Protein Design, University of Washington, Seattle, WA 98195, USA
| | - Thomas E Ferrin
- Department of Pharmaceutical Chemistry, University of California, San Francisco, CA 94158, USA
| | - Margaret J Gabanyi
- Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Thomas D Goddard
- Department of Pharmaceutical Chemistry, University of California, San Francisco, CA 94158, USA
| | | | - Juergen Haas
- Swiss Institute of Bioinformatics and Biozentrum, University of Basel, 4056 Basel, Switzerland
| | - Christian A Hanke
- Molecular Physical Chemistry, Heinrich Heine University Düsseldorf, 40225 Düsseldorf, Germany
| | - Jeffrey C Hoch
- Department of Molecular Biology and Biophysics, UConn Health, Farmington, CT 06030, USA
| | - Gerhard Hummer
- Department of Theoretical Biophysics, Max Planck Institute of Biophysics, 60438 Frankfurt am Main, Germany; Institute for Biophysics, Goethe University Frankfurt, 60438 Frankfurt am Main, Germany
| | - Genji Kurisu
- Protein Data Bank Japan (PDBj), Institute for Protein Research, Osaka University, Osaka 565-0871, Japan
| | - Catherine L Lawson
- Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Alexander Leitner
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, 8093 Zurich, Switzerland
| | - John L Markley
- BioMagResBank (BMRB), Biochemistry Department, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Jens Meiler
- Center for Structural Biology, Vanderbilt University, 465 21st Avenue South, Nashville, TN 37221, USA
| | - Gaetano T Montelione
- Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Department of Biochemistry, Robert Wood Johnson Medical School, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Center for Biotechnology and Interdisciplinary Studies, Rensselaer Polytech Institute, Troy, NY 12180, USA
| | - George N Phillips
- BioSciences at Rice and Department of Chemistry, Rice University, Houston, TX 77251, USA
| | - Thomas Prisner
- Institute of Physical and Theoretical Chemistry and Center of Biomolecular Magnetic Resonance, Goethe University Frankfurt, 60438 Frankfurt am Main, Germany
| | - Juri Rappsilber
- Wellcome Trust Centre for Cell Biology, Edinburgh EH9 3JR, Scotland
| | - David C Schriemer
- Department of Biochemistry & Molecular Biology, Robson DNA Science Centre, University of Calgary, Calgary, AB T2N 4N1, Canada
| | - Torsten Schwede
- Swiss Institute of Bioinformatics and Biozentrum, University of Basel, 4056 Basel, Switzerland
| | - Claus A M Seidel
- Molecular Physical Chemistry, Heinrich Heine University Düsseldorf, 40225 Düsseldorf, Germany
| | | | - Dmitri I Svergun
- European Molecular Biology Laboratory (EMBL), Hamburg Outstation, Notkestrasse 85, 22607 Hamburg, Germany
| | - Emad Tajkhorshid
- Department of Biochemistry, NIH Center for Macromolecular Modeling and Bioinformatics, Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA; Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Jill Trewhella
- School of Life and Environmental Sciences, The University of Sydney, Sydney, NSW 2006, Australia; Department of Chemistry, University of Utah, Salt Lake City, UT 84112, USA
| | - Brinda Vallat
- Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Sameer Velankar
- Protein Data Bank in Europe (PDBe), European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridgeshire CB10 1SD, UK
| | - Geerten W Vuister
- Department of Molecular and Cell Biology, Leicester Institute of Structural and Chemical Biology, University of Leicester, Leicester LE1 9HN, UK
| | - Benjamin Webb
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA
| | - John D Westbrook
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Kate L White
- Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA; Bridge Institute, Michelson Center, University of Southern California, Los Angeles, CA 90089, USA
| | - Andrej Sali
- Department of Pharmaceutical Chemistry, University of California, San Francisco, CA 94158, USA; Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA; California Institute for Quantitative Biosciences, University of California, San Francisco, San Francisco, CA 94158, USA.
| |
Collapse
|
12
|
Ortega DR, Oikonomou CM, Ding HJ, Rees-Lee P, Jensen GJ. ETDB-Caltech: A blockchain-based distributed public database for electron tomography. PLoS One 2019; 14:e0215531. [PMID: 30986271 PMCID: PMC6464211 DOI: 10.1371/journal.pone.0215531] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2018] [Accepted: 04/03/2019] [Indexed: 01/12/2023] Open
Abstract
Three-dimensional electron microscopy techniques like electron tomography provide valuable insights into cellular structures, and present significant challenges for data storage and dissemination. Here we explored a novel method to publicly release more than 11,000 such datasets, more than 30 TB in total, collected by our group. Our method, based on a peer-to-peer file sharing network built around a blockchain ledger, offers a distributed solution to data storage. In addition, we offer a user-friendly browser-based interface, https://etdb.caltech.edu, for anyone interested to explore and download our data. We discuss the relative advantages and disadvantages of this system and provide tools for other groups to mine our data and/or use the same approach to share their own imaging datasets.
Collapse
Affiliation(s)
- Davi R. Ortega
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, United States of America
| | - Catherine M. Oikonomou
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, United States of America
| | - H. Jane Ding
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, United States of America
| | - Prudence Rees-Lee
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, United States of America
| | | | - Grant J. Jensen
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, United States of America
- Howard Hughes Medical Institute, Pasadena, California, United States of America
- * E-mail:
| |
Collapse
|
13
|
Cueno ME, Shiotsu H, Nakano K, Sugiyama E, Kikuta M, Usui R, Oya R, Imai K. Structural significance of residues 158-160 in the H3N2 hemagglutnin globular head: A computational study with implications in viral evolution and infection. J Mol Graph Model 2019; 89:33-40. [PMID: 30849718 DOI: 10.1016/j.jmgm.2019.02.007] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2018] [Revised: 01/28/2019] [Accepted: 02/18/2019] [Indexed: 01/14/2023]
Abstract
Influenza A H3N2 has been linked to annual outbreaks within the human population attributable to continuous structural changes. H3N2 HA contains well identified antigenic sites and receptor-binding sites (RBS) that are possibly correlated to viral evolution and infection. However, the structural significance of amino acid residues associated with both viral evolution and infection were not fully demonstrated. Throughout this study, we generated and analyzed H3N2 HA models that represented the clade 3C.2 population (comprised of clades 3C.2, 3C.2a, and 3C.21 from the transitioning 2014-2018 H3N2 strains) and 3C.3a (from the 2016 H3N2 strain). Model quality estimation, structural analyses and superimposition, and network analytics of H3N2 HA1 evolution were performed. We found that the structural properties of residues 158-160 could influence the overall HA backbone. More specifically, amino acid substitutions at residues 159-160 affected the amino acid orientation at residue 158, thereby, causing the overall HA backbone structure to vary. Our results were consistent with 1968-2018 HA1 evolution. Taken together, we propose that our results would highlight the structural significance of residues 158-160 in HA1 for both antigenic drift and RBS.
Collapse
Affiliation(s)
- Marni E Cueno
- Department of Microbiology, Nihon University School of Dentistry, Tokyo, 101-8310, Japan; Immersion Physics Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, 178-0063, Japan; Immersion Biology Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, 178-0063, Japan.
| | - Hayato Shiotsu
- Immersion Physics Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, 178-0063, Japan
| | - Karin Nakano
- Immersion Physics Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, 178-0063, Japan
| | - Emiko Sugiyama
- Immersion Biology Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, 178-0063, Japan
| | - Mari Kikuta
- Immersion Biology Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, 178-0063, Japan
| | - Rikuya Usui
- Immersion Physics Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, 178-0063, Japan
| | - Riku Oya
- Immersion Physics Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo, 178-0063, Japan
| | - Kenichi Imai
- Department of Microbiology, Nihon University School of Dentistry, Tokyo, 101-8310, Japan
| |
Collapse
|
14
|
Vallat B, Webb B, Westbrook JD, Sali A, Berman HM. Development of a Prototype System for Archiving Integrative/Hybrid Structure Models of Biological Macromolecules. Structure 2018; 26:894-904.e2. [PMID: 29657133 DOI: 10.1016/j.str.2018.03.011] [Citation(s) in RCA: 69] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2017] [Revised: 02/16/2018] [Accepted: 03/20/2018] [Indexed: 10/17/2022]
Abstract
Essential processes in biology are carried out by large macromolecular assemblies, whose structures are often difficult to determine by traditional methods. Increasingly, researchers combine measured data and computed information from several complementary methods to obtain "hybrid" or "integrative" structural models of macromolecules and their assemblies. These integrative/hybrid (I/H) models are not archived in the PDB because of the absence of standard data representations and processing mechanisms. Here we present the development of data standards and a prototype system for archiving I/H models. The data standards provide the definitions required for representing I/H models that span multiple spatiotemporal scales and conformational states, as well as spatial restraints derived from different experimental techniques. Based on these data definitions, we have built a prototype system called PDB-Dev, which provides the infrastructure necessary to archive I/H structural models. PDB-Dev is now accepting structures and is open to the community for new submissions.
Collapse
Affiliation(s)
- Brinda Vallat
- Research Collaboratory for Structural Bioinformatics, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA.
| | - Benjamin Webb
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry and California Institute for Quantitative Biosciences, University of California at San Francisco, San Francisco, CA 94143, USA
| | - John D Westbrook
- Research Collaboratory for Structural Bioinformatics, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry and California Institute for Quantitative Biosciences, University of California at San Francisco, San Francisco, CA 94143, USA
| | - Helen M Berman
- Research Collaboratory for Structural Bioinformatics, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| |
Collapse
|
15
|
Cueno ME, Suzuki I, Shimotomai S, Yokoyama T, Nagahisa K, Imai K. Structural comparison among the 2013-2017 avian influenza A H5N6 hemagglutinin proteins: A computational study with epidemiological implications. J Mol Graph Model 2017; 79:185-191. [PMID: 29220671 DOI: 10.1016/j.jmgm.2017.11.013] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2017] [Revised: 11/25/2017] [Accepted: 11/27/2017] [Indexed: 12/09/2022]
Abstract
Avian influenza viruses easily spread allowing viral re-assortment to simply occur which in-turn increases the potential for a pandemic. A novel 2013 H5N6 influenza strain was detected among the avian population and was reported to continuously evolve, however, this was never structurally demonstrated. Here, we elucidated the putative structural evolution of the novel H5N6 influenza strain. Throughout this study, we analyzed 2013-2017 H5N6 HA protein models. Model quality was first verified before further analyses and structural comparison was made using superimposition. We found that Leu was inserted at position 1291 among the 2013-2015 models while Leu was not inserted among the 2016-2017 models. Moreover, presence of Leu at position 1291 shifts residue E1261 by 159.6° affecting nearby residues which may explain the difference between the 2013-2015 and 2016-2017 HA structural groups. Similarly, we believe that our results would support the hypothesis that the current H5N6 strain is still continuously evolving.
Collapse
Affiliation(s)
- Marni E Cueno
- Department of Microbiology, Nihon University School of Dentistry, Tokyo 101-8310, Japan; Immersion Physics Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo 178-0063, Japan.
| | - Izuho Suzuki
- Immersion Physics Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo 178-0063, Japan
| | - Shiori Shimotomai
- Immersion Physics Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo 178-0063, Japan
| | - Takuma Yokoyama
- Immersion Physics Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo 178-0063, Japan
| | - Kai Nagahisa
- Immersion Physics Class, Department of Science, Tokyo Gakugei University International Secondary School, Tokyo 178-0063, Japan
| | - Kenichi Imai
- Department of Microbiology, Nihon University School of Dentistry, Tokyo 101-8310, Japan
| |
Collapse
|
16
|
Touma C, Adams MN, Ashton NW, Mizzi M, El-Kamand S, Richard DJ, Cubeddu L, Gamsjaeger R. A data-driven structural model of hSSB1 (NABP2/OBFC2B) self-oligomerization. Nucleic Acids Res 2017; 45:8609-8620. [PMID: 28609781 PMCID: PMC5737504 DOI: 10.1093/nar/gkx526] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2017] [Accepted: 06/05/2017] [Indexed: 12/19/2022] Open
Abstract
The maintenance of genome stability depends on the ability of the cell to repair DNA efficiently. Single-stranded DNA binding proteins (SSBs) play an important role in DNA processing events such as replication, recombination and repair. While the role of human single-stranded DNA binding protein 1 (hSSB1/NABP2/OBFC2B) in the repair of double-stranded breaks has been well established, we have recently shown that it is also essential for the base excision repair (BER) pathway following oxidative DNA damage. However, unlike in DSB repair, the formation of stable hSSB1 oligomers under oxidizing conditions is an important prerequisite for its proper function in BER. In this study, we have used solution-state NMR in combination with biophysical and functional experiments to obtain a structural model of hSSB1 self-oligomerization. We reveal that hSSB1 forms a tetramer that is structurally similar to the SSB from Escherichia coli and is stabilized by two cysteines (C81 and C99) as well as a subset of charged and hydrophobic residues. Our structural and functional data also show that hSSB1 oligomerization does not preclude its function in DSB repair, where it can interact with Ints3, a component of the SOSS1 complex, further establishing the versatility that hSSB1 displays in maintaining genome integrity.
Collapse
Affiliation(s)
- Christine Touma
- School of Science and Health, Western Sydney University, Penrith, NSW 2751, Australia
| | - Mark N Adams
- School of Biomedical Research, Institute of Health and Biomedical Innovation at the Translational Research Institute, Queensland University of Technology, Woolloongabba, QLD 4102, Australia
| | - Nicholas W Ashton
- School of Biomedical Research, Institute of Health and Biomedical Innovation at the Translational Research Institute, Queensland University of Technology, Woolloongabba, QLD 4102, Australia
| | - Michael Mizzi
- School of Science and Health, Western Sydney University, Penrith, NSW 2751, Australia
| | - Serene El-Kamand
- School of Science and Health, Western Sydney University, Penrith, NSW 2751, Australia
| | - Derek J Richard
- School of Biomedical Research, Institute of Health and Biomedical Innovation at the Translational Research Institute, Queensland University of Technology, Woolloongabba, QLD 4102, Australia
| | - Liza Cubeddu
- School of Science and Health, Western Sydney University, Penrith, NSW 2751, Australia.,School of Life and Environmental Sciences, University of Sydney, NSW 2006, Australia
| | - Roland Gamsjaeger
- School of Science and Health, Western Sydney University, Penrith, NSW 2751, Australia.,School of Life and Environmental Sciences, University of Sydney, NSW 2006, Australia
| |
Collapse
|
17
|
Bai WL, Wang JJ, Yin RH, Dang YL, Wang ZY, Zhu YB, Cong YY, Deng L, Guo D, Wang SQ, Yang SH, Xue HL. Molecular characterization of HOXC8 gene and methylation status analysis of its exon 1 associated with the length of cashmere fiber in Liaoning cashmere goat. Genetica 2017; 145:115-126. [DOI: 10.1007/s10709-017-9950-5] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2016] [Accepted: 01/07/2017] [Indexed: 11/29/2022]
|
18
|
Burley SK, Berman HM, Kleywegt GJ, Markley JL, Nakamura H, Velankar S. Protein Data Bank (PDB): The Single Global Macromolecular Structure Archive. Methods Mol Biol 2017; 1607:627-641. [PMID: 28573592 PMCID: PMC5823500 DOI: 10.1007/978-1-4939-7000-1_26] [Citation(s) in RCA: 466] [Impact Index Per Article: 66.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2023]
Abstract
The Protein Data Bank (PDB)--the single global repository of experimentally determined 3D structures of biological macromolecules and their complexes--was established in 1971, becoming the first open-access digital resource in the biological sciences. The PDB archive currently houses ~130,000 entries (May 2017). It is managed by the Worldwide Protein Data Bank organization (wwPDB; wwpdb.org), which includes the RCSB Protein Data Bank (RCSB PDB; rcsb.org), the Protein Data Bank Japan (PDBj; pdbj.org), the Protein Data Bank in Europe (PDBe; pdbe.org), and BioMagResBank (BMRB; www.bmrb.wisc.edu). The four wwPDB partners operate a unified global software system that enforces community-agreed data standards and supports data Deposition, Biocuration, and Validation of ~11,000 new PDB entries annually (deposit.wwpdb.org). The RCSB PDB currently acts as the archive keeper, ensuring disaster recovery of PDB data and coordinating weekly updates. wwPDB partners disseminate the same archival data from multiple FTP sites, while operating complementary websites that provide their own views of PDB data with selected value-added information and links to related data resources. At present, the PDB archives experimental data, associated metadata, and 3D-atomic level structural models derived from three well-established methods: crystallography, nuclear magnetic resonance spectroscopy (NMR), and electron microscopy (3DEM). wwPDB partners are working closely with experts in related experimental areas (small-angle scattering, chemical cross-linking/mass spectrometry, Forster energy resonance transfer or FRET, etc.) to establish a federation of data resources that will support sustainable archiving and validation of 3D structural models and experimental data derived from integrative or hybrid methods.
Collapse
Affiliation(s)
- Stephen K Burley
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Center for Integrative Proteomics, Research, Institute for Quantitative Biomedicine, and Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, 08854, USA.
- Rutgers Cancer Institute of New Jersey, Robert Wood Johnson Medical School, New Brunswick, NJ, 08903, USA.
- Skaggs School of Pharmacy and Pharmaceutical Sciences and San Diego Supercomputer Center, University of California, San Diego, La Jolla, CA, 92093, USA.
| | - Helen M Berman
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Center for Integrative Proteomics, Research, Institute for Quantitative Biomedicine, and Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, 08854, USA
| | - Gerard J Kleywegt
- Protein Data Bank in Europe, European Molecular Biology Laboratory-European Bioinformatics Institute, Wellcome Genome Campus, Cambridge, CB10 1SD, UK
| | - John L Markley
- BioMagResBank, Department of Biochemistry, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Haruki Nakamura
- Protein Data Bank Japan, Institute for Protein Research, Osaka University, Suita, Osaka, 565-0871, Japan
| | - Sameer Velankar
- Protein Data Bank in Europe, European Molecular Biology Laboratory-European Bioinformatics Institute, Wellcome Genome Campus, Cambridge, CB10 1SD, UK
| |
Collapse
|
19
|
Bienert S, Waterhouse A, de Beer TAP, Tauriello G, Studer G, Bordoli L, Schwede T. The SWISS-MODEL Repository-new features and functionality. Nucleic Acids Res 2016; 45:D313-D319. [PMID: 27899672 PMCID: PMC5210589 DOI: 10.1093/nar/gkw1132] [Citation(s) in RCA: 1047] [Impact Index Per Article: 130.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2016] [Revised: 10/26/2016] [Accepted: 10/28/2016] [Indexed: 11/23/2022] Open
Abstract
SWISS-MODEL Repository (SMR) is a database of annotated 3D protein structure models generated by the automated SWISS-MODEL homology modeling pipeline. It currently holds >400 000 high quality models covering almost 20% of Swiss-Prot/UniProtKB entries. In this manuscript, we provide an update of features and functionalities which have been implemented recently. We address improvements in target coverage, model quality estimates, functional annotations and improved in-page visualization. We also introduce a new update concept which includes regular updates of an expanded set of core organism models and UniProtKB-based targets, complemented by user-driven on-demand update of individual models. With the new release of the modeling pipeline, SMR has implemented a REST-API and adopted an open licencing model for accessing model coordinates, thus enabling bulk download for groups of targets fostering re-use of models in other contexts. SMR can be accessed at https://swissmodel.expasy.org/repository.
Collapse
Affiliation(s)
- Stefan Bienert
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Andrew Waterhouse
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Tjaart A P de Beer
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Gerardo Tauriello
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Gabriel Studer
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Lorenza Bordoli
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| | - Torsten Schwede
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland .,SIB Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland
| |
Collapse
|
20
|
Grabowski M, Langner KM, Cymborowski M, Porebski PJ, Sroka P, Zheng H, Cooper DR, Zimmerman MD, Elsliger MA, Burley SK, Minor W. A public database of macromolecular diffraction experiments. Acta Crystallogr D Struct Biol 2016; 72:1181-1193. [PMID: 27841751 PMCID: PMC5108346 DOI: 10.1107/s2059798316014716] [Citation(s) in RCA: 96] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2016] [Accepted: 09/17/2016] [Indexed: 12/28/2022] Open
Abstract
The low reproducibility of published experimental results in many scientific disciplines has recently garnered negative attention in scientific journals and the general media. Public transparency, including the availability of `raw' experimental data, will help to address growing concerns regarding scientific integrity. Macromolecular X-ray crystallography has led the way in requiring the public dissemination of atomic coordinates and a wealth of experimental data, making the field one of the most reproducible in the biological sciences. However, there remains no mandate for public disclosure of the original diffraction data. The Integrated Resource for Reproducibility in Macromolecular Crystallography (IRRMC) has been developed to archive raw data from diffraction experiments and, equally importantly, to provide related metadata. Currently, the database of our resource contains data from 2920 macromolecular diffraction experiments (5767 data sets), accounting for around 3% of all depositions in the Protein Data Bank (PDB), with their corresponding partially curated metadata. IRRMC utilizes distributed storage implemented using a federated architecture of many independent storage servers, which provides both scalability and sustainability. The resource, which is accessible via the web portal at http://www.proteindiffraction.org, can be searched using various criteria. All data are available for unrestricted access and download. The resource serves as a proof of concept and demonstrates the feasibility of archiving raw diffraction data and associated metadata from X-ray crystallographic studies of biological macromolecules. The goal is to expand this resource and include data sets that failed to yield X-ray structures in order to facilitate collaborative efforts that will improve protein structure-determination methods and to ensure the availability of `orphan' data left behind for various reasons by individual investigators and/or extinct structural genomics projects.
Collapse
Affiliation(s)
- Marek Grabowski
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA 22904, USA
| | - Karol M. Langner
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA 22904, USA
| | - Marcin Cymborowski
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA 22904, USA
| | - Przemyslaw J. Porebski
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA 22904, USA
- Jerzy Haber Institute of Catalysis and Surface Chemistry, Polish Academy of Sciences, Niezapominajek 8, 30-239 Cracow, Poland
| | - Piotr Sroka
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA 22904, USA
| | - Heping Zheng
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA 22904, USA
| | - David R. Cooper
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA 22904, USA
| | - Matthew D. Zimmerman
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA 22904, USA
| | - Marc-André Elsliger
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 90237, USA
| | - Stephen K. Burley
- RCSB Protein Data Bank; Center for Integrative Proteomics Research; Institute for Quantitative Biomedicine; Rutgers Cancer Institute of New Jersey; Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
- San Diego Supercomputer Center and Skaggs School of Pharmacological Sciences, University of California, San Diego, La Jolla, CA 92093, USA
| | - Wladek Minor
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA 22904, USA
| |
Collapse
|
21
|
Berman HM, Burley SK, Kleywegt GJ, Markley JL, Nakamura H, Velankar S. The archiving and dissemination of biological structure data. Curr Opin Struct Biol 2016; 40:17-22. [PMID: 27450113 PMCID: PMC5161703 DOI: 10.1016/j.sbi.2016.06.018] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2016] [Revised: 06/28/2016] [Accepted: 06/30/2016] [Indexed: 01/11/2023]
Abstract
The global Protein Data Bank (PDB) was the first open-access digital archive in biology. The history and evolution of the PDB are described, together with the ways in which molecular structural biology data and information are collected, curated, validated, archived, and disseminated by the members of the Worldwide Protein Data Bank organization (wwPDB; http://wwpdb.org). Particular emphasis is placed on the role of community in establishing the standards and policies by which the PDB archive is managed day-to-day.
Collapse
Affiliation(s)
- Helen M Berman
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Department of Chemistry and Chemical Biology, Center for Integrative Proteomics Research, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, 174 Frelinghuysen Road, Piscataway, NJ 08854, USA.
| | - Stephen K Burley
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Department of Chemistry and Chemical Biology, Center for Integrative Proteomics Research, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, 174 Frelinghuysen Road, Piscataway, NJ 08854, USA; Research Collaboratory for Structural Bioinformatics Protein Data Bank, Skaggs School of Pharmacy and Pharmaceutical Sciences and San Diego Supercomputer Center, University of California San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA
| | - Gerard J Kleywegt
- Protein Data Bank in Europe, European Molecular Biology Laboratory-European Bioinformatics Institute, Wellcome Genome Campus, Cambridge CB10 1SD, UK
| | - John L Markley
- Biological Magnetic Resonance Bank, Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Haruki Nakamura
- Protein Data Bank Japan, Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka, 565-0871, Japan
| | - Sameer Velankar
- Protein Data Bank in Europe, European Molecular Biology Laboratory-European Bioinformatics Institute, Wellcome Genome Campus, Cambridge CB10 1SD, UK
| |
Collapse
|
22
|
Bai WL, Dang YL, Wang JJ, Yin RH, Wang ZY, Zhu YB, Cong YY, Xue HL, Deng L, Guo D, Wang SQ, Yang SH. Molecular characterization, expression and methylation status analysis of BMP4 gene in skin tissue of Liaoning cashmere goat during hair follicle cycle. Genetica 2016; 144:457-67. [DOI: 10.1007/s10709-016-9914-1] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2015] [Accepted: 07/07/2016] [Indexed: 12/24/2022]
|
23
|
Touma C, Kariawasam R, Gimenez AX, Bernardo RE, Ashton NW, Adams MN, Paquet N, Croll TI, O'Byrne KJ, Richard DJ, Cubeddu L, Gamsjaeger R. A structural analysis of DNA binding by hSSB1 (NABP2/OBFC2B) in solution. Nucleic Acids Res 2016; 44:7963-73. [PMID: 27387285 PMCID: PMC5027503 DOI: 10.1093/nar/gkw617] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2016] [Accepted: 06/28/2016] [Indexed: 02/07/2023] Open
Abstract
Single-stranded DNA binding proteins (SSBs) play an important role in DNA processing events such as replication, recombination and repair. Human single-stranded DNA binding protein 1 (hSSB1/NABP2/OBFC2B) contains a single oligosaccharide/oligonucleotide binding (OB) domain followed by a charged C-terminus and is structurally homologous to the SSB from the hyperthermophilic crenarchaeote Sulfolobus solfataricus. Recent work has revealed that hSSB1 is critical to homologous recombination and numerous other important biological processes such as the regulation of telomeres, the maintenance of DNA replication forks and oxidative damage repair. Since the ability of hSSB1 to directly interact with single-stranded DNA (ssDNA) is paramount for all of these processes, understanding the molecular details of ssDNA recognition is essential. In this study, we have used solution-state nuclear magnetic resonance in combination with biophysical and functional experiments to structurally analyse ssDNA binding by hSSB1. We reveal that ssDNA recognition in solution is modulated by base-stacking of four key aromatic residues within the OB domain. This DNA binding mode differs significantly from the recently determined crystal structure of the SOSS1 complex containing hSSB1 and ssDNA. Our findings elucidate the detailed molecular mechanism in solution of ssDNA binding by hSSB1, a major player in the maintenance of genomic stability.
Collapse
Affiliation(s)
- Christine Touma
- School of Science and Health, Western Sydney University, Penrith, NSW 2751, Australia
| | - Ruvini Kariawasam
- School of Science and Health, Western Sydney University, Penrith, NSW 2751, Australia
| | - Adrian X Gimenez
- School of Science and Health, Western Sydney University, Penrith, NSW 2751, Australia
| | - Ray E Bernardo
- School of Science and Health, Western Sydney University, Penrith, NSW 2751, Australia
| | - Nicholas W Ashton
- School of Biomedical Research, Institute of Health and Biomedical Innovation at the Translational Research Institute, Queensland University of Technology, Woolloongabba, QLD 4102, Australia
| | - Mark N Adams
- School of Biomedical Research, Institute of Health and Biomedical Innovation at the Translational Research Institute, Queensland University of Technology, Woolloongabba, QLD 4102, Australia
| | - Nicolas Paquet
- School of Biomedical Research, Institute of Health and Biomedical Innovation at the Translational Research Institute, Queensland University of Technology, Woolloongabba, QLD 4102, Australia
| | - Tristan I Croll
- School of Biomedical Research, Institute of Health and Biomedical Innovation at the Translational Research Institute, Queensland University of Technology, Woolloongabba, QLD 4102, Australia
| | - Kenneth J O'Byrne
- School of Biomedical Research, Institute of Health and Biomedical Innovation at the Translational Research Institute, Queensland University of Technology, Woolloongabba, QLD 4102, Australia
| | - Derek J Richard
- School of Biomedical Research, Institute of Health and Biomedical Innovation at the Translational Research Institute, Queensland University of Technology, Woolloongabba, QLD 4102, Australia
| | - Liza Cubeddu
- School of Science and Health, Western Sydney University, Penrith, NSW 2751, Australia School of Molecular Biosciences, University of Sydney, NSW 2006, Australia
| | - Roland Gamsjaeger
- School of Science and Health, Western Sydney University, Penrith, NSW 2751, Australia School of Molecular Biosciences, University of Sydney, NSW 2006, Australia
| |
Collapse
|
24
|
Abstract
Protein tertiary structure prediction algorithms aim to predict, from amino acid sequence, the tertiary structure of a protein. In silico protein structure prediction methods have become extremely important, as in vitro-based structural elucidation is unable to keep pace with the current growth of sequence databases due to high-throughput next-generation sequencing, which has exacerbated the gaps in our knowledge between sequences and structures.Here we briefly discuss protein tertiary structure prediction, the biennial competition for the Critical Assessment of Techniques for Protein Structure Prediction (CASP) and its role in shaping the field. We also discuss, in detail, our cutting-edge web-server method IntFOLD2-TS for tertiary structure prediction. Furthermore, we provide a step-by-step guide on using the IntFOLD2-TS web server, along with some real world examples, where the IntFOLD server can and has been used to improve protein tertiary structure prediction and aid in functional elucidation.
Collapse
Affiliation(s)
- Daniel Barry Roche
- Institut de Biologie Computationnelle, LIRMM, CNRS, Université de Montpellier, Montpellier, France.
- CEA, DSV, IG, Genoscope, Évry, France.
- CNRS-UMR8030, Évry, France.
- Université d'Évry Val d'Essonne, Évry, France.
- PRES UniverSud Paris, Saint-Aubin, France.
| | | |
Collapse
|
25
|
Sali A, Berman HM, Schwede T, Trewhella J, Kleywegt G, Burley SK, Markley J, Nakamura H, Adams P, Bonvin AMJJ, Chiu W, Peraro MD, Di Maio F, Ferrin TE, Grünewald K, Gutmanas A, Henderson R, Hummer G, Iwasaki K, Johnson G, Lawson CL, Meiler J, Marti-Renom MA, Montelione GT, Nilges M, Nussinov R, Patwardhan A, Rappsilber J, Read RJ, Saibil H, Schröder GF, Schwieters CD, Seidel CAM, Svergun D, Topf M, Ulrich EL, Velankar S, Westbrook JD. Outcome of the First wwPDB Hybrid/Integrative Methods Task Force Workshop. Structure 2015; 23:1156-67. [PMID: 26095030 PMCID: PMC4933300 DOI: 10.1016/j.str.2015.05.013] [Citation(s) in RCA: 141] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2015] [Revised: 05/11/2015] [Accepted: 05/18/2015] [Indexed: 01/20/2023]
Abstract
Structures of biomolecular systems are increasingly computed by integrative modeling that relies on varied types of experimental data and theoretical information. We describe here the proceedings and conclusions from the first wwPDB Hybrid/Integrative Methods Task Force Workshop held at the European Bioinformatics Institute in Hinxton, UK, on October 6 and 7, 2014. At the workshop, experts in various experimental fields of structural biology, experts in integrative modeling and visualization, and experts in data archiving addressed a series of questions central to the future of structural biology. How should integrative models be represented? How should the data and integrative models be validated? What data should be archived? How should the data and models be archived? What information should accompany the publication of integrative models?
Collapse
Affiliation(s)
- Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, California Institute for Quantitative Biosciences, Byers Hall Room 503B, University of California, San Francisco, 1700 4(th) Street, San Francisco, CA 94158-2330, USA.
| | - Helen M Berman
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Center for Integrative Proteomics Research, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Torsten Schwede
- Swiss Institute of Bioinformatics Biozentrum, University of Basel, Klingelbergstrasse 50-70, 4056 Basel, Switzerland
| | - Jill Trewhella
- School of Molecular Bioscience, The University of Sydney, NSW 2006, Australia
| | - Gerard Kleywegt
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Stephen K Burley
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Center for Integrative Proteomics Research, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Skaggs School of Pharmacy and Pharmaceutical Sciences and San Diego Supercomputer Center, University of California, San Diego, La Jolla, CA 92093, USA
| | - John Markley
- BioMagResBank, Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706-1544, USA
| | - Haruki Nakamura
- Protein Data Bank Japan, Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka 565-0871, Japan
| | - Paul Adams
- Physical Biosciences Division, Lawrence Berkeley Laboratory, Berkeley, CA 94720-8235, USA; Department of Bioengineering, UC Berkeley, Berkeley, CA 94720, USA
| | - Alexandre M J J Bonvin
- Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, Utrecht, 3584 CH, the Netherlands
| | - Wah Chiu
- National Center for Macromolecular Imaging, Baylor College of Medicine, Houston, TX 77030, USA
| | - Matteo Dal Peraro
- Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL) and Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Frank Di Maio
- Department of Biochemistry, University of Washington, Seattle, WA 98195-7370, USA
| | - Thomas E Ferrin
- Department of Pharmaceutical Chemistry and Department of Bioengineering and Therapeutic Sciences, California Institute for Quantitative Biosciences, University of California, San Francisco, 600 16(th) Street, San Francisco, CA 94158-2517, USA
| | - Kay Grünewald
- Division of Structural Biology, Wellcome Trust Centre of Human Genetics, University of Oxford, OX3 7BN Oxford, UK
| | - Aleksandras Gutmanas
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Richard Henderson
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, UK
| | - Gerhard Hummer
- Department of Theoretical Biophysics, Max Planck Institute of Biophysics, Max-von-Laue Straße 3, 60438 Frankfurt am Main, Germany
| | - Kenji Iwasaki
- Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka 565-0871, Japan
| | - Graham Johnson
- Department of Bioengineering and Therapeutic Sciences, California Institute for Quantitative Biosciences, University of California, San Francisco, 600 16(th) Street, San Francisco, CA 94158-2330, USA
| | - Catherine L Lawson
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Center for Integrative Proteomics Research, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Jens Meiler
- Department of Chemistry, Center for Structural Biology, Vanderbilt University, Nashville, TN 37235, USA
| | - Marc A Marti-Renom
- Genome Biology Group, Centre Nacional d'Anàlisi Genòmica (CNAG), Gene Regulation, Stem Cells and Cancer Program, Center for Genomic Regulation (CRG) and Institució Catalana de Recerca i Estudis Avançats (ICREA), 08028 Barcelona, Spain
| | - Gaetano T Montelione
- Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Department of Biochemistry, Robert Wood Johnson Medical School, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Michael Nilges
- Département de Biologie Structurale et Chimie, Unité de Bioinformatique Structurale, Institut Pasteur, F-75015 Paris, France; Unité Mixte de Recherche 3258, Centre National de la Recherche Scientifique, F-75015 Paris, France
| | - Ruth Nussinov
- Cancer and Inflammation Program, Leidos Biomedical Research Inc., Frederick National Laboratory, National Cancer Institute, Frederick, MD 21702, USA; Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel
| | - Ardan Patwardhan
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Juri Rappsilber
- Wellcome Trust Centre for Cell Biology, Institute of Cell Biology, University of Edinburgh, Edinburgh EH9 3BF, UK; Department of Bioanalytics, Institute of Biotechnology, Technische Universität Berlin, 13355 Berlin, Germany
| | - Randy J Read
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge CB2 0XY, UK
| | - Helen Saibil
- Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck College, Malet Street, London WC1E 7HX, UK
| | - Gunnar F Schröder
- Institute of Complex Systems (ICS-6), Forschungszentrum Jülich, 52425 Jülich, Germany; Physics Department, Heinrich-Heine University Düsseldorf, 40225 Düsseldorf, Germany
| | - Charles D Schwieters
- Division of Computational Bioscience, Center for Information Technology, National Institutes of Health, Bethesda, MD 20892-0520, USA
| | - Claus A M Seidel
- Chair for Molecular Physical Chemistry, Heinrich-Heine-Universität, Universitätsstraße 1, 40225 Düsseldorf, Germany
| | - Dmitri Svergun
- European Molecular Biology Laboratory, Hamburg Unit, Notkestrasse 85, 22607 Hamburg, Germany
| | - Maya Topf
- Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck College, Malet Street, London WC1E 7HX, UK
| | - Eldon L Ulrich
- BioMagResBank, Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706-1544, USA
| | - Sameer Velankar
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - John D Westbrook
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Center for Integrative Proteomics Research, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| |
Collapse
|
26
|
Hurt E, Beck M. Towards understanding nuclear pore complex architecture and dynamics in the age of integrative structural analysis. Curr Opin Cell Biol 2015; 34:31-8. [PMID: 25938906 DOI: 10.1016/j.ceb.2015.04.009] [Citation(s) in RCA: 59] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2014] [Revised: 03/26/2015] [Accepted: 04/16/2015] [Indexed: 11/29/2022]
Abstract
Determining the functional architecture of the nuclear pore complex, that remains only partially understood, requires bridging across different length scales. Recent technological advances in quantitative and cross-linking mass spectrometry, super-resolution fluorescence microscopy and electron microscopy have enormously accelerated the integration of different types of data into coherent structural models. Moreover, high-resolution structural analysis of nucleoporins and their in vitro reconstitution into complexes is now facilitated by the use of thermostable orthologs. In this review we highlight how the application of such technologies has led to novel insights into nuclear pore architecture and to a paradigm shift. Today nuclear pores are not anymore seen as static facilitators of nucleocytoplasmic transport but ensembles of multiple overlaying functional states that are involved in various cellular processes.
Collapse
Affiliation(s)
- Ed Hurt
- Biochemistry Center of Heidelberg University, INF328, D-69120 Heidelberg, Germany.
| | - Martin Beck
- European Molecular Biology Laboratory, Structural and Computational Biology Unit, Meyerhofstrasse 1, D-69117 Heidelberg, Germany.
| |
Collapse
|
27
|
Schwede T. Protein modeling: what happened to the "protein structure gap"? Structure 2014; 21:1531-40. [PMID: 24010712 DOI: 10.1016/j.str.2013.08.007] [Citation(s) in RCA: 83] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2013] [Revised: 08/12/2013] [Accepted: 08/12/2013] [Indexed: 11/27/2022]
Abstract
Computational modeling of three-dimensional macromolecular structures and complexes from their sequence has been a long-standing vision in structural biology. Over the last 2 decades, a paradigm shift has occurred: starting from a large "structure knowledge gap" between the huge number of protein sequences and small number of known structures, today, some form of structural information, either experimental or template-based models, is available for the majority of amino acids encoded by common model organism genomes. With the scientific focus of interest moving toward larger macromolecular complexes and dynamic networks of interactions, the integration of computational modeling methods with low-resolution experimental techniques allows the study of large and complex molecular machines. One of the open challenges for computational modeling and prediction techniques is to convey the underlying assumptions, as well as the expected accuracy and structural variability of a specific model, which is crucial to understanding its limitations.
Collapse
Affiliation(s)
- Torsten Schwede
- Biozentrum, University of Basel, Klingelbergstrasse 50-70, 4056 Basel, Switzerland; Computational Structural Biology, SIB Swiss Institute of Bioinformatics, Klingelbergstrasse 50-70, 4056 Basel, Switzerland.
| |
Collapse
|
28
|
Berman HM, Kleywegt GJ, Nakamura H, Markley JL. How community has shaped the Protein Data Bank. Structure 2014; 21:1485-91. [PMID: 24010707 DOI: 10.1016/j.str.2013.07.010] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2013] [Revised: 07/12/2013] [Accepted: 07/17/2013] [Indexed: 11/19/2022]
Abstract
Following several years of community discussion, the Protein Data Bank (PDB) was established in 1971 as a public repository for the coordinates of three-dimensional models of biological macromolecules. Since then, the number, size, and complexity of structural models have continued to grow, reflecting the productivity of structural biology. Managed by the Worldwide PDB organization, the PDB has been able to meet increasing demands for the quantity of structural information and of quality. In addition to providing unrestricted access to structural information, the PDB also works to promote data standards and to raise the profile of structural biology with broader audiences. In this perspective, we describe the history of PDB and the many ways in which the community continues to shape the archive.
Collapse
Affiliation(s)
- Helen M Berman
- RCSB PDB, Center for Integrative Proteomics Research and Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ USA 08854.
| | | | | | | |
Collapse
|
29
|
Recognition of Errors in the Refinement and Validation of Three-Dimensional Structures of AC1 Proteins of Begomovirus Strains by Using ProSA-Web. ACTA ACUST UNITED AC 2014. [DOI: 10.1155/2014/752656] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
The structural model of begomovirus AC1 protein is useful for understanding biological function at molecular level and docking study. For this study we have used the ProSA program (Protein Structure Analysis) tool to establish the structure prediction and modeling of protein. This tool was used for refinement and validation of experimental protein structures. Potential problems of protein structures based on energy plots are easily seen by ProSA and are displayed in a three-dimensional manner. In the present study we have selected different AC1 proteins of begomovirus strains (YP_003288785, YP_002004579, and YP_003288773) for structural analysis and display of energy plots that highlight potential problems spotted in protein structures. The 3D models of Rep proteins with recognized errors can be effectively used for in silico docking study for development of potential ligand molecules against begomovirus infection.
Collapse
|
30
|
Cueno ME, Imai K, Tamura M, Ochiai K. Structural differences between the avian and human H7N9 hemagglutinin proteins are attributable to modifications in salt bridge formation: a computational study with implications in viral evolution. PLoS One 2013; 8:e76764. [PMID: 24116152 PMCID: PMC3792060 DOI: 10.1371/journal.pone.0076764] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2013] [Accepted: 09/03/2013] [Indexed: 01/06/2023] Open
Abstract
Influenza A hemagglutinin (HA) is a homotrimeric glycoprotein composed of a fibrous globular stem supporting a globular head containing three sialic acid binding sites responsible for infection. The H7N9 strain has consistently infected an avian host, however, the novel 2013 strain is now capable of infecting a human host which would imply that the HA in both strains structurally differ. A better understanding of the structural differences between the avian and human H7N9 strains may shed light into viral evolution and transmissibility. In this study, we elucidated the structural differences between the avian and human H7N9 strains. Throughout the study, we generated HA homology models, verified the quality of each model, superimposed HA homology models to determine structural differences, and, likewise, elucidated the probable cause for these structural differences. We detected two different types of structural differences between the novel H7N9 human and representative avian strains, wherein, one type (Pattern-1) showed three non-overlapping regions while the other type (Pattern-2) showed only one non-overlapping region. In addition, we found that superimposed HA homology models exhibiting Pattern-1 contain three non-overlapping regions designated as: Region-1 (S1571-A1601); Region-3 (R2621-S2651); and Region-4 (S2701-D2811), whereas, superimposed HA homology models showing Pattern-2 only contain one non-overlapping region designated as Region-2 (S1371-S1451). We attributed the two patterns we observed to either the presence of salt bridges involving the E1141 residue or absence of the R1411:D771 salt bridge. Interestingly, comparison between the human H7N7 and H7N9 HA homology models showed high structural similarity. We propose that the putative absence of the R1411:D771 salt bridge coupled with the putative presence of the E1141:R2621 and E1141:K2641 salt bridges found in the 2013 H7N9 HA homology model is associated to human-type receptor binding. This highlights the possible significance of HA salt bridge formation modifications in viral infectivity, immune escape, transmissibility and evolution.
Collapse
Affiliation(s)
- Marni E. Cueno
- Department of Microbiology, Nihon University School of Dentistry, Tokyo, Japan
- * E-mail: (KO); (MEC)
| | - Kenichi Imai
- Department of Microbiology, Nihon University School of Dentistry, Tokyo, Japan
| | - Muneaki Tamura
- Department of Microbiology, Nihon University School of Dentistry, Tokyo, Japan
| | - Kuniyasu Ochiai
- Department of Microbiology, Nihon University School of Dentistry, Tokyo, Japan
- * E-mail: (KO); (MEC)
| |
Collapse
|
31
|
Bai WL, Yin RH, Yin RL, Wang JJ, Jiang WQ, Luo GB, Zhao ZH. IGF1 mRNA splicing variants in Liaoning cashmere goat: identification, characterization, and transcriptional patterns in skin and visceral organs. Anim Biotechnol 2013; 24:81-93. [PMID: 23534956 DOI: 10.1080/10495398.2012.750245] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Insulin-like growth factor I (IGF1) is a member of the insulin superfamily. It performs important roles in the proliferation and differentiation of skin cell and control of hair cycles and is thought to be a potential candidate gene for goat cashmere traits. In this work, we isolated and characterized three kinds of IGF1 mRNA splicing variants from the liver of Liaoning Cashmere goat, and the expression characterization of the IGF1 mRNA splicing variants were investigated in skin and other tissues of Liaoning cashmere goat. The sequencing results indicated that the classes 1w, 1, and 2 of IGF1 cDNAs in Liaoning cashmere goat, each included an open reading frame encoding the IGF1 precursor protein. The deduced amino acid sequences of the three IGF1 precursor proteins differed only in their NH2-terminal leader peptides. Through removal of the signal peptide and extension peptide, the three IGF1 mRNA splicing variants (classes 1w, 1, and 2) resulted in the same mature IGF1 protein in Liaoning cashmere goat. In skin tissue of Liaoning cashmere goat, class 1 and class 2 were detected in all stages of hair follicle cycling, and they had the highest transcription level at anagen, and then early anagen; whereas at telogen both classes 1 and 2 had the lowest expression in mRNA level, but the class 1 appears to be relatively more abundant than class 2 in skin tissue of Liaoning cashmere goat. However, the class 1w transcript was not detected in the skin tissues. Three classes of IGF1 mRNA were transcribed in a variety of tissues, including heart, brain, spleen, lung, kidney, liver, and skeletal muscle, but class 1 IGF1 mRNA was more abundant than classes 1w and 2 in the investigated tissues.
Collapse
Affiliation(s)
- Wen L Bai
- College of Animal Science and Veterinary Medicine, Shenyang Agricultural University, Shenyang, China
| | | | | | | | | | | | | |
Collapse
|
32
|
Homology modeling study toward identifying structural properties in the HA2 B-loop that would influence the HA1 receptor-binding site. J Mol Graph Model 2013; 44:161-7. [PMID: 23831996 DOI: 10.1016/j.jmgm.2013.05.011] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2013] [Revised: 05/20/2013] [Accepted: 05/27/2013] [Indexed: 11/20/2022]
Abstract
Influenza hemagglutinin (HA) consists of a fibrous globular stem (HA2) inserted into the viral membrane supporting a globular head (HA1). HA1 receptor-binding has been hypothesized to be structurally correlated to the HA2 B-loop, however, this was never fully understood. Here, we elucidated the structural relationship between the HA2 B-loop and the HA1 receptor-binding site (RBS). Throughout this study, we analyzed 2486 H1N1 HA homology models obtained from human, swine and avian strains during 1976-2012. Quality of all homology models were verified before further analyses. We established that amino acid residue 882 is putatively strain-conserved and differs in the human (K882), swine (H882) and avian (N882) strains. Moreover, we observed that the amino acid at residue 882 and, similarly, its orientation has the potential to influence the HA1 RBS diameter measurements which we hypothesize may consequentially affect influenza H1N1 viral infectivity, immune escape, transmissibility, and evolution.
Collapse
|
33
|
Bourne PE, Beran B, Bi C, Bluhm WF, Dimitropoulos D, Feng Z, Goodsell DS, Prlić A, B. Quinn G, W. Rose P, Westbrook J, Yukich B, Young J, Zardecki C, Berman HM. The evolution of the RCSB Protein Data Bank website. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2011. [DOI: 10.1002/wcms.57] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Affiliation(s)
| | - Bojan Beran
- University of California, San Diego, CA, USA
| | - Chunxiao Bi
- University of California, San Diego, CA, USA
| | | | | | - Zukang Feng
- Rutgers, The State University of New Jersey, NJ, USA
| | | | | | | | | | | | | | - Jasmine Young
- Rutgers, The State University of New Jersey, NJ, USA
| | | | | |
Collapse
|
34
|
Arnold K, Kiefer F, Kopp J, Battey JND, Podvinec M, Westbrook JD, Berman HM, Bordoli L, Schwede T. The Protein Model Portal. JOURNAL OF STRUCTURAL AND FUNCTIONAL GENOMICS 2009; 10:1-8. [PMID: 19037750 PMCID: PMC2704613 DOI: 10.1007/s10969-008-9048-5] [Citation(s) in RCA: 117] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/17/2008] [Accepted: 11/02/2008] [Indexed: 11/28/2022]
Abstract
Structural Genomics has been successful in determining the structures of many unique proteins in a high throughput manner. Still, the number of known protein sequences is much larger than the number of experimentally solved protein structures. Homology (or comparative) modeling methods make use of experimental protein structures to build models for evolutionary related proteins. Thereby, experimental structure determination efforts and homology modeling complement each other in the exploration of the protein structure space. One of the challenges in using model information effectively has been to access all models available for a specific protein in heterogeneous formats at different sites using various incompatible accession code systems. Often, structure models for hundreds of proteins can be derived from a given experimentally determined structure, using a variety of established methods. This has been done by all of the PSI centers, and by various independent modeling groups. The goal of the Protein Model Portal (PMP) is to provide a single portal which gives access to the various models that can be leveraged from PSI targets and other experimental protein structures. A single interface allows all existing pre-computed models across these various sites to be queried simultaneously, and provides links to interactive services for template selection, target-template alignment, model building, and quality assessment. The current release of the portal consists of 7.6 million model structures provided by different partner resources (CSMP, JCSG, MCSG, NESG, NYSGXRC, JCMM, ModBase, SWISS-MODEL Repository). The PMP is available at http://www.proteinmodelportal.org and from the PSI Structural Genomics Knowledgebase.
Collapse
Affiliation(s)
- Konstantin Arnold
- Biozentrum, University of Basel, Klingelbergstrasse 50/70, CH-4056 Basel, Switzerland
- Swiss Institute of Bioinformatics (SIB), Basel, Switzerland
| | - Florian Kiefer
- Biozentrum, University of Basel, Klingelbergstrasse 50/70, CH-4056 Basel, Switzerland
- Swiss Institute of Bioinformatics (SIB), Basel, Switzerland
| | - Jürgen Kopp
- Biozentrum, University of Basel, Klingelbergstrasse 50/70, CH-4056 Basel, Switzerland
- Swiss Institute of Bioinformatics (SIB), Basel, Switzerland
| | - James N. D. Battey
- Biozentrum, University of Basel, Klingelbergstrasse 50/70, CH-4056 Basel, Switzerland
- Swiss Institute of Bioinformatics (SIB), Basel, Switzerland
| | - Michael Podvinec
- Biozentrum, University of Basel, Klingelbergstrasse 50/70, CH-4056 Basel, Switzerland
- Swiss Institute of Bioinformatics (SIB), Basel, Switzerland
| | - John D. Westbrook
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854-8087 USA
| | - Helen M. Berman
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854-8087 USA
| | - Lorenza Bordoli
- Biozentrum, University of Basel, Klingelbergstrasse 50/70, CH-4056 Basel, Switzerland
- Swiss Institute of Bioinformatics (SIB), Basel, Switzerland
| | - Torsten Schwede
- Biozentrum, University of Basel, Klingelbergstrasse 50/70, CH-4056 Basel, Switzerland
- Swiss Institute of Bioinformatics (SIB), Basel, Switzerland
| |
Collapse
|
35
|
Schwede T, Sali A, Honig B, Levitt M, Berman HM, Jones D, Brenner SE, Burley SK, Das R, Dokholyan NV, Dunbrack RL, Fidelis K, Fiser A, Godzik A, Huang YJ, Humblet C, Jacobson MP, Joachimiak A, Krystek SR, Kortemme T, Kryshtafovych A, Montelione GT, Moult J, Murray D, Sanchez R, Sosnick TR, Standley DM, Stouch T, Vajda S, Vasquez M, Westbrook JD, Wilson IA. Outcome of a workshop on applications of protein models in biomedical research. Structure 2009; 17:151-9. [PMID: 19217386 PMCID: PMC2739730 DOI: 10.1016/j.str.2008.12.014] [Citation(s) in RCA: 109] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2008] [Revised: 11/14/2008] [Accepted: 12/16/2008] [Indexed: 02/05/2023]
Abstract
We describe the proceedings and conclusions from the "Workshop on Applications of Protein Models in Biomedical Research" (the Workshop) that was held at the University of California, San Francisco on 11 and 12 July, 2008. At the Workshop, international scientists involved with structure modeling explored (i) how models are currently used in biomedical research, (ii) the requirements and challenges for different applications, and (iii) how the interaction between the computational and experimental research communities could be strengthened to advance the field.
Collapse
Affiliation(s)
- Torsten Schwede
- Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Klingelbergstrasse 50-70, CH-4056 Basel, Switzerland.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
36
|
Nair R, Liu J, Soong TT, Acton TB, Everett JK, Kouranov A, Fiser A, Godzik A, Jaroszewski L, Orengo C, Montelione GT, Rost B. Structural genomics is the largest contributor of novel structural leverage. ACTA ACUST UNITED AC 2009; 10:181-91. [PMID: 19194785 PMCID: PMC2705706 DOI: 10.1007/s10969-008-9055-6] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2008] [Accepted: 12/08/2008] [Indexed: 11/28/2022]
Abstract
The Protein Structural Initiative (PSI) at the US National Institutes of Health (NIH) is funding four large-scale centers for structural genomics (SG). These centers systematically target many large families without structural coverage, as well as very large families with inadequate structural coverage. Here, we report a few simple metrics that demonstrate how successfully these efforts optimize structural coverage: while the PSI-2 (2005-now) contributed more than 8% of all structures deposited into the PDB, it contributed over 20% of all novel structures (i.e. structures for protein sequences with no structural representative in the PDB on the date of deposition). The structural coverage of the protein universe represented by today’s UniProt (v12.8) has increased linearly from 1992 to 2008; structural genomics has contributed significantly to the maintenance of this growth rate. Success in increasing novel leverage (defined in Liu et al. in Nat Biotechnol 25:849–851, 2007) has resulted from systematic targeting of large families. PSI’s per structure contribution to novel leverage was over 4-fold higher than that for non-PSI structural biology efforts during the past 8 years. If the success of the PSI continues, it may just take another ~15 years to cover most sequences in the current UniProt database.
Collapse
Affiliation(s)
- Rajesh Nair
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
37
|
Berman HM, Westbrook JD, Gabanyi MJ, Tao W, Shah R, Kouranov A, Schwede T, Arnold K, Kiefer F, Bordoli L, Kopp J, Podvinec M, Adams PD, Carter LG, Minor W, Nair R, La Baer J. The protein structure initiative structural genomics knowledgebase. Nucleic Acids Res 2009; 37:D365-8. [PMID: 19010965 PMCID: PMC2686438 DOI: 10.1093/nar/gkn790] [Citation(s) in RCA: 84] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2008] [Accepted: 10/08/2008] [Indexed: 11/13/2022] Open
Abstract
The Protein Structure Initiative Structural Genomics Knowledgebase (PSI SGKB, http://kb.psi-structuralgenomics.org) has been created to turn the products of the PSI structural genomics effort into knowledge that can be used by the biological research community to understand living systems and disease. This resource provides central access to structures in the Protein Data Bank (PDB), along with functional annotations, associated homology models, worldwide protein target tracking information, available protocols and the potential to obtain DNA materials for many of the targets. It also offers the ability to search all of the structural and methodological publications and the innovative technologies that were catalyzed by the PSI's high-throughput research efforts. In collaboration with the Nature Publishing Group, the PSI SGKB provides a research library, editorials about new research advances, news and an events calendar to present a broader view of structural biology and structural genomics. By making these resources freely available, the PSI SGKB serves as a bridge to connect the structural biology and the greater biomedical communities.
Collapse
Affiliation(s)
- Helen M Berman
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
38
|
Data deposition and annotation at the worldwide protein data bank. Mol Biotechnol 2008; 42:1-13. [PMID: 19082769 DOI: 10.1007/s12033-008-9127-7] [Citation(s) in RCA: 82] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2008] [Accepted: 11/06/2008] [Indexed: 10/21/2022]
Abstract
The Protein Data Bank (PDB) is the repository for three-dimensional structures of biological macromolecules, determined by experimental methods. The data in the archive is free and easily available via the Internet from any of the worldwide centers managing this global archive. These data are used by scientists, researchers, bioinformatics specialists, educators, students, and general audiences to understand biological phenomenon at a molecular level. Analysis of this structural data also inspires and facilitates new discoveries in science. This chapter describes the tools and methods currently used for deposition, processing, and release of data in the PDB. References to future enhancements are also included.
Collapse
|
39
|
Dutta S, Burkhardt K, Swaminathan GJ, Kosada T, Henrick K, Nakamura H, Berman HM. Data deposition and annotation at the worldwide protein data bank. Methods Mol Biol 2008; 426:81-101. [PMID: 18542858 DOI: 10.1007/978-1-60327-058-8_5] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/08/2023]
Abstract
The Protein Data Bank (PDB) is the repository for the three-dimensional structures of biological macromolecules, determined by experimental methods. The data in the archive are free and easily available via the Internet from any of the worldwide centers managing this global archive. These data are used by scientists, researchers, bioinformatics specialists, educators, students, and lay audiences to understand biological phenomena at a molecular level. Analysis of these structural data also inspires and facilitates new discoveries in science. This chapter describes the tools and methods currently used for deposition, processing, and release of data in the PDB. References to future enhancements are also included.
Collapse
Affiliation(s)
- Shuchismita Dutta
- Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | | | | | | | | | | | | |
Collapse
|
40
|
Dutta S, M Berman H, F Bluhm W. Using the tools and resources of the RCSB protein data bank. ACTA ACUST UNITED AC 2008; Chapter 1:1.9.1-1.9.24. [PMID: 18428680 DOI: 10.1002/0471250953.bi0109s20] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
The Protein Data Bank (PDB; http://www.pdb.org) is the world-wide repository for three-dimensional structural data determined using various experimental methods. The options and procedures for searching and downloading structural data from the Research Collaboratory for Structural Bioinformatics (RCSB) PDB are described here, along with tools for assessing the quality of structures. Several types of information are associated with each structure deposition, including atomic coordinates of the structure, experimental data used to solve it, sequences of all macromolecules in the structures, details about the structure solution method, images showing different views of the structure, derived geometric data, and a variety of links to other resources. These data and resources may be used for understanding the function and stability of the molecule and for designing biochemical, genetic, or other experiments. They can also be used for molecular modeling and drug design.
Collapse
|
41
|
Affiliation(s)
- Helen M Berman
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA.
| |
Collapse
|
42
|
Occhino M, Ghiotto F, Soro S, Mortarino M, Bosi S, Maffei M, Bruno S, Nardini M, Figini M, Tramontano A, Ciccone E. Dissecting the structural determinants of the interaction between the human cytomegalovirus UL18 protein and the CD85j immune receptor. THE JOURNAL OF IMMUNOLOGY 2008; 180:957-68. [PMID: 18178836 DOI: 10.4049/jimmunol.180.2.957] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
UL18 is a glycoprotein encoded by the human cytomegalovirus genome and is thought to play a pivotal role during human cytomegalovirus infection, although its exact function is still a matter of debate. UL18 shares structural similarity with MHC class I and binds the receptor CD85j on immune cells. Besides UL18, CD85j binds MHC class I molecules. The binding properties of CD85j to MHC class I molecules have been thoroughly studied. Conversely, very little information is available on the CD85j/UL18 complex, namely that UL18 binds CD85j through its alpha3 domain with an affinity that is approximately 1000-fold higher than the MHC class I affinity for CD85j. Deeper knowledge of features of the UL18/CD85j complex would help to disclose the function of UL18 when it binds to CD85j. In this study we first demonstrated that the UL18alpha3 domain is not sufficient per se for binding and that beta2-microglobulin is necessary for UL18-CD85j interaction. We then dissected structural determinants of binding UL18 to CD85j. To this end, we constructed a three-dimensional model of the complex. The model was used to design mutants in selected regions of the putative interaction interface, the effects of which were measured on binding. Six regions in both the alpha2 and alpha3 domains and specific amino acids within them were identified that are potentially involved in the UL18-CD85j interaction. The higher affinity of UL18 to CD85j, compared with MHC class I, seems to be due not to additional interaction regions but to an overall better fit of the two molecules.
Collapse
Affiliation(s)
- Marzia Occhino
- Department of Experimental Medicine, University of Genoa, Genoa, Italy
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
43
|
Smith E. Thermodynamics of natural selection I: Energy flow and the limits on organization. J Theor Biol 2008; 252:185-97. [PMID: 18367210 DOI: 10.1016/j.jtbi.2008.02.010] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2007] [Revised: 02/04/2008] [Accepted: 02/07/2008] [Indexed: 01/22/2023]
Abstract
This is the first of three papers analyzing the representation of information in the biosphere, and the energetic constraints limiting the imposition or maintenance of that information. Biological information is inherently a chemical property, but is equally an aspect of control flow and a result of processes equivalent to computation. The current paper develops the constraints on a theory of biological information capable of incorporating these three characterizations and their quantitative consequences. The paper illustrates the need for a theory linking energy and information by considering the problem of existence and reslience of the biosphere, and presents empirical evidence from growth and development at the organismal level suggesting that the theory developed will capture relevant constraints on real systems. The main result of the paper is that the limits on the minimal energetic cost of information flow will be tractable and universal whereas the assembly of more literal process models into a system-level description often is not. The second paper in the series then goes on to construct reversible models of energy and information flow in chemistry which achieve the idealized limits, and the third paper relates these to fundamental operations of computation.
Collapse
Affiliation(s)
- Eric Smith
- Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, NM 87501, USA.
| |
Collapse
|
44
|
Lyman E, Zuckerman DM. On the structural convergence of biomolecular simulations by determination of the effective sample size. J Phys Chem B 2007; 111:12876-82. [PMID: 17935314 PMCID: PMC2538559 DOI: 10.1021/jp073061t] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Although atomistic simulations of proteins and other biological systems are approaching microsecond timescales, the quality of simulation trajectories has remained difficult to assess. Such assessment is critical not only for establishing the relevance of any individual simulation but also in the extremely active field of developing computational methods. Here we map the trajectory assessment problem onto a simple statistical calculation of the "effective sample size", that is, the number of statistically independent configurations. The mapping is achieved by asking the question, "How much time must elapse between snapshots included in a sample for that sample to exhibit the statistical properties expected for independent and identically distributed configurations?" Our method is more general than autocorrelation methods in that it directly probes the configuration-space distribution without requiring a priori definition of configurational substates and without any fitting parameters. We show that the method is equally and directly applicable to toy models, peptides, and a 72-residue protein model. Variants of our approach can readily be applied to a wide range of physical and chemical systems.
Collapse
Affiliation(s)
- Edward Lyman
- Dept. of Computational biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213
| | - Daniel M. Zuckerman
- Dept. of Computational biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213
| |
Collapse
|
45
|
Wiederstein M, Sippl MJ. ProSA-web: interactive web service for the recognition of errors in three-dimensional structures of proteins. Nucleic Acids Res 2007; 35:W407-10. [PMID: 17517781 PMCID: PMC1933241 DOI: 10.1093/nar/gkm290] [Citation(s) in RCA: 3749] [Impact Index Per Article: 220.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
A major problem in structural biology is the recognition of errors in experimental and theoretical models of protein structures. The ProSA program (Protein Structure Analysis) is an established tool which has a large user base and is frequently employed in the refinement and validation of experimental protein structures and in structure prediction and modeling. The analysis of protein structures is generally a difficult and cumbersome exercise. The new service presented here is a straightforward and easy to use extension of the classic ProSA program which exploits the advantages of interactive web-based applications for the display of scores and energy plots that highlight potential problems spotted in protein structures. In particular, the quality scores of a protein are displayed in the context of all known protein structures and problematic parts of a structure are shown and highlighted in a 3D molecule viewer. The service specifically addresses the needs encountered in the validation of protein structures obtained from X-ray analysis, NMR spectroscopy and theoretical calculations. ProSA-web is accessible at https://prosa.services.came.sbg.ac.at
Collapse
Affiliation(s)
| | - Manfred J. Sippl
- *To whom correspondence should be addressed. +43-662-8044-5796 +43-662-8044-176
| |
Collapse
|
46
|
Berman H, Henrick K, Nakamura H, Markley JL. The worldwide Protein Data Bank (wwPDB): ensuring a single, uniform archive of PDB data. Nucleic Acids Res 2006; 35:D301-3. [PMID: 17142228 PMCID: PMC1669775 DOI: 10.1093/nar/gkl971] [Citation(s) in RCA: 770] [Impact Index Per Article: 42.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
The worldwide Protein Data Bank (wwPDB) is the international collaboration that manages the deposition, processing and distribution of the PDB archive. The online PDB archive is a repository for the coordinates and related information for more than 38 000 structures, including proteins, nucleic acids and large macromolecular complexes that have been determined using X-ray crystallography, NMR and electron microscopy techniques. The founding members of the wwPDB are RCSB PDB (USA), MSD-EBI (Europe) and PDBj (Japan) [H.M. Berman, K. Henrick and H. Nakamura (2003) Nature Struct. Biol., 10, 980]. The BMRB group (USA) joined the wwPDB in 2006. The mission of the wwPDB is to maintain a single archive of macromolecular structural data that are freely and publicly available to the global community. Additionally, the wwPDB provides a variety of services to a broad community of users. The wwPDB website at provides information about services provided by the individual member organizations and about projects undertaken by the wwPDB.
Collapse
Affiliation(s)
- Helen Berman
- RCSB Protein Data Bank, Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, 610 Taylor Road, Piscataway, NJ 08854-8087, USA.
| | | | | | | |
Collapse
|