1
|
Henkel A, Oberthür D. A snapshot love story: what serial crystallography has done and will do for us. Acta Crystallogr D Struct Biol 2024; 80:563-579. [PMID: 38984902 PMCID: PMC11301758 DOI: 10.1107/s2059798324005588] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Accepted: 06/11/2024] [Indexed: 07/11/2024] Open
Abstract
Serial crystallography, born from groundbreaking experiments at the Linac Coherent Light Source in 2009, has evolved into a pivotal technique in structural biology. Initially pioneered at X-ray free-electron laser facilities, it has now expanded to synchrotron-radiation facilities globally, with dedicated experimental stations enhancing its accessibility. This review gives an overview of current developments in serial crystallography, emphasizing recent results in time-resolved crystallography, and discussing challenges and shortcomings.
Collapse
Affiliation(s)
- Alessandra Henkel
- Center for Free-Electron Laser Science CFELDeutsches Elektronen-Synchrotron DESYNotkestr. 8522607HamburgGermany
| | - Dominik Oberthür
- Center for Free-Electron Laser Science CFELDeutsches Elektronen-Synchrotron DESYNotkestr. 8522607HamburgGermany
| |
Collapse
|
2
|
Rosenberg AA, Marx A, Bronstein AM. A dataset of alternately located segments in protein crystal structures. Sci Data 2024; 11:783. [PMID: 39019896 PMCID: PMC11255211 DOI: 10.1038/s41597-024-03595-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Accepted: 07/01/2024] [Indexed: 07/19/2024] Open
Abstract
Protein Data Bank (PDB) files list the relative spatial location of atoms in a protein structure as the final output of the process of fitting and refining to experimentally determined electron density measurements. Where experimental evidence exists for multiple conformations, atoms are modelled in alternate locations. Programs reading PDB files commonly ignore these alternate conformations by default leaving users oblivious to the presence of alternate conformations in the structures they analyze. This has led to underappreciation of their prevalence, under characterisation of their features and limited the accessibility to this high-resolution data representing structural ensembles. We have trawled PDB files to extract structural features of residues with alternately located atoms. The output includes the distance between alternate conformations and identifies the location of these segments within the protein chain and in proximity of all other atoms within a defined radius. This dataset should be of use in efforts to predict multiple structures from a single sequence and support studies investigating protein flexibility and the association with protein function.
Collapse
Affiliation(s)
- Aviv A Rosenberg
- Department of Computer Science, Technion - Israel Institute of Technology, Haifa, Israel
| | - Ailie Marx
- Department of Molecular and Computational Biosciences and Biotechnology, Migal - Galilee Research Institute, Qiryat, Israel.
| | - Alexander M Bronstein
- Department of Computer Science, Technion - Israel Institute of Technology, Haifa, Israel.
| |
Collapse
|
3
|
Wankowicz SA, Fraser JS. Comprehensive encoding of conformational and compositional protein structural ensembles through the mmCIF data structure. IUCRJ 2024; 11:494-501. [PMID: 38958015 PMCID: PMC11220883 DOI: 10.1107/s2052252524005098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Accepted: 05/29/2024] [Indexed: 07/04/2024]
Abstract
In the folded state, biomolecules exchange between multiple conformational states crucial for their function. However, most structural models derived from experiments and computational predictions only encode a single state. To represent biomolecules accurately, we must move towards modeling and predicting structural ensembles. Information about structural ensembles exists within experimental data from X-ray crystallography and cryo-electron microscopy. Although new tools are available to detect conformational and compositional heterogeneity within these ensembles, the legacy PDB data structure does not robustly encapsulate this complexity. We propose modifications to the macromolecular crystallographic information file (mmCIF) to improve the representation and interrelation of conformational and compositional heterogeneity. These modifications will enable the capture of macromolecular ensembles in a human and machine-interpretable way, potentially catalyzing breakthroughs for ensemble-function predictions, analogous to the achievements of AlphaFold with single-structure prediction.
Collapse
Affiliation(s)
- Stephanie A. Wankowicz
- Department of Bioengineering and Therapeutic ScienceUniversity of CaliforniaSan FranciscoCA94117USA
| | - James S. Fraser
- Department of Bioengineering and Therapeutic ScienceUniversity of CaliforniaSan FranciscoCA94117USA
| |
Collapse
|
4
|
Wankowicz SA, Ravikumar A, Sharma S, Riley B, Raju A, Hogan DW, Flowers J, van den Bedem H, Keedy DA, Fraser JS. Automated multiconformer model building for X-ray crystallography and cryo-EM. eLife 2024; 12:RP90606. [PMID: 38904665 PMCID: PMC11192534 DOI: 10.7554/elife.90606] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/22/2024] Open
Abstract
In their folded state, biomolecules exchange between multiple conformational states that are crucial for their function. Traditional structural biology methods, such as X-ray crystallography and cryogenic electron microscopy (cryo-EM), produce density maps that are ensemble averages, reflecting molecules in various conformations. Yet, most models derived from these maps explicitly represent only a single conformation, overlooking the complexity of biomolecular structures. To accurately reflect the diversity of biomolecular forms, there is a pressing need to shift toward modeling structural ensembles that mirror the experimental data. However, the challenge of distinguishing signal from noise complicates manual efforts to create these models. In response, we introduce the latest enhancements to qFit, an automated computational strategy designed to incorporate protein conformational heterogeneity into models built into density maps. These algorithmic improvements in qFit are substantiated by superior Rfree and geometry metrics across a wide range of proteins. Importantly, unlike more complex multicopy ensemble models, the multiconformer models produced by qFit can be manually modified in most major model building software (e.g., Coot) and fit can be further improved by refinement using standard pipelines (e.g., Phenix, Refmac, Buster). By reducing the barrier of creating multiconformer models, qFit can foster the development of new hypotheses about the relationship between macromolecular conformational dynamics and function.
Collapse
Affiliation(s)
- Stephanie A Wankowicz
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
| | - Ashraya Ravikumar
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
| | - Shivani Sharma
- Structural Biology Initiative, CUNY Advanced Science Research CenterNew YorkUnited States
- Ph.D. Program in Biology, The Graduate Center, City University of New YorkNew YorkUnited States
| | - Blake Riley
- Structural Biology Initiative, CUNY Advanced Science Research CenterNew YorkUnited States
| | - Akshay Raju
- Structural Biology Initiative, CUNY Advanced Science Research CenterNew YorkUnited States
| | - Daniel W Hogan
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
| | - Jessica Flowers
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
| | - Henry van den Bedem
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
- Atomwise IncSan FranciscoUnited States
| | - Daniel A Keedy
- Structural Biology Initiative, CUNY Advanced Science Research CenterNew YorkUnited States
- Department of Chemistry and Biochemistry, City College of New YorkNew YorkUnited States
- Ph.D. Programs in Biochemistry, Biology and Chemistry, The Graduate Center, City University of New YorkNew YorkUnited States
| | - James S Fraser
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
| |
Collapse
|
5
|
Wankowicz SA, Ravikumar A, Sharma S, Riley BT, Raju A, Flowers J, Hogan D, van den Bedem H, Keedy DA, Fraser JS. Uncovering Protein Ensembles: Automated Multiconformer Model Building for X-ray Crystallography and Cryo-EM. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.06.28.546963. [PMID: 37425870 PMCID: PMC10327213 DOI: 10.1101/2023.06.28.546963] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/11/2023]
Abstract
In their folded state, biomolecules exchange between multiple conformational states that are crucial for their function. Traditional structural biology methods, such as X-ray crystallography and cryogenic electron microscopy (cryo-EM), produce density maps that are ensemble averages, reflecting molecules in various conformations. Yet, most models derived from these maps explicitly represent only a single conformation, overlooking the complexity of biomolecular structures. To accurately reflect the diversity of biomolecular forms, there is a pressing need to shift towards modeling structural ensembles that mirror the experimental data. However, the challenge of distinguishing signal from noise complicates manual efforts to create these models. In response, we introduce the latest enhancements to qFit, an automated computational strategy designed to incorporate protein conformational heterogeneity into models built into density maps. These algorithmic improvements in qFit are substantiated by superior R f r e e and geometry metrics across a wide range of proteins. Importantly, unlike more complex multicopy ensemble models, the multiconformer models produced by qFit can be manually modified in most major model building software (e.g. Coot) and fit can be further improved by refinement using standard pipelines (e.g. Phenix, Refmac, Buster). By reducing the barrier of creating multiconformer models, qFit can foster the development of new hypotheses about the relationship between macromolecular conformational dynamics and function.
Collapse
Affiliation(s)
- Stephanie A. Wankowicz
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Ashraya Ravikumar
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Shivani Sharma
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031
- Ph.D. Program in Biology, The Graduate Center – City University of New York, New York, NY 10016
| | - Blake T. Riley
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031
| | - Akshay Raju
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031
| | - Jessica Flowers
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Daniel Hogan
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Henry van den Bedem
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
- Atomwise, Inc., San Francisco, CA, United States
| | - Daniel A. Keedy
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031
- Department of Chemistry and Biochemistry, City College of New York, New York, NY 10031
- Ph.D. Programs in Biochemistry, Biology, and Chemistry, The Graduate Center – City University of New York, New York, NY 10016
| | - James S. Fraser
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| |
Collapse
|
6
|
Palaniappan C, Rajendran S, Sekar K. Alternate conformations found in protein structures implies biological functions: A case study using cyclophilin A. Curr Res Struct Biol 2024; 7:100145. [PMID: 38690327 PMCID: PMC11059445 DOI: 10.1016/j.crstbi.2024.100145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2023] [Revised: 03/16/2024] [Accepted: 04/15/2024] [Indexed: 05/02/2024] Open
Abstract
Protein dynamics linked to numerous biomolecular functions, such as ligand binding, allosteric regulation, and catalysis, must be better understood at the atomic level. Reactive atoms of key residues drive a repertoire of biomolecular functions by flipping between alternate conformations or conformational substates, seldom found in protein structures. Probing such sparsely sampled alternate conformations would provide mechanistic insight into many biological functions. We are therefore interested in evaluating the instance of amino acids adopted alternate conformations, either in backbone or side-chain atoms or in both. Accordingly, over 70000 protein structures appear to contain alternate conformations only 'A' and 'B' for any atom, particularly the instance of amino acids that adopted alternate conformations are more for Arg, Cys, Met, and Ser than others. The resulting protein structure analysis depicts that amino acids with alternate conformations are mainly found in the helical and β-regions and are often seen in high-resolution X-ray crystal structures. Furthermore, a case study on human cyclophilin A (CypA) was performed to explain the pre-existing intrinsic dynamics of catalytically critical residues from the CypA and how such intrinsic dynamics perturbed upon Ser99Thr mutation using molecular dynamics simulations on the ns-μs timescale. Simulation results demonstrated that the Ser99Thr mutation had impaired the alternate conformations or the catalytically productive micro-environment of Phe113, mimicking the experimentally observed perturbation captured by X-ray crystallography. In brief, a deeper comprehension of alternate conformations adopted by the amino acids may shed light on the interplay between protein structure, dynamics, and function.
Collapse
Affiliation(s)
- Chandrasekaran Palaniappan
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, 560012, India
- Department of Computational and Data Sciences, Indian Institute of Science, Bangalore, 560012, India
| | - Santhosh Rajendran
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, 560012, India
- Department of Computational and Data Sciences, Indian Institute of Science, Bangalore, 560012, India
| | - Kanagaraj Sekar
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, 560012, India
| |
Collapse
|
7
|
Stachowski TR, Fischer M. FLEXR GUI: a graphical user interface for multi-conformer modeling of proteins. J Appl Crystallogr 2024; 57:580-586. [PMID: 38596743 PMCID: PMC11001397 DOI: 10.1107/s1600576724001523] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Accepted: 02/14/2024] [Indexed: 04/11/2024] Open
Abstract
Proteins are well known 'shapeshifters' which change conformation to function. In crystallography, multiple conformational states are often present within the crystal and the resulting electron-density map. Yet, explicitly incorporating alternative states into models to disentangle multi-conformer ensembles is challenging. We previously reported the tool FLEXR, which, within a few minutes, automatically separates conformational signal from noise and builds the corresponding, often missing, structural features into a multi-conformer model. To make the method widely accessible for routine multi-conformer building as part of the computational toolkit for macromolecular crystallography, we present a graphical user interface (GUI) for FLEXR, designed as a plugin for Coot 1. The GUI implementation seamlessly connects FLEXR models with the existing suite of validation and modeling tools available in Coot. We envision that FLEXR will aid crystallographers by increasing access to a multi-conformer modeling method that will ultimately lead to a better representation of protein conformational heterogeneity in the Protein Data Bank. In turn, deeper insights into the protein conformational landscape may inform biology or provide new opportunities for ligand design. The code is open source and freely available on GitHub at https://github.com/TheFischerLab/FLEXR-GUI.
Collapse
Affiliation(s)
- Timothy R. Stachowski
- Department of Chemical Biology and Therapeutics, St Jude Children’s Research Hospital, Memphis, TN 38105, USA
| | - Marcus Fischer
- Department of Chemical Biology and Therapeutics, St Jude Children’s Research Hospital, Memphis, TN 38105, USA
| |
Collapse
|
8
|
Sharma S, Skaist Mehlman T, Sagabala RS, Boivin B, Keedy DA. High-resolution double vision of the allosteric phosphatase PTP1B. Acta Crystallogr F Struct Biol Commun 2024; 80:1-12. [PMID: 38133579 PMCID: PMC10833341 DOI: 10.1107/s2053230x23010749] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Accepted: 12/14/2023] [Indexed: 12/23/2023] Open
Abstract
Protein tyrosine phosphatase 1B (PTP1B) plays important roles in cellular homeostasis and is a highly validated therapeutic target for multiple human ailments, including diabetes, obesity and breast cancer. However, much remains to be learned about how conformational changes may convey information through the structure of PTP1B to enable allosteric regulation by ligands or functional responses to mutations. High-resolution X-ray crystallography can offer unique windows into protein conformational ensembles, but comparison of even high-resolution structures is often complicated by differences between data sets, including non-isomorphism. Here, the highest resolution crystal structure of apo wild-type (WT) PTP1B to date is presented out of a total of ∼350 PTP1B structures in the PDB. This structure is in a crystal form that is rare for PTP1B, with two unique copies of the protein that exhibit distinct patterns of conformational heterogeneity, allowing a controlled comparison of local disorder across the two chains within the same asymmetric unit. The conformational differences between these chains are interrogated in the apo structure and between several recently reported high-resolution ligand-bound structures. Electron-density maps in a high-resolution structure of a recently reported activating double mutant are also examined, and unmodeled alternate conformations in the mutant structure are discovered that coincide with regions of enhanced conformational heterogeneity in the new WT structure. These results validate the notion that these mutations operate by enhancing local dynamics, and suggest a latent susceptibility to such changes in the WT enzyme. Together, these new data and analysis provide a detailed view of the conformational ensemble of PTP1B and highlight the utility of high-resolution crystallography for elucidating conformational heterogeneity with potential relevance for function.
Collapse
Affiliation(s)
- Shivani Sharma
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031, USA
- PhD Program in Biology, CUNY Graduate Center, New York, NY 10016, USA
| | - Tamar Skaist Mehlman
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031, USA
| | - Reddy Sudheer Sagabala
- Department of Nanobioscience, College of Nanoscale Science and Engineering, SUNY Polytechnic Institute, Albany, NY 12203, USA
| | - Benoit Boivin
- Department of Nanobioscience, College of Nanoscale Science and Engineering, SUNY Polytechnic Institute, Albany, NY 12203, USA
| | - Daniel A. Keedy
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031, USA
- Department of Chemistry and Biochemistry, City College of New York, New York, NY 10031, USA
- PhD Programs in Biochemistry, Biology and Chemistry, CUNY Graduate Center, New York, NY 10016, USA
| |
Collapse
|