1
|
Tajane SV, Thakur A, Acharya S, Chakrabarti P, Dey S. On the abundance and importance of AXXXA sequence motifs in globular proteins and their involvement in C βC β interaction. J Struct Biol 2024; 216:108129. [PMID: 39343152 DOI: 10.1016/j.jsb.2024.108129] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2024] [Revised: 09/22/2024] [Accepted: 09/25/2024] [Indexed: 10/01/2024]
Abstract
The AXXXA and GXXXG motifs are frequently observed in helices, especially in membrane proteins. The motif GXXXG is known to stabilize helix-helix association in membrane proteins via CαHO bonding. AXXXA sequence motif additionally stabilizes the folded state of proteins. We found 27,000 and 18,000 occurrences of AXXXA and GXXXG motifs in a non-redundant set of 6000 obligate homodimeric (OD) complexes. Interestingly, this is less pronounced in transient homodimers (TD) and heterodimers (HetD). On average each obligate homodimer contains four AXXXA motifs, it is 2 and 3.5 for HetD and TD, respectively. Focusing on the binding surface it is seen that 27 % of the ODs contain at least one AXXXA motif at the interface, whereas it is 17 % and 15 % for HetD and TD respectively. AXXXA predominantly stabilizes the OD quaternary structure via the side chain CβCβ interactions. This interaction is energetically favorable and is found to be a major driving force for OD quaternary structure stability. Cβ-Cβ interactions are observed ∼6 times higher than the known CαHO interaction for helix-helix stabilization. Two additional new interactions of CβO and OO are observed at the AXXXA containing interface regions. The occurrence of the motif gets drastically reduced if any of the terminal Ala residues are replaced by Gly. Our findings show the importance of AXXXA in providing stability to the quaternary structure through specific hydrophobic interactions and the specificity of the Ala residue at motif termini. The knowledge gained can be used for designing synthetic proteins of improved stability and for designing peptide-based therapeutics.
Collapse
Affiliation(s)
- Surbhi Vilas Tajane
- Department of Bioscience and Bioengineering, Indian Institute of Technology Jodhpur, NH 62, Nagaur Road, Karwar 342030, Rajasthan, India
| | - Abhilasha Thakur
- Department of Bioscience and Bioengineering, Indian Institute of Technology Jodhpur, NH 62, Nagaur Road, Karwar 342030, Rajasthan, India
| | - Srijita Acharya
- Department of Bioscience and Bioengineering, Indian Institute of Technology Jodhpur, NH 62, Nagaur Road, Karwar 342030, Rajasthan, India
| | | | - Sucharita Dey
- Department of Bioscience and Bioengineering, Indian Institute of Technology Jodhpur, NH 62, Nagaur Road, Karwar 342030, Rajasthan, India.
| |
Collapse
|
2
|
Pollet L, Xia Y. Structure-guided Evolutionary Analysis of Interactome Network Rewiring at Single Residue Resolution in Yeasts. J Mol Biol 2024; 436:168641. [PMID: 38844045 DOI: 10.1016/j.jmb.2024.168641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2024] [Revised: 04/30/2024] [Accepted: 06/01/2024] [Indexed: 06/16/2024]
Abstract
Protein-protein interactions (PPIs) are known to rewire extensively during evolution leading to lineage-specific and species-specific changes in molecular processes. However, the detailed molecular evolutionary mechanisms underlying interactome network rewiring are not well-understood. Here, we combine high-confidence PPI data, high-resolution three-dimensional structures of protein complexes, and homology-based structural annotation transfer to construct structurally-resolved interactome networks for the two yeasts S. cerevisiae and S. pombe. We then classify PPIs according to whether they are preserved or different between the two yeast species and compare site-specific evolutionary rates of interfacial versus non-interfacial residues for these different categories of PPIs. We find that residues in PPI interfaces evolve significantly more slowly than non-interfacial residues when using lineage-specific measures of evolutionary rate, but not when using non-lineage-specific measures. Furthermore, both lineage-specific and non-lineage-specific evolutionary rate measures can distinguish interfacial residues from non-interfacial residues for preserved PPIs between the two yeasts, but only the lineage-specific measure is appropriate for rewired PPIs. Finally, both lineage-specific and non-lineage-specific evolutionary rate measures are appropriate for elucidating structural determinants of protein evolution for residues outside of PPI interfaces. Overall, our results demonstrate that unlike tertiary structures of single proteins, PPIs and PPI interfaces can be highly volatile in their evolution, thus requiring the use of lineage-specific measures when studying their evolution. These results yield insight into the evolutionary design principles of PPIs and the mechanisms by which interactions are preserved or rewired between species, improving our understanding of the molecular evolution of PPIs and PPI interfaces at the residue level.
Collapse
Affiliation(s)
- Léah Pollet
- Department of Bioengineering, Faculty of Engineering, McGill University, Montreal, QC, Canada
| | - Yu Xia
- Department of Bioengineering, Faculty of Engineering, McGill University, Montreal, QC, Canada.
| |
Collapse
|
3
|
Sendker FL, Schlotthauer T, Mais CN, Lo YK, Girbig M, Bohn S, Heimerl T, Schindler D, Weinstein A, Metzger BP, Thornton JW, Pillai A, Bange G, Schuller JM, Hochberg GK. Frequent transitions in self-assembly across the evolution of a central metabolic enzyme. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.05.602260. [PMID: 39005358 PMCID: PMC11245102 DOI: 10.1101/2024.07.05.602260] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 07/16/2024]
Abstract
Many enzymes assemble into homomeric protein complexes comprising multiple copies of one protein. Because structural form is usually assumed to follow function in biochemistry, these assemblies are thought to evolve because they provide some functional advantage. In many cases, however, no specific advantage is known and, in some cases, quaternary structure varies among orthologs. This has led to the proposition that self-assembly may instead vary neutrally within protein families. The extent of such variation has been difficult to ascertain because quaternary structure has until recently been difficult to measure on large scales. Here, we employ mass photometry, phylogenetics, and structural biology to interrogate the evolution of homo-oligomeric assembly across the entire phylogeny of prokaryotic citrate synthases - an enzyme with a highly conserved function. We discover a menagerie of different assembly types that come and go over the course of evolution, including cases of parallel evolution and reversions from complex to simple assemblies. Functional experiments in vitro and in vivo indicate that evolutionary transitions between different assemblies do not strongly influence enzyme catalysis. Our work suggests that enzymes can wander relatively freely through a large space of possible assemblies and demonstrates the power of characterizing structure-function relationships across entire phylogenies.
Collapse
Affiliation(s)
- Franziska L. Sendker
- Max-Planck-Institute for Terrestrial Microbiology; Karl-von-Frisch-Str. 10, 35043 Marburg, Germany
| | - Tabea Schlotthauer
- Max-Planck-Institute for Terrestrial Microbiology; Karl-von-Frisch-Str. 10, 35043 Marburg, Germany
| | - Christopher-Nils Mais
- Center for Synthetic Microbiology (SYNMIKRO), Philipps-University Marburg; Karl-von-Frisch-Str. 14, 35043 Marburg, Germany
| | - Yat Kei Lo
- Center for Synthetic Microbiology (SYNMIKRO), Philipps-University Marburg; Karl-von-Frisch-Str. 14, 35043 Marburg, Germany
| | - Mathias Girbig
- Max-Planck-Institute for Terrestrial Microbiology; Karl-von-Frisch-Str. 10, 35043 Marburg, Germany
| | - Stefan Bohn
- Institute of Structural Biology, Helmholtz Center Munich, Ingolstädter Landstraße 1 Neuherberg, Germany
| | - Thomas Heimerl
- Center for Synthetic Microbiology (SYNMIKRO), Philipps-University Marburg; Karl-von-Frisch-Str. 14, 35043 Marburg, Germany
| | - Daniel Schindler
- Center for Synthetic Microbiology (SYNMIKRO), Philipps-University Marburg; Karl-von-Frisch-Str. 14, 35043 Marburg, Germany
- MaxGENESYS Biofoundry, Max-Planck-Institute for Terrestrial Microbiology; Karl-von-Frisch-Str. 10, 35043 Marburg, Germany
| | - Arielle Weinstein
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
| | - Brain P. Metzger
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
| | - Joseph W. Thornton
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
| | - Arvind Pillai
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
| | - Gert Bange
- Max-Planck-Institute for Terrestrial Microbiology; Karl-von-Frisch-Str. 10, 35043 Marburg, Germany
- Center for Synthetic Microbiology (SYNMIKRO), Philipps-University Marburg; Karl-von-Frisch-Str. 14, 35043 Marburg, Germany
- Department of Chemistry, Philipps-University Marburg; Hans-Meerwein-Str. 4, 35043 Marburg, Germany
| | - Jan M. Schuller
- Center for Synthetic Microbiology (SYNMIKRO), Philipps-University Marburg; Karl-von-Frisch-Str. 14, 35043 Marburg, Germany
- Department of Chemistry, Philipps-University Marburg; Hans-Meerwein-Str. 4, 35043 Marburg, Germany
| | - Georg K.A. Hochberg
- Max-Planck-Institute for Terrestrial Microbiology; Karl-von-Frisch-Str. 10, 35043 Marburg, Germany
- Center for Synthetic Microbiology (SYNMIKRO), Philipps-University Marburg; Karl-von-Frisch-Str. 14, 35043 Marburg, Germany
- Department of Chemistry, Philipps-University Marburg; Hans-Meerwein-Str. 4, 35043 Marburg, Germany
| |
Collapse
|
4
|
Meador K, Castells-Graells R, Aguirre R, Sawaya MR, Arbing MA, Sherman T, Senarathne C, Yeates TO. A suite of designed protein cages using machine learning and protein fragment-based protocols. Structure 2024; 32:751-765.e11. [PMID: 38513658 PMCID: PMC11162342 DOI: 10.1016/j.str.2024.02.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Revised: 01/22/2024] [Accepted: 02/23/2024] [Indexed: 03/23/2024]
Abstract
Designed protein cages and related materials provide unique opportunities for applications in biotechnology and medicine, but their creation remains challenging. Here, we apply computational approaches to design a suite of tetrahedrally symmetric, self-assembling protein cages. For the generation of docked conformations, we emphasize a protein fragment-based approach, while for sequence design of the de novo interface, a comparison of knowledge-based and machine learning protocols highlights the power and increased experimental success achieved using ProteinMPNN. An analysis of design outcomes provides insights for improving interface design protocols, including prioritizing fragment-based motifs, balancing interface hydrophobicity and polarity, and identifying preferred polar contact patterns. In all, we report five structures for seven protein cages, along with two structures of intermediate assemblies, with the highest resolution reaching 2.0 Å using cryo-EM. This set of designed cages adds substantially to the body of available protein nanoparticles, and to methodologies for their creation.
Collapse
Affiliation(s)
- Kyle Meador
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90095, USA
| | | | - Roman Aguirre
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90095, USA
| | - Michael R Sawaya
- UCLA-DOE Institute for Genomics and Proteomics, Los Angeles, CA 90095, USA
| | - Mark A Arbing
- UCLA-DOE Institute for Genomics and Proteomics, Los Angeles, CA 90095, USA
| | - Trent Sherman
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90095, USA
| | - Chethaka Senarathne
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90095, USA
| | - Todd O Yeates
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90095, USA; UCLA-DOE Institute for Genomics and Proteomics, Los Angeles, CA 90095, USA.
| |
Collapse
|
5
|
Cisneros AF, Nielly-Thibault L, Mallik S, Levy ED, Landry CR. Mutational biases favor complexity increases in protein interaction networks after gene duplication. Mol Syst Biol 2024; 20:549-572. [PMID: 38499674 PMCID: PMC11066126 DOI: 10.1038/s44320-024-00030-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Revised: 02/27/2024] [Accepted: 02/28/2024] [Indexed: 03/20/2024] Open
Abstract
Biological systems can gain complexity over time. While some of these transitions are likely driven by natural selection, the extent to which they occur without providing an adaptive benefit is unknown. At the molecular level, one example is heteromeric complexes replacing homomeric ones following gene duplication. Here, we build a biophysical model and simulate the evolution of homodimers and heterodimers following gene duplication using distributions of mutational effects inferred from available protein structures. We keep the specific activity of each dimer identical, so their concentrations drift neutrally without new functions. We show that for more than 60% of tested dimer structures, the relative concentration of the heteromer increases over time due to mutational biases that favor the heterodimer. However, allowing mutational effects on synthesis rates and differences in the specific activity of homo- and heterodimers can limit or reverse the observed bias toward heterodimers. Our results show that the accumulation of more complex protein quaternary structures is likely under neutral evolution, and that natural selection would be needed to reverse this tendency.
Collapse
Affiliation(s)
- Angel F Cisneros
- Département de biochimie, de microbiologie et de bio-informatique, Faculté des sciences et de génie, Université Laval, G1V 0A6, Québec, Canada
- Institut de biologie intégrative et des systèmes, Université Laval, G1V 0A6, Québec, Canada
- PROTEO, Le regroupement québécois de recherche sur la fonction, l'ingénierie et les applications des protéines, Université Laval, G1V 0A6, Québec, Canada
- Centre de recherche sur les données massives, Université Laval, G1V 0A6, Québec, Canada
- Department of Chemical and Structural Biology, Weizmann Institute of Science, 7610001, Rehovot, Israel
| | - Lou Nielly-Thibault
- Institut de biologie intégrative et des systèmes, Université Laval, G1V 0A6, Québec, Canada
- PROTEO, Le regroupement québécois de recherche sur la fonction, l'ingénierie et les applications des protéines, Université Laval, G1V 0A6, Québec, Canada
- Centre de recherche sur les données massives, Université Laval, G1V 0A6, Québec, Canada
- Département de biologie, Faculté des sciences et de génie, Université Laval, G1V 0A6, Québec, Canada
| | - Saurav Mallik
- Department of Chemical and Structural Biology, Weizmann Institute of Science, 7610001, Rehovot, Israel
| | - Emmanuel D Levy
- Department of Chemical and Structural Biology, Weizmann Institute of Science, 7610001, Rehovot, Israel
| | - Christian R Landry
- Département de biochimie, de microbiologie et de bio-informatique, Faculté des sciences et de génie, Université Laval, G1V 0A6, Québec, Canada.
- Institut de biologie intégrative et des systèmes, Université Laval, G1V 0A6, Québec, Canada.
- PROTEO, Le regroupement québécois de recherche sur la fonction, l'ingénierie et les applications des protéines, Université Laval, G1V 0A6, Québec, Canada.
- Centre de recherche sur les données massives, Université Laval, G1V 0A6, Québec, Canada.
- Département de biologie, Faculté des sciences et de génie, Université Laval, G1V 0A6, Québec, Canada.
| |
Collapse
|
6
|
Kshirsagar M, Meller A, Humphreys I, Sledzieski S, Xu Y, Dodhia R, Horvitz E, Berger B, Bowman G, Ferres JL, Baker D, Baek M. Rapid and accurate prediction of protein homo-oligomer symmetry with Seq2Symm. RESEARCH SQUARE 2024:rs.3.rs-4215086. [PMID: 38746169 PMCID: PMC11092833 DOI: 10.21203/rs.3.rs-4215086/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
The majority of proteins must form higher-order assemblies to perform their biological functions. Despite the importance of protein quaternary structure, there are few machine learning models that can accurately and rapidly predict the symmetry of assemblies involving multiple copies of the same protein chain. Here, we address this gap by training several classes of protein foundation models, including ESM-MSA, ESM2, and RoseTTAFold2, to predict homo-oligomer symmetry. Our best model named Seq2Symm, which utilizes ESM2, outperforms existing template-based and deep learning methods. It achieves an average PR-AUC of 0.48 and 0.44 across homo-oligomer symmetries on two different held-out test sets compared to 0.32 and 0.23 for the template-based method. Because Seq2Symm can rapidly predict homo-oligomer symmetries using a single sequence as input (~ 80,000 proteins/hour), we have applied it to 5 entire proteomes and ~ 3.5 million unlabeled protein sequences to identify patterns in protein assembly complexity across biological kingdoms and species.
Collapse
Affiliation(s)
| | | | | | | | - Yixi Xu
- Microsoft AI for Good research lab
| | | | | | | | | | | | | | | |
Collapse
|
7
|
Schweke H, Pacesa M, Levin T, Goverde CA, Kumar P, Duhoo Y, Dornfeld LJ, Dubreuil B, Georgeon S, Ovchinnikov S, Woolfson DN, Correia BE, Dey S, Levy ED. An atlas of protein homo-oligomerization across domains of life. Cell 2024; 187:999-1010.e15. [PMID: 38325366 DOI: 10.1016/j.cell.2024.01.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 11/03/2023] [Accepted: 01/15/2024] [Indexed: 02/09/2024]
Abstract
Protein structures are essential to understanding cellular processes in molecular detail. While advances in artificial intelligence revealed the tertiary structure of proteins at scale, their quaternary structure remains mostly unknown. We devise a scalable strategy based on AlphaFold2 to predict homo-oligomeric assemblies across four proteomes spanning the tree of life. Our results suggest that approximately 45% of an archaeal proteome and a bacterial proteome and 20% of two eukaryotic proteomes form homomers. Our predictions accurately capture protein homo-oligomerization, recapitulate megadalton complexes, and unveil hundreds of homo-oligomer types, including three confirmed experimentally by structure determination. Integrating these datasets with omics information suggests that a majority of known protein complexes are symmetric. Finally, these datasets provide a structural context for interpreting disease mutations and reveal coiled-coil regions as major enablers of quaternary structure evolution in human. Our strategy is applicable to any organism and provides a comprehensive view of homo-oligomerization in proteomes.
Collapse
Affiliation(s)
- Hugo Schweke
- Department of Chemical and Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Martin Pacesa
- Laboratory of Protein Design and Immunoengineering, École Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Tal Levin
- Department of Chemical and Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Casper A Goverde
- Laboratory of Protein Design and Immunoengineering, École Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Prasun Kumar
- School of Chemistry, University of Bristol, Bristol BS8 1TS, UK; School of Biochemistry, University of Bristol, Bristol BS8 1TD, UK; Bristol BioDesign Institute, University of Bristol, Life Sciences Building, Bristol BS8 1TQ, UK; Max Planck-Bristol Centre for Minimal Biology, University of Bristol, Cantock's Close, Bristol BS8 1TS, UK
| | - Yoan Duhoo
- Protein Production and Structure Characterization Core Facility (PTPSP), School of Life Sciences, École polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Lars J Dornfeld
- Laboratory of Protein Design and Immunoengineering, École Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Benjamin Dubreuil
- Department of Chemical and Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Sandrine Georgeon
- Laboratory of Protein Design and Immunoengineering, École Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Sergey Ovchinnikov
- John Harvard Distinguished Science Fellowship Program, Harvard University, Cambridge, MA, USA
| | - Derek N Woolfson
- School of Chemistry, University of Bristol, Bristol BS8 1TS, UK; School of Biochemistry, University of Bristol, Bristol BS8 1TD, UK; Bristol BioDesign Institute, University of Bristol, Life Sciences Building, Bristol BS8 1TQ, UK; Max Planck-Bristol Centre for Minimal Biology, University of Bristol, Cantock's Close, Bristol BS8 1TS, UK.
| | - Bruno E Correia
- Laboratory of Protein Design and Immunoengineering, École Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, Lausanne, Switzerland.
| | - Sucharita Dey
- Department of Bioscience and Bioengineering, Indian Institute of Technology Jodhpur, Rajasthan, India.
| | - Emmanuel D Levy
- Department of Chemical and Structural Biology, Weizmann Institute of Science, Rehovot, Israel.
| |
Collapse
|
8
|
Appasamy SD, Berrisford J, Gaborova R, Nair S, Anyango S, Grudinin S, Deshpande M, Armstrong D, Pidruchna I, Ellaway JIJ, Leines GD, Gupta D, Harrus D, Varadi M, Velankar S. Annotating Macromolecular Complexes in the Protein Data Bank: Improving the FAIRness of Structure Data. Sci Data 2023; 10:853. [PMID: 38040737 PMCID: PMC10692154 DOI: 10.1038/s41597-023-02778-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Accepted: 11/23/2023] [Indexed: 12/03/2023] Open
Abstract
Macromolecular complexes are essential functional units in nearly all cellular processes, and their atomic-level understanding is critical for elucidating and modulating molecular mechanisms. The Protein Data Bank (PDB) serves as the global repository for experimentally determined structures of macromolecules. Structural data in the PDB offer valuable insights into the dynamics, conformation, and functional states of biological assemblies. However, the current annotation practices lack standardised naming conventions for assemblies in the PDB, complicating the identification of instances representing the same assembly. In this study, we introduce a method leveraging resources external to PDB, such as the Complex Portal, UniProt and Gene Ontology, to describe assemblies and contextualise them within their biological settings accurately. Employing the proposed approach, we assigned standard names to over 90% of unique assemblies in the PDB and provided persistent identifiers for each assembly. This standardisation of assembly data enhances the PDB, facilitating a deeper understanding of macromolecular complexes. Furthermore, the data standardisation improves the PDB's FAIR attributes, fostering more effective basic and translational research and scientific education.
Collapse
Affiliation(s)
- Sri Devan Appasamy
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK.
| | - John Berrisford
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Romana Gaborova
- CEITEC - Central European Institute of Technology, Masaryk University, Brno, Czech Republic
| | - Sreenath Nair
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Stephen Anyango
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Sergei Grudinin
- Univ. Grenoble Alpes, CNRS, Grenoble INP, LJK, 38000, Grenoble, France
| | - Mandar Deshpande
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - David Armstrong
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Ivanna Pidruchna
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Joseph I J Ellaway
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Grisell Díaz Leines
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Deepti Gupta
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Deborah Harrus
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Mihaly Varadi
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Sameer Velankar
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| |
Collapse
|
9
|
Avraham O, Tsaban T, Ben-Aharon Z, Tsaban L, Schueler-Furman O. Protein language models can capture protein quaternary state. BMC Bioinformatics 2023; 24:433. [PMID: 37964216 PMCID: PMC10647083 DOI: 10.1186/s12859-023-05549-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Accepted: 10/27/2023] [Indexed: 11/16/2023] Open
Abstract
BACKGROUND Determining a protein's quaternary state, i.e. the number of monomers in a functional unit, is a critical step in protein characterization. Many proteins form multimers for their activity, and over 50% are estimated to naturally form homomultimers. Experimental quaternary state determination can be challenging and require extensive work. To complement these efforts, a number of computational tools have been developed for quaternary state prediction, often utilizing experimentally validated structural information. Recently, dramatic advances have been made in the field of deep learning for predicting protein structure and other characteristics. Protein language models, such as ESM-2, that apply computational natural-language models to proteins successfully capture secondary structure, protein cell localization and other characteristics, from a single sequence. Here we hypothesize that information about the protein quaternary state may be contained within protein sequences as well, allowing us to benefit from these novel approaches in the context of quaternary state prediction. RESULTS We generated ESM-2 embeddings for a large dataset of proteins with quaternary state labels from the curated QSbio dataset. We trained a model for quaternary state classification and assessed it on a non-overlapping set of distinct folds (ECOD family level). Our model, named QUEEN (QUaternary state prediction using dEEp learNing), performs worse than approaches that include information from solved crystal structures. However, it successfully learned to distinguish multimers from monomers, and predicts the specific quaternary state with moderate success, better than simple sequence similarity-based annotation transfer. Our results demonstrate that complex, quaternary state related information is included in such embeddings. CONCLUSIONS QUEEN is the first to investigate the power of embeddings for the prediction of the quaternary state of proteins. As such, it lays out strengths as well as limitations of a sequence-based protein language model approach, compared to structure-based approaches. Since it does not require any structural information and is fast, we anticipate that it will be of wide use both for in-depth investigation of specific systems, as well as for studies of large sets of protein sequences. A simple colab implementation is available at: https://colab. RESEARCH google.com/github/Furman-Lab/QUEEN/blob/main/QUEEN_prediction_notebook.ipynb .
Collapse
Affiliation(s)
- Orly Avraham
- Department of Microbiology and Molecular Genetics, Faculty of Medicine, Institute for Biomedical Research Israel-Canada, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Tomer Tsaban
- Department of Microbiology and Molecular Genetics, Faculty of Medicine, Institute for Biomedical Research Israel-Canada, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Ziv Ben-Aharon
- Department of Microbiology and Molecular Genetics, Faculty of Medicine, Institute for Biomedical Research Israel-Canada, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Linoy Tsaban
- Gaffin Center for Neuro-Oncology, Sharett Institute for Oncology, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
- The Wohl Institute for Translational Medicine, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Ora Schueler-Furman
- Department of Microbiology and Molecular Genetics, Faculty of Medicine, Institute for Biomedical Research Israel-Canada, The Hebrew University of Jerusalem, Jerusalem, Israel.
| |
Collapse
|
10
|
Meador K, Castells-Graells R, Aguirre R, Sawaya MR, Arbing MA, Sherman T, Senarathne C, Yeates TO. A Suite of Designed Protein Cages Using Machine Learning Algorithms and Protein Fragment-Based Protocols. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.09.561468. [PMID: 37873110 PMCID: PMC10592684 DOI: 10.1101/2023.10.09.561468] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
Designed protein cages and related materials provide unique opportunities for applications in biotechnology and medicine, while methods for their creation remain challenging and unpredictable. In the present study, we apply new computational approaches to design a suite of new tetrahedrally symmetric, self-assembling protein cages. For the generation of docked poses, we emphasize a protein fragment-based approach, while for de novo interface design, a comparison of computational protocols highlights the power and increased experimental success achieved using the machine learning program ProteinMPNN. In relating information from docking and design, we observe that agreement between fragment-based sequence preferences and ProteinMPNN sequence inference correlates with experimental success. Additional insights for designing polar interactions are highlighted by experimentally testing larger and more polar interfaces. In all, using X-ray crystallography and cryo-EM, we report five structures for seven protein cages, with atomic resolution in the best case reaching 2.0 Å. We also report structures of two incompletely assembled protein cages, providing unique insights into one type of assembly failure. The new set of designed cages and their structures add substantially to the body of available protein nanoparticles, and to methodologies for their creation.
Collapse
Affiliation(s)
- Kyle Meador
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA 90095
| | | | - Roman Aguirre
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA 90095
| | - Michael R. Sawaya
- UCLA-DOE Institute for Genomics and Proteomics, Los Angeles, CA, USA 90095
| | - Mark A. Arbing
- UCLA-DOE Institute for Genomics and Proteomics, Los Angeles, CA, USA 90095
| | - Trent Sherman
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA 90095
| | - Chethaka Senarathne
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA 90095
| | - Todd O. Yeates
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA 90095
- UCLA-DOE Institute for Genomics and Proteomics, Los Angeles, CA, USA 90095
| |
Collapse
|
11
|
Schweke H, Xu Q, Tauriello G, Pantolini L, Schwede T, Cazals F, Lhéritier A, Fernandez-Recio J, Rodríguez-Lumbreras LÁ, Schueler-Furman O, Varga JK, Jiménez-García B, Réau MF, Bonvin A, Savojardo C, Martelli PL, Casadio R, Tubiana J, Wolfson H, Oliva R, Barradas-Bautista D, Ricciardelli T, Cavallo L, Venclovas Č, Olechnovič K, Guerois R, Andreani J, Martin J, Wang X, Kihara D, Marchand A, Correia B, Zou X, Dey S, Dunbrack R, Levy E, Wodak S. Discriminating physiological from non-physiological interfaces in structures of protein complexes: A community-wide study. Proteomics 2023; 23:e2200323. [PMID: 37365936 PMCID: PMC10937251 DOI: 10.1002/pmic.202200323] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2023] [Revised: 05/11/2023] [Accepted: 05/11/2023] [Indexed: 06/28/2023]
Abstract
Reliably scoring and ranking candidate models of protein complexes and assigning their oligomeric state from the structure of the crystal lattice represent outstanding challenges. A community-wide effort was launched to tackle these challenges. The latest resources on protein complexes and interfaces were exploited to derive a benchmark dataset consisting of 1677 homodimer protein crystal structures, including a balanced mix of physiological and non-physiological complexes. The non-physiological complexes in the benchmark were selected to bury a similar or larger interface area than their physiological counterparts, making it more difficult for scoring functions to differentiate between them. Next, 252 functions for scoring protein-protein interfaces previously developed by 13 groups were collected and evaluated for their ability to discriminate between physiological and non-physiological complexes. A simple consensus score generated using the best performing score of each of the 13 groups, and a cross-validated Random Forest (RF) classifier were created. Both approaches showed excellent performance, with an area under the Receiver Operating Characteristic (ROC) curve of 0.93 and 0.94, respectively, outperforming individual scores developed by different groups. Additionally, AlphaFold2 engines recalled the physiological dimers with significantly higher accuracy than the non-physiological set, lending support to the reliability of our benchmark dataset annotations. Optimizing the combined power of interface scoring functions and evaluating it on challenging benchmark datasets appears to be a promising strategy.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | - Julia K. Varga
- Hebrew University of Jerusalem Institute for Medical Research Israel-Canada
| | | | | | | | | | | | | | - Jérôme Tubiana
- Tel Aviv University Blavatnik School of Computer Science
| | - Haim Wolfson
- Tel Aviv University Blavatnik School of Computer Science
| | | | | | | | | | | | | | | | | | | | | | | | | | | | - Xiaoqin Zou
- Dalton Cardiovascular Research Center, Institute for Data Science and Informatics, University of Missouri
| | | | | | | | | |
Collapse
|
12
|
Escobedo N, Monzon AM, Fornasari MS, Palopoli N, Parisi G. Combining Protein Conformational Diversity and Phylogenetic Information Using CoDNaS and CoDNaS-Q. Curr Protoc 2023; 3:e764. [PMID: 37184204 DOI: 10.1002/cpz1.764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
CoDNaS (http://ufq.unq.edu.ar/codnas/) and CoDNaS-Q (http://ufq.unq.edu.ar/codnasq) are repositories of proteins with different degrees of conformational diversity. Following the ensemble nature of the native state, conformational diversity represents the structural differences between the conformers in the ensemble. Each entry in CoDNaS and CoDNaS-Q contains a redundant collection of experimentally determined conformers obtained under different conditions. These conformers represent snapshots of the protein dynamism. While CoDNaS contains examples of conformational diversity at the tertiary level, a recent development, CoDNaS-Q, contains examples at the quaternary level. In the emerging age of accurate protein structure prediction by machine learning approaches, many questions remain open regarding the characterization of protein dynamism. In this context, most bioinformatics resources take advantage of distinct features derived from protein alignments, however, the complexity and heterogeneity of information makes it difficult to recover reliable biological signatures. Here we present five protocols to explore tertiary and quaternary conformational diversity at the individual protein level as well as for the characterization of the distribution of conformational diversity at the protein family level in a phylogenetic context. These protocols can provide curated protein families with experimentally known conformational diversity, facilitating the exploration of sequence determinants of protein dynamism. © 2023 Wiley Periodicals LLC. Basic Protocol 1: Assessing conformational diversity with CoDNaS Alternate Protocol 1: Assessing conformational diversity at the quaternary level with CoDNaS-Q Basic Protocol 2: Exploring conformational diversity in a protein family Alternate Protocol 2: Exploring quaternary conformational diversity in a protein family Basic Protocol 3: Representing conformational diversity in a phylogenetic context.
Collapse
Affiliation(s)
- Nahuel Escobedo
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | | | - María Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Nicolas Palopoli
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| |
Collapse
|
13
|
Xu Q, Dunbrack R. The protein common assembly database (ProtCAD)-a comprehensive structural resource of protein complexes. Nucleic Acids Res 2023; 51:D466-D478. [PMID: 36300618 PMCID: PMC9825537 DOI: 10.1093/nar/gkac937] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 10/04/2022] [Accepted: 10/11/2022] [Indexed: 01/29/2023] Open
Abstract
Proteins often act through oligomeric interactions with other proteins. X-ray crystallography and cryo-electron microscopy provide detailed information on the structures of biological assemblies, defined as the most likely biologically relevant structures derived from experimental data. In crystal structures, the most relevant assembly may be ambiguously determined, since multiple assemblies observed in the crystal lattice may be plausible. It is estimated that 10-15% of PDB entries may have incorrect or ambiguous assembly annotations. Accurate assemblies are required for understanding functional data and training of deep learning methods for predicting assembly structures. As with any other kind of biological data, replication via multiple independent experiments provides important validation for the determination of biological assembly structures. Here we present the Protein Common Assembly Database (ProtCAD), which presents clusters of protein assembly structures observed in independent structure determinations of homologous proteins in the Protein Data Bank (PDB). ProtCAD is searchable by PDB entry, UniProt identifiers, or Pfam domain designations and provides downloads of coordinate files, PyMol scripts, and publicly available assembly annotations for each cluster of assemblies. About 60% of PDB entries contain assemblies in clusters of at least 2 independent experiments. All clusters and coordinates are available on ProtCAD web site (http://dunbrack2.fccc.edu/protcad).
Collapse
Affiliation(s)
- Qifang Xu
- Institute for Cancer Research, Fox Chase Cancer Center, Philadelphia, PA 19111, USA
| | - Roland L Dunbrack
- Institute for Cancer Research, Fox Chase Cancer Center, Philadelphia, PA 19111, USA
| |
Collapse
|
14
|
Pillai AS, Hochberg GK, Thornton JW. Simple mechanisms for the evolution of protein complexity. Protein Sci 2022; 31:e4449. [PMID: 36107026 PMCID: PMC9601886 DOI: 10.1002/pro.4449] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Revised: 09/01/2022] [Accepted: 09/10/2022] [Indexed: 01/26/2023]
Abstract
Proteins are tiny models of biological complexity: specific interactions among their many amino acids cause proteins to fold into elaborate structures, assemble with other proteins into higher-order complexes, and change their functions and structures upon binding other molecules. These complex features are classically thought to evolve via long and gradual trajectories driven by persistent natural selection. But a growing body of evidence from biochemistry, protein engineering, and molecular evolution shows that naturally occurring proteins often exist at or near the genetic edge of multimerization, allostery, and even new folds, so just one or a few mutations can trigger acquisition of these properties. These sudden transitions can occur because many of the physical properties that underlie these features are present in simpler proteins as fortuitous by-products of their architecture. Moreover, complex features of proteins can be encoded by huge arrays of sequences, so they are accessible from many different starting points via many possible paths. Because the bridges to these features are both short and numerous, random chance can join selection as a key factor in explaining the evolution of molecular complexity.
Collapse
Affiliation(s)
- Arvind S. Pillai
- Department of Ecology and EvolutionUniversity of ChicagoChicagoIllinoisUSA
- Institute for Protein DesignUniversity of WashingtonSeattleWAUSA
| | - Georg K.A. Hochberg
- Max Planck Institute for Terrestrial MicrobiologyMarburgGermany
- Department of Chemistry, Center for Synthetic MicrobiologyPhilipps University MarburgMarburgGermany
| | - Joseph W. Thornton
- Department of Ecology and EvolutionUniversity of ChicagoChicagoIllinoisUSA
- Departments of Human Genetics and Ecology and EvolutionUniversity of ChicagoChicagoIllinoisUSA
| |
Collapse
|
15
|
Marciano S, Dey D, Listov D, Fleishman SJ, Sonn-Segev A, Mertens H, Busch F, Kim Y, Harvey SR, Wysocki VH, Schreiber G. Protein quaternary structures in solution are a mixture of multiple forms. Chem Sci 2022; 13:11680-11695. [PMID: 36320402 PMCID: PMC9555727 DOI: 10.1039/d2sc02794a] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Accepted: 09/21/2022] [Indexed: 11/21/2022] Open
Abstract
Over half the proteins in the E. coli cytoplasm form homo or hetero-oligomeric structures. Experimentally determined structures are often considered in determining a protein's oligomeric state, but static structures miss the dynamic equilibrium between different quaternary forms. The problem is exacerbated in homo-oligomers, where the oligomeric states are challenging to characterize. Here, we re-evaluated the oligomeric state of 17 different bacterial proteins across a broad range of protein concentrations and solutions by native mass spectrometry (MS), mass photometry (MP), size exclusion chromatography (SEC), and small-angle X-ray scattering (SAXS), finding that most exhibit several oligomeric states. Surprisingly, some proteins did not show mass-action driven equilibrium between the oligomeric states. For approximately half the proteins, the predicted oligomeric forms described in publicly available databases underestimated the complexity of protein quaternary structures in solution. Conversely, AlphaFold multimer provided an accurate description of the potential multimeric states for most proteins, suggesting that it could help resolve uncertainties on the solution state of many proteins.
Collapse
Affiliation(s)
- Shir Marciano
- Department of Biomolecular Sciences, Weizmann Institute of Science Rehovot Israel
| | - Debabrata Dey
- Department of Biomolecular Sciences, Weizmann Institute of Science Rehovot Israel
| | - Dina Listov
- Department of Biomolecular Sciences, Weizmann Institute of Science Rehovot Israel
| | - Sarel J Fleishman
- Department of Biomolecular Sciences, Weizmann Institute of Science Rehovot Israel
| | - Adar Sonn-Segev
- Refeyn Ltd 1 Electric Avenue, Ferry Hinksey Road Oxford OX2 0BY UK
| | - Haydyn Mertens
- Hamburg Outstation, European Molecular Biology Laboratory Notkestrasse 85 Hamburg 22607 Germany
| | - Florian Busch
- Department of Chemistry and Biochemistry, Resource for Native Mass Spectrometry Guided Structural Biology, The Ohio State University Columbus OH 43210 USA
| | - Yongseok Kim
- Department of Chemistry and Biochemistry, Resource for Native Mass Spectrometry Guided Structural Biology, The Ohio State University Columbus OH 43210 USA
| | - Sophie R Harvey
- Department of Chemistry and Biochemistry, Resource for Native Mass Spectrometry Guided Structural Biology, The Ohio State University Columbus OH 43210 USA
| | - Vicki H Wysocki
- Department of Chemistry and Biochemistry, Resource for Native Mass Spectrometry Guided Structural Biology, The Ohio State University Columbus OH 43210 USA
| | - Gideon Schreiber
- Department of Biomolecular Sciences, Weizmann Institute of Science Rehovot Israel
| |
Collapse
|
16
|
Iida T, Hamatani S, Takagi Y, Fujiwara K, Tamura M, Tokonami S. Attogram-level light-induced antigen-antibody binding confined in microflow. Commun Biol 2022; 5:1053. [PMID: 36203087 PMCID: PMC9537419 DOI: 10.1038/s42003-022-03946-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Accepted: 09/02/2022] [Indexed: 11/28/2022] Open
Abstract
The analysis of trace amounts of proteins based on immunoassays and other methods is essential for the early diagnosis of various diseases such as cancer, dementia, and microbial infections. Here, we propose a light-induced acceleration of antigen-antibody reaction of attogram-level proteins at the solid-liquid interface by tuning the laser irradiation area comparable to the microscale confinement geometry for enhancing the collisional probability of target molecules and probe particles with optical force and fluidic pressure. This principle was applied to achieve a 102-fold higher sensitivity and ultrafast specific detection in comparison with conventional protein detection methods (a few hours) by omitting any pretreatment procedures; 47-750 ag of target proteins were detected in 300 nL of sample after 3 minutes of laser irradiation. Our findings can promote the development of proteomics and innovative platforms for high-throughput bio-analyses under the control of a variety of biochemical reactions.
Collapse
Affiliation(s)
- Takuya Iida
- Department of Physical Science, Graduate School of Science, Osaka Prefecture University, 1-2 Gakuencho, Nakaku, Sakai, Osaka, 599-8570, Japan.
- Research Institute for Light-induced Acceleration System (RILACS), Osaka Prefecture University, 1-2 Gakuencho, Nakaku, Sakai, Osaka, 599-8570, Japan.
- Department of Physics, Graduate School of Science, Osaka Metropolitan University, 1-2 Gakuencho, Nakaku, Sakai, Osaka, 599-8570, Japan.
- Research Institute for Light-induced Acceleration System (RILACS), Osaka Metropolitan University, 1-2 Gakuencho, Nakaku, Sakai, Osaka, 599-8570, Japan.
| | - Shota Hamatani
- Department of Physical Science, Graduate School of Science, Osaka Prefecture University, 1-2 Gakuencho, Nakaku, Sakai, Osaka, 599-8570, Japan
- Research Institute for Light-induced Acceleration System (RILACS), Osaka Prefecture University, 1-2 Gakuencho, Nakaku, Sakai, Osaka, 599-8570, Japan
- Department of Applied Chemistry, Graduate School of Engineering, Osaka Prefecture University, 1-2 Gakuencho, Nakaku, Sakai, Osaka, 599-8570, Japan
- Department of Physics, Graduate School of Science, Osaka Metropolitan University, 1-2 Gakuencho, Nakaku, Sakai, Osaka, 599-8570, Japan
- Research Institute for Light-induced Acceleration System (RILACS), Osaka Metropolitan University, 1-2 Gakuencho, Nakaku, Sakai, Osaka, 599-8570, Japan
- Department of Applied Chemistry, Graduate School of Engineering, Osaka Metropolitan University, 1-2 Gakuencho, Nakaku, Sakai, Osaka, 599-8570, Japan
| | - Yumiko Takagi
- Department of Physical Science, Graduate School of Science, Osaka Prefecture University, 1-2 Gakuencho, Nakaku, Sakai, Osaka, 599-8570, Japan
- Research Institute for Light-induced Acceleration System (RILACS), Osaka Prefecture University, 1-2 Gakuencho, Nakaku, Sakai, Osaka, 599-8570, Japan
- Department of Physics, Graduate School of Science, Osaka Metropolitan University, 1-2 Gakuencho, Nakaku, Sakai, Osaka, 599-8570, Japan
- Research Institute for Light-induced Acceleration System (RILACS), Osaka Metropolitan University, 1-2 Gakuencho, Nakaku, Sakai, Osaka, 599-8570, Japan
| | - Kana Fujiwara
- Department of Physical Science, Graduate School of Science, Osaka Prefecture University, 1-2 Gakuencho, Nakaku, Sakai, Osaka, 599-8570, Japan
- Research Institute for Light-induced Acceleration System (RILACS), Osaka Prefecture University, 1-2 Gakuencho, Nakaku, Sakai, Osaka, 599-8570, Japan
- Department of Applied Chemistry, Graduate School of Engineering, Osaka Prefecture University, 1-2 Gakuencho, Nakaku, Sakai, Osaka, 599-8570, Japan
- Department of Physics, Graduate School of Science, Osaka Metropolitan University, 1-2 Gakuencho, Nakaku, Sakai, Osaka, 599-8570, Japan
- Research Institute for Light-induced Acceleration System (RILACS), Osaka Metropolitan University, 1-2 Gakuencho, Nakaku, Sakai, Osaka, 599-8570, Japan
- Department of Applied Chemistry, Graduate School of Engineering, Osaka Metropolitan University, 1-2 Gakuencho, Nakaku, Sakai, Osaka, 599-8570, Japan
| | - Mamoru Tamura
- Research Institute for Light-induced Acceleration System (RILACS), Osaka Prefecture University, 1-2 Gakuencho, Nakaku, Sakai, Osaka, 599-8570, Japan
- Graduate School of Engineering Science, Osaka University, 1-3 Machikaneyama-cho, Toyonaka, Osaka, 560-8531, Japan
- Research Institute for Light-induced Acceleration System (RILACS), Osaka Metropolitan University, 1-2 Gakuencho, Nakaku, Sakai, Osaka, 599-8570, Japan
| | - Shiho Tokonami
- Research Institute for Light-induced Acceleration System (RILACS), Osaka Prefecture University, 1-2 Gakuencho, Nakaku, Sakai, Osaka, 599-8570, Japan.
- Department of Applied Chemistry, Graduate School of Engineering, Osaka Prefecture University, 1-2 Gakuencho, Nakaku, Sakai, Osaka, 599-8570, Japan.
- Research Institute for Light-induced Acceleration System (RILACS), Osaka Metropolitan University, 1-2 Gakuencho, Nakaku, Sakai, Osaka, 599-8570, Japan.
- Department of Applied Chemistry, Graduate School of Engineering, Osaka Metropolitan University, 1-2 Gakuencho, Nakaku, Sakai, Osaka, 599-8570, Japan.
| |
Collapse
|
17
|
Gurusinghe SN, Oppenheimer B, Shifman JM. Cold spots are universal in protein-protein interactions. Protein Sci 2022; 31:e4435. [PMID: 36173158 PMCID: PMC9490803 DOI: 10.1002/pro.4435] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Revised: 07/22/2022] [Accepted: 08/26/2022] [Indexed: 12/02/2022]
Abstract
Proteins interact with each other through binding interfaces that differ greatly in size and physico-chemical properties. Within the binding interface, a few residues called hot spots contribute the majority of the binding free energy and are hence irreplaceable. In contrast, cold spots are occupied by suboptimal amino acids, providing possibility for affinity enhancement through mutations. In this study, we identify cold spots due to cavities and unfavorable charge interactions in multiple protein-protein interactions (PPIs). For our cold spot analysis, we first use a small affinity database of PPIs with known structures and affinities and then expand our search to nearly 4000 homo- and heterodimers in the Protein Data Bank (PDB). We observe that cold spots due to cavities are present in nearly all PPIs unrelated to their binding affinity, while unfavorable charge interactions are relatively rare. We also find that most cold spots are located in the periphery of the binding interface, with high-affinity complexes showing fewer centrally located colds spots than low-affinity complexes. A larger number of cold spots is also found in non-cognate interactions compared to their cognate counterparts. Furthermore, our analysis reveals that cold spots are more frequent in homo-dimeric complexes compared to hetero-complexes, likely due to symmetry constraints imposed on sequences of homodimers. Finally, we find that glycines, glutamates, and arginines are the most frequent amino acids appearing at cold spot positions. Our analysis emphasizes the importance of cold spot positions to protein evolution and facilitates protein engineering studies directed at enhancing binding affinity and specificity in a wide range of applications.
Collapse
Affiliation(s)
- Sagara N.S. Gurusinghe
- Department of Biological ChemistryThe Alexander Silberman Institute of Life Sciences, The Hebrew University of JerusalemJerusalemIsrael
| | - Ben Oppenheimer
- Department of Biological ChemistryThe Alexander Silberman Institute of Life Sciences, The Hebrew University of JerusalemJerusalemIsrael
| | - Julia M. Shifman
- Department of Biological ChemistryThe Alexander Silberman Institute of Life Sciences, The Hebrew University of JerusalemJerusalemIsrael
| |
Collapse
|
18
|
Escobedo N, Tunque Cahui RR, Caruso G, García Ríos E, Hirsh L, Monzon AM, Parisi G, Palopoli N. CoDNaS-Q: a database of conformational diversity of the native state of proteins with quaternary structure. Bioinformatics 2022; 38:4959-4961. [DOI: 10.1093/bioinformatics/btac627] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 07/03/2022] [Accepted: 09/15/2022] [Indexed: 11/15/2022] Open
Abstract
Abstract
Summary
A collection of conformers that exist in a dynamical equilibrium defines the native state of a protein. The structural differences between them describe their conformational diversity, a defining characteristic of the protein with an essential role in multiple cellular processes. Since most proteins carry out their functions by assembling into complexes, we have developed CoDNaS-Q, the first online resource to explore conformational diversity in homooligomeric proteins. It features a curated collection of redundant protein structures with known quaternary structure. CoDNaS-Q integrates relevant annotations that allow researchers to identify and explore the extent and possible reasons of conformational diversity in homooligomeric protein complexes.
Availability and Implementation
CoDNaS-Q is freely accessible at http://ufq.unq.edu.ar/codnasq/ or https://codnas-q.bioinformatica.org/home. The data can be retrieved from the website. The source code of the database can be downloaded from https://github.com/SfrRonaldo/codnas-q.
Supplementary information
Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Nahuel Escobedo
- Universidad Nacional de Quilmes Departamento de Ciencia y Tecnología, , Buenos Aires, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas , Buenos Aires, Argentina
| | | | - Gastón Caruso
- Universidad Nacional de Quilmes Departamento de Ciencia y Tecnología, , Buenos Aires, Argentina
| | - Emilio García Ríos
- Pontificia Universidad Católica del Perú Departamento de Ingeniería, , Lima, Perú
| | - Layla Hirsh
- Pontificia Universidad Católica del Perú Departamento de Ingeniería, , Lima, Perú
| | | | - Gustavo Parisi
- Universidad Nacional de Quilmes Departamento de Ciencia y Tecnología, , Buenos Aires, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas , Buenos Aires, Argentina
| | - Nicolas Palopoli
- Universidad Nacional de Quilmes Departamento de Ciencia y Tecnología, , Buenos Aires, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas , Buenos Aires, Argentina
| |
Collapse
|
19
|
Liu AK, Pereira JH, Kehl AJ, Rosenberg DJ, Orr DJ, Chu SKS, Banda DM, Hammel M, Adams PD, Siegel JB, Shih PM. Structural plasticity enables evolution and innovation of RuBisCO assemblies. SCIENCE ADVANCES 2022; 8:eadc9440. [PMID: 36026446 PMCID: PMC9417184 DOI: 10.1126/sciadv.adc9440] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Accepted: 07/13/2022] [Indexed: 06/15/2023]
Abstract
Oligomerization is a core structural feature that defines the form and function of many proteins. Most proteins form molecular complexes; however, there remains a dearth of diversity-driven structural studies investigating the evolutionary trajectory of these assemblies. Ribulose-1,5-bisphosphate carboxylase-oxygenase (RuBisCO) is one such enzyme that adopts multiple assemblies, although the origins and distribution of its different oligomeric states remain cryptic. Here, we retrace the evolution of ancestral and extant form II RuBisCOs, revealing a complex and diverse history of oligomerization. We structurally characterize a newly discovered tetrameric RuBisCO, elucidating how solvent-exposed surfaces can readily adopt new interactions to interconvert or give rise to new oligomeric states. We further use these principles to engineer and demonstrate how changes in oligomerization can be mediated by relatively few mutations. Our findings yield insight into how structural plasticity may give rise to new oligomeric states.
Collapse
Affiliation(s)
- Albert K. Liu
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, CA 94720, USA
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
- Biochemistry, Molecular, Cellular and Developmental Biology Graduate Group, University of California, Davis, Davis, CA 95616, USA
| | - Jose H. Pereira
- Technology Division, Joint BioEnergy Institute, Emeryville, CA 94608, USA
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Alexander J. Kehl
- Biophysics Graduate Group, University of California, Davis, Davis, CA, USA
| | - Daniel J. Rosenberg
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
- Graduate Group in Biophysics, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Douglas J. Orr
- Lancaster Environment Centre, Lancaster University, Lancaster LA1 4YQ, UK
| | - Simon K. S. Chu
- Biophysics Graduate Group, University of California, Davis, Davis, CA, USA
| | - Douglas M. Banda
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Michal Hammel
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Paul D. Adams
- Technology Division, Joint BioEnergy Institute, Emeryville, CA 94608, USA
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
- Department of Bioengineering, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Justin B. Siegel
- Genome Center, University of California, Davis, Davis, CA 95616, USA
- Chemistry Department, University of California, Davis, Davis, CA 95616, USA
- Department of Biochemistry and Molecular Medicine, University of California, Sacramento, Sacramento, CA 95616, USA
| | - Patrick M. Shih
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, CA 94720, USA
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
- Feedstocks Division, Joint BioEnergy Institute, Emeryville, CA 94608, USA
- Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA 94720, USA
| |
Collapse
|
20
|
Mallik S, Tawfik DS, Levy ED. How gene duplication diversifies the landscape of protein oligomeric state and function. Curr Opin Genet Dev 2022; 76:101966. [PMID: 36007298 PMCID: PMC9548406 DOI: 10.1016/j.gde.2022.101966] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Revised: 07/01/2022] [Accepted: 07/08/2022] [Indexed: 11/29/2022]
Abstract
Oligomeric proteins are central to cellular life and the duplication and divergence of their genes is a key driver of evolutionary innovations. The duplication of a gene coding for an oligomeric protein has numerous possible outcomes, which motivates questions on the relationship between structural and functional divergence. How do protein oligomeric states diversify after gene duplication? In the simple case of duplication of a homo-oligomeric protein gene, what properties can influence the fate of descendant paralogs toward forming independent homomers or maintaining their interaction as a complex? Furthermore, how are functional innovations associated with the diversification of oligomeric states? Here, we review recent literature and present specific examples in an attempt to illustrate and answer these questions.
Collapse
Affiliation(s)
- Saurav Mallik
- Department of Chemical and Structural Biology, The Weizmann Institute of Science, Rehovot 7610001, Israel; Department of Biomolecular Sciences, The Weizmann Institute of Science, Rehovot 7610001, Israel.
| | - Dan S Tawfik
- Department of Biomolecular Sciences, The Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Emmanuel D Levy
- Department of Chemical and Structural Biology, The Weizmann Institute of Science, Rehovot 7610001, Israel.
| |
Collapse
|
21
|
Pollet L, Lambourne L, Xia Y. Structural Determinants of Yeast Protein-Protein Interaction Interface Evolution at the Residue Level. J Mol Biol 2022; 434:167750. [PMID: 35850298 DOI: 10.1016/j.jmb.2022.167750] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 06/09/2022] [Accepted: 07/12/2022] [Indexed: 12/01/2022]
Abstract
Interfaces of contact between proteins play important roles in determining the proper structure and function of protein-protein interactions (PPIs). Therefore, to fully understand PPIs, we need to better understand the evolutionary design principles of PPI interfaces. Previous studies have uncovered that interfacial sites are more evolutionarily conserved than other surface protein sites. Yet, little is known about the nature and relative importance of evolutionary constraints in PPI interfaces. Here, we explore constraints imposed by the structure of the microenvironment surrounding interfacial residues on residue evolutionary rate using a large dataset of over 700 structural models of baker's yeast PPIs. We find that interfacial residues are, on average, systematically more conserved than all other residues with a similar degree of total burial as measured by relative solvent accessibility (RSA). Besides, we find that RSA of the residue when the PPI is formed is a better predictor of interfacial residue evolutionary rate than RSA in the monomer state. Furthermore, we investigate four structure-based measures of residue interfacial involvement, including change in RSA upon binding (ΔRSA), number of residue-residue contacts across the interface, and distance from the center or the periphery of the interface. Integrated modeling for evolutionary rate prediction in interfaces shows that ΔRSA plays a dominant role among the four measures of interfacial involvement, with minor, but independent contributions from other measures. These results yield insight into the evolutionary design of interfaces, improving our understanding of the role that structure plays in the molecular evolution of PPIs at the residue level.
Collapse
Affiliation(s)
- Léah Pollet
- Department of Bioengineering, Faculty of Engineering, McGill University, Montreal, QC, Canada
| | - Luke Lambourne
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, USA; Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, USA; Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA.
| | - Yu Xia
- Department of Bioengineering, Faculty of Engineering, McGill University, Montreal, QC, Canada.
| |
Collapse
|
22
|
Villegas JA, Levy ED. A unified statistical potential reveals that amino acid stickiness governs nonspecific recruitment of client proteins into condensates. Protein Sci 2022; 31:e4361. [PMID: 35762716 PMCID: PMC9207749 DOI: 10.1002/pro.4361] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Revised: 05/06/2022] [Accepted: 05/10/2022] [Indexed: 11/07/2022]
Abstract
Membraneless organelles are cellular compartments that form by liquid-liquid phase separation of one or more components. Other molecules, such as proteins and nucleic acids, will distribute between the cytoplasm and the liquid compartment in accordance with the thermodynamic drive to lower the free energy of the system. The resulting distribution colocalizes molecular species to carry out a diversity of functions. Two factors could drive this partitioning: the difference in solvation between the dilute versus dense phase and intermolecular interactions between the client and scaffold proteins. Here, we develop a set of knowledge-based potentials that allow for the direct comparison between stickiness, which is dominated by desolvation energy, and pairwise residue contact propensity terms. We use these scales to examine experimental data from two systems: protein cargo dissolving within phase-separated droplets made from FG repeat proteins of the nuclear pore complex and client proteins dissolving within phase-separated FUS droplets. These analyses reveal a close agreement between the stickiness of the client proteins and the experimentally determined values of the partition coefficients (R > 0.9), while pairwise residue contact propensities between client and scaffold show weaker correlations. Hence, the stickiness of client proteins is sufficient to explain their differential partitioning within these two phase-separated systems without taking into account the composition of the condensate. This result implies that selective trafficking of client proteins to distinct membraneless organelles requires recognition elements beyond the client sequence composition. STATEMENT: Empirical potentials for amino acid stickiness and pairwise residue contact propensities are derived. These scales are unique in that they enable direct comparison of desolvation versus contact terms. We find that partitioning of a client protein to a condensate is best explained by amino acid stickiness.
Collapse
Affiliation(s)
- José A. Villegas
- Department of Chemical and Structural BiologyWeizmann Institute of ScienceRehovotIsrael
- Present address:
Department of Pharmaceutical SciencesCollege of Pharmacy, University of Illinois ChicagoChicagoIL60612
| | - Emmanuel D. Levy
- Department of Chemical and Structural BiologyWeizmann Institute of ScienceRehovotIsrael
| |
Collapse
|
23
|
ScanNet: an interpretable geometric deep learning model for structure-based protein binding site prediction. Nat Methods 2022; 19:730-739. [DOI: 10.1038/s41592-022-01490-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2021] [Accepted: 04/12/2022] [Indexed: 11/08/2022]
|
24
|
Wright AJ, Orlic-Milacic M, Rothfels K, Weiser J, Trinh QM, Jassal B, Haw RA, Stein LD. Evaluating the predictive accuracy of curated biological pathways in a public knowledgebase. Database (Oxford) 2022; 2022:6555052. [PMID: 35348650 PMCID: PMC9216552 DOI: 10.1093/database/baac009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Revised: 01/04/2022] [Accepted: 02/15/2022] [Indexed: 11/14/2022]
Abstract
Abstract Reactome is a database of human biological pathways manually curated from the primary literature and peer-reviewed by experts. To evaluate the utility of Reactome pathways for predicting functional consequences of genetic perturbations, we compared predictions of perturbation effects based on Reactome pathways against published empirical observations. Ten cancer-relevant Reactome pathways, representing diverse biological processes such as signal transduction, cell division, DNA repair and transcriptional regulation, were selected for testing. For each pathway, root input nodes and key pathway outputs were defined. We then used pathway-diagram-derived logic graphs to predict, either by inspection by biocurators or using a novel algorithm MP-BioPath, the effects of bidirectional perturbations (upregulation/activation or downregulation/inhibition) of single root inputs on the status of key outputs. These predictions were then compared to published empirical tests. In total, 4968 test cases were analyzed across 10 pathways, of which 847 were supported by published empirical findings. Out of the 847 test cases, curators’ predictions agreed with the experimental evidence in 670 and disagreed in 177 cases, resulting in ∼81% overall accuracy. MP-BioPath predictions agreed with experimental evidence for 625 and disagreed for 222 test cases, resulting in ∼75% overall accuracy. The expected accuracy of random guessing was 33%. Per-pathway accuracy did not correlate with the number of pathway edges nor the number of pathway nodes but varied across pathways, ranging from 56% (curator)/44% (MP-BioPath) for ‘Mitotic G1 phase and G1/S transition’ to 100% (curator)/94% (MP-BioPath) for ‘RAF/MAP kinase cascade’. This study highlights the potential of pathway databases such as Reactome in modeling genetic perturbations, promoting standardization of experimental pathway activity readout and supporting hypothesis-driven research by revealing relationships between pathway inputs and outputs that have not yet been directly experimentally tested. Database URL www.reactome.org
Collapse
Affiliation(s)
- Adam J Wright
- Adaptive Oncology Program, Ontario Institute for Cancer Research, 661 University Avenue Suite 500, Toronto, ON M5G 0A3, Canada
| | - Marija Orlic-Milacic
- Adaptive Oncology Program, Ontario Institute for Cancer Research, 661 University Avenue Suite 500, Toronto, ON M5G 0A3, Canada
| | - Karen Rothfels
- Adaptive Oncology Program, Ontario Institute for Cancer Research, 661 University Avenue Suite 500, Toronto, ON M5G 0A3, Canada
| | - Joel Weiser
- Adaptive Oncology Program, Ontario Institute for Cancer Research, 661 University Avenue Suite 500, Toronto, ON M5G 0A3, Canada
| | - Quang M Trinh
- Adaptive Oncology Program, Ontario Institute for Cancer Research, 661 University Avenue Suite 500, Toronto, ON M5G 0A3, Canada
| | - Bijay Jassal
- Adaptive Oncology Program, Ontario Institute for Cancer Research, 661 University Avenue Suite 500, Toronto, ON M5G 0A3, Canada
| | - Robin A Haw
- Adaptive Oncology Program, Ontario Institute for Cancer Research, 661 University Avenue Suite 500, Toronto, ON M5G 0A3, Canada
| | - Lincoln D Stein
- Adaptive Oncology Program, Ontario Institute for Cancer Research, 661 University Avenue Suite 500, Toronto, ON M5G 0A3, Canada
- Department of Molecular Genetics, University of Toronto, Room 4396, Medical Sciences Building, 1 King’s College Circle, Toronto, ON M5S 1A1, Canada
| |
Collapse
|
25
|
Tupiņa D, Krah A, Marzinek JK, Zuzic L, Moverley AA, Constantinidou C, Bond PJ. Bridging the N-terminal and middle domains in FliG of the flagellar rotor. Curr Res Struct Biol 2022; 4:59-67. [PMID: 35345452 PMCID: PMC8956890 DOI: 10.1016/j.crstbi.2022.02.002] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2021] [Revised: 01/28/2022] [Accepted: 02/28/2022] [Indexed: 11/27/2022] Open
Abstract
Flagella are necessary for bacterial movement and contribute to various aspects of virulence. They are complex cylindrical structures built of multiple molecular rings with self-assembly properties. The flagellar rotor is composed of the MS-ring and the C-ring. The FliG protein of the C-ring is central to flagellar assembly and function due to its roles in linking the C-ring with the MS-ring and in torque transmission from stator to rotor. No high-resolution structure of an assembled C-ring has been resolved to date, and the conformation adopted by FliG within the ring is unclear due to variations in available crystallographic data. Here, we use molecular dynamics (MD) simulations to study the conformation and dynamics of FliG in different states of assembly, including both in physiologically relevant and crystallographic lattice environments. We conclude that the linker between the FliG N-terminal and middle domain likely adopts an extended helical conformation in vivo, in contrast with the contracted conformation observed in some previous X-ray studies. We further support our findings with integrative model building of full-length FliG and a FliG ring model that is compatible with cryo-electron tomography (cryo-ET) and electron microscopy (EM) densities of the C-ring. Collectively, our study contributes to a better mechanistic understanding of the flagellar rotor assembly and its function.
Collapse
|
26
|
Tianma Formula Alleviates Dementia via ACER2-Mediated Sphingolipid Signaling Pathway Involving A β. EVIDENCE-BASED COMPLEMENTARY AND ALTERNATIVE MEDICINE 2022; 2021:6029237. [PMID: 35069753 PMCID: PMC8357478 DOI: 10.1155/2021/6029237] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/09/2021] [Accepted: 07/06/2021] [Indexed: 12/14/2022]
Abstract
Objective To reveal the molecular mechanism of the antagonistic effect of traditional Chinese medicine Tianma formula (TF) on dementia including vascular dementia (VaD) and Alzheimer's disease (AD) and to provide a scientific basis for the study of traditional Chinese medicine for prevention and treatment of dementia. Method The TF was derived from the concerted application of traditional Chinese medicine. We detected the pharmacological effect of TF in VaD rats. The molecular mechanism of TF was examined by APP/PS1 mice in vivo, Caenorhabditis elegans (C. elegans) in vitro, ELISA, pathological assay via HE staining, and transcriptome. Based on RNA-seq analysis in VaD rats, the differentially expressed genes (DEGs) were identified and then verified by quantitative PCR (qPCR) and ELISA. The molecular mechanisms of TF on dementia were further confirmed by network pharmacology and molecular docking finally. Results The Morris water maze showed that TF could improve the cognitive memory function of the VaD rats. The ELISA and histological analysis suggested that TF could protect the hippocampus via reducing tau and IL-6 levels and increasing SYN expression. Meanwhile, it could protect the neurological function by alleviating Aβ deposition in APP/PS1 mice and C. elegans. In the RNA-seq analysis, 3 sphingolipid metabolism pathway-related genes, ADORA3, FCER1G, and ACER2, and another 5 nerve-related genes in 45 key DEGs were identified, so it indicated that the protection mechanism of TF was mainly associated with the sphingolipid metabolism pathway. In the qPCR assay, compared with the model group, the mRNA expressions of the 8 genes mentioned above were upregulated, and these results were consistent with RNA-seq. The protein and mRNA levels of ACER2 were also upregulated. Also, the results of network pharmacology analysis and molecular docking were consistent with those of RNA-seq analysis. Conclusion TF alleviates dementia by reducing Aβ deposition via the ACER2-mediated sphingolipid signaling pathway.
Collapse
|
27
|
Dey S, Prilusky J, Levy ED. QSalignWeb: A Server to Predict and Analyze Protein Quaternary Structure. Front Mol Biosci 2022; 8:787510. [PMID: 35071324 PMCID: PMC8769216 DOI: 10.3389/fmolb.2021.787510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Accepted: 12/02/2021] [Indexed: 11/16/2022] Open
Abstract
The identification of physiologically relevant quaternary structures (QSs) in crystal lattices is challenging. To predict the physiological relevance of a particular QS, QSalign searches for homologous structures in which subunits interact in the same geometry. This approach proved accurate but was limited to structures already present in the Protein Data Bank (PDB). Here, we introduce a webserver (www.QSalign.org) allowing users to submit homo-oligomeric structures of their choice to the QSalign pipeline. Given a user-uploaded structure, the sequence is extracted and used to search homologs based on sequence similarity and PFAM domain architecture. If structural conservation is detected between a homolog and the user-uploaded QS, physiological relevance is inferred. The web server also generates alternative QSs with PISA and processes them the same way as the query submitted to widen the predictions. The result page also shows representative QSs in the protein family of the query, which is informative if no QS conservation was detected or if the protein appears monomeric. These representative QSs can also serve as a starting point for homology modeling.
Collapse
Affiliation(s)
- Sucharita Dey
- Department of Chemical and Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Jaime Prilusky
- Department of Life Sciences and Core Facilities, Weizmann Institute of Science, Rehovot, Israel
| | - Emmanuel D. Levy
- Department of Chemical and Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| |
Collapse
|
28
|
Dai Y, Li Y, Wang L, Peng Z, Yang J. On Monomeric and Multimeric Structures-Based Protein-Ligand Interactions. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:569-574. [PMID: 32750865 DOI: 10.1109/tcbb.2020.3002776] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Many ligands simultaneously interact with multiple protein chains in quaternary structure (QS). However, a significant number of previous studies on template-based modeling of protein-ligand interactions were based on monomeric structure (MS), which may suffer from incomplete binding information. The defects of using MS rather than QS have not been systematically studied before. In this work, based on molecular docking experiments and binding free energy estimations, we performed a large-scale comparison of the protein-ligand interactions in both forms of structures. We found that 1) about 18.6 percent biologically relevant ligands bind multiple chains in QS simultaneously. 2) For more than 95 percent complexes with multiple chains involved in the interactions, the binding free energy is lower for the QS form than the MS form. 3) For over 70 percent complexes with multi-chain binding pockets, docking with QS yields more accurate ligand conformations than with MS. While for about 1.82 percent complexes, accurate docking conformations were obtained by MS. Based on this work, it is encouraged to make use of QS rather than MS in future studies on protein-ligand interactions.
Collapse
|
29
|
Torres PHM, Rossi AD, Blundell TL. ProtCHOIR: a tool for proteome-scale generation of homo-oligomers. Brief Bioinform 2021; 22:bbab182. [PMID: 34015821 PMCID: PMC8574958 DOI: 10.1093/bib/bbab182] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Revised: 04/04/2021] [Accepted: 04/20/2021] [Indexed: 01/10/2023] Open
Abstract
The rapid developments in gene sequencing technologies achieved in the recent decades, along with the expansion of knowledge on the three-dimensional structures of proteins, have enabled the construction of proteome-scale databases of protein models such as the Genome3D and ModBase. Nevertheless, although gene products are usually expressed as individual polypeptide chains, most biological processes are associated with either transient or stable oligomerisation. In the PDB databank, for example, ~40% of the deposited structures contain at least one homo-oligomeric interface. Unfortunately, databases of protein models are generally devoid of multimeric structures. To tackle this particular issue, we have developed ProtCHOIR, a tool that is able to generate homo-oligomeric structures in an automated fashion, providing detailed information for the input protein and output complex. ProtCHOIR requires input of either a sequence or a protomeric structure that is queried against a pre-constructed local database of homo-oligomeric structures, then extensively analyzed using well-established tools such as PSI-Blast, MAFFT, PISA and Molprobity. Finally, MODELLER is employed to achieve the construction of the homo-oligomers. The output complex is thoroughly analyzed taking into account its stereochemical quality, interfacial stabilities, hydrophobicity and conservation profile. All these data are then summarized in a user-friendly HTML report that can be saved or printed as a PDF file. The software is easily parallelizable and also outputs a comma-separated file with summary statistics that can straightforwardly be concatenated as a spreadsheet-like document for large-scale data analyses. As a proof-of-concept, we built oligomeric models for the Mabellini Mycobacterium abscessus structural proteome database. ProtCHOIR can be run as a web-service and the code can be obtained free-of-charge at http://lmdm.biof.ufrj.br/protchoir.
Collapse
|
30
|
PDB-wide identification of physiological hetero-oligomeric assemblies based on conserved quaternary structure geometry. Structure 2021; 29:1303-1311.e3. [PMID: 34520740 PMCID: PMC8575123 DOI: 10.1016/j.str.2021.07.012] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Revised: 03/22/2021] [Accepted: 07/23/2021] [Indexed: 11/21/2022]
Abstract
An accurate understanding of biomolecular mechanisms and diseases requires information on protein quaternary structure (QS). A critical challenge in inferring QS information from crystallography data is distinguishing biological interfaces from fortuitous crystal-packing contacts. Here, we employ QS conservation across homologs to infer the biological relevance of hetero-oligomers. We compare the structures and compositions of hetero-oligomers, which allow us to annotate 7,810 complexes as physiologically relevant, 1,060 as likely errors, and 1,432 with comparative information on subunit stoichiometry and composition. Excluding immunoglobulins, these annotations encompass over 51% of hetero-oligomers in the PDB. We curate a dataset of 577 hetero-oligomeric complexes to benchmark these annotations, which reveals an accuracy >94%. When homology information is not available, we compare QS across repositories (PDB, PISA, and EPPIC) to derive confidence estimates. This work provides high-quality annotations along with a large benchmark dataset of hetero-assemblies.
Collapse
|
31
|
Gaber A, Pavšič M. Modeling and Structure Determination of Homo-Oligomeric Proteins: An Overview of Challenges and Current Approaches. Int J Mol Sci 2021; 22:9081. [PMID: 34445785 PMCID: PMC8396596 DOI: 10.3390/ijms22169081] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Revised: 08/20/2021] [Accepted: 08/20/2021] [Indexed: 12/12/2022] Open
Abstract
Protein homo-oligomerization is a very common phenomenon, and approximately half of proteins form homo-oligomeric assemblies composed of identical subunits. The vast majority of such assemblies possess internal symmetry which can be either exploited to help or poses challenges during structure determination. Moreover, aspects of symmetry are critical in the modeling of protein homo-oligomers either by docking or by homology-based approaches. Here, we first provide a brief overview of the nature of protein homo-oligomerization. Next, we describe how the symmetry of homo-oligomers is addressed by crystallographic and non-crystallographic symmetry operations, and how biologically relevant intermolecular interactions can be deciphered from the ordered array of molecules within protein crystals. Additionally, we describe the most important aspects of protein homo-oligomerization in structure determination by NMR. Finally, we give an overview of approaches aimed at modeling homo-oligomers using computational methods that specifically address their internal symmetry and allow the incorporation of other experimental data as spatial restraints to achieve higher model reliability.
Collapse
|
32
|
Bianchetti L, Sinar D, Depenveiller C, Dejaegere A. Insights into mineralocorticoid receptor homodimerization from a combined molecular modeling and bioinformatics study. Proteins 2021; 89:952-965. [PMID: 33713045 DOI: 10.1002/prot.26073] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2020] [Revised: 02/25/2021] [Accepted: 03/04/2021] [Indexed: 01/01/2023]
Abstract
In vertebrates, the mineralocorticoid receptor (MR) is a steroid-activated nuclear receptor (NR) that plays essential roles in water-electrolyte balance and blood pressure homeostasis. It belongs to the group of oxo-steroidian NRs, together with the glucocorticoid (GR), progesterone (PR), and androgen (AR) receptors. Classically, these oxo-steroidian NRs homodimerize and bind to specific genomic sequences to activate gene expression. NRs are multi-domain proteins, and dimerization is mediated by both the DNA (DBD) and ligand binding domains (LBDs), with the latter thought to provide the largest dimerization interface. However, at the structural level, the dimerization of oxo-steroidian receptors LBDs has remained largely a matter of debate and, despite their sequence homology, there is currently no consensus on a common homodimer assembly across the four receptors, that is, GR, PR, AR, and MR. Here, we examined all available MR LBD crystals using different computational methods (protein common interface database, proteins, interfaces, structures and assemblies, protein-protein interaction prediction by structural matching, and evolutionary protein-protein interface classifier, and the molecular mechanics Poisson-Boltzmann surface area method). A consensus is reached by all methods and singles out an interface mediated by helices H9, H10 and the C-terminal F domain as having characteristics of a biologically relevant assembly. Interestingly, a similar assembly was previously identified for GRα, MR closest homolog. Alternative architectures that were proposed for GRα were not observed for MR. These data call for further experimental investigations of oxo-steroid dimer architectures.
Collapse
Affiliation(s)
- Laurent Bianchetti
- Laboratoire de Chimie Biophysique de la Signalisation de la Transcription, Département de Biologie Structurale Intégrative, Institut de Génétique et de Biologie Moléculaire et Cellulaire, Illkirch, France
- Centre National de la Recherche Scientifique UMR7104, Illkirch, France
- Institut National de la Santé et de la Recherche Médicale, Illkirch, France
- Ecole Supérieure de Biotechnologie de Strasbourg, Université de Strasbourg, Illkirch, France
| | - Deniz Sinar
- Laboratoire de Chimie Biophysique de la Signalisation de la Transcription, Département de Biologie Structurale Intégrative, Institut de Génétique et de Biologie Moléculaire et Cellulaire, Illkirch, France
- Centre National de la Recherche Scientifique UMR7104, Illkirch, France
- Institut National de la Santé et de la Recherche Médicale, Illkirch, France
- Ecole Supérieure de Biotechnologie de Strasbourg, Université de Strasbourg, Illkirch, France
| | - Camille Depenveiller
- Laboratoire de Chimie Biophysique de la Signalisation de la Transcription, Département de Biologie Structurale Intégrative, Institut de Génétique et de Biologie Moléculaire et Cellulaire, Illkirch, France
- Centre National de la Recherche Scientifique UMR7104, Illkirch, France
- Institut National de la Santé et de la Recherche Médicale, Illkirch, France
- Ecole Supérieure de Biotechnologie de Strasbourg, Université de Strasbourg, Illkirch, France
| | - Annick Dejaegere
- Laboratoire de Chimie Biophysique de la Signalisation de la Transcription, Département de Biologie Structurale Intégrative, Institut de Génétique et de Biologie Moléculaire et Cellulaire, Illkirch, France
- Centre National de la Recherche Scientifique UMR7104, Illkirch, France
- Institut National de la Santé et de la Recherche Médicale, Illkirch, France
- Ecole Supérieure de Biotechnologie de Strasbourg, Université de Strasbourg, Illkirch, France
| |
Collapse
|
33
|
Alai S, Gujar N, Joshi M, Gautam M, Gairola S. Pan-India novel coronavirus SARS-CoV-2 genomics and global diversity analysis in spike protein. Heliyon 2021; 7:e06564. [PMID: 33758785 PMCID: PMC7972664 DOI: 10.1016/j.heliyon.2021.e06564] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2020] [Revised: 01/30/2021] [Accepted: 03/16/2021] [Indexed: 02/06/2023] Open
Abstract
The mortality rates due to COVID-19 have been found disproportionate globally and are currently being researched. India mortality rate with a population of 1.3 billion people is relatively lowest to other countries with high infection rates. Genetic composition of circulating isolates continues to be a key determinant of virulence and pathogenesis. This study aimed to analyse the extent of divergence between genomes of Indian isolates (n = 2525 as compared to reference Wuhan-1 strain and isolates from countries showing higher fatality rates including France, Italy, Belgium, and the USA. The study also analyses the impact of key mutations on interactions with angiotensin converting enzyme 2 (ACE2) and panel of neutralizing monoclonal antibodies. Using 1,44,605 spike protein sequences, global prevalence of mutations in spike protein was observed. The study suggests that SARS-CoV-2 genomes from India share consensus with global trends with respect to D614G as most prevalent mutational event (81.66% among 2525 Indian isolates). Indian isolates did not reported prevalence of N439K mutation in receptor binding motif (RBM) as compared to global isolates (0.54%). Computational docking and molecular dynamics simulation analysis of N439K mutation with respect to ACE 2 binding and reactivity with RBM targeted antibodies viz., B38, BD23, CB6, P2B-F26 and EY6A suggests that variant have relatively higher affinity with ACE 2 receptor which may support higher infectivity. The study warrants large scale monitoring of Indian isolates as SARS-CoV-2 virus is expected to evolve and mutations may appear in unpredictable way.
Collapse
Affiliation(s)
- Shweta Alai
- Department of Health and Biological Sciences, Symbiosis International University, Pune, Maharashtra, 412115, India
| | - Nidhi Gujar
- Bioinformatics Centre, Savitribai Phule Pune University, Pune, Maharashtra, 411007, India
| | - Manali Joshi
- Bioinformatics Centre, Savitribai Phule Pune University, Pune, Maharashtra, 411007, India
| | - Manish Gautam
- Serum Institute of India Pvt Ltd, Pune, Maharashtra, 411028, India
| | - Sunil Gairola
- Serum Institute of India Pvt Ltd, Pune, Maharashtra, 411028, India
| |
Collapse
|
34
|
Laniado J, Meador K, Yeates TO. A fragment-based protein interface design algorithm for symmetric assemblies. Protein Eng Des Sel 2021; 34:gzab008. [PMID: 33955480 PMCID: PMC8101011 DOI: 10.1093/protein/gzab008] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Accepted: 03/08/2021] [Indexed: 11/13/2022] Open
Abstract
Theoretical and experimental advances in protein engineering have led to the creation of precisely defined, novel protein assemblies of great size and complexity, with diverse applications. One powerful approach involves designing a new attachment or binding interface between two simpler symmetric oligomeric protein components. The required methods of design, which present both similarities and key differences compared to problems in protein docking, remain challenging and are not yet routine. With the aim of more fully enabling this emerging area of protein material engineering, we developed a computer program, nanohedra, to introduce two key advances. First, we encoded in the program the construction rules (i.e. the search space parameters) that underlie all possible symmetric material constructions. Second, we developed algorithms for rapidly identifying favorable docking/interface arrangements based on tabulations of empirical patterns of known protein fragment-pair associations. As a result, the candidate poses that nanohedra generates for subsequent amino acid interface design appear highly native-like (at the protein backbone level), while simultaneously conforming to the exacting requirements for symmetry-based assembly. A retrospective computational analysis of successful vs failed experimental studies supports the expectation that this should improve the success rate for this challenging area of protein engineering.
Collapse
Affiliation(s)
- Joshua Laniado
- UCLA Molecular Biology Institute, Los Angeles, CA 90095, USA
| | - Kyle Meador
- UCLA Department of Chemistry and Biochemistry, Los Angeles, CA 90095, USA
| | - Todd O Yeates
- UCLA Molecular Biology Institute, Los Angeles, CA 90095, USA
- UCLA Department of Chemistry and Biochemistry, Los Angeles, CA 90095, USA
- UCLA DOE Institute for Genomics and Proteomics, Los Angeles, CA 90095, USA
| |
Collapse
|
35
|
Li X, Kou Z, Wang J. Manipulating Interfaces of Electrocatalysts Down to Atomic Scales: Fundamentals, Strategies, and Electrocatalytic Applications. SMALL METHODS 2021; 5:e2001010. [PMID: 34927897 DOI: 10.1002/smtd.202001010] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/23/2020] [Revised: 12/03/2020] [Indexed: 05/03/2023]
Abstract
Raising electrocatalysis by rationally devising catalysts plays a core role in almost all renewable energy conversion and storage systems. The principal catalytic properties can be controlled and improved well by manipulation of interfaces, ascribed to the interactions among different components/players at the interfaces. In particular, manipulating interfaces down to atomic scales is becoming increasingly attractive, not only because those atoms at around the interface are the key players during electrocatalysis, but also, understandings on the atomic level electrocatalysis allow one to gain deep insights into the reaction mechanism. With the feature down-sizing to atomic scales, there is a timely need to redefine the interfaces, as some of them have gone beyond the conventionally perceived interfacial concept. In this overview, the key active players participating in the interfacial manipulation of electrocatalysts are examined, from a new angle of "atomic interface," including those individual atoms, defects, and their interactions, together with the essential characterization techniques for them. The specific approaches and pathways to engineer better atomic interfaces are investigated, and thus to enable the unique electrocatalysis for targeted applications. Looking beyond recent progress, the challenges and prospects of the atomic level interfacial engineering are also briefly visited.
Collapse
Affiliation(s)
- Xin Li
- Department of Materials Science and Engineering, National University of Singapore, Singapore, 117574, Singapore
| | - Zongkui Kou
- Department of Materials Science and Engineering, National University of Singapore, Singapore, 117574, Singapore
| | - John Wang
- Department of Materials Science and Engineering, National University of Singapore, Singapore, 117574, Singapore
| |
Collapse
|
36
|
A complete rule set for designing symmetry combination materials from protein molecules. Proc Natl Acad Sci U S A 2020; 117:31817-31823. [PMID: 33239442 DOI: 10.1073/pnas.2015183117] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
Diverse efforts in protein engineering are beginning to produce novel kinds of symmetric self-assembling architectures, from protein cages to extended two-dimensional (2D) and three-dimensional (3D) crystalline arrays. Partial theoretical frameworks for creating symmetric protein materials have been introduced, but no complete system has been articulated. Only a minute fraction of the possible design space has been explored experimentally, in part because that space has not yet been described in theory. Here, in the form of a multiplication table, we lay out a complete rule set for materials that can be created by combining two chiral oligomeric components (e.g., proteins) in precise configurations. A unified system is described for parameterizing and searching the construction space for all such symmetry-combination materials (SCMs). In total, 124 distinct types of SCMs are identified, and then proven by computational construction. Mathematical properties, such as minimal ring or circuit size, are established for each case, enabling strategic predictions about potentially favorable design targets. The study lays out the theoretical landscape and detailed computational prescriptions for a rapidly growing area of protein-based nanotechnology, with numerous underlying connections to mathematical networks and chemical materials such as metal organic frameworks.
Collapse
|
37
|
Andreani J, Quignot C, Guerois R. Structural prediction of protein interactions and docking using conservation and coevolution. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2020. [DOI: 10.1002/wcms.1470] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Affiliation(s)
- Jessica Andreani
- Université Paris‐Saclay CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC) Gif‐sur‐Yvette France
| | - Chloé Quignot
- Université Paris‐Saclay CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC) Gif‐sur‐Yvette France
| | - Raphael Guerois
- Université Paris‐Saclay CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC) Gif‐sur‐Yvette France
| |
Collapse
|
38
|
Biological vs. Crystallographic Protein Interfaces: An Overview of Computational Approaches for Their Classification. CRYSTALS 2020. [DOI: 10.3390/cryst10020114] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Complexes between proteins are at the basis of almost every process in cells. Their study, from a structural perspective, has a pivotal role in understanding biological functions and, importantly, in drug development. X-ray crystallography represents the broadest source for the experimental structural characterization of protein-protein complexes. Correctly identifying the biologically relevant interface from the crystallographic ones is, however, not trivial and can be prone to errors. Over the past two decades, computational methodologies have been developed to study the differences of those interfaces and automatically classify them as biological or crystallographic. Overall, protein-protein interfaces show differences in terms of composition, energetics and evolutionary conservation between biological and crystallographic ones. Based on those observations, a number of computational methods have been developed for this classification problem, which can be grouped into three main categories: Energy-, empirical knowledge- and machine learning-based approaches. In this review, we give a comprehensive overview of the training datasets and methods so far implemented, providing useful links and a brief description of each method.
Collapse
|
39
|
Byska J, Jurcik A, Furmanova K, Kozlikova B, Palecek JJ. Visual Analysis of Protein-Protein Interaction Docking Models Using COZOID Tool. Methods Mol Biol 2020; 2074:81-94. [PMID: 31583632 DOI: 10.1007/978-1-4939-9873-9_7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Networks of protein-protein interactions (PPI) constitute either stable or transient complexes in every cell. Most of the cellular complexes keep their function, and therefore stay similar, during evolution. The evolutionary constraints preserve most cellular functions via preservation of protein structures and interactions. The evolutionary conservation information is utilized in template-based approaches, like protein structure modeling or docking. Here we use the combination of the template-free docking method with conservation-based selection of the best docking model using our newly developed COZOID tool.We describe a step-by-step protocol for visual selection of docking models, based on their similarity to the original protein complex structure. Using the COZOID tool, we first analyze contact zones of the original complex structure and select contact amino acids for docking restraints. Then we model and dock the homologous proteins. Finally, we utilize different analytical modes of our COZOID tool to select the docking models most similar to the original complex structure.
Collapse
Affiliation(s)
- Jan Byska
- Department of Informatics, University of Bergen, Bergen, Norway
- Faculty of Informatics, Masaryk University, Brno, Czech Republic
| | - Adam Jurcik
- Faculty of Informatics, Masaryk University, Brno, Czech Republic
| | | | | | - Jan J Palecek
- Faculty of Science, National Centre for Biomolecular Research, Masaryk University, Brno, Czech Republic.
- Mendel Centre for Plant Genomics and Proteomics, Central European Institute of Technology, Masaryk University, Brno, Czech Republic.
| |
Collapse
|
40
|
Marchant A, Cisneros AF, Dubé AK, Gagnon-Arsenault I, Ascencio D, Jain H, Aubé S, Eberlein C, Evans-Yamamoto D, Yachie N, Landry CR. The role of structural pleiotropy and regulatory evolution in the retention of heteromers of paralogs. eLife 2019; 8:46754. [PMID: 31454312 PMCID: PMC6711710 DOI: 10.7554/elife.46754] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2019] [Accepted: 08/11/2019] [Indexed: 01/07/2023] Open
Abstract
Gene duplication is a driver of the evolution of new functions. The duplication of genes encoding homomeric proteins leads to the formation of homomers and heteromers of paralogs, creating new complexes after a single duplication event. The loss of these heteromers may be required for the two paralogs to evolve independent functions. Using yeast as a model, we find that heteromerization is frequent among duplicated homomers and correlates with functional similarity between paralogs. Using in silico evolution, we show that for homomers and heteromers sharing binding interfaces, mutations in one paralog can have structural pleiotropic effects on both interactions, resulting in highly correlated responses of the complexes to selection. Therefore, heteromerization could be preserved indirectly due to selection for the maintenance of homomers, thus slowing down functional divergence between paralogs. We suggest that paralogs can overcome the obstacle of structural pleiotropy by regulatory evolution at the transcriptional and post-translational levels.
Collapse
Affiliation(s)
- Axelle Marchant
- Département de biochimie, de microbiologie et de bio-informatique, Université Laval, Québec, Canada.,PROTEO, le réseau québécois de recherche sur la fonction, la structure et l'ingénierie des protéines, Université Laval, Québec, Canada.,Centre de Recherche en Données Massives (CRDM), Université Laval, Québec, Canada.,Département de biologie, Université Laval, Québec, Canada
| | - Angel F Cisneros
- Département de biochimie, de microbiologie et de bio-informatique, Université Laval, Québec, Canada.,PROTEO, le réseau québécois de recherche sur la fonction, la structure et l'ingénierie des protéines, Université Laval, Québec, Canada.,Centre de Recherche en Données Massives (CRDM), Université Laval, Québec, Canada
| | - Alexandre K Dubé
- Département de biochimie, de microbiologie et de bio-informatique, Université Laval, Québec, Canada.,PROTEO, le réseau québécois de recherche sur la fonction, la structure et l'ingénierie des protéines, Université Laval, Québec, Canada.,Centre de Recherche en Données Massives (CRDM), Université Laval, Québec, Canada.,Département de biologie, Université Laval, Québec, Canada
| | - Isabelle Gagnon-Arsenault
- Département de biochimie, de microbiologie et de bio-informatique, Université Laval, Québec, Canada.,PROTEO, le réseau québécois de recherche sur la fonction, la structure et l'ingénierie des protéines, Université Laval, Québec, Canada.,Centre de Recherche en Données Massives (CRDM), Université Laval, Québec, Canada.,Département de biologie, Université Laval, Québec, Canada
| | - Diana Ascencio
- Département de biochimie, de microbiologie et de bio-informatique, Université Laval, Québec, Canada.,PROTEO, le réseau québécois de recherche sur la fonction, la structure et l'ingénierie des protéines, Université Laval, Québec, Canada.,Centre de Recherche en Données Massives (CRDM), Université Laval, Québec, Canada.,Département de biologie, Université Laval, Québec, Canada
| | - Honey Jain
- Département de biochimie, de microbiologie et de bio-informatique, Université Laval, Québec, Canada.,PROTEO, le réseau québécois de recherche sur la fonction, la structure et l'ingénierie des protéines, Université Laval, Québec, Canada.,Centre de Recherche en Données Massives (CRDM), Université Laval, Québec, Canada.,Department of Biological Sciences, Birla Institute of Technology and Sciences, Pilani, India
| | - Simon Aubé
- Département de biochimie, de microbiologie et de bio-informatique, Université Laval, Québec, Canada.,PROTEO, le réseau québécois de recherche sur la fonction, la structure et l'ingénierie des protéines, Université Laval, Québec, Canada.,Centre de Recherche en Données Massives (CRDM), Université Laval, Québec, Canada
| | - Chris Eberlein
- PROTEO, le réseau québécois de recherche sur la fonction, la structure et l'ingénierie des protéines, Université Laval, Québec, Canada.,Centre de Recherche en Données Massives (CRDM), Université Laval, Québec, Canada.,Département de biologie, Université Laval, Québec, Canada
| | - Daniel Evans-Yamamoto
- Research Center for Advanced Science and Technology, University of Tokyo, Tokyo, Japan.,Institute for Advanced Biosciences, Keio University, Tsuruoka, Japan.,Graduate School of Media and Governance, Keio University, Fujisawa, Japan
| | - Nozomu Yachie
- Research Center for Advanced Science and Technology, University of Tokyo, Tokyo, Japan.,Institute for Advanced Biosciences, Keio University, Tsuruoka, Japan.,Graduate School of Media and Governance, Keio University, Fujisawa, Japan.,Department of Biological Sciences, Graduate School of Science, University of Tokyo, Tokyo, Japan
| | - Christian R Landry
- Département de biochimie, de microbiologie et de bio-informatique, Université Laval, Québec, Canada.,PROTEO, le réseau québécois de recherche sur la fonction, la structure et l'ingénierie des protéines, Université Laval, Québec, Canada.,Centre de Recherche en Données Massives (CRDM), Université Laval, Québec, Canada.,Département de biologie, Université Laval, Québec, Canada
| |
Collapse
|
41
|
Developments in integrative modeling with dynamical interfaces. Curr Opin Struct Biol 2019; 56:11-17. [DOI: 10.1016/j.sbi.2018.10.007] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2018] [Revised: 10/26/2018] [Accepted: 10/27/2018] [Indexed: 11/19/2022]
|
42
|
Geometric description of self-interaction potential in symmetric protein complexes. Sci Data 2019; 6:64. [PMID: 31101822 PMCID: PMC6525250 DOI: 10.1038/s41597-019-0058-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2018] [Accepted: 03/29/2019] [Indexed: 01/23/2023] Open
Abstract
Proteins can self-associate with copies of themselves to form symmetric complexes called homomers. Homomers are widespread in all kingdoms of life and allow for unique geometric and functional properties, as reflected in viral capsids or allostery. Once a protein forms a homomer, however, its internal symmetry can compound the effect of point mutations and trigger uncontrolled self-assembly into high-order structures. We identified mutation hot spots for supramolecular assembly, which are predictable by geometry. Here, we present a dataset of descriptors that characterize these hot spot positions both geometrically and chemically, as well as computer scripts allowing the calculation and visualization of these properties for homomers of choice. Since the biological relevance of homomers is not readily available from their X-ray crystallographic structure, we also provide reliability estimates obtained by methods we recently developed. These data have implications in the study of disease-causing mutations, protein evolution and can be exploited in the design of biomaterials. Design Type(s) | protein interaction analysis objective • protein structure prediction objective • modeling and simulation objective | Measurement Type(s) | protein complex | Technology Type(s) | computational modeling technique | Factor Type(s) | Filtering • source | Sample Characteristic(s) | laboratory environment |
Machine-accessible metadata file describing the reported data (ISA-Tab format)
Collapse
|
43
|
Cannon KA, Ochoa JM, Yeates TO. High-symmetry protein assemblies: patterns and emerging applications. Curr Opin Struct Biol 2019; 55:77-84. [PMID: 31005680 DOI: 10.1016/j.sbi.2019.03.008] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2018] [Accepted: 03/06/2019] [Indexed: 12/28/2022]
Abstract
The accelerated elucidation of three-dimensional structures of protein complexes, both natural and designed, is providing new examples of large supramolecular assemblies with intriguing shapes. Those with high symmetry - based on the geometries of the Platonic solids - are particularly notable as their innately closed forms create interior spaces with varying degrees of enclosure. We survey known protein assemblies of this type and discuss their geometric features. The results bear on issues of protein function and evolution, while also guiding novel bioengineering applications. Recent successes using high-symmetry protein assemblies for applications in interior encapsulation and exterior display are highlighted.
Collapse
Affiliation(s)
- Kevin A Cannon
- UCLA Department of Chemistry and Biochemistry, United States; UCLA-DOE Institute for Genomics and Proteomics, United States
| | - Jessica M Ochoa
- UCLA Department of Chemistry and Biochemistry, United States; UCLA Molecular Biology Institute, United States
| | - Todd O Yeates
- UCLA Department of Chemistry and Biochemistry, United States; UCLA-DOE Institute for Genomics and Proteomics, United States; UCLA Molecular Biology Institute, United States.
| |
Collapse
|
44
|
Garton M, MacKinnon SS, Malevanets A, Wodak SJ. Interplay of self-association and conformational flexibility in regulating protein function. Philos Trans R Soc Lond B Biol Sci 2019; 373:rstb.2017.0190. [PMID: 29735742 DOI: 10.1098/rstb.2017.0190] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/14/2018] [Indexed: 12/18/2022] Open
Abstract
Many functional roles have been attributed to homodimers, the most common mode of protein self-association, notably in the regulation of enzymes, ion channels, transporters and transcription factors. Here we review findings that offer new insights into the different roles conformational flexibility plays in regulating homodimer function. Intertwined homodimers of two-domain proteins and their related family members display significant conformational flexibility, which translates into concerted motion between structural domains. This flexibility enables the corresponding proteins to regulate function across family members by modulating the spatial positions of key recognition surfaces of individual domains, to either maintain subunit interfaces, alter them or break them altogether, leading to a variety of functional consequences. Many proteins may exist as monomers but carry out their biological function as homodimers or higher-order oligomers. We present early evidence that in such systems homodimer formation primes the protein for its functional role. It does so by inducing elevated mobility in protein regions corresponding to the binding epitopes of functionally important ligands. In some systems this process acts as an allosteric response elicited by the self-association reaction itself. Our analysis furthermore suggests that the induced extra mobility likely facilitates ligand binding through the mechanism of conformational selection.This article is part of a discussion meeting issue 'Allostery and molecular machines'.
Collapse
Affiliation(s)
- Michael Garton
- Department of Molecular Genetics, University of Toronto, The Donnelly Centre, 160 College Street, Toronto, Ontario M5S 3E1, Canada
| | - Stephen S MacKinnon
- Cyclica Inc., 18 King Street East, Suite 810, Toronto, Ontario M5C 1C4, Canada
| | - Anatoly Malevanets
- Hospital for Sick Children, 555 University Avenue, Toronto, Ontario M5G 1X8, Canada
| | - Shoshana J Wodak
- VIB Structural Biology research Centre, VUB, Building E Pleinlaan 2, 1050 Brussels, Belgium
| |
Collapse
|
45
|
Principles and characteristics of biological assemblies in experimentally determined protein structures. Curr Opin Struct Biol 2019; 55:34-49. [PMID: 30965224 DOI: 10.1016/j.sbi.2019.03.006] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2019] [Accepted: 03/01/2019] [Indexed: 12/27/2022]
Abstract
More than half of all structures in the PDB are assemblies of two or more proteins, including both homooligomers and heterooligomers. Structural information on these assemblies comes from X-ray crystallography, NMR, and cryo-EM spectroscopy. The correct assembly in an X-ray structure is often ambiguous, and computational methods have been developed to identify the most likely biologically relevant assembly based on physical properties of assemblies and sequence conservation in interfaces. Taking advantage of the large number of structures now available, some of the most recent methods have relied on similarity of interfaces and assemblies across structures of homologous proteins.
Collapse
|
46
|
Macossay-Castillo M, Marvelli G, Guharoy M, Jain A, Kihara D, Tompa P, Wodak SJ. The Balancing Act of Intrinsically Disordered Proteins: Enabling Functional Diversity while Minimizing Promiscuity. J Mol Biol 2019; 431:1650-1670. [PMID: 30878482 DOI: 10.1016/j.jmb.2019.03.008] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2018] [Revised: 02/25/2019] [Accepted: 03/03/2019] [Indexed: 10/27/2022]
Abstract
Intrinsically disordered proteins (IDPs) or regions (IDRs) perform diverse cellular functions, but are also prone to forming promiscuous and potentially deleterious interactions. We investigate the extent to which the properties of, and content in, IDRs have adapted to enable functional diversity while limiting interference from promiscuous interactions in the crowded cellular environment. Information on protein sequences, their predicted intrinsic disorder, and 3D structure contents is related to data on protein cellular concentrations, gene co-expression, and protein-protein interactions in the well-studied yeast Saccharomyces cerevisiae. Results reveal that both the protein IDR content and the frequency of "sticky" amino acids in IDRs (those more frequently involved in protein interfaces) decrease with increasing protein cellular concentration. This implies that the IDR content and the amino acid composition of IDRs experience negative selection as the protein concentration increases. In the S. cerevisiae protein-protein interaction network, the higher a protein's IDR content, the more frequently it interacts with IDR-containing partners, and the more functionally diverse the partners are. Employing a clustering analysis of Gene Ontology terms, we newly identify ~600 putative multifunctional proteins in S. cerevisiae. Strikingly, these proteins are enriched in IDRs and contribute significantly to all the observed trends. In particular, IDRs of multi-functional proteins feature more sticky amino acids than IDRs of their non-multifunctional counterparts, or the surfaces of structured yeast proteins. This property likely affords sufficient binding affinity for the functional interactions, commonly mediated by short IDR segments, thereby counterbalancing the loss in overall IDR conformational entropy upon binding.
Collapse
Affiliation(s)
- Mauricio Macossay-Castillo
- VIB-VUB Center for Structural Biology, Vlaams Instituut voor Biotechnologie, Pleinlaan 2, 1050 Brussels, Belgium; Structural Biology Brussels, Department of Bioengineering Sciences, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
| | - Giulio Marvelli
- VIB-VUB Center for Structural Biology, Vlaams Instituut voor Biotechnologie, Pleinlaan 2, 1050 Brussels, Belgium; Structural Biology Brussels, Department of Bioengineering Sciences, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
| | - Mainak Guharoy
- VIB-VUB Center for Structural Biology, Vlaams Instituut voor Biotechnologie, Pleinlaan 2, 1050 Brussels, Belgium; Structural Biology Brussels, Department of Bioengineering Sciences, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
| | - Aashish Jain
- Department of Computer Science, Purdue University, West Lafayette, IN 47907, USA
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, IN 47907, USA; Department of Biological Sciences, Purdue University, Hockmeyer Structural Biology Building, 249 S. Martin Jischke Dr West Lafayette, IN 47907, USA
| | - Peter Tompa
- VIB-VUB Center for Structural Biology, Vlaams Instituut voor Biotechnologie, Pleinlaan 2, 1050 Brussels, Belgium; Structural Biology Brussels, Department of Bioengineering Sciences, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium; Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, Magyar tudosok korutja 2, 1117 Budapest, Hungary
| | - Shoshana J Wodak
- VIB-VUB Center for Structural Biology, Vlaams Instituut voor Biotechnologie, Pleinlaan 2, 1050 Brussels, Belgium.
| |
Collapse
|
47
|
Investigation of quaternary structure of aggregating 3-ketosteroid dehydrogenase from Sterolibacterium denitrificans: In the pursuit of consensus of various biophysical techniques. Biochim Biophys Acta Gen Subj 2019; 1863:1027-1039. [PMID: 30876874 DOI: 10.1016/j.bbagen.2019.03.009] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2018] [Revised: 02/15/2019] [Accepted: 03/10/2019] [Indexed: 11/22/2022]
Abstract
In this work we analyzed the quaternary structure of FAD-dependent 3-ketosteroid dehydrogenase (AcmB) from Sterolibacterium denitrificans, the protein that in solution forms massive aggregates (>600 kDa). Using size-excursion chromatography (SEC), dynamic light scattering (DLS), native-PAGE and atomic force microscopy (AFM) we studied the nature of enzyme aggregation. Partial protein de-aggregation was facilitated by the presence of non-ionic detergent such as Tween 20 or by a high degree of protein dilution but not by addition of a reducing agent or an increase of ionic strength. De-aggregating influence of Tween 20 had no impact on either enzyme's specific activity or FAD reconstitution to recombinant AcmB. The joint experimental (DLS, isoelectric focusing) and theoretical investigations demonstrated gradual shift of enzyme's isoelectric point upon aggregation from 8.6 for a monomeric form to even 5.0. The AFM imaging on mica or highly oriented pyrolytic graphite (HOPG) surface enabled observation of individual protein monomers deposited from a highly diluted solution (0.2 μg/ml). Such approach revealed that native AcmB can indeed be monomeric. AFM imaging supported by theoretical random sequential adsorption (RSA) kinetics allowed estimation of distribution enzyme forms in the bulk solution: 5%, monomer, 11.4% dimer and 12% trimer. Finally, based on results of AFM as well as analysis of the surface of AcmB homology models we have observed that aggregation is most probably initiated by hydrophobic forces and then assisted by electrostatic attraction between negatively charged aggregates and positively charged monomers.
Collapse
|
48
|
The peroxisomal zebrafish SCP2-thiolase (type-1) is a weak transient dimer as revealed by crystal structures and native mass spectrometry. Biochem J 2019; 476:307-332. [PMID: 30573650 DOI: 10.1042/bcj20180788] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2018] [Revised: 12/12/2018] [Accepted: 12/20/2018] [Indexed: 12/19/2022]
Abstract
The SCP2 (sterol carrier protein 2)-thiolase (type-1) functions in the vertebrate peroxisomal, bile acid synthesis pathway, converting 24-keto-THC-CoA and CoA into choloyl-CoA and propionyl-CoA. This conversion concerns the β-oxidation chain shortening of the steroid fatty acyl-moiety of 24-keto-THC-CoA. This class of dimeric thiolases has previously been poorly characterized. High-resolution crystal structures of the zebrafish SCP2-thiolase (type-1) now reveal an open catalytic site, shaped by residues of both subunits. The structure of its non-dimerized monomeric form has also been captured in the obtained crystals. Four loops at the dimer interface adopt very different conformations in the monomeric form. These loops also shape the active site and their structural changes explain why a competent active site is not present in the monomeric form. Native mass spectrometry studies confirm that the zebrafish SCP2-thiolase (type-1) as well as its human homolog are weak transient dimers in solution. The crystallographic binding studies reveal the mode of binding of CoA and octanoyl-CoA in the active site, highlighting the conserved geometry of the nucleophilic cysteine, the catalytic acid/base cysteine and the two oxyanion holes. The dimer interface of SCP2-thiolase (type-1) is equally extensive as in other thiolase dimers; however, it is more polar than any of the corresponding interfaces, which correlates with the notion that the enzyme forms a weak transient dimer. The structure comparison of the monomeric and dimeric forms suggests functional relevance of this property. These comparisons provide also insights into the structural rearrangements that occur when the folded inactive monomers assemble into the mature dimer.
Collapse
|
49
|
Avraham O, Bayer EA, Livnah O. Crystal structure of afifavidin reveals common features of molecular assemblage in the bacterial dimeric avidins. FEBS J 2018; 285:4617-4630. [DOI: 10.1111/febs.14685] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2018] [Revised: 10/10/2018] [Accepted: 10/24/2018] [Indexed: 01/05/2023]
Affiliation(s)
- Orly Avraham
- Department of Biological Chemistry The Alexander Silverman Institute of Life Sciences The Wolfson Centre for Applied Structural Biology The Hebrew University of Jerusalem Israel
| | - Edward A. Bayer
- Department of Biological Chemistry The Weizmann Institute of Science Rehovot Israel
| | - Oded Livnah
- Department of Biological Chemistry The Alexander Silverman Institute of Life Sciences The Wolfson Centre for Applied Structural Biology The Hebrew University of Jerusalem Israel
| |
Collapse
|
50
|
Hu J, Liu HF, Sun J, Wang J, Liu R. Integrating co-evolutionary signals and other properties of residue pairs to distinguish biological interfaces from crystal contacts. Protein Sci 2018; 27:1723-1735. [PMID: 29931702 DOI: 10.1002/pro.3448] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2018] [Revised: 04/21/2018] [Accepted: 05/16/2018] [Indexed: 12/25/2022]
Abstract
It remains challenging to accurately discriminate between biological and crystal interfaces. Most existing analyses and algorithms focused on the features derived from a single side of the interface. However, less attention has been paid to the properties of residue pairs across protein interfaces. To address this problem, we defined a novel co-evolutionary feature for homodimers through integrating direct coupling analysis and image processing techniques. The residue pairs across biological homodimeric interfaces were significantly enriched in co-evolving residues compared to those across crystal contacts, resulting in a promising classification accuracy with area under the curves (AUCs) of >0.85. Considering the availability of co-evolutionary feature, we also designed other residue pair based features that were useful for both homodimers and heterodimers. The most informative residue pairs were identified to reflect the interaction preferences across protein interfaces. Regarding the other extant properties, we designed the new descriptors at the interface residue level as well as at the pairwise contact level. Extensive validation showed that these single properties can be used to identify biological interfaces with AUCs ranging from 0.60 to 0.88. By integrating co-evolutionary feature with other residue pair based properties, our final prediction model output excellent performance with AUCs of >0.91 on different datasets. Compared to existing methods, our algorithm not only yielded better or comparable results but also provided complementary information. An easy-to-use web server is freely accessible at http://liulab.hzau.edu.cn/RPAIAnalyst.
Collapse
Affiliation(s)
- Jian Hu
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, P. R. China.,College of Biomedical Engineering, South-Central University for Nationalities, Wuhan, 430074, P. R. China
| | - Hui-Fang Liu
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, P. R. China
| | - Jun Sun
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, P. R. China
| | - Jia Wang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, P. R. China
| | - Rong Liu
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, P. R. China
| |
Collapse
|