1
|
Schweke H, Pacesa M, Levin T, Goverde CA, Kumar P, Duhoo Y, Dornfeld LJ, Dubreuil B, Georgeon S, Ovchinnikov S, Woolfson DN, Correia BE, Dey S, Levy ED. An atlas of protein homo-oligomerization across domains of life. Cell 2024; 187:999-1010.e15. [PMID: 38325366 DOI: 10.1016/j.cell.2024.01.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 11/03/2023] [Accepted: 01/15/2024] [Indexed: 02/09/2024]
Abstract
Protein structures are essential to understanding cellular processes in molecular detail. While advances in artificial intelligence revealed the tertiary structure of proteins at scale, their quaternary structure remains mostly unknown. We devise a scalable strategy based on AlphaFold2 to predict homo-oligomeric assemblies across four proteomes spanning the tree of life. Our results suggest that approximately 45% of an archaeal proteome and a bacterial proteome and 20% of two eukaryotic proteomes form homomers. Our predictions accurately capture protein homo-oligomerization, recapitulate megadalton complexes, and unveil hundreds of homo-oligomer types, including three confirmed experimentally by structure determination. Integrating these datasets with omics information suggests that a majority of known protein complexes are symmetric. Finally, these datasets provide a structural context for interpreting disease mutations and reveal coiled-coil regions as major enablers of quaternary structure evolution in human. Our strategy is applicable to any organism and provides a comprehensive view of homo-oligomerization in proteomes.
Collapse
Affiliation(s)
- Hugo Schweke
- Department of Chemical and Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Martin Pacesa
- Laboratory of Protein Design and Immunoengineering, École Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Tal Levin
- Department of Chemical and Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Casper A Goverde
- Laboratory of Protein Design and Immunoengineering, École Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Prasun Kumar
- School of Chemistry, University of Bristol, Bristol BS8 1TS, UK; School of Biochemistry, University of Bristol, Bristol BS8 1TD, UK; Bristol BioDesign Institute, University of Bristol, Life Sciences Building, Bristol BS8 1TQ, UK; Max Planck-Bristol Centre for Minimal Biology, University of Bristol, Cantock's Close, Bristol BS8 1TS, UK
| | - Yoan Duhoo
- Protein Production and Structure Characterization Core Facility (PTPSP), School of Life Sciences, École polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Lars J Dornfeld
- Laboratory of Protein Design and Immunoengineering, École Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Benjamin Dubreuil
- Department of Chemical and Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Sandrine Georgeon
- Laboratory of Protein Design and Immunoengineering, École Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Sergey Ovchinnikov
- John Harvard Distinguished Science Fellowship Program, Harvard University, Cambridge, MA, USA
| | - Derek N Woolfson
- School of Chemistry, University of Bristol, Bristol BS8 1TS, UK; School of Biochemistry, University of Bristol, Bristol BS8 1TD, UK; Bristol BioDesign Institute, University of Bristol, Life Sciences Building, Bristol BS8 1TQ, UK; Max Planck-Bristol Centre for Minimal Biology, University of Bristol, Cantock's Close, Bristol BS8 1TS, UK.
| | - Bruno E Correia
- Laboratory of Protein Design and Immunoengineering, École Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, Lausanne, Switzerland.
| | - Sucharita Dey
- Department of Bioscience and Bioengineering, Indian Institute of Technology Jodhpur, Rajasthan, India.
| | - Emmanuel D Levy
- Department of Chemical and Structural Biology, Weizmann Institute of Science, Rehovot, Israel.
| |
Collapse
|
2
|
Kim H, Yang I, Lim SI. Streamlined construction of robust heteroprotein complexes by self-induced in-cell disulfide pairing. Int J Biol Macromol 2024; 254:127965. [PMID: 37944724 DOI: 10.1016/j.ijbiomac.2023.127965] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Revised: 11/04/2023] [Accepted: 11/06/2023] [Indexed: 11/12/2023]
Abstract
Biomolecules and their functional subdomains are essential building blocks in the creation of multifunctional nanocomplexes. Methyl-binding domain protein 2 (MBD2) and p66α stand out as small α-helical motifs with an ability to self-assemble into a heterodimeric coiled-coil, making them promising building units. Yet, their practical use is hindered by rapid dissociation upon dilution. In this study, novel fusion tags, MBD2 and p66α variants, were developed to covalently link during co-expression in E. coli SHuffle. Through strategic placement of cysteine at each α-helix's terminus, intracellular crosslinking occurred with high specificity and yield, facilitated by preserved α-helical interactions. This instant disulfide bonding in the oxidative cytoplasm of E. coli SHuffle efficiently overcame the need for inefficient in vitro oxidation and protein extraction prone to creating non-specific adducts and suboptimal bioprocesses. In contrast to their wild-type counterparts, the GFP-mCherry protein complex cross-linked by the fusion tags maintained the heterodimeric state even under extensive dilution. The fusion tags, when combined with the E. coli SHuffle system, allowed for the streamlined co-expression of a stable protein complex through self-induced intracellular cysteine coupling. The approach demonstrated herein holds great promise for producing multifunctional and robust heteroprotein complexes.
Collapse
Affiliation(s)
- Hyunji Kim
- Department of Chemical Engineering, Pukyong National University, Yongso-ro 45, Nam-gu, Busan, Republic of Korea
| | - Iji Yang
- Department of Chemical Engineering, Pukyong National University, Yongso-ro 45, Nam-gu, Busan, Republic of Korea
| | - Sung In Lim
- Department of Chemical Engineering, Pukyong National University, Yongso-ro 45, Nam-gu, Busan, Republic of Korea.
| |
Collapse
|
3
|
Kumar P, Petrenas R, Dawson WM, Schweke H, Levy ED, Woolfson DN. CC + : A searchable database of validated coiled coils in PDB structures and AlphaFold2 models. Protein Sci 2023; 32:e4789. [PMID: 37768271 PMCID: PMC10588367 DOI: 10.1002/pro.4789] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Revised: 09/10/2023] [Accepted: 09/23/2023] [Indexed: 09/29/2023]
Abstract
α-Helical coiled coils are common tertiary and quaternary elements of protein structure. In coiled coils, two or more α helices wrap around each other to form bundles. This apparently simple structural motif can generate many architectures and topologies. Coiled coil-forming sequences can be predicted from heptad repeats of hydrophobic and polar residues, hpphppp, although this is not always reliable. Alternatively, coiled-coil structures can be identified using the program SOCKET, which finds knobs-into-holes (KIH) packing between side chains of neighboring helices. SOCKET also classifies coiled-coil architecture and topology, thus allowing sequence-to-structure relationships to be garnered. In 2009, we used SOCKET to create a relational database of coiled-coil structures, CC+ , from the RCSB Protein Data Bank (PDB). Here, we report an update of CC+ following an update of SOCKET (to Socket2) and the recent explosion of structural data and the success of AlphaFold2 in predicting protein structures from genome sequences. With the most-stringent SOCKET parameters, CC+ contains ≈12,000 coiled-coil assemblies from experimentally determined structures, and ≈120,000 potential coiled-coil structures within single-chain models predicted by AlphaFold2 across 48 proteomes. CC+ allows these and other less-stringently defined coiled coils to be searched at various levels of structure, sequence, and side-chain interactions. The identified coiled coils can be viewed directly from CC+ using the Socket2 application, and their associated data can be downloaded for further analyses. CC+ is available freely at http://coiledcoils.chm.bris.ac.uk/CCPlus/Home.html. It will be updated automatically. We envisage that CC+ could be used to understand coiled-coil assemblies and their sequence-to-structure relationships, and to aid protein design and engineering.
Collapse
Affiliation(s)
- Prasun Kumar
- School of ChemistryUniversity of BristolBristolUK
| | | | | | - Hugo Schweke
- Department of Chemical and Structural BiologyWeizmann Institute of ScienceRehovotIsrael
| | - Emmanuel D. Levy
- Department of Chemical and Structural BiologyWeizmann Institute of ScienceRehovotIsrael
| | - Derek N. Woolfson
- School of ChemistryUniversity of BristolBristolUK
- School of BiochemistryUniversity of Bristol, Medical Sciences Building, University WalkBristolUK
- Bristol BioDesign Institute, School of ChemistryUniversity of BristolBristolUK
| |
Collapse
|
4
|
Kümpel C, Grein F, Dahl C. Fluorescence Microscopy Study of the Intracellular Sulfur Globule Protein SgpD in the Purple Sulfur Bacterium Allochromatium vinosum. Microorganisms 2023; 11:1792. [PMID: 37512964 PMCID: PMC10386293 DOI: 10.3390/microorganisms11071792] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Revised: 07/01/2023] [Accepted: 07/08/2023] [Indexed: 07/30/2023] Open
Abstract
When oxidizing reduced sulfur compounds, the phototrophic sulfur bacterium Allochromatium vinosum forms spectacular sulfur globules as obligatory intracellular-but extracytoplasmic-intermediates. The globule envelope consists of three extremely hydrophobic proteins: SgpA and SgpB, which are very similar and can functionally replace each other, and SgpC which is involved in the expansion of the sulfur globules. The presence of a fourth protein, SgpD, was suggested by comparative transcriptomics and proteomics of purified sulfur globules. Here, we investigated the in vivo function of SgpD by coupling its carboxy-terminus to mCherry. This fluorescent protein requires oxygen for chromophore maturation, but we were able to use it in anaerobically growing A. vinosum provided the cells were exposed to oxygen for one hour prior to imaging. While mCherry lacking a signal peptide resulted in low fluorescence evenly distributed throughout the cell, fusion with SgpD carrying its original Sec-dependent signal peptide targeted mCherry to the periplasm and co-localized it exactly with the highly light-refractive sulfur deposits seen in sulfide-fed A. vinosum cells. Insertional inactivation of the sgpD gene showed that the protein is not essential for the formation and degradation of sulfur globules.
Collapse
Affiliation(s)
- Carolin Kümpel
- Institut für Mikrobiologie & Biotechnologie, Rheinische Friedrich-Wilhelms-Universität Bonn, Meckenheimer Allee 168, D-53115 Bonn, Germany
| | - Fabian Grein
- Institut für Pharmazeutische Mikrobiologie, Rheinische Friedrich-Wilhelms-Universität Bonn, Meckenheimer Allee 16, D-53115 Bonn, Germany
| | - Christiane Dahl
- Institut für Mikrobiologie & Biotechnologie, Rheinische Friedrich-Wilhelms-Universität Bonn, Meckenheimer Allee 168, D-53115 Bonn, Germany
| |
Collapse
|
5
|
Regulation of Polyhomeotic Condensates by Intrinsically Disordered Sequences That Affect Chromatin Binding. EPIGENOMES 2022; 6:epigenomes6040040. [PMID: 36412795 PMCID: PMC9680516 DOI: 10.3390/epigenomes6040040] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Revised: 10/30/2022] [Accepted: 11/01/2022] [Indexed: 11/06/2022] Open
Abstract
The Polycomb group (PcG) complex PRC1 localizes in the nucleus in condensed structures called Polycomb bodies. The PRC1 subunit Polyhomeotic (Ph) contains an oligomerizing sterile alpha motif (SAM) that is implicated in both PcG body formation and chromatin organization in Drosophila and mammalian cells. A truncated version of Ph containing the SAM (mini-Ph) forms phase-separated condensates with DNA or chromatin in vitro, suggesting that PcG bodies may form through SAM-driven phase separation. In cells, Ph forms multiple small condensates, while mini-Ph typically forms a single large nuclear condensate. We therefore hypothesized that sequences outside of mini-Ph, which are predicted to be intrinsically disordered, are required for proper condensate formation. We identified three distinct low-complexity regions in Ph based on sequence composition. We systematically tested the role of each of these sequences in Ph condensates using live imaging of transfected Drosophila S2 cells. Each sequence uniquely affected Ph SAM-dependent condensate size, number, and morphology, but the most dramatic effects occurred when the central, glutamine-rich intrinsically disordered region (IDR) was removed, which resulted in large Ph condensates. Like mini-Ph condensates, condensates lacking the glutamine-rich IDR excluded chromatin. Chromatin fractionation experiments indicated that the removal of the glutamine-rich IDR reduced chromatin binding and that the removal of either of the other IDRs increased chromatin binding. Our data suggest that all three IDRs, and functional interactions among them, regulate Ph condensate size and number. Our results can be explained by a model in which tight chromatin binding by Ph IDRs antagonizes Ph SAM-driven phase separation. Our observations highlight the complexity of regulation of biological condensates housed in single proteins.
Collapse
|
6
|
Chen X, Liu Y, Yin S, Zang J, Zhang T, Lv C, Zhao G. Construction of Sol-Gel Phase-Reversible Hydrogels with Tunable Properties with Native Nanofibrous Protein as Building Blocks. ACS APPLIED MATERIALS & INTERFACES 2022; 14:44125-44135. [PMID: 36162135 DOI: 10.1021/acsami.2c11765] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Reversible sol-gel transforming behaviors combined with tunable mechanical properties are vital demands for developing biomaterials. However, it remains challenging to correlate these properties with the hydrogels constructed by denatured protein as building blocks. Herein, taking advantage of naturally high-affinity coordination environments consisting of i, i + 4 His-Glu motifs offered by paramyosin, a ubiquitous nanofibrous protein, we found that Zn2+ rather than Ca2+ or Mg2+ has the ability to trigger the self-assembly of native abalone paramyosin (AbPM) into protein hydrogels under benign conditions, while the addition of EDTA induces the hydrogels back into protein monomers, indicative of a reversible process. By using such sol-gel reversible property, the AbPM gels can serve as a vehicle to encapsulate bioactive molecules such as curcumin, thereby protecting it from degradation from thermal and photo treatment. Notably, based on the high conserved structure of native AbPM, the mechanical property and biological activity of the fabricated AbPM hydrogels can be fined-tuned by its noncovalent interaction with small molecules. All these findings raise the possibility that native paramyosin can be explored as a new class of protein hydrogels which exhibit favorable properties that the traditional hydrogels constructed by denatured protein building blocks do not have.
Collapse
Affiliation(s)
- Xuemin Chen
- College of Food Science & Nutritional Engineering, China Agricultural University, Key Laboratory of Functional Dairy, Ministry of Education, Beijing 100083, China
| | - Yu Liu
- College of Food Science & Nutritional Engineering, China Agricultural University, Key Laboratory of Functional Dairy, Ministry of Education, Beijing 100083, China
| | - Shuhua Yin
- College of Food Science & Nutritional Engineering, China Agricultural University, Key Laboratory of Functional Dairy, Ministry of Education, Beijing 100083, China
| | - Jiachen Zang
- College of Food Science & Nutritional Engineering, China Agricultural University, Key Laboratory of Functional Dairy, Ministry of Education, Beijing 100083, China
| | - Tuo Zhang
- College of Food Science & Nutritional Engineering, China Agricultural University, Key Laboratory of Functional Dairy, Ministry of Education, Beijing 100083, China
| | - Chenyan Lv
- College of Food Science & Nutritional Engineering, China Agricultural University, Key Laboratory of Functional Dairy, Ministry of Education, Beijing 100083, China
| | - Guanghua Zhao
- College of Food Science & Nutritional Engineering, China Agricultural University, Key Laboratory of Functional Dairy, Ministry of Education, Beijing 100083, China
| |
Collapse
|
7
|
Bioinformatic Analysis Predicts a Novel Genetic Module Related to Triple Gene and Binary Movement Blocks of Plant Viruses: Tetra-Cistron Movement Block. Biomolecules 2022; 12:biom12070861. [PMID: 35883420 PMCID: PMC9313169 DOI: 10.3390/biom12070861] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Revised: 06/14/2022] [Accepted: 06/17/2022] [Indexed: 11/16/2022] Open
Abstract
Previous studies have shown that the RNA genomes of some plant viruses encode two related genetic modules required for virus movement over the host body, containing two or three genes and named the binary movement block (BMB) and triple gene block (TGB), respectively. In this paper, we predict a novel putative-related movement gene module, called the tetra-cistron movement block (TCMB), in the virus-like transcriptome assemblies of the moss Dicranum scoparium and the Antarctic flowering plant Colobanthus quitensis. These TCMBs are encoded by smaller RNA components of putative two-component viruses related to plant benyviruses. Similar to the RNA2 of benyviruses, TCMB-containing RNAs have the 5′-terminal coat protein gene and include the RNA helicase gene which is followed by two small overlapping cistrons encoding hydrophobic proteins with a distant sequence similarity to the TGB2 and TGB3 proteins. Unlike TGB, TCMB also includes a fourth 5′-terminal gene preceding the helicase gene and coding for a protein showing a similarity to the double-stranded RNA-binding proteins of the DSRM AtDRB-like superfamily. Additionally, based on phylogenetic analysis, we suggest the involvement of replicative beny-like helicases in the evolution of the BMB and TCMB movement genetic modules.
Collapse
|
8
|
Wang B, Wang M, Zhang H, Xu J, Hou J, Zhu Y. Canine Adenovirus 1 Isolation Bioinformatics Analysis of the Fiber. Front Cell Infect Microbiol 2022; 12:879360. [PMID: 35770071 PMCID: PMC9235841 DOI: 10.3389/fcimb.2022.879360] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2022] [Accepted: 04/25/2022] [Indexed: 11/13/2022] Open
Abstract
Canine adenovirus type 1 (CAdV-1) is a double-stranded DNA virus, which is the causative agent of fox encephalitis. The Fiber protein is one of the structural proteins in CAdV-1, which mediates virion binding to the coxsackievirus and adenovirus receptor on host cells. The suspected virus was cultured in the MDCK cells, and it was determined through the cytopathic effects, sequencing and electron microscopy. The informatics analysis of the Fiber was done using online bioinformatics servers. The CAdV-1-JL2021 strain was isolated successfully, and were most similar to the CAdV-1 strain circulating in Italy. The occurrence of negative selection and recombination were found in the CAdV-1-JL2021 and CAdV-2-AC_000020.1. Host cell membrane was its subcellular localization. The CAdV-1-JL2021 Fiber (ON164651) had 6 glycosylation sites and 107 phosphorylation sites, exerted adhesion receptor-mediated virion attachment to host cell, which was the same as CAdV-2-AC_000020.1 Fiber. The Fiber tertiary structure of the CAdV-1-JL2021 and CAdV-2-AC_000020.1 was different, but they had the same coxsackievirus and adenovirus receptor. “VATTSPTLTFAYPLIKNNNH” were predicted to be the potential CAdV-1 B cell linear epitope. The MHC-I binding peptide “KLGVKPTTY” were both presented in the CAdV-1-JL2021 and CAdV-2-AC_000020.1 Fiber and it is useful to design the canine adenovirus vaccine.
Collapse
Affiliation(s)
- Ben Wang
- Animal Science and Technology College, Jilin Agriculture Science and Technology University, Jilin, China
| | - Minchun Wang
- Institute of Special Animal and Plant Sciences, Chinese Academy of Agricultural Sciences, Changchun, China
| | - Hongling Zhang
- Animal Science and Technology College, Jilin Agriculture Science and Technology University, Jilin, China
| | - Jinfeng Xu
- Institute of Special Animal and Plant Sciences, Chinese Academy of Agricultural Sciences, Changchun, China
| | - Jinyu Hou
- Institute of Special Animal and Plant Sciences, Chinese Academy of Agricultural Sciences, Changchun, China
- College of Veterinary Medicine, Jilin Agricultural University, Changchun, China
| | - Yanzhu Zhu
- Institute of Special Animal and Plant Sciences, Chinese Academy of Agricultural Sciences, Changchun, China
- College of Veterinary Medicine, Jilin Agricultural University, Changchun, China
- *Correspondence: Yanzhu Zhu,
| |
Collapse
|