1
|
Zhang Y, Xie G, Lee JE, Zandian M, Sudarshan D, Estavoyer B, Benz C, Viita T, Asgaritarghi G, Lachance C, Messmer C, Simonetti L, Sinha VK, Lambert JP, Chen YW, Wang SP, Ivarsson Y, Affar EB, Côté J, Ge K, Kutateladze TG. ASXLs binding to the PHD2/3 fingers of MLL4 provides a mechanism for the recruitment of BAP1 to active enhancers. Nat Commun 2024; 15:4883. [PMID: 38849395 PMCID: PMC11161652 DOI: 10.1038/s41467-024-49391-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2022] [Accepted: 05/31/2024] [Indexed: 06/09/2024] Open
Abstract
The human methyltransferase and transcriptional coactivator MLL4 and its paralog MLL3 are frequently mutated in cancer. MLL4 and MLL3 monomethylate histone H3K4 and contain a set of uncharacterized PHD fingers. Here, we report a novel function of the PHD2 and PHD3 (PHD2/3) fingers of MLL4 and MLL3 that bind to ASXL2, a component of the Polycomb repressive H2AK119 deubiquitinase (PR-DUB) complex. The structure of MLL4 PHD2/3 in complex with the MLL-binding helix (MBH) of ASXL2 and mutational analyses reveal the molecular mechanism which is conserved in homologous ASXL1 and ASXL3. The native interaction of the Trithorax MLL3/4 complexes with the PR-DUB complex in vivo depends solely on MBH of ASXL1/2, coupling the two histone modifying activities. ChIP-seq analysis in embryonic stem cells demonstrates that MBH of ASXL1/2 is required for the deubiquitinase BAP1 recruitment to MLL4-bound active enhancers. Our findings suggest an ASXL1/2-dependent functional link between the MLL3/4 and PR-DUB complexes.
Collapse
Affiliation(s)
- Yi Zhang
- Department of Pharmacology, University of Colorado School of Medicine, Aurora, CO, 80045, USA
- Department of Biochemistry, Case Western Reserve University, Cleveland, OH, 44106, USA
| | - Guojia Xie
- Laboratory of Endocrinology and Receptor Biology, National Institute of Diabetes and Digestive and Kidney Diseases, NIH, Bethesda, MD, 20892, USA
| | - Ji-Eun Lee
- Laboratory of Endocrinology and Receptor Biology, National Institute of Diabetes and Digestive and Kidney Diseases, NIH, Bethesda, MD, 20892, USA
| | - Mohamad Zandian
- Department of Pharmacology, University of Colorado School of Medicine, Aurora, CO, 80045, USA
| | - Deepthi Sudarshan
- St-Patrick Research Group in Basic Oncology, Oncology Division of CHU de Québec-Université Laval Research, Laval University Cancer Research Center, Quebec City, QC, G1R 3S3, Canada
| | - Benjamin Estavoyer
- Maisonneuve-Rosemont Hospital Research Center, Montréal, QC, H1T 2M4, Canada
| | - Caroline Benz
- Department of Chemistry, BMC, Uppsala University, Uppsala, 75237, Sweden
| | - Tiina Viita
- St-Patrick Research Group in Basic Oncology, Oncology Division of CHU de Québec-Université Laval Research, Laval University Cancer Research Center, Quebec City, QC, G1R 3S3, Canada
| | - Golareh Asgaritarghi
- St-Patrick Research Group in Basic Oncology, Oncology Division of CHU de Québec-Université Laval Research, Laval University Cancer Research Center, Quebec City, QC, G1R 3S3, Canada
| | - Catherine Lachance
- St-Patrick Research Group in Basic Oncology, Oncology Division of CHU de Québec-Université Laval Research, Laval University Cancer Research Center, Quebec City, QC, G1R 3S3, Canada
| | - Clémence Messmer
- Maisonneuve-Rosemont Hospital Research Center, Montréal, QC, H1T 2M4, Canada
| | - Leandro Simonetti
- Department of Chemistry, BMC, Uppsala University, Uppsala, 75237, Sweden
| | - Vikrant Kumar Sinha
- Department of Pharmacology, University of Colorado School of Medicine, Aurora, CO, 80045, USA
| | - Jean-Philippe Lambert
- St-Patrick Research Group in Basic Oncology, Oncology Division of CHU de Québec-Université Laval Research, Laval University Cancer Research Center, Quebec City, QC, G1R 3S3, Canada
| | - Yu-Wen Chen
- Institute of Biomedical Sciences, Academia Sinica, Taipei, 11529, Taiwan, ROC
| | - Shu-Ping Wang
- Institute of Biomedical Sciences, Academia Sinica, Taipei, 11529, Taiwan, ROC
| | - Ylva Ivarsson
- Department of Chemistry, BMC, Uppsala University, Uppsala, 75237, Sweden
| | - El Bachir Affar
- Maisonneuve-Rosemont Hospital Research Center, Montréal, QC, H1T 2M4, Canada
- Department of Medicine, University of Montréal, Montréal, QC, H3C 3J7, Canada
| | - Jacques Côté
- St-Patrick Research Group in Basic Oncology, Oncology Division of CHU de Québec-Université Laval Research, Laval University Cancer Research Center, Quebec City, QC, G1R 3S3, Canada.
| | - Kai Ge
- Laboratory of Endocrinology and Receptor Biology, National Institute of Diabetes and Digestive and Kidney Diseases, NIH, Bethesda, MD, 20892, USA.
| | - Tatiana G Kutateladze
- Department of Pharmacology, University of Colorado School of Medicine, Aurora, CO, 80045, USA.
| |
Collapse
|
2
|
Becht DC, Mohid SA, Lee JE, Zandian M, Benz C, Biswas S, Sinha VK, Ivarsson Y, Ge K, Zhang Y, Kutateladze TG. MLL4 binds TET3. Structure 2024; 32:706-714.e3. [PMID: 38579707 PMCID: PMC11162309 DOI: 10.1016/j.str.2024.03.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Revised: 02/01/2024] [Accepted: 03/11/2024] [Indexed: 04/07/2024]
Abstract
Human mixed lineage leukemia 4 (MLL4), also known as KMT2D, regulates cell type specific transcriptional programs through enhancer activation. Along with the catalytic methyltransferase domain, MLL4 contains seven less characterized plant homeodomain (PHD) fingers. Here, we report that the sixth PHD finger of MLL4 (MLL4PHD6) binds to the hydrophobic motif of ten-eleven translocation 3 (TET3), a dioxygenase that converts methylated cytosine into oxidized derivatives. The solution NMR structure of the TET3-MLL4PHD6 complex and binding assays show that, like histone H4 tail, TET3 occupies the hydrophobic site of MLL4PHD6, and that this interaction is conserved in the seventh PHD finger of homologous MLL3 (MLL3PHD7). Analysis of genomic localization of endogenous MLL4 and ectopically expressed TET3 in mouse embryonic stem cells reveals a high degree overlap on active enhancers and suggests a potential functional relationship of MLL4 and TET3.
Collapse
Affiliation(s)
- Dustin C Becht
- Department of Pharmacology, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Sk Abdul Mohid
- Department of Biochemistry, School of Medicine, Case Western Reserve University, Cleveland, OH 44106, USA
| | - Ji-Eun Lee
- Laboratory of Endocrinology and Receptor Biology, National Institute of Diabetes and Digestive and Kidney Diseases, NIH, Bethesda, MD 20892, USA
| | - Mohamad Zandian
- Department of Pharmacology, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Caroline Benz
- Department of Chemistry - BMC, Uppsala University, 751 23 Uppsala, Sweden
| | - Soumi Biswas
- Department of Pharmacology, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Vikrant Kumar Sinha
- Department of Pharmacology, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Ylva Ivarsson
- Department of Chemistry - BMC, Uppsala University, 751 23 Uppsala, Sweden
| | - Kai Ge
- Laboratory of Endocrinology and Receptor Biology, National Institute of Diabetes and Digestive and Kidney Diseases, NIH, Bethesda, MD 20892, USA
| | - Yi Zhang
- Department of Biochemistry, School of Medicine, Case Western Reserve University, Cleveland, OH 44106, USA.
| | - Tatiana G Kutateladze
- Department of Pharmacology, University of Colorado School of Medicine, Aurora, CO 80045, USA.
| |
Collapse
|
3
|
Kumar M, Michael S, Alvarado-Valverde J, Zeke A, Lazar T, Glavina J, Nagy-Kanta E, Donagh J, Kalman Z, Pascarelli S, Palopoli N, Dobson L, Suarez C, Van Roey K, Krystkowiak I, Griffin J, Nagpal A, Bhardwaj R, Diella F, Mészáros B, Dean K, Davey N, Pancsa R, Chemes L, Gibson T. ELM-the Eukaryotic Linear Motif resource-2024 update. Nucleic Acids Res 2024; 52:D442-D455. [PMID: 37962385 PMCID: PMC10767929 DOI: 10.1093/nar/gkad1058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 10/22/2023] [Accepted: 10/24/2023] [Indexed: 11/15/2023] Open
Abstract
Short Linear Motifs (SLiMs) are the smallest structural and functional components of modular eukaryotic proteins. They are also the most abundant, especially when considering post-translational modifications. As well as being found throughout the cell as part of regulatory processes, SLiMs are extensively mimicked by intracellular pathogens. At the heart of the Eukaryotic Linear Motif (ELM) Resource is a representative (not comprehensive) database. The ELM entries are created by a growing community of skilled annotators and provide an introduction to linear motif functionality for biomedical researchers. The 2024 ELM update includes 346 novel motif instances in areas ranging from innate immunity to both protein and RNA degradation systems. In total, 39 classes of newly annotated motifs have been added, and another 17 existing entries have been updated in the database. The 2024 ELM release now includes 356 motif classes incorporating 4283 individual motif instances manually curated from 4274 scientific publications and including >700 links to experimentally determined 3D structures. In a recent development, the InterPro protein module resource now also includes ELM data. ELM is available at: http://elm.eu.org.
Collapse
Affiliation(s)
- Manjeet Kumar
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg 69117, Germany
| | - Sushama Michael
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg 69117, Germany
| | - Jesús Alvarado-Valverde
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg 69117, Germany
- Collaboration for joint PhD degree between EMBL and Heidelberg University, Faculty of Biosciences, Germany
| | - András Zeke
- Institute of Enzymology, HUN-REN Research Centre for Natural Sciences, Budapest 1117, Hungary
| | - Tamas Lazar
- VIB-VUB Center for Structural Biology, Vlaams Instituut voor Biotechnologie, Pleinlaan 2, 1050 Brussels, Belgium
- Structural Biology Brussels, Department of Bioengineering, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
| | - Juliana Glavina
- Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín (UNSAM), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), CP 1650, Buenos Aires, Argentina
- Escuela de Bio y Nanotecnologías (EByN), Universidad Nacional de San Martín, Av. 25 de Mayo y Francia, CP1650 San Martín, Buenos Aires, Argentina
| | - Eszter Nagy-Kanta
- Faculty of Information Technology and Bionics, Pázmány Péter Catholic University, Práter u. 50/A, Budapest 1083, Hungary
| | - Juan Mac Donagh
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes - Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Bernal, Buenos Aires, Argentina
| | - Zsofia E Kalman
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg 69117, Germany
- Faculty of Information Technology and Bionics, Pázmány Péter Catholic University, Práter u. 50/A, Budapest 1083, Hungary
| | - Stefano Pascarelli
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Nicolas Palopoli
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes - Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Bernal, Buenos Aires, Argentina
| | - László Dobson
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg 69117, Germany
- Department of Bioinformatics, Semmelweis University, Tűzoltó u. 7, Budapest 1094, Hungary
| | - Carmen Florencia Suarez
- Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín (UNSAM), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), CP 1650, Buenos Aires, Argentina
- Escuela de Bio y Nanotecnologías (EByN), Universidad Nacional de San Martín, Av. 25 de Mayo y Francia, CP1650 San Martín, Buenos Aires, Argentina
| | - Kim Van Roey
- Health Services Research, Sciensano, Brussels, Belgium
| | - Izabella Krystkowiak
- Institute of Cancer Research, Chester Beatty Laboratories, 237 Fulham Rd, Chelsea, London SW3 6JB, UK
| | - Juan Esteban Griffin
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes - Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Bernal, Buenos Aires, Argentina
| | - Anurag Nagpal
- Department of Biological Sciences, BITS Pilani, K. K. Birla Goa campus, Zuarinagar, Goa 403726, India
| | - Rajesh Bhardwaj
- Inselspital, University of Bern, Freiburgstrasse 15, CH-3010 Bern, Switzerland
| | - Francesca Diella
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg 69117, Germany
| | - Bálint Mészáros
- Department of Structural Biology and Center of Excellence for Data Driven Discovery, St Jude Children's Research Hospital, 262 Danny Thomas Place, Memphis, TN 38105, USA
| | - Kellie Dean
- School of Biochemistry and Cell Biology, 3.91 Western Gateway Building, University College Cork, Cork, Ireland
| | - Norman E Davey
- Institute of Cancer Research, Chester Beatty Laboratories, 237 Fulham Rd, Chelsea, London SW3 6JB, UK
| | - Rita Pancsa
- Institute of Enzymology, HUN-REN Research Centre for Natural Sciences, Budapest 1117, Hungary
| | - Lucía B Chemes
- Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín (UNSAM), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), CP 1650, Buenos Aires, Argentina
- Escuela de Bio y Nanotecnologías (EByN), Universidad Nacional de San Martín, Av. 25 de Mayo y Francia, CP1650 San Martín, Buenos Aires, Argentina
| | - Toby J Gibson
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg 69117, Germany
| |
Collapse
|
4
|
Mihalič F, Benz C, Kassa E, Lindqvist R, Simonetti L, Inturi R, Aronsson H, Andersson E, Chi CN, Davey NE, Överby AK, Jemth P, Ivarsson Y. Identification of motif-based interactions between SARS-CoV-2 protein domains and human peptide ligands pinpoint antiviral targets. Nat Commun 2023; 14:5636. [PMID: 37704626 PMCID: PMC10499821 DOI: 10.1038/s41467-023-41312-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Accepted: 08/30/2023] [Indexed: 09/15/2023] Open
Abstract
The virus life cycle depends on host-virus protein-protein interactions, which often involve a disordered protein region binding to a folded protein domain. Here, we used proteomic peptide phage display (ProP-PD) to identify peptides from the intrinsically disordered regions of the human proteome that bind to folded protein domains encoded by the SARS-CoV-2 genome. Eleven folded domains of SARS-CoV-2 proteins were found to bind 281 peptides from human proteins, and affinities of 31 interactions involving eight SARS-CoV-2 protein domains were determined (KD ∼ 7-300 μM). Key specificity residues of the peptides were established for six of the interactions. Two of the peptides, binding Nsp9 and Nsp16, respectively, inhibited viral replication. Our findings demonstrate how high-throughput peptide binding screens simultaneously identify potential host-virus interactions and peptides with antiviral properties. Furthermore, the high number of low-affinity interactions suggest that overexpression of viral proteins during infection may perturb multiple cellular pathways.
Collapse
Affiliation(s)
- Filip Mihalič
- Department of Medical Biochemistry and Microbiology, Uppsala University, Box 582, Husargatan 3, 751 23, Uppsala, Sweden
| | - Caroline Benz
- Department of Chemistry - BMC, Uppsala University, Box 576, Husargatan 3, 751 23, Uppsala, Sweden
| | - Eszter Kassa
- Department of Chemistry - BMC, Uppsala University, Box 576, Husargatan 3, 751 23, Uppsala, Sweden
| | - Richard Lindqvist
- Department of Clinical Microbiology, Umeå University, 90185, Umeå, Sweden
- Laboratory for Molecular Infection Medicine Sweden (MIMS), Umeå University, 90187, Umeå, Sweden
| | - Leandro Simonetti
- Department of Chemistry - BMC, Uppsala University, Box 576, Husargatan 3, 751 23, Uppsala, Sweden
| | - Raviteja Inturi
- Department of Medical Biochemistry and Microbiology, Uppsala University, Box 582, Husargatan 3, 751 23, Uppsala, Sweden
| | - Hanna Aronsson
- Department of Medical Biochemistry and Microbiology, Uppsala University, Box 582, Husargatan 3, 751 23, Uppsala, Sweden
| | - Eva Andersson
- Department of Medical Biochemistry and Microbiology, Uppsala University, Box 582, Husargatan 3, 751 23, Uppsala, Sweden
| | - Celestine N Chi
- Department of Medical Biochemistry and Microbiology, Uppsala University, Box 582, Husargatan 3, 751 23, Uppsala, Sweden
| | - Norman E Davey
- Division of Cancer Biology, The Institute of Cancer Research, 237 Fulham Road, London, SW3 6JB, UK
| | - Anna K Överby
- Department of Clinical Microbiology, Umeå University, 90185, Umeå, Sweden
- Laboratory for Molecular Infection Medicine Sweden (MIMS), Umeå University, 90187, Umeå, Sweden
| | - Per Jemth
- Department of Medical Biochemistry and Microbiology, Uppsala University, Box 582, Husargatan 3, 751 23, Uppsala, Sweden.
| | - Ylva Ivarsson
- Department of Chemistry - BMC, Uppsala University, Box 576, Husargatan 3, 751 23, Uppsala, Sweden.
| |
Collapse
|
5
|
Kliche J, Garvanska DH, Simonetti L, Badgujar D, Dobritzsch D, Nilsson J, Davey NE, Ivarsson Y. Large-scale phosphomimetic screening identifies phospho-modulated motif-based protein interactions. Mol Syst Biol 2023; 19:e11164. [PMID: 37219487 PMCID: PMC10333884 DOI: 10.15252/msb.202211164] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Revised: 05/02/2023] [Accepted: 05/03/2023] [Indexed: 05/24/2023] Open
Abstract
Phosphorylation is a ubiquitous post-translation modification that regulates protein function by promoting, inhibiting or modulating protein-protein interactions. Hundreds of thousands of phosphosites have been identified but the vast majority have not been functionally characterised and it remains a challenge to decipher phosphorylation events modulating interactions. We generated a phosphomimetic proteomic peptide-phage display library to screen for phosphosites that modulate short linear motif-based interactions. The peptidome covers ~13,500 phospho-serine/threonine sites found in the intrinsically disordered regions of the human proteome. Each phosphosite is represented as wild-type and phosphomimetic variant. We screened 71 protein domains to identify 248 phosphosites that modulate motif-mediated interactions. Affinity measurements confirmed the phospho-modulation of 14 out of 18 tested interactions. We performed a detailed follow-up on a phospho-dependent interaction between clathrin and the mitotic spindle protein hepatoma-upregulated protein (HURP), demonstrating the essentiality of the phospho-dependency to the mitotic function of HURP. Structural characterisation of the clathrin-HURP complex elucidated the molecular basis for the phospho-dependency. Our work showcases the power of phosphomimetic ProP-PD to discover novel phospho-modulated interactions required for cellular function.
Collapse
Affiliation(s)
- Johanna Kliche
- Department of Chemistry, BMCUppsala UniversityUppsalaSweden
| | - Dimitriya Hristoforova Garvanska
- Faculty of Health and Medical Sciences, Novo Nordisk Foundation Center for Protein ResearchUniversity of CopenhagenCopenhagenDenmark
| | | | - Dilip Badgujar
- Department of Chemistry, BMCUppsala UniversityUppsalaSweden
| | | | - Jakob Nilsson
- Faculty of Health and Medical Sciences, Novo Nordisk Foundation Center for Protein ResearchUniversity of CopenhagenCopenhagenDenmark
| | - Norman E Davey
- Division of Cancer BiologyThe Institute of Cancer ResearchLondonUK
| | - Ylva Ivarsson
- Department of Chemistry, BMCUppsala UniversityUppsalaSweden
| |
Collapse
|
6
|
Aubel M, Eicholt L, Bornberg-Bauer E. Assessing structure and disorder prediction tools for de novo emerged proteins in the age of machine learning. F1000Res 2023; 12:347. [PMID: 37113259 PMCID: PMC10126731 DOI: 10.12688/f1000research.130443.1] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 03/17/2023] [Indexed: 03/31/2023] Open
Abstract
Background: De novo protein coding genes emerge from scratch in the non-coding regions of the genome and have, per definition, no homology to other genes. Therefore, their encoded de novo proteins belong to the so-called "dark protein space". So far, only four de novo protein structures have been experimentally approximated. Low homology, presumed high disorder and limited structures result in low confidence structural predictions for de novo proteins in most cases. Here, we look at the most widely used structure and disorder predictors and assess their applicability for de novo emerged proteins. Since AlphaFold2 is based on the generation of multiple sequence alignments and was trained on solved structures of largely conserved and globular proteins, its performance on de novo proteins remains unknown. More recently, natural language models of proteins have been used for alignment-free structure predictions, potentially making them more suitable for de novo proteins than AlphaFold2. Methods: We applied different disorder predictors (IUPred3 short/long, flDPnn) and structure predictors, AlphaFold2 on the one hand and language-based models (Omegafold, ESMfold, RGN2) on the other hand, to four de novo proteins with experimental evidence on structure. We compared the resulting predictions between the different predictors as well as to the existing experimental evidence. Results: Results from IUPred, the most widely used disorder predictor, depend heavily on the choice of parameters and differ significantly from flDPnn which has been found to outperform most other predictors in a comparative assessment study recently. Similarly, different structure predictors yielded varying results and confidence scores for de novo proteins. Conclusions: We suggest that, while in some cases protein language model based approaches might be more accurate than AlphaFold2, the structure prediction of de novo emerged proteins remains a difficult task for any predictor, be it disorder or structure.
Collapse
Affiliation(s)
- Margaux Aubel
- Institute for Evolution and Bidiversity, University of Muenster, Muenster, 48149, Germany
| | - Lars Eicholt
- Institute for Evolution and Bidiversity, University of Muenster, Muenster, 48149, Germany
| | - Erich Bornberg-Bauer
- Institute for Evolution and Bidiversity, University of Muenster, Muenster, 48149, Germany
- Department Protein Evolution, Max Planck-Institute for Biology, Tuebingen, 72076, Germany
| |
Collapse
|
7
|
Reciprocal regulatory balance within the CLEC16A-RNF41 mitophagy complex depends on an intrinsically disordered protein region. J Biol Chem 2023; 299:103057. [PMID: 36822331 PMCID: PMC10066562 DOI: 10.1016/j.jbc.2023.103057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Accepted: 01/26/2023] [Indexed: 02/23/2023] Open
Abstract
CLEC16A is an E3 ubiquitin ligase that regulates mitochondrial quality control through mitophagy and is associated with over 20 human diseases. CLEC16A forms a complex with another E3 ligase, RNF41, and a ubiquitin-specific peptidase, USP8; however, regions that regulate CLEC16A activity or the assembly of the tripartite mitophagy regulatory complex are unknown. Here, we report that CLEC16A contains an internal intrinsically disordered protein region (IDPR) that is crucial for CLEC16A function and turnover. IDPRs lack a fixed secondary structure and possess emerging, yet still equivocal roles in protein stability, interactions, and enzymatic activity. We find that the internal IDPR of CLEC16A is crucial for its degradation. CLEC16A turnover was promoted by RNF41, which binds and acts upon the internal IDPR to destabilize CLEC16A. Loss of this internal IDPR also destabilized the ubiquitin-dependent tripartite CLEC16A-RNF41-USP8 complex. Finally, the presence of an internal IDPR within CLEC16A was confirmed using NMR and circular dichroism spectroscopy. Together, our studies reveal that an IDPR is essential to control the reciprocal regulatory balance between CLEC16A and RNF41, which could be targeted to improve mitochondrial health in disease.
Collapse
|
8
|
Eicholt LA, Aubel M, Berk K, Bornberg‐Bauer E, Lange A. Heterologous expression of naturally evolved putative
de novo
proteins with chaperones. Protein Sci 2022; 31:e4371. [PMID: 35900020 PMCID: PMC9278007 DOI: 10.1002/pro.4371] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2022] [Revised: 05/03/2022] [Accepted: 05/14/2022] [Indexed: 11/23/2022]
Abstract
Over the past decade, evidence has accumulated that new protein‐coding genes can emerge de novo from previously non‐coding DNA. Most studies have focused on large scale computational predictions of de novo protein‐coding genes across a wide range of organisms. In contrast, experimental data concerning the folding and function of de novo proteins are scarce. This might be due to difficulties in handling de novo proteins in vitro, as most are short and predicted to be disordered. Here, we propose a guideline for the effective expression of eukaryotic de novo proteins in Escherichia coli. We used 11 sequences from Drosophila melanogaster and 10 from Homo sapiens, that are predicted de novo proteins from former studies, for heterologous expression. The candidate de novo proteins have varying secondary structure and disorder content. Using multiple combinations of purification tags, E. coli expression strains, and chaperone systems, we were able to increase the number of solubly expressed putative de novo proteins from 30% to 62%. Our findings indicate that the best combination for expressing putative de novo proteins in E. coli is a GST‐tag with T7 Express cells and co‐expressed chaperones. We found that, overall, proteins with higher predicted disorder were easier to express.
Collapse
Affiliation(s)
- Lars A. Eicholt
- Institute for Evolution and Biodiversity University of Muenster Münster Germany
| | - Margaux Aubel
- Institute for Evolution and Biodiversity University of Muenster Münster Germany
| | - Katrin Berk
- Institute for Evolution and Biodiversity University of Muenster Münster Germany
| | - Erich Bornberg‐Bauer
- Institute for Evolution and Biodiversity University of Muenster Münster Germany
- Max Planck‐Institute for Biology Tuebingen Tübingen Germany
| | - Andreas Lange
- Institute for Evolution and Biodiversity University of Muenster Münster Germany
| |
Collapse
|
9
|
Kumar M, Michael S, Alvarado-Valverde J, Mészáros B, Sámano‐Sánchez H, Zeke A, Dobson L, Lazar T, Örd M, Nagpal A, Farahi N, Käser M, Kraleti R, Davey N, Pancsa R, Chemes L, Gibson T. The Eukaryotic Linear Motif resource: 2022 release. Nucleic Acids Res 2022; 50:D497-D508. [PMID: 34718738 PMCID: PMC8728146 DOI: 10.1093/nar/gkab975] [Citation(s) in RCA: 110] [Impact Index Per Article: 55.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2021] [Accepted: 10/27/2021] [Indexed: 02/03/2023] Open
Abstract
Almost twenty years after its initial release, the Eukaryotic Linear Motif (ELM) resource remains an invaluable source of information for the study of motif-mediated protein-protein interactions. ELM provides a comprehensive, regularly updated and well-organised repository of manually curated, experimentally validated short linear motifs (SLiMs). An increasing number of SLiM-mediated interactions are discovered each year and keeping the resource up-to-date continues to be a great challenge. In the current update, 30 novel motif classes have been added and five existing classes have undergone major revisions. The update includes 411 new motif instances mostly focused on cell-cycle regulation, control of the actin cytoskeleton, membrane remodelling and vesicle trafficking pathways, liquid-liquid phase separation and integrin signalling. Many of the newly annotated motif-mediated interactions are targets of pathogenic motif mimicry by viral, bacterial or eukaryotic pathogens, providing invaluable insights into the molecular mechanisms underlying infectious diseases. The current ELM release includes 317 motif classes incorporating 3934 individual motif instances manually curated from 3867 scientific publications. ELM is available at: http://elm.eu.org.
Collapse
Affiliation(s)
- Manjeet Kumar
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg 69117, Germany
| | - Sushama Michael
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg 69117, Germany
| | - Jesús Alvarado-Valverde
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg 69117, Germany
- Collaboration for joint PhD degree between EMBL and Heidelberg University, Faculty of Biosciences
| | - Bálint Mészáros
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg 69117, Germany
| | - Hugo Sámano‐Sánchez
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg 69117, Germany
- Zhejiang University School of Medicine, International Campus, Zhejiang University, Haining, China
- Biomedical Sciences, Edinburgh Medical School, The University of Edinburgh, Edinburgh, EH8 9JZ, UK
| | - András Zeke
- Institute of Enzymology, Research Centre for Natural Sciences, Budapest 1117, Hungary
| | - Laszlo Dobson
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg 69117, Germany
- Institute of Enzymology, Research Centre for Natural Sciences, Budapest 1117, Hungary
| | - Tamas Lazar
- VIB-VUB Center for Structural Biology, Vlaams Instituut voor Biotechnologie, Pleinlaan 2, 1050 Brussels, Belgium
- Structural Biology Brussels, Department of Bioengineering, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
| | - Mihkel Örd
- Institute of Cancer Research, Chester Beatty Laboratories, 237 Fulham Rd, Chelsea, London SW3 6JB, UK
| | - Anurag Nagpal
- Department of Biological Sciences, BITS Pilani, K. K. Birla Goa campus, Zuarinagar, Goa 403726, India
| | - Nazanin Farahi
- VIB-VUB Center for Structural Biology, Vlaams Instituut voor Biotechnologie, Pleinlaan 2, 1050 Brussels, Belgium
- Structural Biology Brussels, Department of Bioengineering, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
| | - Melanie Käser
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg 69117, Germany
- Institute of Pharmacy and Molecular Biotechnology (IPMB), Heidelberg University, Heidelberg, Germany
| | - Ramya Kraleti
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg 69117, Germany
- Justus Liebig University Giessen, Ludwigstraße 23, 35390 Gießen, Germany
| | - Norman E Davey
- Institute of Cancer Research, Chester Beatty Laboratories, 237 Fulham Rd, Chelsea, London SW3 6JB, UK
| | - Rita Pancsa
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg 69117, Germany
- Institute of Enzymology, Research Centre for Natural Sciences, Budapest 1117, Hungary
| | - Lucía B Chemes
- Instituto de Investigaciones Biotecnológicas “Dr. Rodolfo A. Ugalde”, IIB-UNSAM, IIBIO-CONICET, Universidad Nacional de San Martín, Av. 25 de Mayo y Francia, CP1650 San Martín, Buenos Aires, Argentina
| | - Toby J Gibson
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg 69117, Germany
| |
Collapse
|
10
|
Benz C, Ali M, Krystkowiak I, Simonetti L, Sayadi A, Mihalic F, Kliche J, Andersson E, Jemth P, Davey NE, Ivarsson Y. Proteome-scale mapping of binding sites in the unstructured regions of the human proteome. Mol Syst Biol 2022; 18:e10584. [PMID: 35044719 PMCID: PMC8769072 DOI: 10.15252/msb.202110584] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2021] [Revised: 12/21/2021] [Accepted: 12/22/2021] [Indexed: 12/18/2022] Open
Abstract
Specific protein-protein interactions are central to all processes that underlie cell physiology. Numerous studies have together identified hundreds of thousands of human protein-protein interactions. However, many interactions remain to be discovered, and low affinity, conditional, and cell type-specific interactions are likely to be disproportionately underrepresented. Here, we describe an optimized proteomic peptide-phage display library that tiles all disordered regions of the human proteome and allows the screening of ~ 1,000,000 overlapping peptides in a single binding assay. We define guidelines for processing, filtering, and ranking the results and provide PepTools, a toolkit to annotate the identified hits. We uncovered >2,000 interaction pairs for 35 known short linear motif (SLiM)-binding domains and confirmed the quality of the produced data by complementary biophysical or cell-based assays. Finally, we show how the amino acid resolution-binding site information can be used to pinpoint functionally important disease mutations and phosphorylation events in intrinsically disordered regions of the proteome. The optimized human disorderome library paired with PepTools represents a powerful pipeline for unbiased proteome-wide discovery of SLiM-based interactions.
Collapse
Affiliation(s)
- Caroline Benz
- Department of Chemistry ‐ BMCUppsala UniversityUppsalaSweden
| | - Muhammad Ali
- Department of Chemistry ‐ BMCUppsala UniversityUppsalaSweden
| | | | | | - Ahmed Sayadi
- Department of Chemistry ‐ BMCUppsala UniversityUppsalaSweden
| | - Filip Mihalic
- Department of Medical Biochemistry and MicrobiologyUppsala UniversityUppsalaSweden
| | - Johanna Kliche
- Department of Chemistry ‐ BMCUppsala UniversityUppsalaSweden
| | - Eva Andersson
- Department of Medical Biochemistry and MicrobiologyUppsala UniversityUppsalaSweden
| | - Per Jemth
- Department of Medical Biochemistry and MicrobiologyUppsala UniversityUppsalaSweden
| | - Norman E Davey
- Division of Cancer BiologyThe Institute of Cancer ResearchLondonUK
| | - Ylva Ivarsson
- Department of Chemistry ‐ BMCUppsala UniversityUppsalaSweden
| |
Collapse
|
11
|
Structural Insights into Protein Regulation by Phosphorylation and Substrate Recognition of Protein Kinases/Phosphatases. Life (Basel) 2021; 11:life11090957. [PMID: 34575106 PMCID: PMC8467178 DOI: 10.3390/life11090957] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2021] [Revised: 09/05/2021] [Accepted: 09/10/2021] [Indexed: 12/30/2022] Open
Abstract
Protein phosphorylation is one of the most widely observed and important post-translational modification (PTM) processes. Protein phosphorylation is regulated by protein kinases, each of which covalently attaches a phosphate group to an amino acid side chain on a serine (Ser), threonine (Thr), or tyrosine (Tyr) residue of a protein, and by protein phosphatases, each of which, conversely, removes a phosphate group from a phosphoprotein. These reversible enzyme activities provide a regulatory mechanism by activating or deactivating many diverse functions of proteins in various cellular processes. In this review, their structures and substrate recognition are described and summarized, focusing on Ser/Thr protein kinases and protein Ser/Thr phosphatases, and the regulation of protein structures by phosphorylation. The studies reviewed here and the resulting information could contribute to further structural, biochemical, and combined studies on the mechanisms of protein phosphorylation and to drug discovery approaches targeting protein kinases or protein phosphatases.
Collapse
|
12
|
Malliavin TE. Tandem domain structure determination based on a systematic enumeration of conformations. Sci Rep 2021; 11:16925. [PMID: 34413388 PMCID: PMC8376923 DOI: 10.1038/s41598-021-96370-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2021] [Accepted: 08/04/2021] [Indexed: 12/03/2022] Open
Abstract
Protein structure determination is undergoing a change of perspective due to the larger importance taken in biology by the disordered regions of biomolecules. In such cases, the convergence criterion is more difficult to set up and the size of the conformational space is a obstacle to exhaustive exploration. A pipeline is proposed here to exhaustively sample protein conformations using backbone angle limits obtained by nuclear magnetic resonance (NMR), and then to determine the populations of conformations. The pipeline is applied to a tandem domain of the protein whirlin. An original approach, derived from a reformulation of the Distance Geometry Problem is used to enumerate the conformations of the linker connecting the two domains. Specifically designed procedure then permit to assemble the domains to the linker conformations and to optimize the tandem domain conformations with respect to two sets of NMR measurements: residual dipolar couplings and paramagnetic resonance enhancements. The relative populations of optimized conformations are finally determined by fitting small angle X-ray scattering (SAXS) data. The most populated conformation of the tandem domain is a semi-closed one, fully closed and more extended conformations being in minority, in agreement with previous observations. The SAXS and NMR data show different influences on the determination of populations.
Collapse
Affiliation(s)
- Thérèse E Malliavin
- Unité de Bioinformatique Structurale, Institut Pasteur, UMR 3528, CNRS, Paris, France.
- Center of Bioinformatics, Biostatistics and Integrative Biology, Institut Pasteur, USR 3756, CNRS, Paris, France.
| |
Collapse
|
13
|
Identification of PDZ Interactions by Proteomic Peptide Phage Display. Methods Mol Biol 2021. [PMID: 34014515 DOI: 10.1007/978-1-0716-1166-1_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/26/2023]
Abstract
PSD95-Disc large-Zonula occludens (PDZ) domains are among the most abundant modular domains in the human proteome. They typically bind short carboxy-terminal sequence motifs of their ligand proteins, which may be transmembrane proteins such as ion channels and GPCRs, as well as soluble proteins. The identity of the endogenous ligands of many PDZ domains remains unclear despite more than two decades of PDZ research. Combinatorial peptide phage display and bioinformatics predictions have contributed to shed light on PDZ-mediated interactions. However, the efficiency of these methods for the identification of interactions of potential biological relevance is hampered by different biases. Proteomic peptide-phage display (ProP-PD) was developed to overcome these limitations. Here we describe a ProP-PD protocol for the identification of C-terminal PDZ domain ligands. The method efficiently identifies peptide ligands within a proteome of interest, and pinpoint targets of potential biological relevance.
Collapse
|