1
|
McIvor JAP, Larsen DS, Mercadante D. Charge Relaying within a Phospho-Motif Rescue Binding Competency of a Disordered Transcription Factor. J Chem Inf Model 2024; 64:6041-6052. [PMID: 39074869 DOI: 10.1021/acs.jcim.4c00286] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/31/2024]
Abstract
Structural disorder in proteins is central to cellular signaling, where conformational plasticity equips molecules to promiscuously interact with different partners. By engaging with multiple binding partners via the rearrangement of its three helices, the nuclear coactivator binding domain (NCBD) of the CBP/p300 transcription factor is a paradigmatic example of promiscuity. Recently, molecular simulations and experiments revealed that, through the establishment of long-range electrostatic interactions, intended as salt-bridges formed between the post-translationally inserted phosphate and positively charged residues in helix H3 of NCBD, phosphorylation triggers NCBD compaction, lowering its affinity for binding partners. By means of extensive molecular simulations, we here investigated the effect of short-range electrostatics on the conformational ensemble of NCBD, by monitoring the interactions between a phosphorylated serine and conserved positively charged residues within the NCBD phospho-motif. We found that empowering proximal electrostatic interactions, as opposed to long-range electrostatics, can reshape the NCBD ensemble rescuing the binding competency of phosphorylated NCBD. Given the conservation of positive charges in phospho-motifs, proximal electrostatic interactions might dampen the effects of phosphorylation and act as a relay to regulate phosphorylated intrinsically disordered proteins, ultimately tuning the binding affinity for different cellular partners.
Collapse
Affiliation(s)
- Jordan A P McIvor
- School of Chemical Sciences, The University of Auckland, 23 Symonds Street, Auckland 1010, New Zealand
| | - Danaé S Larsen
- School of Chemical Sciences, The University of Auckland, 23 Symonds Street, Auckland 1010, New Zealand
| | - Davide Mercadante
- School of Chemical Sciences, The University of Auckland, 23 Symonds Street, Auckland 1010, New Zealand
| |
Collapse
|
2
|
Kodavati M, Maloji Rao VH, Provasek VE, Hegde ML. Regulation of DNA damage response by RNA/DNA-binding proteins: Implications for neurological disorders and aging. Ageing Res Rev 2024; 100:102413. [PMID: 39032612 DOI: 10.1016/j.arr.2024.102413] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2024] [Accepted: 07/05/2024] [Indexed: 07/23/2024]
Abstract
RNA-binding proteins (RBPs) are evolutionarily conserved across most forms of life, with an estimated 1500 RBPs in humans. Traditionally associated with post-transcriptional gene regulation, RBPs contribute to nearly every known aspect of RNA biology, including RNA splicing, transport, and decay. In recent years, an increasing subset of RBPs have been recognized for their DNA binding properties and involvement in DNA transactions. We refer to these RBPs with well-characterized DNA binding activity as RNA/DNA binding proteins (RDBPs), many of which are linked to neurological diseases. RDBPs are associated with both nuclear and mitochondrial DNA repair. Furthermore, the presence of intrinsically disordered domains in RDBPs appears to be critical for regulating their diverse interactions and plays a key role in controlling protein aggregation, which is implicated in neurodegeneration. In this review, we discuss the emerging roles of common RDBPs from the heterogeneous nuclear ribonucleoprotein (hnRNP) family, such as TAR DNA binding protein-43 (TDP43) and fused in sarcoma (FUS) in controlling DNA damage response (DDR). We also explore the implications of RDBP pathology in aging and neurodegenerative diseases and provide a prospective on the therapeutic potential of targeting RDBP pathology mediated DDR defects for motor neuron diseases and aging.
Collapse
Affiliation(s)
- Manohar Kodavati
- Department of Neurosurgery, Center for Neuroregeneration, Houston Methodist Research Institute, Houston, TX 77047, USA.
| | - Vikas H Maloji Rao
- Department of Neurosurgery, Center for Neuroregeneration, Houston Methodist Research Institute, Houston, TX 77047, USA
| | - Vincent E Provasek
- Department of Neurosurgery, Center for Neuroregeneration, Houston Methodist Research Institute, Houston, TX 77047, USA; School of Medicine, Texas A&M University, College Station, TX 77843, USA
| | - Muralidhar L Hegde
- Department of Neurosurgery, Center for Neuroregeneration, Houston Methodist Research Institute, Houston, TX 77047, USA; School of Medicine, Texas A&M University, College Station, TX 77843, USA; Department of Neurosurgery, Weill Medical College, New York, NY 10065, USA.
| |
Collapse
|
3
|
Jankowski MS, Griffith D, Shastry DG, Pelham JF, Ginell GM, Thomas J, Karande P, Holehouse AS, Hurley JM. Disordered clock protein interactions and charge blocks turn an hourglass into a persistent circadian oscillator. Nat Commun 2024; 15:3523. [PMID: 38664421 PMCID: PMC11045787 DOI: 10.1038/s41467-024-47761-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Accepted: 04/11/2024] [Indexed: 04/28/2024] Open
Abstract
Organismal physiology is widely regulated by the molecular circadian clock, a feedback loop composed of protein complexes whose members are enriched in intrinsically disordered regions. These regions can mediate protein-protein interactions via SLiMs, but the contribution of these disordered regions to clock protein interactions had not been elucidated. To determine the functionality of these disordered regions, we applied a synthetic peptide microarray approach to the disordered clock protein FRQ in Neurospora crassa. We identified residues required for FRQ's interaction with its partner protein FRH, the mutation of which demonstrated FRH is necessary for persistent clock oscillations but not repression of transcriptional activity. Additionally, the microarray demonstrated an enrichment of FRH binding to FRQ peptides with a net positive charge. We found that positively charged residues occurred in significant "blocks" within the amino acid sequence of FRQ and that ablation of one of these blocks affected both core clock timing and physiological clock output. Finally, we found positive charge clusters were a commonly shared molecular feature in repressive circadian clock proteins. Overall, our study suggests a mechanistic purpose for positive charge blocks and yielded insights into repressive arm protein roles in clock function.
Collapse
Affiliation(s)
- Meaghan S Jankowski
- Department of Biological Sciences, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA
| | - Daniel Griffith
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, 63110, USA
| | - Divya G Shastry
- Department of Biological Sciences, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA
| | - Jacqueline F Pelham
- Department of Biological Sciences, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA
| | - Garrett M Ginell
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, 63110, USA
| | - Joshua Thomas
- Department of Biological Sciences, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA
| | - Pankaj Karande
- Department of Chemical and Biological Engineering, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA
- Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA
| | - Alex S Holehouse
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, 63110, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, 63110, USA
| | - Jennifer M Hurley
- Department of Biological Sciences, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA.
- Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA.
| |
Collapse
|
4
|
Alderson TR, Pritišanac I, Kolarić Đ, Moses AM, Forman-Kay JD. Systematic identification of conditionally folded intrinsically disordered regions by AlphaFold2. Proc Natl Acad Sci U S A 2023; 120:e2304302120. [PMID: 37878721 PMCID: PMC10622901 DOI: 10.1073/pnas.2304302120] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Accepted: 08/30/2023] [Indexed: 10/27/2023] Open
Abstract
The AlphaFold Protein Structure Database contains predicted structures for millions of proteins. For the majority of human proteins that contain intrinsically disordered regions (IDRs), which do not adopt a stable structure, it is generally assumed that these regions have low AlphaFold2 confidence scores that reflect low-confidence structural predictions. Here, we show that AlphaFold2 assigns confident structures to nearly 15% of human IDRs. By comparison to experimental NMR data for a subset of IDRs that are known to conditionally fold (i.e., upon binding or under other specific conditions), we find that AlphaFold2 often predicts the structure of the conditionally folded state. Based on databases of IDRs that are known to conditionally fold, we estimate that AlphaFold2 can identify conditionally folding IDRs at a precision as high as 88% at a 10% false positive rate, which is remarkable considering that conditionally folded IDR structures were minimally represented in its training data. We find that human disease mutations are nearly fivefold enriched in conditionally folded IDRs over IDRs in general and that up to 80% of IDRs in prokaryotes are predicted to conditionally fold, compared to less than 20% of eukaryotic IDRs. These results indicate that a large majority of IDRs in the proteomes of human and other eukaryotes function in the absence of conditional folding, but the regions that do acquire folds are more sensitive to mutations. We emphasize that the AlphaFold2 predictions do not reveal functionally relevant structural plasticity within IDRs and cannot offer realistic ensemble representations of conditionally folded IDRs.
Collapse
Affiliation(s)
- T. Reid Alderson
- Department of Biochemistry, University of Toronto, Toronto, ONM5S 1A8, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ONM5S 1A8, Canada
| | - Iva Pritišanac
- Department of Cell and Systems Biology, University of Toronto, Toronto, ONM5S 35G, Canada
- Molecular Medicine Program, The Hospital for Sick Children, Toronto, ONM5G 0A4, Canada
- Department of Molecular Biology and Biochemistry, Gottfried Schatz Research Center for Cell Signaling, Metabolism and Aging, Medical University of Graz, Graz8010, Austria
| | - Đesika Kolarić
- Department of Molecular Biology and Biochemistry, Gottfried Schatz Research Center for Cell Signaling, Metabolism and Aging, Medical University of Graz, Graz8010, Austria
| | - Alan M. Moses
- Department of Cell and Systems Biology, University of Toronto, Toronto, ONM5S 35G, Canada
| | - Julie D. Forman-Kay
- Department of Biochemistry, University of Toronto, Toronto, ONM5S 1A8, Canada
- Molecular Medicine Program, The Hospital for Sick Children, Toronto, ONM5G 0A4, Canada
| |
Collapse
|
5
|
Bhopatkar AA, Kayed R. Flanking regions, amyloid cores, and polymorphism: the potential interplay underlying structural diversity. J Biol Chem 2023; 299:105122. [PMID: 37536631 PMCID: PMC10482755 DOI: 10.1016/j.jbc.2023.105122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 07/10/2023] [Accepted: 07/28/2023] [Indexed: 08/05/2023] Open
Abstract
The β-sheet-rich amyloid core is the defining feature of protein aggregates associated with neurodegenerative disorders. Recent investigations have revealed that there exist multiple examples of the same protein, with the same sequence, forming a variety of amyloid cores with distinct structural characteristics. These structural variants, termed as polymorphs, are hypothesized to influence the pathological profile and the progression of different neurodegenerative diseases, giving rise to unique phenotypic differences. Thus, identifying the origin and properties of these structural variants remain a focus of studies, as a preliminary step in the development of therapeutic strategies. Here, we review the potential role of the flanking regions of amyloid cores in inducing polymorphism. These regions, adjacent to the amyloid cores, show a preponderance for being structurally disordered, imbuing them with functional promiscuity. The dynamic nature of the flanking regions can then manifest in the form of conformational polymorphism of the aggregates. We take a closer look at the sequences flanking the amyloid cores, followed by a review of the polymorphic aggregates of the well-characterized proteins amyloid-β, α-synuclein, Tau, and TDP-43. We also consider different factors that can potentially influence aggregate structure and how these regions can be viewed as novel targets for therapeutic strategies by utilizing their unique structural properties.
Collapse
Affiliation(s)
- Anukool A Bhopatkar
- Mitchell Center for Neurodegenerative Diseases, University of Texas Medical Branch, Galveston, Texas, USA; Departments of Neurology, Neuroscience and Cell Biology, University of Texas Medical Branch, Galveston, Texas, USA
| | - Rakez Kayed
- Mitchell Center for Neurodegenerative Diseases, University of Texas Medical Branch, Galveston, Texas, USA; Departments of Neurology, Neuroscience and Cell Biology, University of Texas Medical Branch, Galveston, Texas, USA.
| |
Collapse
|
6
|
Tahti EF, Blount JM, Jackson SN, Gao M, Gill NP, Smith SN, Pederson NJ, Rumph SN, Struyvenberg SA, Mackley IGP, Madden DR, Amacher JF. Additive energetic contributions of multiple peptide positions determine the relative promiscuity of viral and human sequences for PDZ domain targets. Protein Sci 2023; 32:e4611. [PMID: 36851847 PMCID: PMC10022582 DOI: 10.1002/pro.4611] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2022] [Revised: 02/13/2023] [Accepted: 02/23/2023] [Indexed: 03/01/2023]
Abstract
Protein-protein interactions that involve recognition of short peptides are critical in cellular processes. Protein-peptide interaction surface areas are relatively small and shallow, and there are often overlapping specificities in families of peptide-binding domains. Therefore, dissecting selectivity determinants can be challenging. PDZ domains are a family of peptide-binding domains located in several intracellular signaling and trafficking pathways. These domains are also directly targeted by pathogens, and a hallmark of many oncogenic viral proteins is a PDZ-binding motif. However, amidst sequences that target PDZ domains, there is a wide spectrum in relative promiscuity. For example, the viral HPV16 E6 oncoprotein recognizes over double the number of PDZ domain-containing proteins as the cystic fibrosis transmembrane conductance regulator (CFTR) in the cell, despite similar PDZ targeting-sequences and identical motif residues. Here, we determine binding affinities for PDZ domains known to bind either HPV16 E6 alone or both CFTR and HPV16 E6, using peptides matching WT and hybrid sequences. We also use energy minimization to model PDZ-peptide complexes and use sequence analyses to investigate this difference. We find that while the majority of single mutations had marginal effects on overall affinity, the additive effect on the free energy of binding accurately describes the selectivity observed. Taken together, our results describe how complex and differing PDZ interactomes can be programmed in the cell.
Collapse
Affiliation(s)
- Elise F. Tahti
- Department of ChemistryWestern Washington UniversityBellinghamWashingtonUSA
| | - Jadon M. Blount
- Department of ChemistryWestern Washington UniversityBellinghamWashingtonUSA
| | - Sophie N. Jackson
- Department of ChemistryWestern Washington UniversityBellinghamWashingtonUSA
| | - Melody Gao
- Department of ChemistryWestern Washington UniversityBellinghamWashingtonUSA
| | - Nicholas P. Gill
- Department of BiochemistryGeisel School of Medicine at DartmouthHanoverNew HampshireUSA
| | - Sarah N. Smith
- Department of ChemistryWestern Washington UniversityBellinghamWashingtonUSA
| | - Nick J. Pederson
- Department of ChemistryWestern Washington UniversityBellinghamWashingtonUSA
| | | | | | - Iain G. P. Mackley
- Department of ChemistryWestern Washington UniversityBellinghamWashingtonUSA
| | - Dean R. Madden
- Department of BiochemistryGeisel School of Medicine at DartmouthHanoverNew HampshireUSA
| | - Jeanine F. Amacher
- Department of ChemistryWestern Washington UniversityBellinghamWashingtonUSA
| |
Collapse
|
7
|
Comparison of Biomolecular Condensate Localization and Protein Phase Separation Predictors. Biomolecules 2023; 13:biom13030527. [PMID: 36979462 PMCID: PMC10046894 DOI: 10.3390/biom13030527] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2023] [Revised: 03/07/2023] [Accepted: 03/10/2023] [Indexed: 03/17/2023] Open
Abstract
Research in the field of biochemistry and cellular biology has entered a new phase due to the discovery of phase separation driving the formation of biomolecular condensates, or membraneless organelles, in cells. The implications of this novel principle of cellular organization are vast and can be applied at multiple scales, spawning exciting research questions in numerous directions. Of fundamental importance are the molecular mechanisms that underly biomolecular condensate formation within cells and whether insights gained into these mechanisms provide a gateway for accurate predictions of protein phase behavior. Within the last six years, a significant number of predictors for protein phase separation and condensate localization have emerged. Herein, we compare a collection of state-of-the-art predictors on different tasks related to protein phase behavior. We show that the tested methods achieve high AUCs in the identification of biomolecular condensate drivers and scaffolds, as well as in the identification of proteins able to phase separate in vitro. However, our benchmark tests reveal that their performance is poorer when used to predict protein segments that are involved in phase separation or to classify amino acid substitutions as phase-separation-promoting or -inhibiting mutations. Our results suggest that the phenomenological approach used by most predictors is insufficient to fully grasp the complexity of the phenomenon within biological contexts and make reliable predictions related to protein phase behavior at the residue level.
Collapse
|
8
|
Functional characterization and comparative analysis of gene repression-mediating domains interacting with yeast pleiotropic corepressors Sin3, Cyc8 and Tup1. Curr Genet 2023; 69:127-139. [PMID: 36854981 PMCID: PMC10163088 DOI: 10.1007/s00294-023-01262-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Revised: 02/09/2023] [Accepted: 02/12/2023] [Indexed: 03/02/2023]
Abstract
Transcriptional corepressors Sin3, Cyc8 and Tup1 are important for downregulation of gene expression by recruiting various histone deacetylases once they gain access to defined genomic locations by interaction with pathway-specific repressor proteins. In this work we systematically investigated whether 17 yeast repressor proteins (Cti6, Dal80, Fkh1, Gal80, Mig1, Mot3, Nrg1, Opi1, Rdr1, Rox1, Sko1, Ume6, Ure2, Xbp1, Yhp1, Yox1 and Whi5) representing several unrelated regulatory pathways are able to bind to Sin3, Cyc8 and Tup1. Our results show that paired amphipathic helices 1 and 2 (PAH1 and PAH2) of Sin3 are functionally redundant for some regulatory pathways. WD40 domains of Tup1 proved to be sufficient for interaction with repressor proteins. Using length variants of selected repressors, we mapped corepressor interaction domains (CIDs) in vitro and assayed gene repression in vivo. Systematic comparison of CID minimal sequences allowed us to define several related positional patterns of hydrophobic amino acids some of which could be confirmed as functionally supported by site-directed mutagenesis. Although structural predictions indicated that certain CIDs may be α-helical, most repression domains appear to be randomly structured and must be considered as intrinsically disordered regions (IDR) adopting a defined conformation only by interaction with a corepressor.
Collapse
|
9
|
Computational Analysis of the Ligand-Binding Sites of the Molecular Chaperone OppA from Yersinia pseudotuberculosis. Int J Mol Sci 2023; 24:ijms24044023. [PMID: 36835435 PMCID: PMC9967938 DOI: 10.3390/ijms24044023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 01/20/2023] [Accepted: 01/23/2023] [Indexed: 02/19/2023] Open
Abstract
The function of chaperones is to correct or degrade misfolded proteins inside the cell. Classic molecular chaperones such as GroEL and DnaK have not been found in the periplasm of Yersinia pseudotuberculosis. Some periplasmic substrate-binding proteins could be bifunctional, such as OppA. Using bioinformatic tools, we try to elucidate the nature of the interactions between OppA and ligands from four proteins with different oligomeric states. Using the crystal structure of the proteins Mal12 alpha-glucosidase from Saccharomyces cerevisiae S288C, LDH rabbit muscle lactate dehydrogenase, EcoRI endonuclease from Escherichia coli and THG Geotrichum candidum lipase, a hundred models were obtained in total, including five different ligands from each enzyme with five conformations of each ligand. The best values for Mal12 stem from ligands 4 and 5, with conformation 5 for both; for LDH, ligands 1 and 4, with conformations 2 and 4, respectively; for EcoRI, ligands 3 and 5, with conformation 1 for both; and for THG, ligands 2 and 3, with conformation 1 for both. The interactions were analyzed with LigProt, and the length of the hydrogen bridges has an average of 2.8 to 3.0 Å. The interaction within the OppA pocket is energetically favored due to the formation of hydrogen bonds both of OppA and of the selected enzymes. The Asp 419 residue is important in these junctions.
Collapse
|
10
|
Tahti EF, Blount JM, Jackson SN, Gao M, Gill NP, Smith SN, Pederson NJ, Rumph SN, Struyvenberg SA, Mackley IGP, Madden DR, Amacher JF. Additive energetic contributions of multiple peptide positions determine the relative promiscuity of viral and human sequences for PDZ domain targets. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2022.12.31.522388. [PMID: 36711692 PMCID: PMC9881875 DOI: 10.1101/2022.12.31.522388] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
Protein-protein interactions that include recognition of short sequences of amino acids, or peptides, are critical in cellular processes. Protein-peptide interaction surface areas are relatively small and shallow, and there are often overlapping specificities in families of peptide-binding domains. Therefore, dissecting selectivity determinants can be challenging. PDZ domains are an example of a peptide-binding domain located in several intracellular signaling and trafficking pathways, which form interactions critical for the regulation of receptor endocytic trafficking, tight junction formation, organization of supramolecular complexes in neurons, and other biological systems. These domains are also directly targeted by pathogens, and a hallmark of many oncogenic viral proteins is a PDZ-binding motif. However, amidst sequences that target PDZ domains, there is a wide spectrum in relative promiscuity. For example, the viral HPV16 E6 oncoprotein recognizes over double the number of PDZ domain-containing proteins as the cystic fibrosis transmembrane conductance regulator (CFTR) in the cell, despite similar PDZ targeting-sequences and identical motif residues. Here, we determine binding affinities for PDZ domains known to bind either HPV16 E6 alone or both CFTR and HPV16 E6, using peptides matching WT and hybrid sequences. We also use energy minimization to model PDZ-peptide complexes and use sequence analyses to investigate this difference. We find that while the majority of single mutations had a marginal effect on overall affinity, the additive effect on the free energy of binding accurately describes the selectivity observed. Taken together, our results describe how complex and differing PDZ interactomes can be programmed in the cell.
Collapse
Affiliation(s)
- Elise F. Tahti
- Department of Chemistry, Western Washington University, Bellingham, WA, USA
| | - Jadon M. Blount
- Department of Chemistry, Western Washington University, Bellingham, WA, USA
| | - Sophie N. Jackson
- Department of Chemistry, Western Washington University, Bellingham, WA, USA
| | - Melody Gao
- Department of Chemistry, Western Washington University, Bellingham, WA, USA
| | - Nicholas P. Gill
- Department of Biochemistry, Geisel School of Medicine at Dartmouth, Hanover, NH, USA
| | - Sarah N. Smith
- Department of Chemistry, Western Washington University, Bellingham, WA, USA
| | - Nick J. Pederson
- Department of Chemistry, Western Washington University, Bellingham, WA, USA
| | - Simone N. Rumph
- Department of Biochemistry, Bowdoin College, Brunswick, ME, USA
| | | | - Iain G. P. Mackley
- Department of Chemistry, Western Washington University, Bellingham, WA, USA
| | - Dean R. Madden
- Department of Biochemistry, Geisel School of Medicine at Dartmouth, Hanover, NH, USA
| | - Jeanine F. Amacher
- Department of Chemistry, Western Washington University, Bellingham, WA, USA
| |
Collapse
|
11
|
Patil A. Enrichment patterns of intrinsic disorder in proteins. Biophys Rev 2022; 14:1487-1493. [PMID: 36659984 PMCID: PMC9842814 DOI: 10.1007/s12551-022-01016-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Accepted: 11/07/2022] [Indexed: 11/21/2022] Open
Abstract
Intrinsically disordered regions in proteins have been shown to be important in protein function. However, not all proteins contain the same amount of intrinsic disorder. The variation in the levels of intrinsic disorder in different types of proteins has been extensively studied over the last two decades. It is now known that the levels of intrinsic disorder vary in proteins across organisms, functions, diseases, and cellular locations. This review consolidates the known trends in the abundance of intrinsic disorder identified in groups of proteins across varying conditions and functions. It also presents new data towards the understanding of intrinsic disorder in cell type-specific proteins. Supplementary Information The online version contains supplementary material available at 10.1007/s12551-022-01016-7.
Collapse
Affiliation(s)
- Ashwini Patil
- Combinatics Inc., 2-2-6 Sugano, Ichikawa-Shi, Chiba, 272-0824 Japan
| |
Collapse
|
12
|
Holguin-Cruz JA, Foster LJ, Gsponer J. Where protein structure and cell diversity meet. Trends Cell Biol 2022; 32:996-1007. [PMID: 35537902 DOI: 10.1016/j.tcb.2022.04.004] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Revised: 04/08/2022] [Accepted: 04/12/2022] [Indexed: 01/21/2023]
Abstract
Protein-protein interaction networks - interactomes - are charted with the hope to understand how phenotypes emerge and how they are altered in disease states. Early efforts to map interactomes have focused on the assembly of context agnostic, reference networks. However, recent studies have mapped interactomes across different cell lines and tissues, finding highly variable interactomes due to the rewiring of protein-protein interactions in different contexts. Increasing evidence points to significant links between protein structure and interactome diversity seen across cell types and tissues. We discuss how recent findings support the key role of alternative splicing and phosphorylation, two well-established regulators of protein structural and functional diversity, in defining cell type- and tissue-specific interactomes. Moreover, we show that intrinsically disordered protein regions are most favorably equipped to support interactome rewiring by acting as hubs of protein structure and function regulation.
Collapse
Affiliation(s)
- Jorge A Holguin-Cruz
- Michael Smith Laboratories, Department of Biochemistry and Molecular Biology, The University of British Columbia, Vancouver, Canada
| | - Leonard J Foster
- Michael Smith Laboratories, Department of Biochemistry and Molecular Biology, The University of British Columbia, Vancouver, Canada
| | - Jörg Gsponer
- Michael Smith Laboratories, Department of Biochemistry and Molecular Biology, The University of British Columbia, Vancouver, Canada.
| |
Collapse
|
13
|
Samulevich ML, Shamilov R, Aneskievich BJ. Thermostable Proteins from HaCaT Keratinocytes Identify a Wide Breadth of Intrinsically Disordered Proteins and Candidates for Liquid-Liquid Phase Separation. Int J Mol Sci 2022; 23:ijms232214323. [PMID: 36430801 PMCID: PMC9692912 DOI: 10.3390/ijms232214323] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Revised: 11/08/2022] [Accepted: 11/16/2022] [Indexed: 11/22/2022] Open
Abstract
Intrinsically disordered proteins (IDPs) move through an ensemble of conformations which allows multitudinous roles within a cell. Keratinocytes, the predominant cell type in mammalian epidermis, have had only a few individual proteins assessed for intrinsic disorder and its possible contribution to liquid-liquid phase separation (LLPS), especially in regard to what functions or structures these proteins provide. We took a holistic approach to keratinocyte IDPs starting with enrichment via the isolation of thermostable proteins. The keratinocyte protein involucrin, known for its resistance to heat denaturation, served as a marker. It and other thermostable proteins were identified by liquid chromatography tandem mass spectrometry and subjected to extensive bioinformatic analysis covering gene ontology, intrinsic disorder, and potential for LLPS. Numerous proteins unique to keratinocytes and other proteins with shared expression in multiple cell types were identified to have IDP traits (e.g., compositional bias, nucleic acid binding, and repeat motifs). Among keratinocyte-specific proteins, many that co-assemble with involucrin into the cell-specific structure known as the cornified envelope scored highly for intrinsic disorder and potential for LLPS. This suggests intrinsic disorder and LLPS are previously unrecognized traits for assembly of the cornified envelope, echoing the contribution of intrinsic disorder and LLPS to more widely encountered features such as stress granules and PML bodies.
Collapse
Affiliation(s)
- Michael L. Samulevich
- Graduate Program in Pharmacology & Toxicology, Department of Pharmaceutical Sciences, University of Connecticut, 69 North Eagleville Road, Storrs, CT 06292-3092, USA
| | - Rambon Shamilov
- Graduate Program in Pharmacology & Toxicology, Department of Pharmaceutical Sciences, University of Connecticut, 69 North Eagleville Road, Storrs, CT 06292-3092, USA
| | - Brian J. Aneskievich
- Department of Pharmaceutical Sciences, School of Pharmacy, University of Connecticut, 69 North Eagleville Road, Storrs, CT 06269-3092, USA
- Correspondence: ; Tel.: +1-860-486-3053; Fax: +1-860-486-5792
| |
Collapse
|
14
|
Soleymani F, Paquet E, Viktor H, Michalowski W, Spinello D. Protein-protein interaction prediction with deep learning: A comprehensive review. Comput Struct Biotechnol J 2022; 20:5316-5341. [PMID: 36212542 PMCID: PMC9520216 DOI: 10.1016/j.csbj.2022.08.070] [Citation(s) in RCA: 24] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Revised: 08/29/2022] [Accepted: 08/30/2022] [Indexed: 11/15/2022] Open
Abstract
Most proteins perform their biological function by interacting with themselves or other molecules. Thus, one may obtain biological insights into protein functions, disease prevalence, and therapy development by identifying protein-protein interactions (PPI). However, finding the interacting and non-interacting protein pairs through experimental approaches is labour-intensive and time-consuming, owing to the variety of proteins. Hence, protein-protein interaction and protein-ligand binding problems have drawn attention in the fields of bioinformatics and computer-aided drug discovery. Deep learning methods paved the way for scientists to predict the 3-D structure of proteins from genomes, predict the functions and attributes of a protein, and modify and design new proteins to provide desired functions. This review focuses on recent deep learning methods applied to problems including predicting protein functions, protein-protein interaction and their sites, protein-ligand binding, and protein design.
Collapse
Affiliation(s)
- Farzan Soleymani
- Department of Mechanical Engineering, University of Ottawa, Ottawa, ON, Canada
| | - Eric Paquet
- National Research Council, 1200 Montreal Road, Ottawa, ON K1A 0R6, Canada
| | - Herna Viktor
- School of Electrical Engineering and Computer Science, University of Ottawa, ON, Canada
| | | | - Davide Spinello
- Department of Mechanical Engineering, University of Ottawa, Ottawa, ON, Canada
| |
Collapse
|
15
|
Choi J, Kim R, Koh J. Quantitative Frameworks for Multivalent Macromolecular Interactions in Biological Linear Lattice Systems. Mol Cells 2022; 45:444-453. [PMID: 35754369 PMCID: PMC9260134 DOI: 10.14348/molcells.2022.0035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2022] [Revised: 03/27/2022] [Accepted: 03/28/2022] [Indexed: 11/29/2022] Open
Abstract
Multivalent macromolecular interactions underlie dynamic regulation of diverse biological processes in ever-changing cellular states. These interactions often involve binding of multiple proteins to a linear lattice including intrinsically disordered proteins and the chromosomal DNA with many repeating recognition motifs. Quantitative understanding of such multivalent interactions on a linear lattice is crucial for exploring their unique regulatory potentials in the cellular processes. In this review, the distinctive molecular features of the linear lattice system are first discussed with a particular focus on the overlapping nature of potential protein binding sites within a lattice. Then, we introduce two general quantitative frameworks, combinatorial and conditional probability models, dealing with the overlap problem and relating the binding parameters to the experimentally measurable properties of the linear lattice-protein interactions. To this end, we present two specific examples where the quantitative models have been applied and further extended to provide biological insights into specific cellular processes. In the first case, the conditional probability model was extended to highlight the significant impact of nonspecific binding of transcription factors to the chromosomal DNA on gene-specific transcriptional activities. The second case presents the recently developed combinatorial models to unravel the complex organization of target protein binding sites within an intrinsically disordered region (IDR) of a nucleoporin. In particular, these models have suggested a unique function of IDRs as a molecular switch coupling distinct cellular processes. The quantitative models reviewed here are envisioned to further advance for dissection and functional studies of more complex systems including phase-separated biomolecular condensates.
Collapse
Affiliation(s)
- Jaejun Choi
- School of Biological Sciences, Seoul National University, Seoul 08826, Korea
| | - Ryeonghyeon Kim
- School of Biological Sciences, Seoul National University, Seoul 08826, Korea
| | - Junseock Koh
- School of Biological Sciences, Seoul National University, Seoul 08826, Korea
| |
Collapse
|
16
|
Quaglia F, Hatos A, Salladini E, Piovesan D, Tosatto SCE. Exploring Manually Curated Annotations of Intrinsically Disordered Proteins with DisProt. Curr Protoc 2022; 2:e484. [PMID: 35789137 DOI: 10.1002/cpz1.484] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
DisProt is the major repository of manually curated data for intrinsically disordered proteins collected from the literature. Although lacking a stable three-dimensional structure under physiological conditions, intrinsically disordered proteins carry out a plethora of biological functions, some of them directly arising from their flexible nature. A growing number of scientific studies have been published during the last few decades to shed light on their unstructured state, their binding modes, and their functions. DisProt makes use of a team of expert biocurators to provide up-to-date annotations of intrinsically disordered proteins from the literature, making them available to the scientific community. Here we present a comprehensive description on how to use DisProt in different contexts and provide a detailed explanation of how to explore and interpret manually curated annotations of intrinsically disordered proteins. We describe how to search DisProt annotations, both using the web interface and the API for programmatic access. Finally, we explain how to visualize and interpret a DisProt entry, the SARS-CoV-2 Nucleoprotein, characterized by the presence of unstructured N-terminal and C-terminal regions and a flexible linker. © 2022 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: Performing a search in DisProt Support Protocol 1: Downloading options Support Protocol 2: Programmatic access with DisProt REST API Basic Protocol 2: Exploring the DisProt Ontology page Basic Protocol 3: Visualizing and interpreting DisProt entries-the SARS-CoV-2 Nucleoprotein use case.
Collapse
Affiliation(s)
- Federica Quaglia
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council (CNR-IBIOM), Bari, Italy
- Department of Biomedical Sciences, University of Padova, Padova, Italy
| | - András Hatos
- Department of Biomedical Sciences, University of Padova, Padova, Italy
| | - Edoardo Salladini
- Department of Biomedical Sciences, University of Padova, Padova, Italy
| | - Damiano Piovesan
- Department of Biomedical Sciences, University of Padova, Padova, Italy
| | | |
Collapse
|
17
|
Garg A, Dabburu GR, Singhal N, Kumar M. Investigating the disordered regions (MoRFs, SLiMs and LCRs) and functions of mimicry proteins/peptides in silico. PLoS One 2022; 17:e0265657. [PMID: 35421114 PMCID: PMC9009644 DOI: 10.1371/journal.pone.0265657] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2021] [Accepted: 03/04/2022] [Indexed: 11/24/2022] Open
Abstract
Microbial mimicry of the host proteins/peptides can elicit host auto-reactive T- or B-cells resulting in autoimmune disease(s). Since intrinsically disordered protein regions (IDPRs) are involved in several host cell signaling and PPI networks, molecular mimicry of the IDPRs can help the pathogens in substituting their own proteins in the host cell-signaling and PPI networks and, ultimately hijacking the host cellular machinery. Thus, the present study was conducted to discern the structural disorder and intrinsically disordered protein regions (IDPRs) like, molecular recognition features (MoRFs), short linear motifs (SLiMs), and low complexity regions (LCRs) in the experimentally verified mimicry proteins and peptides (mimitopes) of bacteria, viruses and host. Also, functional characteristics of the mimicry proteins were studied in silico. Our results indicated that 78% of the bacterial host mimicry proteins and 45% of the bacterial host mimitopes were moderately/highly disordered while, 73% of the viral host mimicry proteins and 31% of the viral host mimitopes were moderately/highly disordered. Among the pathogens, 27% of the bacterial mimicry proteins and 13% of the bacterial mimitopes were moderately/highly disordered while, 53% of the viral mimicry proteins and 21% of the viral mimitopes were moderately/highly disordered. Though IDPR were frequent in host, bacterial and viral mimicry proteins, only a few mimitopes overlapped with the IDPRs like, MoRFs, SLiMs and LCRs. This suggests that most of the microbes cannot use molecular mimicry to modulate the host PPIs and hijack the host cell machinery. Functional analyses indicated that most of the pathogens exhibited mimicry with the host proteins involved in ion binding and signaling pathways. This is the first report on the disordered regions and functional aspects of experimentally proven host and microbial mimicry proteins.
Collapse
Affiliation(s)
- Anjali Garg
- Department of Biophysics, University of Delhi South Campus, New Delhi, India
| | - Govinda Rao Dabburu
- Department of Biophysics, University of Delhi South Campus, New Delhi, India
| | - Neelja Singhal
- Department of Biophysics, University of Delhi South Campus, New Delhi, India
- * E-mail: (MK); (NS)
| | - Manish Kumar
- Department of Biophysics, University of Delhi South Campus, New Delhi, India
- * E-mail: (MK); (NS)
| |
Collapse
|
18
|
Ghadie MA, Xia Y. Are transient protein-protein interactions more dispensable? PLoS Comput Biol 2022; 18:e1010013. [PMID: 35404956 PMCID: PMC9000134 DOI: 10.1371/journal.pcbi.1010013] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2021] [Accepted: 03/11/2022] [Indexed: 12/12/2022] Open
Abstract
Protein-protein interactions (PPIs) are key drivers of cell function and evolution. While it is widely assumed that most permanent PPIs are important for cellular function, it remains unclear whether transient PPIs are equally important. Here, we estimate and compare dispensable content among transient PPIs and permanent PPIs in human. Starting with a human reference interactome mapped by experiments, we construct a human structural interactome by building three-dimensional structural models for PPIs, and then distinguish transient PPIs from permanent PPIs using several structural and biophysical properties. We map common mutations from healthy individuals and disease-causing mutations onto the structural interactome, and perform structure-based calculations of the probabilities for common mutations (assumed to be neutral) and disease mutations (assumed to be mildly deleterious) to disrupt transient PPIs and permanent PPIs. Using Bayes' theorem we estimate that a similarly small fraction (<~20%) of both transient and permanent PPIs are completely dispensable, i.e., effectively neutral upon disruption. Hence, transient and permanent interactions are subject to similarly strong selective constraints in the human interactome.
Collapse
Affiliation(s)
| | - Yu Xia
- Department of Bioengineering, McGill University, Montreal, Canada
| |
Collapse
|
19
|
Kulkarni P, Bhattacharya S, Achuthan S, Behal A, Jolly MK, Kotnala S, Mohanty A, Rangarajan G, Salgia R, Uversky V. Intrinsically Disordered Proteins: Critical Components of the Wetware. Chem Rev 2022; 122:6614-6633. [PMID: 35170314 PMCID: PMC9250291 DOI: 10.1021/acs.chemrev.1c00848] [Citation(s) in RCA: 36] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Despite the wealth of knowledge gained about intrinsically disordered proteins (IDPs) since their discovery, there are several aspects that remain unexplored and, hence, poorly understood. A living cell is a complex adaptive system that can be described as a wetware─a metaphor used to describe the cell as a computer comprising both hardware and software and attuned to logic gates─capable of "making" decisions. In this focused Review, we discuss how IDPs, as critical components of the wetware, influence cell-fate decisions by wiring protein interaction networks to keep them minimally frustrated. Because IDPs lie between order and chaos, we explore the possibility that they can be modeled as attractors. Further, we discuss how the conformational dynamics of IDPs manifests itself as conformational noise, which can potentially amplify transcriptional noise to stochastically switch cellular phenotypes. Finally, we explore the potential role of IDPs in prebiotic evolution, in forming proteinaceous membrane-less organelles, in the origin of multicellularity, and in protein conformation-based transgenerational inheritance of acquired characteristics. Together, these ideas provide a new conceptual framework to discern how IDPs may perform critical biological functions despite their lack of structure.
Collapse
Affiliation(s)
- Prakash Kulkarni
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA, USA
| | - Supriyo Bhattacharya
- Integrative Genomics Core, City of Hope National Medical Center, Duarte, CA, USA
| | - Srisairam Achuthan
- Division of Research Informatics, Center for Informatics, City of Hope National Medical Center, Duarte, CA 91010, USA
| | - Amita Behal
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA, USA
| | - Mohit Kumar Jolly
- Center for BioSystems Science and Engineering, Indian Institute of Science, Bangalore 560012, India
| | - Sourabh Kotnala
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA, USA
| | - Atish Mohanty
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA, USA
| | - Govindan Rangarajan
- Department of Mathematics, Indian Institute of Science, Bangalore 560012, India
- Center for Neuroscience, Indian Institute of Science, Bangalore 560012, India
| | - Ravi Salgia
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA, USA
| | - Vladimir Uversky
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL, USA
- Center for Molecular Mechanisms of Aging and Age-Related Diseases, Moscow Institute of Physics and Technology, Institutskiy pereulok, 9, Dolgoprudny, Moscow region 141700, Russia
| |
Collapse
|
20
|
Sučec I, Bersch B, Schanda P. How do Chaperones Bind (Partly) Unfolded Client Proteins? Front Mol Biosci 2021; 8:762005. [PMID: 34760928 PMCID: PMC8573040 DOI: 10.3389/fmolb.2021.762005] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2021] [Accepted: 10/06/2021] [Indexed: 01/03/2023] Open
Abstract
Molecular chaperones are central to cellular protein homeostasis. Dynamic disorder is a key feature of the complexes of molecular chaperones and their client proteins, and it facilitates the client release towards a folded state or the handover to downstream components. The dynamic nature also implies that a given chaperone can interact with many different client proteins, based on physico-chemical sequence properties rather than on structural complementarity of their (folded) 3D structure. Yet, the balance between this promiscuity and some degree of client specificity is poorly understood. Here, we review recent atomic-level descriptions of chaperones with client proteins, including chaperones in complex with intrinsically disordered proteins, with membrane-protein precursors, or partially folded client proteins. We focus hereby on chaperone-client interactions that are independent of ATP. The picture emerging from these studies highlights the importance of dynamics in these complexes, whereby several interaction types, not only hydrophobic ones, contribute to the complex formation. We discuss these features of chaperone-client complexes and possible factors that may contribute to this balance of promiscuity and specificity.
Collapse
Affiliation(s)
- Iva Sučec
- CEA, CNRS, Institut de Biologie Structurale (IBS), Univ. Grenoble Alpes, Grenoble, France
| | - Beate Bersch
- CEA, CNRS, Institut de Biologie Structurale (IBS), Univ. Grenoble Alpes, Grenoble, France
| | - Paul Schanda
- CEA, CNRS, Institut de Biologie Structurale (IBS), Univ. Grenoble Alpes, Grenoble, France.,Institute of Science and Technology Austria, Klosterneuburg, Austria
| |
Collapse
|
21
|
Chio US, Liu Y, Chung S, Shim WJ, Chandrasekar S, Weiss S, Shan SO. Subunit cooperation in the Get1/2 receptor promotes tail-anchored membrane protein insertion. J Cell Biol 2021; 220:212681. [PMID: 34614151 PMCID: PMC8530227 DOI: 10.1083/jcb.202103079] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2021] [Revised: 08/03/2021] [Accepted: 08/19/2021] [Indexed: 11/29/2022] Open
Abstract
The guided entry of tail-anchored protein (GET) pathway, in which the Get3 ATPase delivers an essential class of tail-anchored membrane proteins (TAs) to the Get1/2 receptor at the endoplasmic reticulum, provides a conserved mechanism for TA biogenesis in eukaryotic cells. The membrane-associated events of this pathway remain poorly understood. Here we show that complex assembly between the cytosolic domains (CDs) of Get1 and Get2 strongly enhances the affinity of the individual subunits for Get3•TA, thus enabling efficient capture of the targeting complex. In addition to the known role of Get1CD in remodeling Get3 conformation, two molecular recognition features (MoRFs) in Get2CD induce Get3 opening, and both subunits are required for optimal TA release from Get3. Mutation of the MoRFs attenuates TA insertion into the ER in vivo. Our results demonstrate extensive cooperation between the Get1/2 receptor subunits in the capture and remodeling of the targeting complex, and emphasize the role of MoRFs in receptor function during membrane protein biogenesis.
Collapse
Affiliation(s)
- Un Seng Chio
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA
| | - Yumeng Liu
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA
| | - SangYoon Chung
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA
| | - Woo Jun Shim
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA
| | - Sowmya Chandrasekar
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA
| | - Shimon Weiss
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA.,Department of Physics, Institute for Nanotechnology and Advanced Materials, Bar-Ilan University, Ramat-Gan, Israel
| | - Shu-Ou Shan
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA
| |
Collapse
|
22
|
Morris OM, Torpey JH, Isaacson RL. Intrinsically disordered proteins: modes of binding with emphasis on disordered domains. Open Biol 2021; 11:210222. [PMID: 34610267 PMCID: PMC8492171 DOI: 10.1098/rsob.210222] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
Our notions of protein function have long been determined by the protein structure-function paradigm. However, the idea that protein function is dictated by a prerequisite complementarity of shapes at the binding interface is becoming increasingly challenged. Interactions involving intrinsically disordered proteins (IDPs) have indicated a significant degree of disorder present in the bound state, ranging from static disorder to complete disorder, termed 'random fuzziness'. This review assesses the anatomy of an IDP and relates how its intrinsic properties permit promiscuity and allow for the various modes of interaction. Furthermore, a mechanistic overview of the types of disordered domains is detailed, while also relating to a recent example and the kinetic and thermodynamic principles governing its formation.
Collapse
Affiliation(s)
- Owen Michael Morris
- Department of Chemistry, Faculty of Natural, Mathematical and Engineering Sciences, King's College London, Britannia House, 7 Trinity Street, London SE1 1DB, UK
| | - James Hilary Torpey
- Department of Chemistry, Faculty of Natural, Mathematical and Engineering Sciences, King's College London, Britannia House, 7 Trinity Street, London SE1 1DB, UK
| | - Rivka Leah Isaacson
- Department of Chemistry, Faculty of Natural, Mathematical and Engineering Sciences, King's College London, Britannia House, 7 Trinity Street, London SE1 1DB, UK
| |
Collapse
|
23
|
Nassar R, Dignon GL, Razban RM, Dill KA. The Protein Folding Problem: The Role of Theory. J Mol Biol 2021; 433:167126. [PMID: 34224747 PMCID: PMC8547331 DOI: 10.1016/j.jmb.2021.167126] [Citation(s) in RCA: 41] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Revised: 06/21/2021] [Accepted: 06/26/2021] [Indexed: 10/20/2022]
Abstract
The protein folding problem was first articulated as question of how order arose from disorder in proteins: How did the various native structures of proteins arise from interatomic driving forces encoded within their amino acid sequences, and how did they fold so fast? These matters have now been largely resolved by theory and statistical mechanics combined with experiments. There are general principles. Chain randomness is overcome by solvation-based codes. And in the needle-in-a-haystack metaphor, native states are found efficiently because protein haystacks (conformational ensembles) are funnel-shaped. Order-disorder theory has now grown to encompass a large swath of protein physical science across biology.
Collapse
Affiliation(s)
- Roy Nassar
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY, USA; Department of Chemistry, Stony Brook University, Stony Brook, NY, USA
| | - Gregory L Dignon
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY, USA
| | - Rostam M Razban
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY, USA
| | - Ken A Dill
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY, USA; Department of Chemistry, Stony Brook University, Stony Brook, NY, USA; Department of Physics and Astronomy, Stony Brook University, Stony Brook, NY, USA.
| |
Collapse
|
24
|
Malliavin TE. Tandem domain structure determination based on a systematic enumeration of conformations. Sci Rep 2021; 11:16925. [PMID: 34413388 PMCID: PMC8376923 DOI: 10.1038/s41598-021-96370-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2021] [Accepted: 08/04/2021] [Indexed: 12/03/2022] Open
Abstract
Protein structure determination is undergoing a change of perspective due to the larger importance taken in biology by the disordered regions of biomolecules. In such cases, the convergence criterion is more difficult to set up and the size of the conformational space is a obstacle to exhaustive exploration. A pipeline is proposed here to exhaustively sample protein conformations using backbone angle limits obtained by nuclear magnetic resonance (NMR), and then to determine the populations of conformations. The pipeline is applied to a tandem domain of the protein whirlin. An original approach, derived from a reformulation of the Distance Geometry Problem is used to enumerate the conformations of the linker connecting the two domains. Specifically designed procedure then permit to assemble the domains to the linker conformations and to optimize the tandem domain conformations with respect to two sets of NMR measurements: residual dipolar couplings and paramagnetic resonance enhancements. The relative populations of optimized conformations are finally determined by fitting small angle X-ray scattering (SAXS) data. The most populated conformation of the tandem domain is a semi-closed one, fully closed and more extended conformations being in minority, in agreement with previous observations. The SAXS and NMR data show different influences on the determination of populations.
Collapse
Affiliation(s)
- Thérèse E Malliavin
- Unité de Bioinformatique Structurale, Institut Pasteur, UMR 3528, CNRS, Paris, France.
- Center of Bioinformatics, Biostatistics and Integrative Biology, Institut Pasteur, USR 3756, CNRS, Paris, France.
| |
Collapse
|
25
|
He H, Zhou Y, Chi Y, He J. Prediction of MoRFs based on sequence properties and convolutional neural networks. BioData Min 2021; 14:39. [PMID: 34391457 PMCID: PMC8364704 DOI: 10.1186/s13040-021-00275-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2021] [Accepted: 08/08/2021] [Indexed: 12/02/2022] Open
Abstract
Background Intrinsically disordered proteins possess flexible 3-D structures, which makes them play an important role in a variety of biological functions. Molecular recognition features (MoRFs) act as an important type of functional regions, which are located within longer intrinsically disordered regions and undergo disorder-to-order transitions upon binding their interaction partners. Results We develop a method, MoRFCNN, to predict MoRFs based on sequence properties and convolutional neural networks (CNNs). The sequence properties contain structural and physicochemical properties which are used to describe the differences between MoRFs and non-MoRFs. Especially, to highlight the correlation between the target residue and adjacent residues, three windows are selected to preprocess the selected properties. After that, these calculated properties are combined into the feature matrix to predict MoRFs through the constructed CNN. Comparing with other existing methods, MoRFCNN obtains better performance. Conclusions MoRFCNN is a new individual MoRFs prediction method which just uses protein sequence properties without evolutionary information. The simulation results show that MoRFCNN is effective and competitive.
Collapse
Affiliation(s)
- Hao He
- School of Electronic and Information Engineering, Hebei University of Technology, Tianjin, China
| | - Yatong Zhou
- School of Electronic and Information Engineering, Hebei University of Technology, Tianjin, China.
| | - Yue Chi
- School of Electronic and Information Engineering, Hebei University of Technology, Tianjin, China
| | - Jingfei He
- School of Electronic and Information Engineering, Hebei University of Technology, Tianjin, China
| |
Collapse
|
26
|
Oldfield CJ, Peng Z, Kurgan L. Disordered RNA-Binding Region Prediction with DisoRDPbind. Methods Mol Biol 2021; 2106:225-239. [PMID: 31889261 DOI: 10.1007/978-1-0716-0231-7_14] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
RNA chaperone activity is one of the many functions of intrinsically disordered regions (IDRs). IDRs function without the prerequisite of a stable structure. Instead, their functions arise from structural ensembles. A common theme in IDR function is molecular recognition; IDRs mediate interactions with other proteins, RNA, and DNA. Many computational methods are available to predict IDRs from protein sequence, but relatively few are available for predicting IDR functions. Available methods primarily focus on protein-protein interactions. DisoRDPbind was developed to predict several protein functions including interactions with RNA. This method is available as a user-friendly web interface, located at http://biomine.cs.vcu.edu/servers/DisoRDPbind/ . The development and architecture of DisoRDPbind is briefly presented, and its accuracy relative to other RNA-binding residue predictors is discussed. We explain usage of the web interface in detail and provide an example of prediction results and interpretation. While DisoRDPbind does not identify RNA chaperones directly, we provide a case study of an RNA chaperone, HCV core protein, as an example of the method's utility in the study of RNA chaperones.
Collapse
Affiliation(s)
| | - Zhenling Peng
- Center for Applied Mathematics, Tianjin University, Tianjin, People's Republic of China
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA.
| |
Collapse
|
27
|
On the specificity of protein-protein interactions in the context of disorder. Biochem J 2021; 478:2035-2050. [PMID: 34101805 PMCID: PMC8203207 DOI: 10.1042/bcj20200828] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2021] [Revised: 05/14/2021] [Accepted: 05/17/2021] [Indexed: 02/07/2023]
Abstract
With the increased focus on intrinsically disordered proteins (IDPs) and their large interactomes, the question about their specificity — or more so on their multispecificity — arise. Here we recapitulate how specificity and multispecificity are quantified and address through examples if IDPs in this respect differ from globular proteins. The conclusion is that quantitatively, globular proteins and IDPs are similar when it comes to specificity. However, compared with globular proteins, IDPs have larger interactome sizes, a phenomenon that is further enabled by their flexibility, repetitive binding motifs and propensity to adapt to different binding partners. For IDPs, this adaptability, interactome size and a higher degree of multivalency opens for new interaction mechanisms such as facilitated exchange through trimer formation and ultra-sensitivity via threshold effects and ensemble redistribution. IDPs and their interactions, thus, do not compromise the definition of specificity. Instead, it is the sheer size of their interactomes that complicates its calculation. More importantly, it is this size that challenges how we conceptually envision, interpret and speak about their specificity.
Collapse
|
28
|
Schreiber KJ, Lewis JD. Identification of a Putative DNA-Binding Protein in Arabidopsis That Acts as a Susceptibility Hub and Interacts With Multiple Pseudomonas syringae Effectors. MOLECULAR PLANT-MICROBE INTERACTIONS : MPMI 2021; 34:410-425. [PMID: 33373263 DOI: 10.1094/mpmi-10-20-0291-r] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Phytopathogens use secreted effector proteins to suppress host immunity and promote pathogen virulence, and there is increasing evidence that the host-pathogen interactome comprises a complex network. To identify novel interactors of the Pseudomonas syringae effector HopZ1a, we performed a yeast two-hybrid screen that identified a previously uncharacterized Arabidopsis protein that we designate HopZ1a interactor 1 (ZIN1). Additional analyses in yeast and in planta revealed that ZIN1 also interacts with several other P. syringae effectors. We show that an Arabidopsis loss-of-function zin1 mutant is less susceptible to infection by certain strains of P. syringae, while overexpression of ZIN1 results in enhanced susceptibility. Functionally, ZIN1 exhibits topoisomerase-like activity in vitro. Transcriptional profiling of wild-type and zin1 Arabidopsis plants inoculated with P. syringae indicated that while ZIN1 regulates a wide range of pathogen-responsive biological processes, the list of genes more highly expressed in zin1 versus wild-type plants is particularly enriched for ribosomal protein genes. Altogether, these data illuminate ZIN1 as a potential susceptibility hub that interacts with multiple effectors to influence the outcome of plant-microbe interactions.[Formula: see text] Copyright © 2021 The Author(s). This is an open access article distributed under the CC BY-NC-ND 4.0 International license.
Collapse
Affiliation(s)
- Karl J Schreiber
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, CA 94720-3102, U.S.A
| | - Jennifer D Lewis
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, CA 94720-3102, U.S.A
- Plant Gene Expression Center, United States Department of Agriculture, Albany, CA 94710-1105, U.S.A
| |
Collapse
|
29
|
Schreiber KJ, Hassan JA, Lewis JD. Arabidopsis Abscisic Acid Repressor 1 is a susceptibility hub that interacts with multiple Pseudomonas syringae effectors. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2021; 105:1274-1292. [PMID: 33289145 DOI: 10.1111/tpj.15110] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/14/2020] [Revised: 11/20/2020] [Accepted: 11/23/2020] [Indexed: 06/12/2023]
Abstract
Pathogens secrete effector proteins into host cells to suppress host immunity and promote pathogen virulence, although many features at the molecular interface of host-pathogen interactions remain to be characterized. In a yeast two-hybrid assay, we found that the Pseudomonas syringae effector HopZ1a interacts with the Arabidopsis transcriptional regulator Abscisic Acid Repressor 1 (ABR1). Further analysis revealed that ABR1 interacts with multiple P. syringae effectors, suggesting that it may be targeted as a susceptibility hub. Indeed, loss-of-function abr1 mutants exhibit reduced susceptibility to a number of P. syringae strains. The ABR1 protein comprises a conserved APETALA2 (AP2) domain flanked by long regions of predicted structural disorder. We verified the DNA-binding activity of the AP2 domain and demonstrated that the disordered domains act redundantly to enhance DNA binding and to facilitate transcriptional activation by ABR1. Finally, we compared gene expression profiles from wild-type and abr1 plants following inoculation with P. syringae, which suggested that the reduced susceptibility of abr1 mutants is due to the loss of a virulence target rather than an enhanced immune response. These data highlight ABR1 as a functionally important component at the host-pathogen interface.
Collapse
Affiliation(s)
- Karl J Schreiber
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, CA, USA
| | - Jana A Hassan
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, CA, USA
| | - Jennifer D Lewis
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, CA, USA
- United States Department of Agriculture, Plant Gene Expression Center, Albany, CA, USA
| |
Collapse
|
30
|
Bugge K, Staby L, Salladini E, Falbe-Hansen RG, Kragelund BB, Skriver K. αα-Hub domains and intrinsically disordered proteins: A decisive combo. J Biol Chem 2021; 296:100226. [PMID: 33361159 PMCID: PMC7948954 DOI: 10.1074/jbc.rev120.012928] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Revised: 12/22/2020] [Accepted: 12/22/2020] [Indexed: 01/02/2023] Open
Abstract
Hub proteins are central nodes in protein-protein interaction networks with critical importance to all living organisms. Recently, a new group of folded hub domains, the αα-hubs, was defined based on a shared αα-hairpin supersecondary structural foundation. The members PAH, RST, TAFH, NCBD, and HHD are found in large proteins such as Sin3, RCD1, TAF4, CBP, and harmonin, which organize disordered transcriptional regulators and membrane scaffolds in interactomes of importance to human diseases and plant quality. In this review, studies of structures, functions, and complexes across the αα-hubs are described and compared to provide a unified description of the group. This analysis expands the associated molecular concepts of "one domain-one binding site", motif-based ligand binding, and coupled folding and binding of intrinsically disordered ligands to additional concepts of importance to signal fidelity. These include context, motif reversibility, multivalency, complex heterogeneity, synergistic αα-hub:ligand folding, accessory binding sites, and supramodules. We propose that these multifaceted protein-protein interaction properties are made possible by the characteristics of the αα-hub fold, including supersite properties, dynamics, variable topologies, accessory helices, and malleability and abetted by adaptability of the disordered ligands. Critically, these features provide additional filters for specificity. With the presentations of new concepts, this review opens for new research questions addressing properties across the group, which are driven from concepts discovered in studies of the individual members. Combined, the members of the αα-hubs are ideal models for deconvoluting signal fidelity maintained by folded hubs and their interactions with intrinsically disordered ligands.
Collapse
Affiliation(s)
- Katrine Bugge
- REPIN and The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark; Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Lasse Staby
- REPIN and The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark; Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Edoardo Salladini
- REPIN and The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Rasmus G Falbe-Hansen
- REPIN and The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Birthe B Kragelund
- REPIN and The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark; Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| | - Karen Skriver
- REPIN and The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
31
|
Jamasb AR, Day B, Cangea C, Liò P, Blundell TL. Deep Learning for Protein-Protein Interaction Site Prediction. Methods Mol Biol 2021; 2361:263-288. [PMID: 34236667 DOI: 10.1007/978-1-0716-1641-3_16] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Protein-protein interactions (PPIs) are central to cellular functions. Experimental methods for predicting PPIs are well developed but are time and resource expensive and suffer from high false-positive error rates at scale. Computational prediction of PPIs is highly desirable for a mechanistic understanding of cellular processes and offers the potential to identify highly selective drug targets. In this chapter, details of developing a deep learning approach to predicting which residues in a protein are involved in forming a PPI-a task known as PPI site prediction-are outlined. The key decisions to be made in defining a supervised machine learning project in this domain are here highlighted. Alternative training regimes for deep learning models to address shortcomings in existing approaches and provide starting points for further research are discussed. This chapter is written to serve as a companion to developing deep learning approaches to protein-protein interaction site prediction, and an introduction to developing geometric deep learning projects operating on protein structure graphs.
Collapse
Affiliation(s)
- Arian R Jamasb
- Department of Computer Science and Technology, University of Cambridge, Cambridge, UK.,Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - Ben Day
- Department of Computer Science and Technology, University of Cambridge, Cambridge, UK
| | - Cătălina Cangea
- Department of Computer Science and Technology, University of Cambridge, Cambridge, UK
| | - Pietro Liò
- Department of Computer Science and Technology, University of Cambridge, Cambridge, UK
| | - Tom L Blundell
- Department of Biochemistry, University of Cambridge, Cambridge, UK.
| |
Collapse
|
32
|
Seoane B, Carbone A. The complexity of protein interactions unravelled from structural disorder. PLoS Comput Biol 2021; 17:e1008546. [PMID: 33417598 PMCID: PMC7846008 DOI: 10.1371/journal.pcbi.1008546] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2020] [Revised: 01/29/2021] [Accepted: 11/18/2020] [Indexed: 11/19/2022] Open
Abstract
The importance of unstructured biology has quickly grown during the last decades accompanying the explosion of the number of experimentally resolved protein structures. The idea that structural disorder might be a novel mechanism of protein interaction is widespread in the literature, although the number of statistically significant structural studies supporting this idea is surprisingly low. At variance with previous works, our conclusions rely exclusively on a large-scale analysis of all the 134337 X-ray crystallographic structures of the Protein Data Bank averaged over clusters of almost identical protein sequences. In this work, we explore the complexity of the organisation of all the interaction interfaces observed when a protein lies in alternative complexes, showing that interfaces progressively add up in a hierarchical way, which is reflected in a logarithmic law for the size of the union of the interface regions on the number of distinct interfaces. We further investigate the connection of this complexity with different measures of structural disorder: the standard missing residues and a new definition, called "soft disorder", that covers all the flexible and structurally amorphous residues of a protein. We show evidences that both the interaction interfaces and the soft disordered regions tend to involve roughly the same amino-acids of the protein, and preliminary results suggesting that soft disorder spots those surface regions where new interfaces are progressively accommodated by complex formation. In fact, our results suggest that structurally disordered regions not only carry crucial information about the location of alternative interfaces within complexes, but also about the order of the assembly. We verify these hypotheses in several examples, such as the DNA binding domains of P53 and P73, the C3 exoenzyme, and two known biological orders of assembly. We finally compare our measures of structural disorder with several disorder bioinformatics predictors, showing that these latter are optimised to predict the residues that are missing in all the alternative structures of a protein and they are not able to catch the progressive evolution of the disordered regions upon complex formation. Yet, the predicted residues, when not missing, tend to be characterised as soft disordered regions.
Collapse
Affiliation(s)
- Beatriz Seoane
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative - UMR 7238, Paris, France
- Sorbonne Université, Institut des Sciences du Calcul et des Données, Paris, France
- Departamento de Física Teórica, Universidad Complutense, Madrid, Spain
| | - Alessandra Carbone
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative - UMR 7238, Paris, France
| |
Collapse
|
33
|
Bridoux L, Zarrineh P, Mallen J, Phuycharoen M, Latorre V, Ladam F, Losa M, Baker SM, Sagerstrom C, Mace KA, Rattray M, Bobola N. HOX paralogs selectively convert binding of ubiquitous transcription factors into tissue-specific patterns of enhancer activation. PLoS Genet 2020; 16:e1009162. [PMID: 33315856 PMCID: PMC7769617 DOI: 10.1371/journal.pgen.1009162] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2019] [Revised: 12/28/2020] [Accepted: 09/28/2020] [Indexed: 11/18/2022] Open
Abstract
Gene expression programs determine cell fate in embryonic development and their dysregulation results in disease. Transcription factors (TFs) control gene expression by binding to enhancers, but how TFs select and activate their target enhancers is still unclear. HOX TFs share conserved homeodomains with highly similar sequence recognition properties, yet they impart the identity of different animal body parts. To understand how HOX TFs control their specific transcriptional programs in vivo, we compared HOXA2 and HOXA3 binding profiles in the mouse embryo. HOXA2 and HOXA3 directly cooperate with TALE TFs and selectively target different subsets of a broad TALE chromatin platform. Binding of HOX and tissue-specific TFs convert low affinity TALE binding into high confidence, tissue-specific binding events, which bear the mark of active enhancers. We propose that HOX paralogs, alone and in combination with tissue-specific TFs, generate tissue-specific transcriptional outputs by modulating the activity of TALE TFs at selected enhancers.
Collapse
Affiliation(s)
- Laure Bridoux
- School of Medical Sciences, University of Manchester, Manchester, United Kingdom
| | - Peyman Zarrineh
- School of Health Sciences, University of Manchester, Manchester, United Kingdom
| | - Joshua Mallen
- School of Medical Sciences, University of Manchester, Manchester, United Kingdom
| | - Mike Phuycharoen
- Department of Computer Science, University of Manchester, Manchester, United Kingdom
| | - Victor Latorre
- School of Medical Sciences, University of Manchester, Manchester, United Kingdom
| | - Frank Ladam
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, Massachusets, United States of America
| | - Marta Losa
- School of Medical Sciences, University of Manchester, Manchester, United Kingdom
| | - Syed Murtuza Baker
- School of Health Sciences, University of Manchester, Manchester, United Kingdom
- School of Biological Sciences, University of Manchester, Manchester, United Kingdom
| | - Charles Sagerstrom
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, Massachusets, United States of America
| | - Kimberly A. Mace
- School of Biological Sciences, University of Manchester, Manchester, United Kingdom
| | - Magnus Rattray
- School of Health Sciences, University of Manchester, Manchester, United Kingdom
| | - Nicoletta Bobola
- School of Medical Sciences, University of Manchester, Manchester, United Kingdom
- * E-mail:
| |
Collapse
|
34
|
Hong S, Choi S, Kim R, Koh J. Mechanisms of Macromolecular Interactions Mediated by Protein Intrinsic Disorder. Mol Cells 2020; 43:899-908. [PMID: 33243935 PMCID: PMC7700844 DOI: 10.14348/molcells.2020.0186] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2020] [Revised: 11/13/2020] [Accepted: 11/17/2020] [Indexed: 12/29/2022] Open
Abstract
Intrinsically disordered proteins or regions (IDPs or IDRs) are widespread in the eukaryotic proteome. Although lacking stable three-dimensional structures in the free forms, IDRs perform critical functions in various cellular processes. Accordingly, mutations and altered expression of IDRs are associated with many pathological conditions. Hence, it is of great importance to understand at the molecular level how IDRs interact with their binding partners. In particular, discovering the unique interaction features of IDRs originating from their dynamic nature may reveal uncharted regulatory mechanisms of specific biological processes. Here we discuss the mechanisms of the macromolecular interactions mediated by IDRs and present the relevant cellular processes including transcription, cell cycle progression, signaling, and nucleocytoplasmic transport. Of special interest is the multivalent binding nature of IDRs driving assembly of multicomponent macromolecular complexes. Integrating the previous theoretical and experimental investigations, we suggest that such IDR-driven multiprotein complexes can function as versatile allosteric switches to process diverse cellular signals. Finally, we discuss the future challenges and potential medical applications of the IDR research.
Collapse
Affiliation(s)
- Sunghyun Hong
- School of Biological Sciences, Seoul National University, Seoul 08826, Korea
| | - Sangmin Choi
- School of Biological Sciences, Seoul National University, Seoul 08826, Korea
| | - Ryeonghyeon Kim
- School of Biological Sciences, Seoul National University, Seoul 08826, Korea
| | - Junseock Koh
- School of Biological Sciences, Seoul National University, Seoul 08826, Korea
| |
Collapse
|
35
|
Chong S, Mir M. Towards Decoding the Sequence-Based Grammar Governing the Functions of Intrinsically Disordered Protein Regions. J Mol Biol 2020; 433:166724. [PMID: 33248138 DOI: 10.1016/j.jmb.2020.11.023] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2020] [Revised: 11/14/2020] [Accepted: 11/19/2020] [Indexed: 01/03/2023]
Abstract
A substantial portion of the proteome consists of intrinsically disordered regions (IDRs) that do not fold into well-defined 3D structures yet perform numerous biological functions and are associated with a broad range of diseases. It has been a long-standing enigma how different IDRs successfully execute their specific functions. Further putting a spotlight on IDRs are recent discoveries of functionally relevant biomolecular assemblies, which in some cases form through liquid-liquid phase separation. At the molecular level, the formation of biomolecular assemblies is largely driven by weak, multivalent, but selective IDR-IDR interactions. Emerging experimental and computational studies suggest that the primary amino acid sequences of IDRs encode a variety of their interaction behaviors. In this review, we focus on findings and insights that connect sequence-derived features of IDRs to their conformations, propensities to form biomolecular assemblies, selectivity of interaction partners, functions in the context of physiology and disease, and regulation of function. We also discuss directions of future research to facilitate establishing a comprehensive sequence-function paradigm that will eventually allow prediction of selective interactions and specificity of function mediated by IDRs.
Collapse
Affiliation(s)
- Shasha Chong
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA 94720, United States; The Howard Hughes Medical Institute, University of California Berkeley, Berkeley, CA 94720, United States.
| | - Mustafa Mir
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA 94720, United States
| |
Collapse
|
36
|
Nordyke CT, Ahmed YM, Puterbaugh RZ, Bowman GR, Varga K. Intrinsically Disordered Bacterial Polar Organizing Protein Z, PopZ, Interacts with Protein Binding Partners Through an N-terminal Molecular Recognition Feature. J Mol Biol 2020; 432:6092-6107. [PMID: 33058876 DOI: 10.1016/j.jmb.2020.09.020] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2020] [Revised: 09/18/2020] [Accepted: 09/25/2020] [Indexed: 11/15/2022]
Abstract
The polar organizing protein Z (PopZ) is necessary for the formation of three-dimensional microdomains at the cell poles in Caulobacter crescentus, where it functions as a hub protein that recruits multiple regulatory proteins from the cytoplasm. Although a large portion of the protein is predicted to be natively unstructured, in reconstituted systems PopZ can self-assemble into a macromolecular scaffold that directly binds to at least ten different proteins. Here we report the solution NMR structure of PopZΔ134-177, a truncated form of PopZ that does not self-assemble but retains the ability to interact with heterologous proteins. We show that the unbound form of PopZΔ134-177 is unstructured in solution, with the exception of a small amphipathic α-helix in residues M10-I17, which is included within a highly conserved region near the N-terminal. In applying NMR techniques to map the interactions between PopZΔ134-177 and one of its binding partners, RcdA, we find evidence that the α-helix and adjoining amino acids extending to position E23 serve as the core of the binding motif. Consistent with this, a point mutation at position I17 severely compromises binding. Our results show that a partially structured Molecular Recognition Feature (MoRF) within an intrinsically disordered domain of PopZ contributes to the assembly of polar microdomains, revealing a structural basis for complex network assembly in Alphaproteobacteria that is analogous to those formed by intrinsically disordered hub proteins in other kingdoms.
Collapse
Affiliation(s)
- Christopher T Nordyke
- Department of Molecular, Cellular and Biomedical Sciences, University of New Hampshire, Durham, NH 03824, United States
| | - Yasin M Ahmed
- Department of Molecular Biology, University of Wyoming, Laramie, WY 82071, United States
| | - Ryan Z Puterbaugh
- Department of Molecular, Cellular and Biomedical Sciences, University of New Hampshire, Durham, NH 03824, United States
| | - Grant R Bowman
- Department of Molecular Biology, University of Wyoming, Laramie, WY 82071, United States.
| | - Krisztina Varga
- Department of Molecular, Cellular and Biomedical Sciences, University of New Hampshire, Durham, NH 03824, United States.
| |
Collapse
|
37
|
The Anti-Inflammatory Protein TNIP1 Is Intrinsically Disordered with Structural Flexibility Contributed by Its AHD1-UBAN Domain. Biomolecules 2020; 10:biom10111531. [PMID: 33182596 PMCID: PMC7697625 DOI: 10.3390/biom10111531] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2020] [Revised: 11/04/2020] [Accepted: 11/05/2020] [Indexed: 01/02/2023] Open
Abstract
TNFAIP3 interacting protein 1 (TNIP1) interacts with numerous non-related cellular, viral, and bacterial proteins. TNIP1 is also linked with multiple chronic inflammatory disorders on the gene and protein levels, through numerous single-nucleotide polymorphisms and reduced protein amounts. Despite the importance of TNIP1 function, there is limited investigation as to how its conformation may impact its apparent multiple roles. Hub proteins like TNIP1 are often intrinsically disordered proteins. Our initial in silico assessments suggested TNIP1 is natively unstructured, featuring numerous potentials intrinsically disordered regions, including the ABIN homology domain 1-ubiquitin binding domain in ABIN proteins and NEMO (AHD1-UBAN) domain associated with its anti-inflammatory function. Using multiple biophysical approaches, we demonstrate the structural flexibility of full-length TNIP1 and the AHD1-UBAN domain. We present evidence the AHD1-UBAN domain exists primarily as a pre-molten globule with limited secondary structure in solution. Data presented here suggest the previously described coiled-coil conformation of the crystallized UBAN-only region may represent just one of possibly multiple states for the AHD1-UBAN domain in solution. These data also characterize the AHD1-UBAN domain in solution as mostly monomeric with potential to undergo oligomerization under specific environmental conditions (e.g., binding partner availability, pH-dependence). This proposed intrinsic disorder across TNIP1 and within the AHD1-UBAN region is likely to impact TNIP1 function and interaction with its multiple partners.
Collapse
|
38
|
Monette A, Mouland AJ. Zinc and Copper Ions Differentially Regulate Prion-Like Phase Separation Dynamics of Pan-Virus Nucleocapsid Biomolecular Condensates. Viruses 2020; 12:E1179. [PMID: 33081049 PMCID: PMC7589941 DOI: 10.3390/v12101179] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Revised: 10/05/2020] [Accepted: 10/12/2020] [Indexed: 02/08/2023] Open
Abstract
Liquid-liquid phase separation (LLPS) is a rapidly growing research focus due to numerous demonstrations that many cellular proteins phase-separate to form biomolecular condensates (BMCs) that nucleate membraneless organelles (MLOs). A growing repertoire of mechanisms supporting BMC formation, composition, dynamics, and functions are becoming elucidated. BMCs are now appreciated as required for several steps of gene regulation, while their deregulation promotes pathological aggregates, such as stress granules (SGs) and insoluble irreversible plaques that are hallmarks of neurodegenerative diseases. Treatment of BMC-related diseases will greatly benefit from identification of therapeutics preventing pathological aggregates while sparing BMCs required for cellular functions. Numerous viruses that block SG assembly also utilize or engineer BMCs for their replication. While BMC formation first depends on prion-like disordered protein domains (PrLDs), metal ion-controlled RNA-binding domains (RBDs) also orchestrate their formation. Virus replication and viral genomic RNA (vRNA) packaging dynamics involving nucleocapsid (NC) proteins and their orthologs rely on Zinc (Zn) availability, while virus morphology and infectivity are negatively influenced by excess Copper (Cu). While virus infections modify physiological metal homeostasis towards an increased copper to zinc ratio (Cu/Zn), how and why they do this remains elusive. Following our recent finding that pan-retroviruses employ Zn for NC-mediated LLPS for virus assembly, we present a pan-virus bioinformatics and literature meta-analysis study identifying metal-based mechanisms linking virus-induced BMCs to neurodegenerative disease processes. We discover that conserved degree and placement of PrLDs juxtaposing metal-regulated RBDs are associated with disease-causing prion-like proteins and are common features of viral proteins responsible for virus capsid assembly and structure. Virus infections both modulate gene expression of metalloproteins and interfere with metal homeostasis, representing an additional virus strategy impeding physiological and cellular antiviral responses. Our analyses reveal that metal-coordinated virus NC protein PrLDs initiate LLPS that nucleate pan-virus assembly and contribute to their persistence as cell-free infectious aerosol droplets. Virus aerosol droplets and insoluble neurological disease aggregates should be eliminated by physiological or environmental metals that outcompete PrLD-bound metals. While environmental metals can control virus spreading via aerosol droplets, therapeutic interference with metals or metalloproteins represent additional attractive avenues against pan-virus infection and virus-exacerbated neurological diseases.
Collapse
Affiliation(s)
- Anne Monette
- Lady Davis Institute at the Jewish General Hospital, Montréal, QC H3T 1E2, Canada
| | - Andrew J. Mouland
- Lady Davis Institute at the Jewish General Hospital, Montréal, QC H3T 1E2, Canada
- Department of Medicine, McGill University, Montréal, QC H4A 3J1, Canada
| |
Collapse
|
39
|
Quaglia F, Hatos A, Piovesan D, Tosatto SCE. Exploring Manually Curated Annotations of Intrinsically Disordered Proteins with DisProt. ACTA ACUST UNITED AC 2020; 72:e107. [PMID: 33017101 DOI: 10.1002/cpbi.107] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
DisProt is the major repository of manually curated data for intrinsically disordered proteins collected from the literature. Although lacking a stable tertiary structure under physiological conditions, intrinsically disordered proteins carry out a plethora of biological functions, some of them directly arising from their flexible nature. A growing number of scientific studies have been published during the last few decades in an effort to shed light on their unstructured state, their binding modes, and their functions. DisProt makes use of a team of expert biocurators to provide up-to-date annotations of intrinsically disordered proteins from the literature, making them available to the scientific community. Here we present a comprehensive description on how to use DisProt in different contexts and provide a detailed explanation of how to explore and interpret manually curated annotations of intrinsically disordered proteins. We describe how to search DisProt annotations, using both the web interface and the API for programmatic access. Finally, we explain how to visualize and interpret a DisProt entry, p53, a widely studied protein characterized by the presence of unstructured N-terminal and C-terminal regions. © 2020 Wiley Periodicals LLC. Basic Protocol 1: Performing a search in DisProt Support Protocol 1: Downloading options Support Protocol 2: Programmatic access with DisProt REST API Basic Protocol 2: Visualizing and interpreting DisProt entries: the p53 use case Basic Protocol 3: Providing feedback and submitting new intrinsic disorder-related data.
Collapse
Affiliation(s)
- Federica Quaglia
- Department of Biomedical Sciences, University of Padova, Padova, Italy
| | - András Hatos
- Department of Biomedical Sciences, University of Padova, Padova, Italy
| | - Damiano Piovesan
- Department of Biomedical Sciences, University of Padova, Padova, Italy
| | | |
Collapse
|
40
|
ODiNPred: comprehensive prediction of protein order and disorder. Sci Rep 2020; 10:14780. [PMID: 32901090 PMCID: PMC7479119 DOI: 10.1038/s41598-020-71716-1] [Citation(s) in RCA: 45] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2020] [Accepted: 08/10/2020] [Indexed: 12/13/2022] Open
Abstract
Structural disorder is widespread in eukaryotic proteins and is vital for their function in diverse biological processes. It is therefore highly desirable to be able to predict the degree of order and disorder from amino acid sequence. It is, however, notoriously difficult to predict the degree of local flexibility within structured domains and the presence and nuances of localized rigidity within intrinsically disordered regions. To identify such instances, we used the CheZOD database, which encompasses accurate, balanced, and continuous-valued quantification of protein (dis)order at amino acid resolution based on NMR chemical shifts. To computationally forecast the spectrum of protein disorder in the most comprehensive manner possible, we constructed the sequence-based protein order/disorder predictor ODiNPred, trained on an expanded version of CheZOD. ODiNPred applies a deep neural network comprising 157 unique sequence features to 1325 protein sequences together with the experimental NMR chemical shift data. Cross-validation for 117 protein sequences shows that ODiNPred better predicts the continuous variation in order along the protein sequence, suggesting that contemporary predictors are limited by the quality of training data. The inclusion of evolutionary features reduces the performance gap between ODiNPred and its peers, but analysis shows that it retains greater accuracy for the more challenging prediction of intermediate disorder.
Collapse
|
41
|
The RING Domain of RING Finger 12 Efficiently Builds Degradative Ubiquitin Chains. J Mol Biol 2020; 432:3790-3801. [PMID: 32416094 DOI: 10.1016/j.jmb.2020.05.001] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2020] [Revised: 04/30/2020] [Accepted: 05/02/2020] [Indexed: 12/18/2022]
Abstract
RNF12 is a widely expressed ubiquitin E3 ligase that is required for X-chromosome inactivation, regulation of LIM-domain containing transcription factors, and TGF-β signaling. A RING domain at the C terminus of RNF12 is important for its E3 ligase activity, and mutations in the RING domain are associated with X-linked intellectual disability. Here we have characterized ubiquitin transfer by RNF12, and show that the RING domain can bind to, and is active with, ubiquitin conjugating enzymes (E2s) that produce degradative ubiquitin chains. We report the crystal structures of RNF12 in complex with two of these E2 enzymes, as well as with an E2~Ub conjugate in a closed conformation. These structures form a basis for understanding the deleterious effect of a number of disease causing mutations. Comparison of the RNF12 structure with other monomeric RINGs suggests that a loop prior to the core RING domain has a conserved and essential role in stabilization of the active conformation of the bound E2~Ub conjugate. Together these findings provide a framework for better understanding substrate ubiquitylation by RNF12 and the impact of disease causing mutations.
Collapse
|
42
|
Abstract
RNA localization is a key biological strategy for organizing the cytoplasm and generating both cellular and developmental polarity. During RNA localization, RNAs are targeted asymmetrically to specific subcellular destinations, resulting in spatially and temporally restricted gene expression through local protein synthesis. First discovered in oocytes and embryos, RNA localization is now recognized as a significant regulatory strategy for diverse RNAs, both coding and non-coding, in a wide range of cell types. Yet, the highly polarized cytoplasm of the oocyte remains a leading model to understand not only the principles and mechanisms underlying RNA localization, but also links to the formation of biomolecular condensates through phase separation. Here, we discuss both RNA localization and biomolecular condensates in oocytes with a particular focus on the oocyte of the frog, Xenopus laevis.
Collapse
Affiliation(s)
- Sarah E Cabral
- Department of Molecular Biology, Cell Biology, and Biochemistry, Brown University, Providence, RI, United States
| | - Kimberly L Mowry
- Department of Molecular Biology, Cell Biology, and Biochemistry, Brown University, Providence, RI, United States.
| |
Collapse
|
43
|
Amacher JF, Brooks L, Hampton TH, Madden DR. Specificity in PDZ-peptide interaction networks: Computational analysis and review. JOURNAL OF STRUCTURAL BIOLOGY-X 2020; 4:100022. [PMID: 32289118 PMCID: PMC7138185 DOI: 10.1016/j.yjsbx.2020.100022] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/15/2020] [Revised: 02/26/2020] [Accepted: 02/29/2020] [Indexed: 01/03/2023]
Abstract
Globular PDZ domains typically serve as protein-protein interaction modules that regulate a wide variety of cellular functions via recognition of short linear motifs (SLiMs). Often, PDZ mediated-interactions are essential components of macromolecular complexes, and disruption affects the entire scaffold. Due to their roles as linchpins in trafficking and signaling pathways, PDZ domains are attractive targets: both for controlling viral pathogens, which bind PDZ domains and hijack cellular machinery, as well as for developing therapies to combat human disease. However, successful therapeutic interventions that avoid off-target effects are a challenge, because each PDZ domain interacts with a number of cellular targets, and specific binding preferences can be difficult to decipher. Over twenty-five years of research has produced a wealth of data on the stereochemical preferences of individual PDZ proteins and their binding partners. Currently the field lacks a central repository for this information. Here, we provide this important resource and provide a manually curated, comprehensive list of the 271 human PDZ domains. We use individual domain, as well as recent genomic and proteomic, data in order to gain a holistic view of PDZ domains and interaction networks, arguing this knowledge is critical to optimize targeting selectivity and to benefit human health.
Collapse
Affiliation(s)
- Jeanine F Amacher
- Department of Biochemistry and Cell Biology, Geisel School of Medicine at Dartmouth, Hanover, NH 03755, USA.,Department of Chemistry, Western Washington University, Bellingham, WA 98225, USA
| | - Lionel Brooks
- Department of Biology, Western Washington University, Bellingham, WA 98225, USA
| | - Thomas H Hampton
- Department of Microbiology and Immunology, Geisel School of Medicine at Dartmouth, Hanover, NH 03755, USA
| | - Dean R Madden
- Department of Biochemistry and Cell Biology, Geisel School of Medicine at Dartmouth, Hanover, NH 03755, USA
| |
Collapse
|
44
|
Spadotto V, Giambruno R, Massignani E, Mihailovich M, Maniaci M, Patuzzo F, Ghini F, Nicassio F, Bonaldi T. PRMT1-mediated methylation of the microprocessor-associated proteins regulates microRNA biogenesis. Nucleic Acids Res 2020; 48:96-115. [PMID: 31777917 PMCID: PMC6943135 DOI: 10.1093/nar/gkz1051] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2018] [Revised: 10/04/2019] [Accepted: 11/22/2019] [Indexed: 12/17/2022] Open
Abstract
MicroRNA (miRNA) biogenesis is a tightly controlled multi-step process operated in the nucleus by the activity of the Microprocessor and its associated proteins. Through high resolution mass spectrometry (MS)- proteomics we discovered that this complex is extensively methylated, with 84 methylated sites associated to 19 out of its 24 subunits. The majority of the modifications occurs on arginine (R) residues (61), leading to 81 methylation events, while 30 lysine (K)-methylation events occurs on 23 sites of the complex. Interestingly, both depletion and pharmacological inhibition of the Type-I Protein Arginine Methyltransferases (PRMTs) lead to a widespread change in the methylation state of the complex and induce global decrease of miRNA expression, as a consequence of the impairment of the pri-to-pre-miRNA processing step. In particular, we show that the reduced methylation of the Microprocessor subunit ILF3 is linked to its diminished binding to the pri-miRNAs miR-15a/16, miR-17-92, miR-301a and miR-331. Our study uncovers a previously uncharacterized role of R-methylation in the regulation of miRNA biogenesis in mammalian cells.
Collapse
Affiliation(s)
- Valeria Spadotto
- Department of Experimental Oncology, IEO, European Institute of Oncology IRCCS, Milan, Italy
| | - Roberto Giambruno
- Department of Experimental Oncology, IEO, European Institute of Oncology IRCCS, Milan, Italy
| | - Enrico Massignani
- Department of Experimental Oncology, IEO, European Institute of Oncology IRCCS, Milan, Italy
| | - Marija Mihailovich
- Department of Experimental Oncology, IEO, European Institute of Oncology IRCCS, Milan, Italy
| | - Marianna Maniaci
- Department of Experimental Oncology, IEO, European Institute of Oncology IRCCS, Milan, Italy
| | - Francesca Patuzzo
- Department of Experimental Oncology, IEO, European Institute of Oncology IRCCS, Milan, Italy
| | - Francesco Ghini
- Center for Genomic Science of IIT@SEMM, Istituto Italiano di Tecnologia, Milan, Italy
| | - Francesco Nicassio
- Center for Genomic Science of IIT@SEMM, Istituto Italiano di Tecnologia, Milan, Italy
| | - Tiziana Bonaldi
- Department of Experimental Oncology, IEO, European Institute of Oncology IRCCS, Milan, Italy
| |
Collapse
|
45
|
Oldfield CJ, Fan X, Wang C, Dunker AK, Kurgan L. Computational Prediction of Intrinsic Disorder in Protein Sequences with the disCoP Meta-predictor. Methods Mol Biol 2020; 2141:21-35. [PMID: 32696351 DOI: 10.1007/978-1-0716-0524-0_2] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Intrinsically disordered proteins are either entirely disordered or contain disordered regions in their native state. These proteins and regions function without the prerequisite of a stable structure and were found to be abundant across all kingdoms of life. Experimental annotation of disorder lags behind the rapidly growing number of sequenced proteins, motivating the development of computational methods that predict disorder in protein sequences. DisCoP is a user-friendly webserver that provides accurate sequence-based prediction of protein disorder. It relies on meta-architecture in which the outputs generated by multiple disorder predictors are combined together to improve predictive performance. The architecture of disCoP is presented, and its accuracy relative to several other disorder predictors is briefly discussed. We describe usage of the web interface and explain how to access and read results generated by this computational tool. We also provide an example of prediction results and interpretation. The disCoP's webserver is publicly available at http://biomine.cs.vcu.edu/servers/disCoP/ .
Collapse
Affiliation(s)
| | - Xiao Fan
- Department of Pediatrics, Columbia University, New York, NY, USA
| | - Chen Wang
- Department of Medicine, Columbia University, New York, NY, USA
| | - A Keith Dunker
- Department of Biochemistry and Molecular Biology, Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, IN, USA
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA.
| |
Collapse
|
46
|
Ruiz-Ortiz I, De Sancho D. Competitive binding of HIF-1α and CITED2 to the TAZ1 domain of CBP from molecular simulations. Phys Chem Chem Phys 2020; 22:8118-8127. [DOI: 10.1039/d0cp00328j] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
Many intrinsically disordered proteins (IDPs) are involved in complex signalling networks inside the cell.
Collapse
Affiliation(s)
- Irene Ruiz-Ortiz
- Donostia International Physics Center
- Donostia-San Sebastián
- Spain
| | - David De Sancho
- Donostia International Physics Center
- Donostia-San Sebastián
- Spain
- University of the Basque Country
- Faculty of Chemistry
| |
Collapse
|
47
|
Dubreuil B, Matalon O, Levy ED. Protein Abundance Biases the Amino Acid Composition of Disordered Regions to Minimize Non-functional Interactions. J Mol Biol 2019; 431:4978-4992. [PMID: 31442477 PMCID: PMC6941228 DOI: 10.1016/j.jmb.2019.08.008] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2019] [Revised: 08/07/2019] [Accepted: 08/10/2019] [Indexed: 02/07/2023]
Abstract
In eukaryotes, disordered regions cover up to 50% of proteomes and mediate fundamental cellular processes. In contrast to globular domains, where about half of the amino acids are buried in the protein interior, disordered regions show higher solvent accessibility, which makes them prone to engage in non-functional interactions. Such interactions are exacerbated by the law of mass action, prompting the question of how they are minimized in abundant proteins. We find that interaction propensity or "stickiness" of disordered regions negatively correlates with their cellular abundance, both in yeast and human. Strikingly, considering yeast proteins where a large fraction of the sequence is disordered, the correlation between stickiness and abundance reaches R=-0.55. Beyond this global amino-acid composition bias, we identify three rules by which amino-acid composition of disordered regions adjusts with high abundance. First, lysines are preferred over arginines, consistent with the latter amino acid being stickier than the former. Second, compensatory effects exist, whereby a sticky region can be tolerated if it is compensated by a distal non-sticky region. Third, such compensation requires a lower average stickiness at the same abundance when compared to a scenario where stickiness is homogeneous throughout the sequence. We validate these rules experimentally, employing them as different strategies to rescue an otherwise sticky protein fragment from aggregation. Our results highlight that non-functional interactions represent a significant constraint in cellular systems and reveal simple rules by which protein sequences adapt to that constraint. Data from this work are deposited in Figshare, at https://doi.org/10.6084/m9.figshare.8068937.v3.
Collapse
Affiliation(s)
- Benjamin Dubreuil
- Department of Structural Biology, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Or Matalon
- Department of Structural Biology, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Emmanuel D Levy
- Department of Structural Biology, Weizmann Institute of Science, Rehovot 7610001, Israel.
| |
Collapse
|
48
|
Prestel A, Wichmann N, Martins JM, Marabini R, Kassem N, Broendum SS, Otterlei M, Nielsen O, Willemoës M, Ploug M, Boomsma W, Kragelund BB. The PCNA interaction motifs revisited: thinking outside the PIP-box. Cell Mol Life Sci 2019; 76:4923-4943. [PMID: 31134302 PMCID: PMC6881253 DOI: 10.1007/s00018-019-03150-0] [Citation(s) in RCA: 63] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2019] [Revised: 04/16/2019] [Accepted: 05/13/2019] [Indexed: 02/08/2023]
Abstract
Proliferating cell nuclear antigen (PCNA) is a cellular hub in DNA metabolism and a potential drug target. Its binding partners carry a short linear motif (SLiM) known as the PCNA-interacting protein-box (PIP-box), but sequence-divergent motifs have been reported to bind to the same binding pocket. To investigate how PCNA accommodates motif diversity, we assembled a set of 77 experimentally confirmed PCNA-binding proteins and analyzed features underlying their binding affinity. Combining NMR spectroscopy, affinity measurements and computational analyses, we corroborate that most PCNA-binding motifs reside in intrinsically disordered regions, that structure preformation is unrelated to affinity, and that the sequence-patterns that encode binding affinity extend substantially beyond the boundaries of the PIP-box. Our systematic multidisciplinary approach expands current views on PCNA interactions and reveals that the PIP-box affinity can be modulated over four orders of magnitude by positive charges in the flanking regions. Including the flanking regions as part of the motif is expected to have broad implications, particularly for interpretation of disease-causing mutations and drug-design, targeting DNA-replication and -repair.
Collapse
Affiliation(s)
- Andreas Prestel
- Department of Biology, University of Copenhagen, Ole Maaloes Vej 5, 2200, Copenhagen N, Denmark
| | - Nanna Wichmann
- Department of Biology, University of Copenhagen, Ole Maaloes Vej 5, 2200, Copenhagen N, Denmark
| | - Joao M Martins
- Department of Computer Science, University of Copenhagen, Universitetsparken 1, 2100, Copenhagen Ø, Denmark
| | - Riccardo Marabini
- Department of Biology, University of Copenhagen, Ole Maaloes Vej 5, 2200, Copenhagen N, Denmark
| | - Noah Kassem
- Department of Biology, University of Copenhagen, Ole Maaloes Vej 5, 2200, Copenhagen N, Denmark
| | - Sebastian S Broendum
- Department of Biology, University of Copenhagen, Ole Maaloes Vej 5, 2200, Copenhagen N, Denmark
- Department of Biochemistry and Molecular Biology, Biomedicine Discovery Institute, Monash University, Victoria, 3800, Australia
| | - Marit Otterlei
- Department of Clinical and Molecular Medicine, Faculty of Medicine and Health Sciences, NTNU Norwegian University of Science and Technology, 7491, Trondheim, Norway
| | - Olaf Nielsen
- Department of Biology, University of Copenhagen, Ole Maaloes Vej 5, 2200, Copenhagen N, Denmark
| | - Martin Willemoës
- Department of Biology, University of Copenhagen, Ole Maaloes Vej 5, 2200, Copenhagen N, Denmark
| | - Michael Ploug
- Finsen Laboratory, Rigshospitalet, Ole Maaloes Vej 5, 2200, Copenhagen N, Denmark
- Finsen Laboratory, Biotechnology Research Innovation Centre, University of Copenhagen, Ole Maaloes Vej 5, 2200, Copenhagen N, Denmark
| | - Wouter Boomsma
- Department of Computer Science, University of Copenhagen, Universitetsparken 1, 2100, Copenhagen Ø, Denmark.
| | - Birthe B Kragelund
- Department of Biology, University of Copenhagen, Ole Maaloes Vej 5, 2200, Copenhagen N, Denmark.
| |
Collapse
|
49
|
He H, Zhao J, Sun G. Computational prediction of MoRFs based on protein sequences and minimax probability machine. BMC Bioinformatics 2019; 20:529. [PMID: 31660849 PMCID: PMC6819637 DOI: 10.1186/s12859-019-3111-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2018] [Accepted: 09/20/2019] [Indexed: 11/25/2022] Open
Abstract
Background Molecular recognition features (MoRFs) are one important type of disordered segments that can promote specific protein-protein interactions. They are located within longer intrinsically disordered regions (IDRs), and undergo disorder-to-order transitions upon binding to their interaction partners. The functional importance of MoRFs and the limitation of experimental identification make it necessary to predict MoRFs accurately with computational methods. Results In this study, a new sequence-based method, named as MoRFMPM, is proposed for predicting MoRFs. MoRFMPM uses minimax probability machine (MPM) to predict MoRFs based on 16 features and 3 different windows, which neither relying on other predictors nor calculating the properties of the surrounding regions of MoRFs separately. Comparing with ANCHOR, MoRFpred and MoRFCHiBi on the same test sets, MoRFMPM not only obtains higher AUC, but also obtains higher TPR at low FPR. Conclusions The features used in MoRFMPM can effectively predict MoRFs, especially after preprocessing. Besides, MoRFMPM uses a linear classification algorithm and does not rely on results of other predictors which makes it accessible and repeatable.
Collapse
Affiliation(s)
- Hao He
- College of Electronic Information and Optical Engineering, Nankai University, Tianjin, China
| | - Jiaxiang Zhao
- College of Electronic Information and Optical Engineering, Nankai University, Tianjin, China.
| | - Guiling Sun
- College of Electronic Information and Optical Engineering, Nankai University, Tianjin, China
| |
Collapse
|
50
|
Malliavin TE, Mucherino A, Lavor C, Liberti L. Systematic Exploration of Protein Conformational Space Using a Distance Geometry Approach. J Chem Inf Model 2019; 59:4486-4503. [PMID: 31442036 DOI: 10.1021/acs.jcim.9b00215] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The optimization approaches classically used during the determination of protein structure encounter various difficulties, especially when the size of the conformational space is large. Indeed, in such a case, algorithmic convergence criteria are more difficult to set up. Moreover, the size of the search space makes it difficult to achieve a complete exploration. The interval branch-and-prune (iBP) approach, based on the reformulation of the distance geometry problem (DGP) provides a theoretical frame for the generation of protein conformations, by systematically sampling the conformational space. When an appropriate subset of interatomic distances is known exactly, this worst-case exponential-time algorithm is provably complete and fixed-parameter tractable. These guarantees, however, immediately disappear as distance measurement errors are introduced. Here we propose an improvement of this approach: threading-augmented interval branch-and-prune (TAiBP), where the combinatorial explosion of the original iBP approach arising from its exponential complexity is alleviated by partitioning the input instances into consecutive peptide fragments and by using self-organizing maps (SOMs) to obtain clusters of similar solutions. A validation of the TAiBP approach is presented here on a set of proteins of various sizes and structures. The calculation inputs are a uniform covalent geometry extracted from force field covalent terms, the backbone dihedral angles with error intervals, and a few long-range distances. For most of the proteins smaller than 50 residues and interval widths of 20°, the TAiBP approach yielded solutions with RMSD values smaller than 3 Å with respect to the initial protein conformation. The efficiency of the TAiBP approach for proteins larger than 50 residues will require the use of nonuniform covalent geometry and may have benefits from the recent development of residue-specific force-fields.
Collapse
Affiliation(s)
- Thérèse E Malliavin
- Unité de Bioinformatique Structurale, UMR 3528, CNRS, and Departement de Bioinformatique, Biostatistique et Biologie Intégrative, USR 3756, CNRS , Institut Pasteur , 75015 Paris , France
| | | | - Carlile Lavor
- Applied Math Department , IMECC-University of Campinas , Campinas , SP 13083-970 , Brazil
| | - Leo Liberti
- LIX CNRS, Ecole Polytechnique , Institut Polytechnique de Paris , Route de Saclay , 91128 Palaiseau , France
| |
Collapse
|