1
|
Erkine AM, Oliveira MA, Class CA. The Enigma of Transcriptional Activation Domains. J Mol Biol 2024; 436:168766. [PMID: 39214280 DOI: 10.1016/j.jmb.2024.168766] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2024] [Revised: 08/22/2024] [Accepted: 08/23/2024] [Indexed: 09/04/2024]
Abstract
Activation domains (ADs) of eukaryotic gene activators remain enigmatic for decades as short, extremely variable sequences which often are intrinsically disordered in structure and interact with an uncertain number of targets. The general absence of specificity increasingly complicates the utilization of the widely accepted mechanism of AD function by recruitment of coactivators. The long-standing enigma at the heart of molecular biology demands a fundamental rethinking of established concepts. Here, we review the experimental evidence supporting a novel mechanistic model of gene activation, based on ADs functioning via surfactant-like near-stochastic interactions with gene promoter nucleosomes. This new model is consistent with recent information-rich experimental data obtained using high-throughput synthetic biology and bioinformatics analysis methods, including machine learning. We clarify why the conventional biochemical principle of specificity for sequence, structures, and interactions fails to explain activation domain function. This perspective provides connections to the liquid-liquid phase separation model, signifies near-stochastic interactions as fundamental for the biochemical function, and can be generalized to other cellular functions.
Collapse
|
2
|
Bonchuk AN, Georgiev PG. C2H2 proteins: Evolutionary aspects of domain architecture and diversification. Bioessays 2024; 46:e2400052. [PMID: 38873893 DOI: 10.1002/bies.202400052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Revised: 05/24/2024] [Accepted: 05/27/2024] [Indexed: 06/15/2024]
Abstract
The largest group of transcription factors in higher eukaryotes are C2H2 proteins, which contain C2H2-type zinc finger domains that specifically bind to DNA. Few well-studied C2H2 proteins, however, demonstrate their key role in the control of gene expression and chromosome architecture. Here we review the features of the domain architecture of C2H2 proteins and the likely origin of C2H2 zinc fingers. A comprehensive investigation of proteomes for the presence of proteins with multiple clustered C2H2 domains has revealed a key difference between groups of organisms. Unlike plants, transcription factors in metazoans contain clusters of C2H2 domains typically separated by a linker with the TGEKP consensus sequence. The average size of C2H2 clusters varies substantially, even between genomes of higher metazoans, and with a tendency to increase in combination with SCAN, and especially KRAB domains, reflecting the increasing complexity of gene regulatory networks.
Collapse
Affiliation(s)
- Artem N Bonchuk
- Department of the Control of Genetic Processes, Institute of Gene Biology Russian Academy of Sciences, Moscow, Russia
- Center for Precision Genome Editing and Genetic Technologies for Biomedicine, Institute of Gene Biology, Russian Academy of Sciences, Moscow, Russia
| | - Pavel G Georgiev
- Department of the Control of Genetic Processes, Institute of Gene Biology Russian Academy of Sciences, Moscow, Russia
| |
Collapse
|
3
|
Toya H, Okamatsu-Ogura Y, Yokoi S, Kurihara M, Mito M, Iwasaki S, Hirose T, Nakagawa S. The essential role of architectural noncoding RNA Neat1 in cold-induced beige adipocyte differentiation in mice. RNA (NEW YORK, N.Y.) 2024; 30:1011-1024. [PMID: 38692841 PMCID: PMC11251523 DOI: 10.1261/rna.079972.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Accepted: 04/08/2024] [Indexed: 05/03/2024]
Abstract
Neat1 is an architectural RNA that provides the structural basis for nuclear bodies known as paraspeckles. Although the assembly processes by which Neat1 organizes paraspeckle components are well-documented, the physiological functions of Neat1 are not yet fully understood. This is partly because Neat1 knockout (KO) mice, lacking paraspeckles, do not exhibit overt phenotypes under normal laboratory conditions. During our search for conditions that elicit clear phenotypes in Neat1 KO mice, we discovered that the differentiation of beige adipocytes-inducible thermogenic cells that emerge upon cold exposure-is severely impaired in these mutant mice. Neat1_2, the architectural isoform of Neat1, is transiently upregulated during the early stages of beige adipocyte differentiation, coinciding with increased paraspeckle formation. Genes with altered expression during beige adipocyte differentiation typically cluster at specific chromosomal locations, some of which move closer to paraspeckles upon cold exposure. These observations suggest that paraspeckles might coordinate the regulation of these gene clusters by controlling the activity of certain transcriptional condensates that coregulate multiple genes. We propose that our findings highlight a potential role for Neat1 and paraspeckles in modulating chromosomal organization and gene expression, potentially crucial processes for the differentiation of beige adipocytes.
Collapse
Affiliation(s)
- Hikaru Toya
- RNA Biology Laboratory, Faculty of Pharmaceutical Sciences, Hokkaido University, Sapporo 060-0812, Japan
| | - Yuko Okamatsu-Ogura
- Laboratory of Biochemistry, Faculty of Veterinary Medicine, Hokkaido University, Sapporo 060-0818, Japan
| | - Saori Yokoi
- RNA Biology Laboratory, Faculty of Pharmaceutical Sciences, Hokkaido University, Sapporo 060-0812, Japan
| | - Misuzu Kurihara
- RNA Biology Laboratory, Faculty of Pharmaceutical Sciences, Hokkaido University, Sapporo 060-0812, Japan
| | - Mari Mito
- RNA Systems Biochemistry Laboratory, RIKEN Cluster for Pioneering Research, Saitama 351-0198, Japan
| | - Shintaro Iwasaki
- RNA Systems Biochemistry Laboratory, RIKEN Cluster for Pioneering Research, Saitama 351-0198, Japan
| | - Tetsuro Hirose
- RNA Biofunction Laboratory, Graduate School of Frontier Biosciences, Osaka University, Suita 565-0871, Japan
| | - Shinichi Nakagawa
- RNA Biology Laboratory, Faculty of Pharmaceutical Sciences, Hokkaido University, Sapporo 060-0812, Japan
| |
Collapse
|
4
|
Fonda BD, Murray DT. The Potent PHL4 Transcription Factor Effector Domain Contains Significant Disorder. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.27.601048. [PMID: 39005418 PMCID: PMC11244893 DOI: 10.1101/2024.06.27.601048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 07/16/2024]
Abstract
The phosphate-starvation response transcription-factor protein family is essential to plant response to low-levels of phosphate. Proteins in this transcription factor (TF) family act by altering various gene expression levels, such as increasing levels of the acid phosphatase proteins which catalyze the conversion of inorganic phosphates to bio-available compounds. There are few structural characterizations of proteins in this TF family, none of which address the potent TF activation domains. The phosphate-starvation response-like protein-4 (PHL4) protein from this family has garnered interest due to the unusually high TF activation activity of the N-terminal domain. Here, we demonstrate using solution nuclear magnetic resonance (NMR) measurements that the PHL4 N-terminal activating TF effector domain is mainly an intrinsically disordered domain of over 200 residues, and that the C-terminal region of PHL4 is also disordered. Additionally, we present evidence from size-exclusion chromatography, diffusion NMR measurements, and a cross-linking assay suggesting full-length PHL4 forms a tetrameric assembly. Together, the data indicate the N- and C-terminal disordered domains in PHL4 flank a central folded region that likely forms the ordered oligomer of PHL4. This work provides a foundation for future studies detailing how the conformations and molecular motions of PHL4 change as it acts as a potent activator of gene expression in phosphate metabolism. Such a detailed mechanistic understanding of TF function will benefit genetic engineering efforts that take advantage of this activity to boost transcriptional activation of genes across different organisms.
Collapse
Affiliation(s)
- Blake D. Fonda
- Department of Chemistry, University of California, Davis, California, 95616, United States of America
| | - Dylan T. Murray
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, 06926, United States of America
| |
Collapse
|
5
|
Saibo NV, Maiti S, Boral S, Banerjee P, Kushwaha T, Inampudi KK, Goswami R, De S. The intrinsically disordered transactivation region of HOXA9 regulates its function by auto-inhibition of its DNA-binding activity. Int J Biol Macromol 2024; 273:132704. [PMID: 38825283 DOI: 10.1016/j.ijbiomac.2024.132704] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Revised: 02/19/2024] [Accepted: 05/26/2024] [Indexed: 06/04/2024]
Abstract
HOXA9 transcription factor is expressed in hematopoietic stem cells and is involved in the regulation of their differentiation and maturation to various blood cells. HOXA9 is linked to various leukemia and is a marker for poor prognosis of acute myeloid leukemia (AML). This protein has a conserved DNA-binding homeodomain and a transactivation domain. We show that this N-terminal transactivation domain is intrinsically disordered and inhibits DNA-binding by the homeodomain. Using NMR spectroscopy and molecular dynamics simulation, we show that the hexapeptide 197AANWLH202 in the disordered region transiently occludes the DNA-binding interface. The hexapeptide also forms a rigid segment, as determined by NMR dynamics, in an otherwise flexible disordered region. Interestingly, this hexapeptide is known to mediate the interaction of HOXA9 and its TALE partner proteins, such as PBX1, and help in cooperative DNA binding. Mutation of tryptophan to alanine in the hexapeptide abrogates the DNA-binding auto-inhibition. We propose that the disordered transactivation region plays a dual role in the regulation of HOXA9 function. In the absence of TALE partners, it inhibits DNA binding, and in the presence of TALE partners it interacts with the TALE protein and facilitates the cooperative DNA binding by the HOX-TALE complex.
Collapse
Affiliation(s)
- Nikita V Saibo
- School of Bioscience, Indian Institute of Technology Kharagpur, Kharagpur, WB 721302, India
| | - Snigdha Maiti
- School of Bioscience, Indian Institute of Technology Kharagpur, Kharagpur, WB 721302, India
| | - Soumendu Boral
- School of Bioscience, Indian Institute of Technology Kharagpur, Kharagpur, WB 721302, India
| | - Puja Banerjee
- School of Bioscience, Indian Institute of Technology Kharagpur, Kharagpur, WB 721302, India
| | - Tushar Kushwaha
- Department of Biophysics, All India Institute of Medical Sciences, New Delhi, India
| | - Krishna K Inampudi
- Department of Biophysics, All India Institute of Medical Sciences, New Delhi, India
| | - Ritobrata Goswami
- School of Bioscience, Indian Institute of Technology Kharagpur, Kharagpur, WB 721302, India
| | - Soumya De
- School of Bioscience, Indian Institute of Technology Kharagpur, Kharagpur, WB 721302, India.
| |
Collapse
|
6
|
Valyaeva AA, Sheval EV. Nonspecific Interactions in Transcription Regulation and Organization of Transcriptional Condensates. BIOCHEMISTRY. BIOKHIMIIA 2024; 89:688-700. [PMID: 38831505 DOI: 10.1134/s0006297924040084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 11/19/2023] [Accepted: 11/20/2023] [Indexed: 06/05/2024]
Abstract
Eukaryotic cells are characterized by a high degree of compartmentalization of their internal contents, which ensures precise and controlled regulation of intracellular processes. During many processes, including different stages of transcription, dynamic membraneless compartments termed biomolecular condensates are formed. Transcription condensates contain various transcription factors and RNA polymerase and are formed by high- and low-specificity interactions between the proteins, DNA, and nearby RNA. This review discusses recent data demonstrating important role of nonspecific multivalent protein-protein and RNA-protein interactions in organization and regulation of transcription.
Collapse
Affiliation(s)
- Anna A Valyaeva
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, 119991, Russia.
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119991, Russia
- Department of Cell Biology and Histology, Faculty of Biology, Lomonosov Moscow State University, Moscow, 119991, Russia
| | - Eugene V Sheval
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119991, Russia
- Department of Cell Biology and Histology, Faculty of Biology, Lomonosov Moscow State University, Moscow, 119991, Russia
| |
Collapse
|
7
|
Vieira MFM, Hernandez G, Zhong Q, Arbesú M, Veloso T, Gomes T, Martins ML, Monteiro H, Frazão C, Frankel G, Zanzoni A, Cordeiro TN. The pathogen-encoded signalling receptor Tir exploits host-like intrinsic disorder for infection. Commun Biol 2024; 7:179. [PMID: 38351154 PMCID: PMC10864410 DOI: 10.1038/s42003-024-05856-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Accepted: 01/26/2024] [Indexed: 02/16/2024] Open
Abstract
The translocated intimin receptor (Tir) is an essential type III secretion system (T3SS) effector of attaching and effacing pathogens contributing to the global foodborne disease burden. Tir acts as a cell-surface receptor in host cells, rewiring intracellular processes by targeting multiple host proteins. We investigated the molecular basis for Tir's binding diversity in signalling, finding that Tir is a disordered protein with host-like binding motifs. Unexpectedly, also are several other T3SS effectors. By an integrative approach, we reveal that Tir dimerises via an antiparallel OB-fold within a highly disordered N-terminal cytosolic domain. Also, it has a long disordered C-terminal cytosolic domain partially structured at host-like motifs that bind lipids. Membrane affinity depends on lipid composition and phosphorylation, highlighting a previously unrecognised host interaction impacting Tir-induced actin polymerisation and cell death. Furthermore, multi-site tyrosine phosphorylation enables Tir to engage host SH2 domains in a multivalent fuzzy complex, consistent with Tir's scaffolding role and binding promiscuity. Our findings provide insights into the intracellular Tir domains, highlighting the ability of T3SS effectors to exploit host-like protein disorder as a strategy for host evasion.
Collapse
Affiliation(s)
- Marta F M Vieira
- Instituto de Tecnologia Química e Biológica António Xavier, Universidade Nova de Lisboa, Av. da República, Oeiras, Portugal
| | - Guillem Hernandez
- Instituto de Tecnologia Química e Biológica António Xavier, Universidade Nova de Lisboa, Av. da República, Oeiras, Portugal
| | - Qiyun Zhong
- Department of Life Sciences, Imperial College London, South Kensington Campus, London, UK
| | - Miguel Arbesú
- Department of NMR-supported Structural Biology, Leibniz-Forschungsinstitut für Molekulare Pharmakologie, Berlin, Germany
- InstaDeep Ltd, 5 Merchant Square, London, UK
| | - Tiago Veloso
- Instituto de Tecnologia Química e Biológica António Xavier, Universidade Nova de Lisboa, Av. da República, Oeiras, Portugal
| | - Tiago Gomes
- Instituto de Tecnologia Química e Biológica António Xavier, Universidade Nova de Lisboa, Av. da República, Oeiras, Portugal
| | - Maria L Martins
- Instituto de Tecnologia Química e Biológica António Xavier, Universidade Nova de Lisboa, Av. da República, Oeiras, Portugal
| | - Hugo Monteiro
- Instituto de Tecnologia Química e Biológica António Xavier, Universidade Nova de Lisboa, Av. da República, Oeiras, Portugal
| | - Carlos Frazão
- Instituto de Tecnologia Química e Biológica António Xavier, Universidade Nova de Lisboa, Av. da República, Oeiras, Portugal
| | - Gad Frankel
- Department of Life Sciences, Imperial College London, South Kensington Campus, London, UK
| | - Andreas Zanzoni
- Aix-Marseille Université, Inserm, TAGC, UMR_S1090, Marseille, France
| | - Tiago N Cordeiro
- Instituto de Tecnologia Química e Biológica António Xavier, Universidade Nova de Lisboa, Av. da República, Oeiras, Portugal.
| |
Collapse
|
8
|
Hannon CE, Eisen MB. Intrinsic protein disorder is insufficient to drive subnuclear clustering in embryonic transcription factors. eLife 2024; 12:RP88221. [PMID: 38275292 PMCID: PMC10945700 DOI: 10.7554/elife.88221] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2024] Open
Abstract
Modern microscopy has revealed that core nuclear functions, including transcription, replication, and heterochromatin formation, occur in spatially restricted clusters. Previous work from our lab has shown that subnuclear high-concentration clusters of transcription factors may play a role in regulating RNA synthesis in the early Drosophila embryo. A nearly ubiquitous feature of eukaryotic transcription factors is that they contain intrinsically disordered regions (IDRs) that often arise from low complexity amino acid sequences within the protein. It has been proposed that IDRs within transcription factors drive co-localization of transcriptional machinery and target genes into high-concentration clusters within nuclei. Here, we test that hypothesis directly, by conducting a broad survey of the subnuclear localization of IDRs derived from transcription factors. Using a novel algorithm to identify IDRs in the Drosophila proteome, we generated a library of IDRs from transcription factors expressed in the early Drosophila embryo. We used this library to perform a high-throughput imaging screen in Drosophila Schneider-2 (S2) cells. We found that while subnuclear clustering does not occur when the majority of IDRs are expressed alone, it is frequently seen in full-length transcription factors. These results are consistent in live Drosophila embryos, suggesting that IDRs are insufficient to drive the subnuclear clustering behavior of transcription factors. Furthermore, the clustering of transcription factors in living embryos was unaffected by the deletion of IDR sequences. Our results demonstrate that IDRs are unlikely to be the primary molecular drivers of the clustering observed during transcription, suggesting a more complex and nuanced role for these disordered protein sequences.
Collapse
Affiliation(s)
- Colleen E Hannon
- Howard Hughes Medical Institute, University of CaliforniaBerkeleyUnited States
| | - Michael B Eisen
- Howard Hughes Medical Institute, University of CaliforniaBerkeleyUnited States
| |
Collapse
|
9
|
Zhang J, Basu S, Kurgan L. HybridDBRpred: improved sequence-based prediction of DNA-binding amino acids using annotations from structured complexes and disordered proteins. Nucleic Acids Res 2024; 52:e10. [PMID: 38048333 PMCID: PMC10810184 DOI: 10.1093/nar/gkad1131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Accepted: 11/10/2023] [Indexed: 12/06/2023] Open
Abstract
Current predictors of DNA-binding residues (DBRs) from protein sequences belong to two distinct groups, those trained on binding annotations extracted from structured protein-DNA complexes (structure-trained) vs. intrinsically disordered proteins (disorder-trained). We complete the first empirical analysis of predictive performance across the structure- and disorder-annotated proteins for a representative collection of ten predictors. Majority of the structure-trained tools perform well on the structure-annotated proteins while doing relatively poorly on the disorder-annotated proteins, and vice versa. Several methods make accurate predictions for the structure-annotated proteins or the disorder-annotated proteins, but none performs highly accurately for both annotation types. Moreover, most predictors make excessive cross-predictions for the disorder-annotated proteins, where residues that interact with non-DNA ligand types are predicted as DBRs. Motivated by these results, we design, validate and deploy an innovative meta-model, hybridDBRpred, that uses deep transformer network to combine predictions generated by three best current predictors. HybridDBRpred provides accurate predictions and low levels of cross-predictions across the two annotation types, and is statistically more accurate than each of the ten tools and baseline meta-predictors that rely on averaging and logistic regression. We deploy hybridDBRpred as a convenient web server at http://biomine.cs.vcu.edu/servers/hybridDBRpred/ and provide the corresponding source code at https://github.com/jianzhang-xynu/hybridDBRpred.
Collapse
Affiliation(s)
- Jian Zhang
- School of Computer and Information Technology, Xinyang Normal University, Xinyang 464000, PR China
| | - Sushmita Basu
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| |
Collapse
|
10
|
Theisen FF, Prestel A, Elkjær S, Leurs YHA, Morffy N, Strader LC, O'Shea C, Teilum K, Kragelund BB, Skriver K. Molecular switching in transcription through splicing and proline-isomerization regulates stress responses in plants. Nat Commun 2024; 15:592. [PMID: 38238333 PMCID: PMC10796322 DOI: 10.1038/s41467-024-44859-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 01/09/2024] [Indexed: 01/22/2024] Open
Abstract
The Arabidopsis thaliana DREB2A transcription factor interacts with the negative regulator RCD1 and the ACID domain of subunit 25 of the transcriptional co-regulator mediator (Med25) to integrate stress signals for gene expression, with elusive molecular interplay. Using biophysical and structural analyses together with high-throughput screening, we reveal a bivalent binding switch in DREB2A containing an ACID-binding motif (ABS) and the known RCD1-binding motif (RIM). The RIM is lacking in a stress-induced DREB2A splice variant with retained transcriptional activity. ABS and RIM bind to separate sites on Med25-ACID, and NMR analyses show a structurally heterogeneous complex deriving from a DREB2A-ABS proline residue populating cis- and trans-isomers with remote impact on the RIM. The cis-isomer stabilizes an α-helix, while the trans-isomer may introduce energetic frustration facilitating rapid exchange between activators and repressors. Thus, DREB2A uses a post-transcriptionally and post-translationally modulated switch for transcriptional regulation.
Collapse
Affiliation(s)
- Frederik Friis Theisen
- The REPIN and The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
- Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Andreas Prestel
- Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Steffie Elkjær
- The REPIN and The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Yannick H A Leurs
- The REPIN and The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | | | | | - Charlotte O'Shea
- The REPIN and The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Kaare Teilum
- Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Birthe B Kragelund
- The REPIN and The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
- Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| | - Karen Skriver
- The REPIN and The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
11
|
Salomone J, Farrow E, Gebelein B. Homeodomain complex formation and biomolecular condensates in Hox gene regulation. Semin Cell Dev Biol 2024; 152-153:93-100. [PMID: 36517343 PMCID: PMC10258226 DOI: 10.1016/j.semcdb.2022.11.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2022] [Revised: 10/21/2022] [Accepted: 11/30/2022] [Indexed: 12/15/2022]
Abstract
Hox genes are a family of homeodomain transcription factors that regulate specialized morphological structures along the anterior-posterior axis of metazoans. Over the past few decades, researchers have focused on defining how Hox factors with similar in vitro DNA binding activities achieve sufficient target specificity to regulate distinct cell fates in vivo. In this review, we highlight how protein interactions with other transcription factors, many of which are also homeodomain proteins, result in the formation of transcription factor complexes with enhanced DNA binding specificity. These findings suggest that Hox-regulated enhancers utilize distinct combinations of homeodomain binding sites, many of which are low-affinity, to recruit specific Hox complexes. However, low-affinity sites can only yield reproducible responses with high transcription factor concentrations. To overcome this limitation, recent studies revealed how transcription factors, including Hox factors, use intrinsically disordered domains (IDRs) to form biomolecular condensates that increase protein concentrations. Moreover, Hox factors with altered IDRs have been associated with altered transcriptional activity and human disease states, demonstrating the importance of IDRs in mediating essential Hox output. Collectively, these studies highlight how Hox factors use their DNA binding domains, protein-protein interaction domains, and IDRs to form specific transcription factor complexes that yield accurate gene expression.
Collapse
Affiliation(s)
- Joseph Salomone
- Graduate Program in Molecular and Developmental Biology, Cincinnati Children's Hospital Research Foundation, Cincinnati, OH 45229, USA; Medical-Scientist Training Program, University of Cincinnati College of Medicine, Cincinnati, OH 45229, USA
| | - Edward Farrow
- Graduate Program in Molecular and Developmental Biology, Cincinnati Children's Hospital Research Foundation, Cincinnati, OH 45229, USA; Medical-Scientist Training Program, University of Cincinnati College of Medicine, Cincinnati, OH 45229, USA
| | - Brian Gebelein
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center, 3333 Burnet Ave, MLC 7007, Cincinnati, OH 45229, USA; Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH 45229, USA.
| |
Collapse
|
12
|
Kind L, Driver M, Raasakka A, Onck PR, Njølstad PR, Arnesen T, Kursula P. Structural properties of the HNF-1A transactivation domain. Front Mol Biosci 2023; 10:1249939. [PMID: 37908230 PMCID: PMC10613711 DOI: 10.3389/fmolb.2023.1249939] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Accepted: 09/26/2023] [Indexed: 11/02/2023] Open
Abstract
Hepatocyte nuclear factor 1α (HNF-1A) is a transcription factor with important gene regulatory roles in pancreatic β-cells. HNF1A gene variants are associated with a monogenic form of diabetes (HNF1A-MODY) or an increased risk for type 2 diabetes. While several pancreatic target genes of HNF-1A have been described, a lack of knowledge regarding the structure-function relationships in HNF-1A prohibits a detailed understanding of HNF-1A-mediated gene transcription, which is important for precision medicine and improved patient care. Therefore, we aimed to characterize the understudied transactivation domain (TAD) of HNF-1A in vitro. We present a bioinformatic approach to dissect the TAD sequence, analyzing protein structure, sequence composition, sequence conservation, and the existence of protein interaction motifs. Moreover, we developed the first protocol for the recombinant expression and purification of the HNF-1A TAD. Small-angle X-ray scattering and synchrotron radiation circular dichroism suggested a disordered conformation for the TAD. Furthermore, we present functional data on HNF-1A undergoing liquid-liquid phase separation, which is in line with in silico predictions and may be of biological relevance for gene transcriptional processes in pancreatic β-cells.
Collapse
Affiliation(s)
- Laura Kind
- Department of Biomedicine, University of Bergen, Bergen, Norway
| | - Mark Driver
- Zernike Institute for Advanced Materials, University of Groningen, Groningen, Netherlands
| | - Arne Raasakka
- Department of Biomedicine, University of Bergen, Bergen, Norway
| | - Patrick R. Onck
- Zernike Institute for Advanced Materials, University of Groningen, Groningen, Netherlands
| | - Pål Rasmus Njølstad
- Mohn Center for Diabetes Precision Medicine, Department of Clinical Science, University of Bergen, Bergen, Norway
- Section of Endocrinology and Metabolism, Children and Youth Clinic, Haukeland University Hospital, Bergen, Norway
| | - Thomas Arnesen
- Department of Biomedicine, University of Bergen, Bergen, Norway
- Department of Surgery, Haukeland University Hospital, Bergen, Norway
| | - Petri Kursula
- Department of Biomedicine, University of Bergen, Bergen, Norway
- Faculty of Biochemistry and Molecular Medicine & Biocenter Oulu, University of Oulu, Oulu, Finland
| |
Collapse
|
13
|
Jiang Q, Miao J, Wu F, Li F, Zhang J, Jing X, Cai S, Ma X, Wang X, Zhao L, Huang C. RNF6 promotes gastric cancer progression by regulating CCNA1/CREBBP transcription. Cell Cycle 2023; 22:2018-2037. [PMID: 37904524 PMCID: PMC10761170 DOI: 10.1080/15384101.2023.2275899] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2023] [Accepted: 10/21/2023] [Indexed: 11/01/2023] Open
Abstract
Ring finger protein 6 (RNF6) is a member of the E3 ubiquitin ligase family. Previous studies have reported the involvement of RNF6 as a ubiquitin ligase in the progression of gastric cancer (GC). However, this study found that RNF6 has a clear localization in the nucleus of GC, indicating a role other than ubiquitin ligase. Further chromatin immunoprecipitation sequencing (ChIP-seq) analysis revealed that RNF6 has DNA binding and transcriptional regulatory effects and is involved in important pathways such as tumor cell cycle and apoptosis. Cyclin A1 (CCNA1) and CREB binding protein (CREBBP) are downstream targets for RNF6 transcription regulation in GC. RNF6 binds to the promoter region of CCNA1/CREBBP and is actively regulating their expression in GC cells. Silencing CCNA1/CREBBP partially reversed the promoting effect of RNF6 overexpression on the biological function of GC cells. Our study suggests that RNF6 promotes the progression of GC by regulating CCNA1/CREBBP transcription.
Collapse
Affiliation(s)
- Qiuyu Jiang
- Department of Cell Biology and Genetics/Key Laboratory of Environment and Genes Related to Diseases, School of Basic Medical Sciences, Xi’an Jiaotong University Health Science Center, Xi’an, Shaanxi, China
| | - Jiyu Miao
- Department of Hematology, The Second Affiliated Hospital, Xi’an Jiaotong University, Xi’an, Shaanxi, China
| | - Fei Wu
- Department of Oncology, The Second Affiliated Hospital, Xi’an Jiaotong University, Xi’an, Shaanxi, China
| | - Fang Li
- Department of Cell Biology and Genetics/Key Laboratory of Environment and Genes Related to Diseases, School of Basic Medical Sciences, Xi’an Jiaotong University Health Science Center, Xi’an, Shaanxi, China
| | - Jinyuan Zhang
- Department of Cell Biology and Genetics/Key Laboratory of Environment and Genes Related to Diseases, School of Basic Medical Sciences, Xi’an Jiaotong University Health Science Center, Xi’an, Shaanxi, China
| | - Xintao Jing
- Department of Cell Biology and Genetics/Key Laboratory of Environment and Genes Related to Diseases, School of Basic Medical Sciences, Xi’an Jiaotong University Health Science Center, Xi’an, Shaanxi, China
| | - Shuang Cai
- Department of Cell Biology and Genetics/Key Laboratory of Environment and Genes Related to Diseases, School of Basic Medical Sciences, Xi’an Jiaotong University Health Science Center, Xi’an, Shaanxi, China
| | - Xiaoping Ma
- Department of Cell Biology and Genetics/Key Laboratory of Environment and Genes Related to Diseases, School of Basic Medical Sciences, Xi’an Jiaotong University Health Science Center, Xi’an, Shaanxi, China
| | - Xiaofei Wang
- Department of Cell Biology and Genetics/Key Laboratory of Environment and Genes Related to Diseases, School of Basic Medical Sciences, Xi’an Jiaotong University Health Science Center, Xi’an, Shaanxi, China
| | - Lingyu Zhao
- Department of Cell Biology and Genetics/Key Laboratory of Environment and Genes Related to Diseases, School of Basic Medical Sciences, Xi’an Jiaotong University Health Science Center, Xi’an, Shaanxi, China
| | - Chen Huang
- Department of Cell Biology and Genetics/Key Laboratory of Environment and Genes Related to Diseases, School of Basic Medical Sciences, Xi’an Jiaotong University Health Science Center, Xi’an, Shaanxi, China
| |
Collapse
|
14
|
Zhao B, Ghadermarzi S, Kurgan L. Comparative evaluation of AlphaFold2 and disorder predictors for prediction of intrinsic disorder, disorder content and fully disordered proteins. Comput Struct Biotechnol J 2023; 21:3248-3258. [PMID: 38213902 PMCID: PMC10782001 DOI: 10.1016/j.csbj.2023.06.001] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 05/31/2023] [Accepted: 06/01/2023] [Indexed: 01/13/2024] Open
Abstract
We expand studies of AlphaFold2 (AF2) in the context of intrinsic disorder prediction by comparing it against a broad selection of 20 accurate, popular and recently released disorder predictors. We use 25% larger benchmark dataset with 646 proteins and cover protein-level predictions of disorder content and fully disordered proteins. AF2-based disorder predictions secure a relatively high Area Under receiver operating characteristic Curve (AUC) of 0.77 and are statistically outperformed by several modern disorder predictors that secure AUCs around 0.8 with median runtime of about 20 s compared to 1200 s for AF2. Moreover, AF2 provides modestly accurate predictions of fully disordered proteins (F1 = 0.59 vs. 0.91 for the best disorder predictor) and disorder content (mean absolute error of 0.21 vs. 0.15). AF2 also generates statistically more accurate disorder predictions for about 20% of proteins that have relatively short sequences and a few disordered regions that tend to be located at the sequence termini, and which are absent of disordered protein-binding regions. Interestingly, AF2 and the most accurate disorder predictors rely on deep neural networks, suggesting that these models are useful for protein structure and disorder predictions.
Collapse
Affiliation(s)
- Bi Zhao
- Genomics program, College of Public Health, University of South Florida, Tampa, FL, United States
| | - Sina Ghadermarzi
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States
| |
Collapse
|
15
|
Abstract
There are over 100 computational predictors of intrinsic disorder. These methods predict amino acid-level propensities for disorder directly from protein sequences. The propensities can be used to annotate putative disordered residues and regions. This unit provides a practical and holistic introduction to the sequence-based intrinsic disorder prediction. We define intrinsic disorder, explain the format of computational prediction of disorder, and identify and describe several accurate predictors. We also introduce recently released databases of intrinsic disorder predictions and use an illustrative example to provide insights into how predictions should be interpreted and combined. Lastly, we summarize key experimental methods that can be used to validate computational predictions. © 2023 Wiley Periodicals LLC.
Collapse
Affiliation(s)
- Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, Florida
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, Virginia
| |
Collapse
|
16
|
Mattick JS, Amaral PP, Carninci P, Carpenter S, Chang HY, Chen LL, Chen R, Dean C, Dinger ME, Fitzgerald KA, Gingeras TR, Guttman M, Hirose T, Huarte M, Johnson R, Kanduri C, Kapranov P, Lawrence JB, Lee JT, Mendell JT, Mercer TR, Moore KJ, Nakagawa S, Rinn JL, Spector DL, Ulitsky I, Wan Y, Wilusz JE, Wu M. Long non-coding RNAs: definitions, functions, challenges and recommendations. Nat Rev Mol Cell Biol 2023; 24:430-447. [PMID: 36596869 PMCID: PMC10213152 DOI: 10.1038/s41580-022-00566-8] [Citation(s) in RCA: 548] [Impact Index Per Article: 548.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/16/2022] [Indexed: 01/05/2023]
Abstract
Genes specifying long non-coding RNAs (lncRNAs) occupy a large fraction of the genomes of complex organisms. The term 'lncRNAs' encompasses RNA polymerase I (Pol I), Pol II and Pol III transcribed RNAs, and RNAs from processed introns. The various functions of lncRNAs and their many isoforms and interleaved relationships with other genes make lncRNA classification and annotation difficult. Most lncRNAs evolve more rapidly than protein-coding sequences, are cell type specific and regulate many aspects of cell differentiation and development and other physiological processes. Many lncRNAs associate with chromatin-modifying complexes, are transcribed from enhancers and nucleate phase separation of nuclear condensates and domains, indicating an intimate link between lncRNA expression and the spatial control of gene expression during development. lncRNAs also have important roles in the cytoplasm and beyond, including in the regulation of translation, metabolism and signalling. lncRNAs often have a modular structure and are rich in repeats, which are increasingly being shown to be relevant to their function. In this Consensus Statement, we address the definition and nomenclature of lncRNAs and their conservation, expression, phenotypic visibility, structure and functions. We also discuss research challenges and provide recommendations to advance the understanding of the roles of lncRNAs in development, cell biology and disease.
Collapse
Affiliation(s)
- John S Mattick
- School of Biotechnology and Biomolecular Sciences, UNSW, Sydney, NSW, Australia.
- UNSW RNA Institute, UNSW, Sydney, NSW, Australia.
| | - Paulo P Amaral
- INSPER Institute of Education and Research, São Paulo, Brazil
| | - Piero Carninci
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- Human Technopole, Milan, Italy
| | - Susan Carpenter
- Department of Molecular, Cell and Developmental Biology, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Howard Y Chang
- Center for Personal Dynamics Regulomes, Stanford University School of Medicine, Stanford, CA, USA
- Department of Dermatology, Stanford, CA, USA
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA
- Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA, USA
| | - Ling-Ling Chen
- CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, Shanghai, China
| | - Runsheng Chen
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Caroline Dean
- John Innes Centre, Norwich Research Park, Norwich, UK
| | - Marcel E Dinger
- School of Biotechnology and Biomolecular Sciences, UNSW, Sydney, NSW, Australia
- UNSW RNA Institute, UNSW, Sydney, NSW, Australia
| | - Katherine A Fitzgerald
- Division of Innate Immunity, Department of Medicine, University of Massachusetts Chan Medical School, Worcester, MA, USA
| | | | - Mitchell Guttman
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, USA
| | - Tetsuro Hirose
- Graduate School of Frontier Biosciences, Osaka University, Osaka, Japan
| | - Maite Huarte
- Department of Gene Therapy and Regulation of Gene Expression, Center for Applied Medical Research, University of Navarra, Pamplona, Spain
- Institute of Health Research of Navarra, Pamplona, Spain
| | - Rory Johnson
- School of Biology and Environmental Science, University College Dublin, Dublin, Ireland
- Conway Institute for Biomolecular and Biomedical Research, University College Dublin, Dublin, Ireland
| | - Chandrasekhar Kanduri
- Department of Medical Biochemistry and Cell Biology, Institute of Biomedicine, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
| | - Philipp Kapranov
- Institute of Genomics, School of Medicine, Huaqiao University, Xiamen, China
| | - Jeanne B Lawrence
- Department of Neurology, University of Massachusetts Chan Medical School, Worcester, MA, USA
| | - Jeannie T Lee
- Department of Molecular Biology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
- Department of Genetics, Harvard Medical School, Boston, MA, USA
| | - Joshua T Mendell
- Howard Hughes Medical Institute, UT Southwestern Medical Center, Dallas, TX, USA
- Department of Molecular Biology, UT Southwestern Medical Center, Dallas, TX, USA
| | - Timothy R Mercer
- Australian Institute for Bioengineering and Nanotechnology, University of Queensland, Brisbane, QLD, Australia
| | - Kathryn J Moore
- Department of Medicine, New York University Grossman School of Medicine, New York, NY, USA
| | - Shinichi Nakagawa
- RNA Biology Laboratory, Faculty of Pharmaceutical Sciences, Hokkaido University, Sapporo, Japan
| | - John L Rinn
- Department of Biochemistry, University of Colorado Boulder, Boulder, CO, USA
- BioFrontiers Institute, University of Colorado Boulder, Boulder, CO, USA
- Howard Hughes Medical Institute, University of Colorado Boulder, Boulder, CO, USA
| | - David L Spector
- Cold Spring Harbour Laboratory, Cold Spring Harbour, NY, USA
| | - Igor Ulitsky
- Department of Biological Regulation, Weizmann Institute of Science, Rehovot, Israel
| | - Yue Wan
- Laboratory of RNA Genomics and Structure, Genome Institute of Singapore, A*STAR, Singapore, Singapore
- Department of Biochemistry, National University of Singapore, Singapore, Singapore
| | - Jeremy E Wilusz
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Therapeutic Innovation Center, Baylor College of Medicine, Houston, TX, USA
| | - Mian Wu
- Translational Research Institute, Henan Provincial People's Hospital, Academy of Medical Science, Zhengzhou University, Zhengzhou, China
| |
Collapse
|
17
|
Basu S, Gsponer J, Kurgan L. DEPICTER2: a comprehensive webserver for intrinsic disorder and disorder function prediction. Nucleic Acids Res 2023:7151337. [PMID: 37140058 DOI: 10.1093/nar/gkad330] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Revised: 04/12/2023] [Accepted: 04/18/2023] [Indexed: 05/05/2023] Open
Abstract
Intrinsic disorder in proteins is relatively abundant in nature and essential for a broad spectrum of cellular functions. While disorder can be accurately predicted from protein sequences, as it was empirically demonstrated in recent community-organized assessments, it is rather challenging to collect and compile a comprehensive prediction that covers multiple disorder functions. To this end, we introduce the DEPICTER2 (DisorderEd PredictIon CenTER) webserver that offers convenient access to a curated collection of fast and accurate disorder and disorder function predictors. This server includes a state-of-the-art disorder predictor, flDPnn, and five modern methods that cover all currently predictable disorder functions: disordered linkers and protein, peptide, DNA, RNA and lipid binding. DEPICTER2 allows selection of any combination of the six methods, batch predictions of up to 25 proteins per request and provides interactive visualization of the resulting predictions. The webserver is freely available at http://biomine.cs.vcu.edu/servers/DEPICTER2/.
Collapse
Affiliation(s)
- Sushmita Basu
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| | - Jörg Gsponer
- Michael Smith Laboratories, University of British Columbia, Vancouver, British Columbia, Canada
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| |
Collapse
|
18
|
Joosten J, van Sluijs B, Vree Egberts W, Emmaneel M, W T C Jansen P, Vermeulen M, Boelens W, Bonger KM, Spruijt E. Dynamics and composition of small heat shock protein condensates and aggregates. J Mol Biol 2023; 435:168139. [PMID: 37146746 DOI: 10.1016/j.jmb.2023.168139] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Revised: 04/26/2023] [Accepted: 04/27/2023] [Indexed: 05/07/2023]
Abstract
Small heat shock proteins (sHSPs) are essential ATP-independent chaperones that protect the cellular proteome. These proteins assemble into polydisperse oligomeric structures, the composition of which dramatically affects their chaperone activity. The biomolecular consequences of variations in sHSP ratios, especially inside living cells, remain elusive. Here, we study the consequences of altering the relative expression levels of HspB2 and HspB3 in HEK293T cells. These chaperones are partners in a hetero-oligomeric complex, and genetic mutations that abolish their mutual interaction are associated with myopathic disorders. HspB2 displays three distinct phenotypes when co-expressed with HspB3 at varying ratios. Expression of HspB2 alone leads to formation of liquid nuclear condensates, while shifting the stoichiometry towards HspB3 resulted in the formation of large solid-like aggregates. Only cells co-expressing HspB2 with a limited amount of HspB3 formed fully soluble complexes that were distributed homogeneously throughout the nucleus. Strikingly, both condensates and aggregates were reversible, as shifting the HspB2:HspB3 balance in situ resulted in dissolution of these structures. To uncover the molecular composition of HspB2 condensates and aggregates, we used APEX-mediated proximity labelling. Most proteins interact transiently with the condensates and were neither enriched nor depleted in these cells. In contrast, we found that HspB2:HspB3 aggregates sequestered several disordered proteins and autophagy factors, suggesting that the cell is actively attempting to clear these aggregates. This study presents a striking example of how changes in the relative expression levels of interacting proteins affects their phase behavior. Our approach could be applied to study the role of protein stoichiometry and the influence of client binding on phase behavior in other biomolecular condensates and aggregates.
Collapse
Affiliation(s)
- Joep Joosten
- Biomolecular Chemistry, Radboud University Institute for Molecular and Materials, Nijmegen, the Netherlands; Physical Organic Chemistry, Radboud University Institute for Molecular and Materials, Nijmegen, the Netherlands; Synthetic Organic Chemistry, Radboud University Institute for Molecular and Materials, the Netherlands.
| | - Bob van Sluijs
- Physical Organic Chemistry, Radboud University Institute for Molecular and Materials, Nijmegen, the Netherlands
| | - Wilma Vree Egberts
- Biomolecular Chemistry, Radboud University Institute for Molecular and Materials, Nijmegen, the Netherlands
| | - Martin Emmaneel
- Biomolecular Chemistry, Radboud University Institute for Molecular and Materials, Nijmegen, the Netherlands
| | - Pascal W T C Jansen
- Molecular Biology, Radboud University Institute for Molecular Life Sciences, Nijmegen, the Netherlands
| | - Michiel Vermeulen
- Molecular Biology, Radboud University Institute for Molecular Life Sciences, Nijmegen, the Netherlands
| | - Wilbert Boelens
- Biomolecular Chemistry, Radboud University Institute for Molecular and Materials, Nijmegen, the Netherlands
| | - Kimberly M Bonger
- Synthetic Organic Chemistry, Radboud University Institute for Molecular and Materials, the Netherlands
| | - Evan Spruijt
- Physical Organic Chemistry, Radboud University Institute for Molecular and Materials, Nijmegen, the Netherlands
| |
Collapse
|
19
|
Chuang CK, Chen SF, Su YH, Chen WH, Lin WM, Wang IC, Shyue SK. The Role of SCL Isoforms in Embryonic Hematopoiesis. Int J Mol Sci 2023; 24:ijms24076427. [PMID: 37047400 PMCID: PMC10094407 DOI: 10.3390/ijms24076427] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Revised: 03/16/2023] [Accepted: 03/26/2023] [Indexed: 04/01/2023] Open
Abstract
Three waves of hematopoiesis occur in the mouse embryo. The primitive hematopoiesis appears as blood islands in the extra embryonic yolk sac at E7.5. The extra embryonic pro-definitive hematopoiesis launches in late E8 and the embryonic definitive one turns on at E10.5 indicated by the emergence of hemogenic endothelial cells on the inner wall of the extra embryonic arteries and the embryonic aorta. To study the roles of SCL protein isoforms in murine hematopoiesis, the SCL-large (SCL-L) isoform was selectively destroyed with the remaining SCL-small (SCL-S) isoform intact. It was demonstrated that SCL-S was specifically expressed in the hemogenic endothelial cells (HECs) and SCL-L was only detected in the dispersed cells after budding from HECs. The SCLΔ/Δ homozygous mutant embryos only survived to E10.5 with normal extra embryonic vessels and red blood cells. In wild-type mouse embryos, a layer of neatly aligned CD34+ and CD43+ cells appeared on the endothelial wall of the aorta of the E10.5 fetus. However, the cells at the same site expressed CD31 rather than CD34 and/or CD43 in the E10.5 SCLΔ/Δ embryo, indicating that only the endothelial lineage was developed. These results reveal that the SCL-S is sufficient to sustain the primitive hematopoiesis and SCL-L is necessary to launch the definitive hematopoiesis.
Collapse
|
20
|
Computational prediction of disordered binding regions. Comput Struct Biotechnol J 2023; 21:1487-1497. [PMID: 36851914 PMCID: PMC9957716 DOI: 10.1016/j.csbj.2023.02.018] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Revised: 02/08/2023] [Accepted: 02/08/2023] [Indexed: 02/12/2023] Open
Abstract
One of the key features of intrinsically disordered regions (IDRs) is their ability to interact with a broad range of partner molecules. Multiple types of interacting IDRs were identified including molecular recognition fragments (MoRFs), short linear sequence motifs (SLiMs), and protein-, nucleic acids- and lipid-binding regions. Prediction of binding IDRs in protein sequences is gaining momentum in recent years. We survey 38 predictors of binding IDRs that target interactions with a diverse set of partners, such as peptides, proteins, RNA, DNA and lipids. We offer a historical perspective and highlight key events that fueled efforts to develop these methods. These tools rely on a diverse range of predictive architectures that include scoring functions, regular expressions, traditional and deep machine learning and meta-models. Recent efforts focus on the development of deep neural network-based architectures and extending coverage to RNA, DNA and lipid-binding IDRs. We analyze availability of these methods and show that providing implementations and webservers results in much higher rates of citations/use. We also make several recommendations to take advantage of modern deep network architectures, develop tools that bundle predictions of multiple and different types of binding IDRs, and work on algorithms that model structures of the resulting complexes.
Collapse
|
21
|
The biophysics of disordered proteins from the point of view of single-molecule fluorescence spectroscopy. Essays Biochem 2022; 66:875-890. [PMID: 36416865 PMCID: PMC9760427 DOI: 10.1042/ebc20220065] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Revised: 10/10/2022] [Accepted: 10/11/2022] [Indexed: 11/24/2022]
Abstract
Intrinsically disordered proteins (IDPs) and regions (IDRs) have emerged as key players across many biological functions and diseases. Differently from structured proteins, disordered proteins lack stable structure and are particularly sensitive to changes in the surrounding environment. Investigation of disordered ensembles requires new approaches and concepts for quantifying conformations, dynamics, and interactions. Here, we provide a short description of the fundamental biophysical properties of disordered proteins as understood through the lens of single-molecule fluorescence observations. Single-molecule Förster resonance energy transfer (FRET) and fluorescence correlation spectroscopy (FCS) provides an extensive and versatile toolbox for quantifying the characteristics of conformational distributions and the dynamics of disordered proteins across many different solution conditions, both in vitro and in living cells.
Collapse
|
22
|
Usher ET, Showalter SA. Biophysical insights into glucose-dependent transcriptional regulation by PDX1. J Biol Chem 2022; 298:102623. [PMID: 36272648 PMCID: PMC9691942 DOI: 10.1016/j.jbc.2022.102623] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Revised: 10/13/2022] [Accepted: 10/14/2022] [Indexed: 11/22/2022] Open
Abstract
The pancreatic and duodenal homeobox 1 (PDX1) is a central regulator of glucose-dependent transcription of insulin in pancreatic β cells. PDX1 transcription factor activity is integral to the development and sustained health of the pancreas; accordingly, deciphering the complex network of cellular cues that lead to PDX1 activation or inactivation is an important step toward understanding the etiopathologies of pancreatic diseases and the development of novel therapeutics. Despite nearly 3 decades of research into PDX1 control of Insulin expression, the molecular mechanisms that dictate the function of PDX1 in response to glucose are still elusive. The transcriptional activation functions of PDX1 are regulated, in part, by its two intrinsically disordered regions, which pose a barrier to its structural and biophysical characterization. Indeed, many studies of PDX1 interactions, clinical mutations, and posttranslational modifications lack molecular level detail. Emerging methods for the quantitative study of intrinsically disordered regions and refined models for transactivation now enable us to validate and interrogate the biochemical and biophysical features of PDX1 that dictate its function. The goal of this review is to summarize existing PDX1 studies and, further, to generate a comprehensive resource for future studies of transcriptional control via PDX1.
Collapse
Affiliation(s)
- Emery T Usher
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania, USA
| | - Scott A Showalter
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania, USA; Department of Chemistry, The Pennsylvania State University, University Park, Pennsylvania, USA.
| |
Collapse
|
23
|
Clanor PB, Buchholz CN, Hayes JE, Friedman MA, White AM, Enke RA, Berndsen CE. Structural and functional analysis of the human cone‐rod homeobox transcription factor. Proteins 2022; 90:1584-1593. [PMID: 35255174 PMCID: PMC9271546 DOI: 10.1002/prot.26332] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Revised: 02/28/2022] [Accepted: 03/02/2022] [Indexed: 11/30/2022]
Abstract
The cone‐rod homeobox (CRX) protein is a critical K50 homeodomain transcription factor responsible for the differentiation and maintenance of photoreceptor neurons in the vertebrate retina. Mutant alleles in the human gene encoding CRX result in a variety of distinct blinding retinopathies, including retinitis pigmentosa, cone‐rod dystrophy, and Leber congenital amaurosis. Despite the success of using in vitro biochemistry, animal models, and genomics approaches to study this clinically relevant transcription factor over the past 25 years since its initial characterization, there are no high‐resolution structures in the published literature for the CRX protein. In this study, we use bioinformatic approaches and small‐angle X‐ray scattering (SAXS) structural analysis to further understand the biochemical complexity of the human CRX homeodomain (CRX‐HD). We find that the CRX‐HD is a compact, globular monomer in solution that can specifically bind functional cis‐regulatory elements encoded upstream of retina‐specific genes. This study presents the first structural analysis of CRX, paving the way for a new approach to studying the biochemistry of this protein and its disease‐causing mutant protein variants.
Collapse
Affiliation(s)
| | - Christine N. Buchholz
- Department of Chemistry and Biochemistry James Madison University Harrisonburg Virginia USA
| | - Jonathan E. Hayes
- Department of Chemistry and Biochemistry James Madison University Harrisonburg Virginia USA
| | | | - Andrew M. White
- Department of Chemistry and Biochemistry James Madison University Harrisonburg Virginia USA
| | - Ray A. Enke
- Department of Biology James Madison University Harrisonburg Virginia USA
- Center for Genome and Metagenome Studies James Madison University Harrisonburg Virginia USA
| | - Christopher E. Berndsen
- Department of Chemistry and Biochemistry James Madison University Harrisonburg Virginia USA
- Center for Genome and Metagenome Studies James Madison University Harrisonburg Virginia USA
| |
Collapse
|
24
|
Friis Theisen F, Salladini E, Davidsen R, Jo Rasmussen C, Staby L, Kragelund BB, Skriver K. αα-hub coregulator structure and flexibility determine transcription factor binding and selection in regulatory interactomes. J Biol Chem 2022; 298:101963. [PMID: 35452682 PMCID: PMC9127584 DOI: 10.1016/j.jbc.2022.101963] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Revised: 04/13/2022] [Accepted: 04/15/2022] [Indexed: 11/23/2022] Open
Abstract
Formation of transcription factor (TF)-coregulator complexes is a key step in transcriptional regulation, with coregulators having essential functions as hub nodes in molecular networks. How specificity and selectivity are maintained in these nodes remain open questions. In this work, we addressed specificity in transcriptional networks using complexes formed between TFs and αα-hubs, which are defined by a common αα-hairpin secondary structure motif, as a model. Using NMR spectroscopy and binding thermodynamics, we analyzed the structure, dynamics, stability, and ligand-binding properties of the Arabidopsis thaliana RST domains from TAF4 and known binding partner RCD1, and the TAFH domain from human TAF4, allowing comparison across species, functions, and architectural contexts. While these αα-hubs shared the αα-hairpin motif, they differed in length and orientation of accessory helices as well as in their thermodynamic profiles of ligand binding. Whereas biologically relevant RCD1-ligand pairs displayed high affinity driven by enthalpy, TAF4-ligand interactions were entropy driven and exhibited less binding-induced structuring. We in addition identified a thermal unfolding state with a structured core for all three domains, although the temperature sensitivity differed. Thermal stability studies suggested that initial unfolding of the RCD1-RST domain localized around helix 1, lending this region structural malleability, while effects in TAF4-RST were more stochastic, suggesting variability in structural adaptability upon binding. Collectively, our results support a model in which hub structure, flexibility, and binding thermodynamics contribute to αα-hub-TF binding specificity, a finding of general relevance to the understanding of coregulator-ligand interactions and interactome sizes.
Collapse
Affiliation(s)
- Frederik Friis Theisen
- REPIN and the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Edoardo Salladini
- REPIN and the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Rikke Davidsen
- REPIN and the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Christina Jo Rasmussen
- REPIN and the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Lasse Staby
- REPIN and the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark; Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Birthe B Kragelund
- REPIN and the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark; Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| | - Karen Skriver
- REPIN and the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
25
|
Zeng X, Ruff KM, Pappu RV. Competing interactions give rise to two-state behavior and switch-like transitions in charge-rich intrinsically disordered proteins. Proc Natl Acad Sci U S A 2022; 119:e2200559119. [PMID: 35512095 PMCID: PMC9171777 DOI: 10.1073/pnas.2200559119] [Citation(s) in RCA: 32] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Accepted: 04/12/2022] [Indexed: 11/18/2022] Open
Abstract
The most commonly occurring intrinsically disordered proteins (IDPs) are polyampholytes, which are defined by the duality of low net charge per residue and high fractions of charged residues. Recent experiments have uncovered nuances regarding sequence–ensemble relationships of model polyampholytic IDPs. These include differences in conformational preferences for sequences with lysine vs. arginine and the suggestion that well-mixed sequences form a range of conformations, including globules, conformations with ensemble averages that are reminiscent of ideal chains, or self-avoiding walks. Here, we explain these observations by analyzing results from atomistic simulations. We find that polyampholytic IDPs generally sample two distinct stable states, namely, globules and self-avoiding walks. Globules are favored by electrostatic attractions between oppositely charged residues, whereas self-avoiding walks are favored by favorable free energies of hydration of charged residues. We find sequence-specific temperatures of bistability at which globules and self-avoiding walks can coexist. At these temperatures, ensemble averages over coexisting states give rise to statistics that resemble ideal chains without there being an actual counterbalancing of intrachain and chain-solvent interactions. At equivalent temperatures, arginine-rich sequences tilt the preference toward globular conformations whereas lysine-rich sequences tilt the preference toward self-avoiding walks. We also identify differences between aspartate- and glutamate-containing sequences, whereby the shorter aspartate side chain engenders preferences for metastable, necklace-like conformations. Finally, although segregation of oppositely charged residues within the linear sequence maintains the overall two-state behavior, compact states are highly favored by such systems.
Collapse
Affiliation(s)
- Xiangze Zeng
- Department of Biomedical Engineering, Washington University in St. Louis, St. Louis, MO 63130
- Center for Science & Engineering of Living Systems, Washington University in St. Louis, St. Louis, MO 63130
| | - Kiersten M. Ruff
- Department of Biomedical Engineering, Washington University in St. Louis, St. Louis, MO 63130
- Center for Science & Engineering of Living Systems, Washington University in St. Louis, St. Louis, MO 63130
| | - Rohit V. Pappu
- Department of Biomedical Engineering, Washington University in St. Louis, St. Louis, MO 63130
- Center for Science & Engineering of Living Systems, Washington University in St. Louis, St. Louis, MO 63130
| |
Collapse
|
26
|
Disordered regions flanking the binding interface modulate affinity between CBP and NCOA. J Mol Biol 2022; 434:167643. [DOI: 10.1016/j.jmb.2022.167643] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2022] [Revised: 05/13/2022] [Accepted: 05/16/2022] [Indexed: 01/01/2023]
|
27
|
Feric M, Misteli T. Function moves biomolecular condensates in phase space. Bioessays 2022; 44:e2200001. [PMID: 35243657 PMCID: PMC9277701 DOI: 10.1002/bies.202200001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2022] [Revised: 02/19/2022] [Accepted: 02/22/2022] [Indexed: 11/08/2022]
Abstract
Phase separation underlies the formation of biomolecular condensates. We hypothesize the cellular processes that occur within condensates shape their structural features. We use the example of transcription to discuss structure-function relationships in condensates. Various types of transcriptional condensates have been reported across the evolutionary spectrum in the cell nucleus as well as in mitochondrial and bacterial nucleoids. In vitro and in vivo observations suggest that transcriptional activity of condensates influences their supramolecular structure, which in turn affects their function. Condensate organization thus becomes driven by differences in miscibility among the DNA and proteins of the transcription machinery and the RNA transcripts they generate. These considerations are in line with the notion that cellular processes shape the structural properties of condensates, leading to a dynamic, mutual interplay between structure and function in the cell.
Collapse
Affiliation(s)
- Marina Feric
- National Cancer Institute, National Institutes of Health, Bethesda, Maryland, USA
| | - Tom Misteli
- National Cancer Institute, National Institutes of Health, Bethesda, Maryland, USA
| |
Collapse
|
28
|
Cancer: More than a geneticist’s Pandora’s box. J Biosci 2022. [DOI: 10.1007/s12038-022-00254-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
29
|
Kind L, Raasakka A, Molnes J, Aukrust I, Bjørkhaug L, Njølstad PR, Kursula P, Arnesen T. Structural and biophysical characterization of transcription factor HNF-1A as a tool to study MODY3 diabetes variants. J Biol Chem 2022; 298:101803. [PMID: 35257744 PMCID: PMC8988010 DOI: 10.1016/j.jbc.2022.101803] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2021] [Revised: 02/15/2022] [Accepted: 02/20/2022] [Indexed: 11/05/2022] Open
Abstract
Hepatocyte nuclear factor 1A (HNF-1A) is a transcription factor expressed in several embryonic and adult tissues, modulating the expression of numerous target genes. Pathogenic variants in the HNF1A gene are known to cause maturity-onset diabetes of the young 3 (MODY3 or HNF1A MODY), a disease characterized by dominant inheritance, age of onset before 25 to 35 years of age, and pancreatic β-cell dysfunction. A precise diagnosis can alter management of this disease, as insulin can be exchanged with sulfonylurea tablets and genetic counseling differs from polygenic forms of diabetes. Therefore, more knowledge on the mechanisms of HNF-1A function and the level of pathogenicity of the numerous HNF1A variants is required for precise diagnostics. Here, we structurally and biophysically characterized an HNF-1A protein containing both the DNA-binding domain and the dimerization domain, and determined the folding and DNA-binding capacity of two established MODY3 HNF-1A variant proteins (P112L, R263C) and one variant of unknown significance (N266S). All three variants showed reduced functionality compared to the WT protein. Furthermore, while the R263C and N266S variants displayed reduced binding to an HNF-1A target promoter, we found the P112L variant was unstable in vitro and in cells. Our results support and mechanistically explain disease causality for these investigated variants and present a novel approach for the dissection of structurally unstable and DNA-binding defective variants. This study indicates that structural and biochemical investigation of HNF-1A is a valuable tool in reliable variant classification needed for precision diabetes diagnostics and management.
Collapse
Affiliation(s)
- Laura Kind
- Department of Biomedicine, University of Bergen, Bergen, Norway.
| | - Arne Raasakka
- Department of Biomedicine, University of Bergen, Bergen, Norway
| | - Janne Molnes
- Center for Diabetes Research, Department of Clinical Science, University of Bergen, Bergen, Norway; Department of Medical Genetics, Haukeland University Hospital, Bergen, Norway
| | - Ingvild Aukrust
- Center for Diabetes Research, Department of Clinical Science, University of Bergen, Bergen, Norway; Department of Medical Genetics, Haukeland University Hospital, Bergen, Norway
| | - Lise Bjørkhaug
- Department of Safety, Chemistry, and Biomedical Laboratory Sciences, Western Norway University of Applied Sciences, Bergen, Norway
| | - Pål Rasmus Njølstad
- Center for Diabetes Research, Department of Clinical Science, University of Bergen, Bergen, Norway; Section of Endocrinology and Metabolism, Children and Youth Clinic, Haukeland University Hospital, Bergen, Norway.
| | - Petri Kursula
- Department of Biomedicine, University of Bergen, Bergen, Norway; Faculty of Biochemistry and Molecular Medicine & Biocenter Oulu, University of Oulu, Oulu, Finland
| | - Thomas Arnesen
- Department of Biomedicine, University of Bergen, Bergen, Norway; Department of Biological Sciences, University of Bergen, Bergen, Norway; Department of Surgery, Haukeland University Hospital, Bergen, Norway.
| |
Collapse
|
30
|
Kulkarni P, Bhattacharya S, Achuthan S, Behal A, Jolly MK, Kotnala S, Mohanty A, Rangarajan G, Salgia R, Uversky V. Intrinsically Disordered Proteins: Critical Components of the Wetware. Chem Rev 2022; 122:6614-6633. [PMID: 35170314 PMCID: PMC9250291 DOI: 10.1021/acs.chemrev.1c00848] [Citation(s) in RCA: 36] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Despite the wealth of knowledge gained about intrinsically disordered proteins (IDPs) since their discovery, there are several aspects that remain unexplored and, hence, poorly understood. A living cell is a complex adaptive system that can be described as a wetware─a metaphor used to describe the cell as a computer comprising both hardware and software and attuned to logic gates─capable of "making" decisions. In this focused Review, we discuss how IDPs, as critical components of the wetware, influence cell-fate decisions by wiring protein interaction networks to keep them minimally frustrated. Because IDPs lie between order and chaos, we explore the possibility that they can be modeled as attractors. Further, we discuss how the conformational dynamics of IDPs manifests itself as conformational noise, which can potentially amplify transcriptional noise to stochastically switch cellular phenotypes. Finally, we explore the potential role of IDPs in prebiotic evolution, in forming proteinaceous membrane-less organelles, in the origin of multicellularity, and in protein conformation-based transgenerational inheritance of acquired characteristics. Together, these ideas provide a new conceptual framework to discern how IDPs may perform critical biological functions despite their lack of structure.
Collapse
Affiliation(s)
- Prakash Kulkarni
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA, USA
| | - Supriyo Bhattacharya
- Integrative Genomics Core, City of Hope National Medical Center, Duarte, CA, USA
| | - Srisairam Achuthan
- Division of Research Informatics, Center for Informatics, City of Hope National Medical Center, Duarte, CA 91010, USA
| | - Amita Behal
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA, USA
| | - Mohit Kumar Jolly
- Center for BioSystems Science and Engineering, Indian Institute of Science, Bangalore 560012, India
| | - Sourabh Kotnala
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA, USA
| | - Atish Mohanty
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA, USA
| | - Govindan Rangarajan
- Department of Mathematics, Indian Institute of Science, Bangalore 560012, India
- Center for Neuroscience, Indian Institute of Science, Bangalore 560012, India
| | - Ravi Salgia
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA, USA
| | - Vladimir Uversky
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL, USA
- Center for Molecular Mechanisms of Aging and Age-Related Diseases, Moscow Institute of Physics and Technology, Institutskiy pereulok, 9, Dolgoprudny, Moscow region 141700, Russia
| |
Collapse
|
31
|
Zhao B, Kurgan L. Deep learning in prediction of intrinsic disorder in proteins. Comput Struct Biotechnol J 2022; 20:1286-1294. [PMID: 35356546 PMCID: PMC8927795 DOI: 10.1016/j.csbj.2022.03.003] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Revised: 03/04/2022] [Accepted: 03/04/2022] [Indexed: 12/12/2022] Open
Abstract
Intrinsic disorder prediction is an active area that has developed over 100 predictors. We identify and investigate a recent trend towards the development of deep neural network (DNN)-based methods. The first DNN-based method was released in 2013 and since 2019 deep learners account for majority of the new disorder predictors. We find that the 13 currently available DNN-based predictors are diverse in their topologies, sizes of their networks and the inputs that they utilize. We empirically show that the deep learners are statistically more accurate than other types of disorder predictors using the blind test dataset from the recent community assessment of intrinsic disorder predictions (CAID). We also identify several well-rounded DNN-based predictors that are accurate, fast and/or conveniently available. The popularity, favorable predictive performance and architectural flexibility suggest that deep networks are likely to fuel the development of future disordered predictors. Novel hybrid designs of deep networks could be used to adequately accommodate for diversity of types and flavors of intrinsic disorder. We also discuss scarcity of the DNN-based methods for the prediction of disordered binding regions and the need to develop more accurate methods for this prediction.
Collapse
Affiliation(s)
- Bi Zhao
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| |
Collapse
|
32
|
Kurgan L. Resources for computational prediction of intrinsic disorder in proteins. Methods 2022; 204:132-141. [DOI: 10.1016/j.ymeth.2022.03.018] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Revised: 03/25/2022] [Accepted: 03/29/2022] [Indexed: 12/26/2022] Open
|
33
|
Soto L, Li Z, Santoso CS, Berenson A, Ho I, Shen VX, Yuan S, Bass JIF. Compendium of human transcription factor effector domains. Mol Cell 2022; 82:514-526. [PMID: 34863368 PMCID: PMC8818021 DOI: 10.1016/j.molcel.2021.11.007] [Citation(s) in RCA: 40] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Revised: 10/16/2021] [Accepted: 11/03/2021] [Indexed: 02/08/2023]
Abstract
Transcription factors (TFs) regulate gene expression by binding to DNA sequences and modulating transcriptional activity through their effector domains. Despite the central role of effector domains in TF function, there is a current lack of a comprehensive resource and characterization of effector domains. Here, we provide a catalog of 924 effector domains across 594 human TFs. Using this catalog, we characterized the amino acid composition of effector domains, their conservation across species and across the human population, and their roles in human diseases. Furthermore, we provide a classification system for effector domains that constitutes a valuable resource and a blueprint for future experimental studies of TF effector domain function.
Collapse
Affiliation(s)
- Luis Soto
- Escuela Profesional de Genética y Biotecnología, Facultad de Ciencias Biológicas, Universidad Nacional Mayor de San Marcos, Lima 15081, Perú
| | - Zhaorong Li
- Bioinformatics Program, Boston University, Boston MA 02215
| | - Clarissa S Santoso
- Biology Department, Boston University, Boston MA 02215,Molecular Biology, Cellular Biology and Biochemistry Program, Boston University, Boston MA 02215
| | - Anna Berenson
- Biology Department, Boston University, Boston MA 02215,Molecular Biology, Cellular Biology and Biochemistry Program, Boston University, Boston MA 02215
| | - Isabella Ho
- Biology Department, Boston University, Boston MA 02215
| | - Vivian X Shen
- Biology Department, Boston University, Boston MA 02215
| | - Samson Yuan
- Biology Department, Boston University, Boston MA 02215
| | - Juan I Fuxman Bass
- Bioinformatics Program, Boston University, Boston MA 02215,Biology Department, Boston University, Boston MA 02215,Molecular Biology, Cellular Biology and Biochemistry Program, Boston University, Boston MA 02215,correspondence:
| |
Collapse
|
34
|
Gao M, Li P, Su Z, Huang Y. Topological frustration leading to backtracking in a coupled folding-binding process. Phys Chem Chem Phys 2022; 24:2630-2637. [PMID: 35029261 DOI: 10.1039/d1cp04927e] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Intrinsically disordered proteins (IDPs) are abundant in all species. Their discovery challenges the traditional "sequence-structure-function" paradigm of protein science because IDPs play important roles in various biological processes without preformed folded structures. Bioinformatic analysis reveals that the intrinsically conformational disorder of IDPs as well as their conformational transition upon binding to their targets is encoded by their amino acid sequences. The rRNase domain of colicin E3 and the immunity protein Im3 are a pair of proteins involved in bacterial survival. While the N-terminal segment and the central segment of E3 make comparable intermolecular contacts with Im3 in the bound state, binding of E3 with Im3 is dominantly triggered by the central segment of E3. In this work, to further investigate the binding mechanism of disordered E3 with Im3, we performed systematic free energy and transition path analysis through coarse-grained molecular dynamics simulations. We observed backtracking of the N-terminal segment of E3 in the binding process, whose occurrence depends on salt concentration. Conformational analysis revealed that initial binding of the N-terminal segment of E3 to Im3 usually leads to misorientation of a central hairpin of E3 on Im3, which generates topological frustration and results in backtracking of the N-terminal segment. Our results not only provide deeper mechanistic insights into the coupled folding-binding process of the E3/Im3 complex, but also suggest that topological frustration could be present in the coupled folding-binding process of IDPs and play an important role in regulating the binding transition pathways.
Collapse
Affiliation(s)
- Meng Gao
- Key Laboratory of Industrial Fermentation (Ministry of Education), Hubei University of Technology, Wuhan 430068, China.
- Hubei Key Laboratory of Industrial Microbiology, Hubei University of Technology, Wuhan, China
- National "111" Center for Cellular Regulation and Molecular Pharmaceutics, Department of Biological Engineering, Hubei University of Technology, Wuhan 430068, China
| | - Ping Li
- Key Laboratory of Industrial Fermentation (Ministry of Education), Hubei University of Technology, Wuhan 430068, China.
- Hubei Key Laboratory of Industrial Microbiology, Hubei University of Technology, Wuhan, China
- National "111" Center for Cellular Regulation and Molecular Pharmaceutics, Department of Biological Engineering, Hubei University of Technology, Wuhan 430068, China
| | - Zhengding Su
- Key Laboratory of Industrial Fermentation (Ministry of Education), Hubei University of Technology, Wuhan 430068, China.
- Hubei Key Laboratory of Industrial Microbiology, Hubei University of Technology, Wuhan, China
- National "111" Center for Cellular Regulation and Molecular Pharmaceutics, Department of Biological Engineering, Hubei University of Technology, Wuhan 430068, China
| | - Yongqi Huang
- Key Laboratory of Industrial Fermentation (Ministry of Education), Hubei University of Technology, Wuhan 430068, China.
- Hubei Key Laboratory of Industrial Microbiology, Hubei University of Technology, Wuhan, China
- National "111" Center for Cellular Regulation and Molecular Pharmaceutics, Department of Biological Engineering, Hubei University of Technology, Wuhan 430068, China
| |
Collapse
|
35
|
Lorenz P, Steinbeck F, Krause L, Thiesen HJ. The KRAB Domain of ZNF10 Guides the Identification of Specific Amino Acids That Transform the Ancestral KRAB-A-Related Domain Present in Human PRDM9 into a Canonical Modern KRAB-A Domain. Int J Mol Sci 2022; 23:1072. [PMID: 35162997 PMCID: PMC8835667 DOI: 10.3390/ijms23031072] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Revised: 01/11/2022] [Accepted: 01/13/2022] [Indexed: 12/14/2022] Open
Abstract
Krüppel-associated box (KRAB) zinc finger proteins are a large class of tetrapod transcription factors that usually exert transcriptional repression through recruitment of TRIM28/KAP1. The evolutionary root of modern KRAB domains (mKRAB) can be traced back to an ancestral motif (aKRAB) that occurs even in invertebrates. Here, we first stratified three subgroups of aKRAB sequences from the animal kingdom (PRDM9, SSX and coelacanth KZNF families) and defined ancestral subdomains for KRAB-A and KRAB-B. Using human ZNF10 mKRAB-AB as blueprints for function, we then identified the necessary amino acid changes that transform the inactive aKRAB-A of human PRDM9 into an mKRAB domain capable of mediating silencing and complexing TRIM28/KAP1 in human cells when employed as a hybrid with ZNF10-B. Full gain of function required replacement of residues KR by the conserved motif MLE (positionsA32-A34), which inserted an additional residue, and exchange of A9/S for F, A20/M for L, and A27/R for V. AlphaFold2 modelling documented an evolutionary conserved L-shaped body of two α-helices in all KRAB domains. It is transformed into a characteristic spatial arrangement typical for mKRAB-AB upon the amino acid replacements and in conjunction with a third helix supplied by mKRAB-B. Side-chains pointing outward from the core KRAB 3D structure may reveal a protein-protein interaction code enabling graded binding of TRIM28 to different KRAB domains. Our data provide basic insights into structure-function relationships and emulate transitions of KRAB during evolution.
Collapse
Affiliation(s)
- Peter Lorenz
- Rostock University Medical Center, Institute of Immunology, Schillingallee 70, 18057 Rostock, Germany; (F.S.); (L.K.); (H.-J.T.)
| | - Felix Steinbeck
- Rostock University Medical Center, Institute of Immunology, Schillingallee 70, 18057 Rostock, Germany; (F.S.); (L.K.); (H.-J.T.)
| | - Ludwig Krause
- Rostock University Medical Center, Institute of Immunology, Schillingallee 70, 18057 Rostock, Germany; (F.S.); (L.K.); (H.-J.T.)
| | - Hans-Jürgen Thiesen
- Rostock University Medical Center, Institute of Immunology, Schillingallee 70, 18057 Rostock, Germany; (F.S.); (L.K.); (H.-J.T.)
- Gesellschaft für Individualisierte Medizin (IndyMed) mbH, 17, 18055 Rostock, Germany
| |
Collapse
|
36
|
Bernardini A, Lorenzo M, Chaves-Sanjuan A, Swuec P, Pigni M, Saad D, Konarev PV, Graewert MA, Valentini E, Svergun DI, Nardini M, Mantovani R, Gnesutta N. The USR domain of USF1 mediates NF-Y interactions and cooperative DNA binding. Int J Biol Macromol 2021; 193:401-413. [PMID: 34673109 DOI: 10.1016/j.ijbiomac.2021.10.056] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Revised: 10/07/2021] [Accepted: 10/08/2021] [Indexed: 10/20/2022]
Abstract
The trimeric CCAAT-binding NF-Y is a "pioneer" Transcription Factor -TF- known to cooperate with neighboring TFs to regulate gene expression. Genome-wide analyses detected a precise stereo-alignment -10/12 bp- of CCAAT with E-box elements and corresponding colocalization of NF-Y with basic-Helix-Loop-Helix (bHLH) TFs. We dissected here NF-Y interactions with USF1 and MAX. USF1, but not MAX, cooperates in DNA binding with NF-Y. NF-Y and USF1 synergize to activate target promoters. Reconstruction of complexes by structural means shows independent DNA binding of MAX, whereas USF1 has extended contacts with NF-Y, involving the USR, a USF-specific amino acid sequence stretch required for trans-activation. The USR is an intrinsically disordered domain and adopts different conformations based on E-box-CCAAT distances. Deletion of the USR abolishes cooperative DNA binding with NF-Y. Our data indicate that the functionality of certain unstructured domains involves adapting to small variation in stereo-alignments of the multimeric TFs sites.
Collapse
Affiliation(s)
- Andrea Bernardini
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano 20133, Italy
| | - Mariangela Lorenzo
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano 20133, Italy
| | | | - Paolo Swuec
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano 20133, Italy
| | - Matteo Pigni
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano 20133, Italy
| | - Dana Saad
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano 20133, Italy
| | - Petr V Konarev
- A.V. Shubnikov Institute of Crystallography, Federal Scientific Research Centre "Crystallography and Photonics" of Russian Academy of Science, Moscow 119333, Russian Federation
| | | | - Erica Valentini
- European Molecular Biology Laboratory, Hamburg Unit, Hamburg 22607, Germany
| | - Dmitri I Svergun
- European Molecular Biology Laboratory, Hamburg Unit, Hamburg 22607, Germany
| | - Marco Nardini
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano 20133, Italy
| | - Roberto Mantovani
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano 20133, Italy.
| | - Nerina Gnesutta
- Dipartimento di Bioscienze, Università degli Studi di Milano, Milano 20133, Italy.
| |
Collapse
|
37
|
Flanking Disorder of the Folded αα-Hub Domain from Radical Induced Cell Death1 Affects Transcription Factor Binding by Ensemble Redistribution. J Mol Biol 2021; 433:167320. [PMID: 34687712 DOI: 10.1016/j.jmb.2021.167320] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Revised: 09/28/2021] [Accepted: 10/13/2021] [Indexed: 11/22/2022]
Abstract
Protein intrinsic disorder is essential for organization of transcription regulatory interactomes. In these interactomes, the majority of transcription factors as well as their interaction partners have co-existing order and disorder. Yet, little attention has been paid to their interplay. Here, we investigate how order is affected by flanking disorder in the folded αα-hub domain RST from Radical-Induced Cell Death1 (RCD1), central in a large interactome of transcription factors. We show that the intrinsically disordered C-terminal tail of RCD1-RST shifts its conformational ensemble towards a pseudo-bound state through weak interactions with the ligand-binding pocket. An unfolded excited state is also accessible on the ms timescale independent of surrounding disordered regions, but its population is lowered by 50% in their presence. Flanking disorder additionally lowers transcription factor binding-affinity without affecting the dissociation rate constant, in accordance with similar bound-states assessed by NMR. The extensive dynamics of the RCD1-RST domain, modulated by flanking disorder, is suggestive of its adaptation to many different transcription factor ligands. The study illustrates how disordered flanking regions can tune fold and function through ensemble redistribution and is of relevance to modular proteins in general, many of which play key roles in regulation of genes.
Collapse
|
38
|
Sequence, structural and functional conservation among the human and fission yeast ELL and EAF transcription elongation factors. Mol Biol Rep 2021; 49:1303-1320. [PMID: 34807377 DOI: 10.1007/s11033-021-06958-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2021] [Accepted: 11/11/2021] [Indexed: 10/19/2022]
Abstract
BACKGROUND Transcription elongation is a dynamic and tightly regulated step of gene expression in eukaryotic cells. Eleven nineteen Lysine rich Leukemia (ELL) and ELL Associated Factors (EAF) family of conserved proteins are required for efficient RNA polymerase II-mediated transcription elongation. Orthologs of these proteins have been identified in different organisms, including fission yeast and humans. METHODS AND RESULTS In the present study, we have examined the sequence, structural and functional conservation between the fission yeast and human ELL and EAF orthologs. Our computational analysis revealed that these proteins share some sequence characteristics, and were predominantly disordered in both organisms. Our functional complementation assays revealed that both human ELL and EAF proteins could complement the lack of ell1+ or eaf1+ in Schizosaccharomyces pombe respectively. Furthermore, our domain mapping experiments demonstrated that both the amino and carboxyl terminal domains of human EAF proteins could functionally complement the S. pombe eaf1 deletion phenotypes. However, only the carboxyl-terminus domain of human ELL was able to partially rescue the phenotypes associated with lack of ell1+ in S. pombe. CONCLUSIONS Collectively, our work adds ELL-EAF to the increasing list of human-yeast complementation gene pairs, wherein the simpler fission yeast can be used to further enhance our understanding of the role of these proteins in transcription elongation and human disease.
Collapse
|
39
|
Abstract
To predict transcription, one needs a mechanistic understanding of how the numerous required transcription factors (TFs) explore the nuclear space to find their target genes, assemble, cooperate, and compete with one another. Advances in fluorescence microscopy have made it possible to visualize real-time TF dynamics in living cells, leading to two intriguing observations: first, most TFs contact chromatin only transiently; and second, TFs can assemble into clusters through their intrinsically disordered regions. These findings suggest that highly dynamic events and spatially structured nuclear microenvironments might play key roles in transcription regulation that are not yet fully understood. The emerging model is that while some promoters directly convert TF-binding events into on/off cycles of transcription, many others apply complex regulatory layers that ultimately lead to diverse phenotypic outputs. Cracking this kinetic code is an ongoing and challenging task that is made possible by combining innovative imaging approaches with biophysical models.
Collapse
Affiliation(s)
- Feiyue Lu
- Institute for Systems Genetics and Cell Biology Department, NYU School of Medicine, New York, New York 10016, USA
| | - Timothée Lionnet
- Institute for Systems Genetics and Cell Biology Department, NYU School of Medicine, New York, New York 10016, USA
| |
Collapse
|
40
|
Broyles BK, Gutierrez AT, Maris TP, Coil DA, Wagner TM, Wang X, Kihara D, Class CA, Erkine AM. Activation of gene expression by detergent-like protein domains. iScience 2021; 24:103017. [PMID: 34522860 PMCID: PMC8426559 DOI: 10.1016/j.isci.2021.103017] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Revised: 07/08/2021] [Accepted: 08/18/2021] [Indexed: 11/24/2022] Open
Abstract
The mechanisms by which transcriptional activation domains (tADs) initiate eukaryotic gene expression have been an enigma for decades because most tADs lack specificity in sequence, structure, and interactions with targets. Machine learning analysis of data sets of tAD sequences generated in vivo elucidated several functionality rules: the functional tAD sequences should (i) be devoid of or depleted with basic amino acid residues, (ii) be enriched with aromatic and acidic residues, (iii) be with aromatic residues localized mostly near the terminus of the sequence, and acidic residues localized more internally within a span of 20-30 amino acids, (iv) be with both aromatic and acidic residues preferably spread out in the sequence and not clustered, and (v) not be separated by occasional basic residues. These and other more subtle rules are not absolute, reflecting absence of a tAD consensus sequence, enormous variability, and consistent with surfactant-like tAD biochemical properties. The findings are compatible with the paradigm-shifting nucleosome detergent mechanism of gene expression activation, contributing to the development of the liquid-liquid phase separation model and the biochemistry of near-stochastic functional allosteric interactions.
Collapse
Affiliation(s)
- Bradley K Broyles
- College of Pharmacy and Health Sciences, Butler University, Indianapolis, IN 46208, USA
| | - Andrew T Gutierrez
- College of Pharmacy and Health Sciences, Butler University, Indianapolis, IN 46208, USA
| | - Theodore P Maris
- College of Pharmacy and Health Sciences, Butler University, Indianapolis, IN 46208, USA
| | - Daniel A Coil
- College of Pharmacy and Health Sciences, Butler University, Indianapolis, IN 46208, USA
| | - Thomas M Wagner
- College of Pharmacy and Health Sciences, Butler University, Indianapolis, IN 46208, USA
| | - Xiao Wang
- Department of Computer Science, Purdue University, West Lafayette, IN 47907, USA
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, IN 47907, USA
| | - Caleb A Class
- College of Pharmacy and Health Sciences, Butler University, Indianapolis, IN 46208, USA
| | - Alexandre M Erkine
- College of Pharmacy and Health Sciences, Butler University, Indianapolis, IN 46208, USA
| |
Collapse
|
41
|
Juanes-Gusano D, Santos M, Reboto V, Alonso M, Rodríguez-Cabello JC. Self-assembling systems comprising intrinsically disordered protein polymers like elastin-like recombinamers. J Pept Sci 2021; 28:e3362. [PMID: 34545666 DOI: 10.1002/psc.3362] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2021] [Revised: 07/02/2021] [Accepted: 07/13/2021] [Indexed: 12/19/2022]
Abstract
Despite lacking cooperatively folded structures under native conditions, numerous intrinsically disordered proteins (IDPs) nevertheless have great functional importance. These IDPs are hybrids containing both ordered and intrinsically disordered protein regions (IDPRs), the structure of which is highly flexible in this unfolded state. The conformational flexibility of these disordered systems favors transitions between disordered and ordered states triggered by intrinsic and extrinsic factors, folding into different dynamic molecular assemblies to enable proper protein functions. Indeed, prokaryotic enzymes present less disorder than eukaryotic enzymes, thus showing that this disorder is related to functional and structural complexity. Protein-based polymers that mimic these IDPs include the so-called elastin-like polypeptides (ELPs), which are inspired by the composition of natural elastin. Elastin-like recombinamers (ELRs) are ELPs produced using recombinant techniques and which can therefore be tailored for a specific application. One of the most widely used and studied characteristic structures in this field is the pentapeptide (VPGXG)n . The structural disorder in ELRs probably arises due to the high content of proline and glycine in the ELR backbone, because both these amino acids help to keep the polypeptide structure of elastomers disordered and hydrated. Moreover, the recombinant nature of these systems means that different sequences can be designed, including bioactive domains, to obtain specific structures for each application. Some of these structures, along with their applications as IDPs that self-assemble into functional vesicles or micelles from diblock copolymer ELRs, will be studied in the following sections. The incorporation of additional order- and disorder-promoting peptide/protein domains, such as α-helical coils or β-strands, in the ELR sequence, and their influence on self-assembly, will also be reviewed. In addition, chemically cross-linked systems with controllable order-disorder balance, and their role in biomineralization, will be discussed. Finally, we will review different multivalent IDPs-based coatings and films for different biomedical applications, such as spatially controlled cell adhesion, osseointegration, or biomaterial-associated infection (BAI).
Collapse
Affiliation(s)
- Diana Juanes-Gusano
- BIOFORGE (Group for Advanced Materials and Nanobiotechnology) CIBER-BBN, Edificio Lucía, University of Valladolid, Valladolid, Spain
| | - Mercedes Santos
- BIOFORGE (Group for Advanced Materials and Nanobiotechnology) CIBER-BBN, Edificio Lucía, University of Valladolid, Valladolid, Spain
| | - Virginia Reboto
- BIOFORGE (Group for Advanced Materials and Nanobiotechnology) CIBER-BBN, Edificio Lucía, University of Valladolid, Valladolid, Spain
| | - Matilde Alonso
- BIOFORGE (Group for Advanced Materials and Nanobiotechnology) CIBER-BBN, Edificio Lucía, University of Valladolid, Valladolid, Spain
| | - José Carlos Rodríguez-Cabello
- BIOFORGE (Group for Advanced Materials and Nanobiotechnology) CIBER-BBN, Edificio Lucía, University of Valladolid, Valladolid, Spain
| |
Collapse
|
42
|
Mazzocca M, Colombo E, Callegari A, Mazza D. Transcription factor binding kinetics and transcriptional bursting: What do we really know? Curr Opin Struct Biol 2021; 71:239-248. [PMID: 34481381 DOI: 10.1016/j.sbi.2021.08.002] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2021] [Revised: 08/02/2021] [Accepted: 08/06/2021] [Indexed: 11/18/2022]
Abstract
In eukaryotes, transcription is a discontinuous process with mRNA being generated in bursts, after the binding of transcription factors (TFs) to regulatory elements on the genome. Live-cell single-molecule microscopy has highlighted that transcriptional bursting can be controlled by tuning TF/DNA binding kinetics. Yet the timescales of these two processes seem disconnected with TF/DNA interactions typically lasting orders of magnitude shorter than transcriptional bursts. To test models that could reconcile these discrepancies, reliable measurements of TF binding kinetics are needed, also accounting for the current limitations in performing these single-molecule measurements at specific regulatory elements. Here, we review the recent studies linking TF binding kinetics to transcriptional bursting and outline some current and future challenges that need to be addressed to provide a microscopic description of transcriptional regulation kinetics.
Collapse
Affiliation(s)
- Matteo Mazzocca
- Experimental Imaging Center, IRCCS San Raffaele Scientific Institute, Milan 20132, Italy
| | - Emanuele Colombo
- Experimental Imaging Center, IRCCS San Raffaele Scientific Institute, Milan 20132, Italy
| | | | - Davide Mazza
- Experimental Imaging Center, IRCCS San Raffaele Scientific Institute, Milan 20132, Italy.
| |
Collapse
|
43
|
Bjarnason S, Ruidiaz SF, McIvor J, Mercadante D, Heidarsson PO. Protein intrinsic disorder on a dynamic nucleosomal landscape. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2021; 183:295-354. [PMID: 34656332 DOI: 10.1016/bs.pmbts.2021.06.006] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
The complex nucleoprotein landscape of the eukaryotic cell nucleus is rich in dynamic proteins that lack a stable three-dimensional structure. Many of these intrinsically disordered proteins operate directly on the first fundamental level of genome compaction: the nucleosome. Here we give an overview of how disordered interactions with and within nucleosomes shape the dynamics, architecture, and epigenetic regulation of the genetic material, controlling cellular transcription patterns. We highlight experimental and computational challenges in the study of protein disorder and illustrate how integrative approaches are increasingly unveiling the fine details of nuclear interaction networks. We finally dissect sequence properties encoded in disordered regions and assess common features of disordered nucleosome-binding proteins. As drivers of many critical biological processes, disordered proteins are integral to a comprehensive molecular view of the dynamic nuclear milieu.
Collapse
Affiliation(s)
- Sveinn Bjarnason
- Department of Biochemistry, Science Institute, University of Iceland, Reykjavík, Iceland
| | - Sarah F Ruidiaz
- Department of Biochemistry, Science Institute, University of Iceland, Reykjavík, Iceland
| | - Jordan McIvor
- School of Chemical Science, University of Auckland, Auckland, New Zealand
| | - Davide Mercadante
- School of Chemical Science, University of Auckland, Auckland, New Zealand.
| | - Pétur O Heidarsson
- Department of Biochemistry, Science Institute, University of Iceland, Reykjavík, Iceland.
| |
Collapse
|
44
|
Saad D, Paissoni C, Chaves-Sanjuan A, Nardini M, Mantovani R, Gnesutta N, Camilloni C. High Conformational Flexibility of the E2F1/DP1/DNA Complex. J Mol Biol 2021; 433:167119. [PMID: 34181981 DOI: 10.1016/j.jmb.2021.167119] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Revised: 06/17/2021] [Accepted: 06/22/2021] [Indexed: 10/21/2022]
Abstract
The E2F1 transcription factor is a master regulator of cell-cycle progression whose uncontrolled activation contributes to tumor cells growth. E2F1 binds DNA as a heterodimer with DP partners, resulting in a multi-domain quaternary-structure complex composed of DNA binding domains, a coiled coil domain and a marked box domain separated by short linkers. Building on the 3D knowledge of the single domains of E2F and DPs, we characterized the structure and dynamics of the complete E2F1/DP1/DNA complex by a combination of small-angle X-ray scattering and molecular dynamics simulations. It shows an asymmetric contribution of the dynamics of the two proteins. Namely, the coiled-coil domain leans toward the DP1 side of the complex; the DP1 loop between α2 and α3 of the DBD partially populates a helical structure leaning far from the DNA and in the same direction of the coiled-coil domain; and the N-terminal disordered region of DP1, rich in basic residues, contributes to DNA binding stabilization. Intriguingly, tumor mutations in the flexible regions of the complex suggest that perturbation of protein dynamics could affect protein function in a context-dependent way. Our data suggest fundamental contributions of DP proteins in distinct aspects of E2F biology.
Collapse
Affiliation(s)
- Dana Saad
- Dipartimento di Bioscienze, Università degli Studi di Milano, Via Celoria 26, 20133 Milano, Italy
| | - Cristina Paissoni
- Dipartimento di Bioscienze, Università degli Studi di Milano, Via Celoria 26, 20133 Milano, Italy
| | - Antonio Chaves-Sanjuan
- Dipartimento di Bioscienze, Università degli Studi di Milano, Via Celoria 26, 20133 Milano, Italy
| | - Marco Nardini
- Dipartimento di Bioscienze, Università degli Studi di Milano, Via Celoria 26, 20133 Milano, Italy
| | - Roberto Mantovani
- Dipartimento di Bioscienze, Università degli Studi di Milano, Via Celoria 26, 20133 Milano, Italy
| | - Nerina Gnesutta
- Dipartimento di Bioscienze, Università degli Studi di Milano, Via Celoria 26, 20133 Milano, Italy.
| | - Carlo Camilloni
- Dipartimento di Bioscienze, Università degli Studi di Milano, Via Celoria 26, 20133 Milano, Italy.
| |
Collapse
|
45
|
Grasso EM, Majumdar A, Wrabl JO, Frueh DP, Hilser VJ. Conserved allosteric ensembles in disordered proteins using TROSY/anti-TROSY R 2-filtered spectroscopy. Biophys J 2021; 120:2498-2510. [PMID: 33901472 PMCID: PMC8390865 DOI: 10.1016/j.bpj.2021.04.017] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2021] [Revised: 03/11/2021] [Accepted: 04/16/2021] [Indexed: 11/22/2022] Open
Abstract
Defining the role of intrinsic disorder in proteins in the myriad of biological processes with which it is involved represents a significant goal in modern biophysics. Toward this end, NMR is uniquely suited for molecular studies of dynamic and disordered regions, but studying these regions in concert with their more structured domains and binding partners presents spectroscopic challenges. Here, we investigate the interactions between the structured and disordered regions of the human glucocorticoid receptor (GR). To do this, we developed an NMR strategy that relies on a novel relaxation filter for the simultaneous study of structured and unstructured regions. Using this approach, we conducted a comparative analysis of three translational isoforms of GR containing a folded DNA-binding domain (DBD) and two disordered regions that flank the DBD, one of which varies in size in the different isoforms. Notably, we were able to assign resonances that had previously been inaccessible because of the spectral complexity of the translational isoforms, which in turn allowed us to 1) identify a region of the structured DBD that undergoes significant changes in the local chemical environment in the presence of the disordered region and 2) determine differences in the conformational ensembles of the disordered regions of the translational isoforms. Furthermore, an ensemble-based thermodynamic analysis of the isoforms reveals conserved patterns of stability within the N-terminal domain of GR that persist despite low sequence conservation. These studies provide an avenue for further investigations of the mechanistic underpinnings of the functional relevance of the translational isoforms of GR while also providing a general NMR strategy for studying systems containing both structured and disordered regions.
Collapse
Affiliation(s)
- Emily M Grasso
- Department of Biology, Johns Hopkins University, Baltimore, Maryland; T.C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, Maryland
| | - Ananya Majumdar
- The Biomolecular NMR Center, Johns Hopkins University, Baltimore, Maryland
| | - James O Wrabl
- Department of Biology, Johns Hopkins University, Baltimore, Maryland
| | - Dominique P Frueh
- Department of Biophysics and Biophysical Chemistry, Johns Hopkins School of Medicine, Baltimore, Maryland
| | - Vincent J Hilser
- Department of Biology, Johns Hopkins University, Baltimore, Maryland; T.C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, Maryland.
| |
Collapse
|
46
|
Colinas M, Pollier J, Vaneechoutte D, Malat DG, Schweizer F, De Milde L, De Clercq R, Guedes JG, Martínez-Cortés T, Molina-Hidalgo FJ, Sottomayor M, Vandepoele K, Goossens A. Subfunctionalization of Paralog Transcription Factors Contributes to Regulation of Alkaloid Pathway Branch Choice in Catharanthus roseus. FRONTIERS IN PLANT SCIENCE 2021; 12:687406. [PMID: 34113373 PMCID: PMC8186833 DOI: 10.3389/fpls.2021.687406] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Accepted: 04/27/2021] [Indexed: 06/12/2023]
Abstract
Catharanthus roseus produces a diverse range of specialized metabolites of the monoterpenoid indole alkaloid (MIA) class in a heavily branched pathway. Recent great progress in identification of MIA biosynthesis genes revealed that the different pathway branch genes are expressed in a highly cell type- and organ-specific and stress-dependent manner. This implies a complex control by specific transcription factors (TFs), only partly revealed today. We generated and mined a comprehensive compendium of publicly available C. roseus transcriptome data for MIA pathway branch-specific TFs. Functional analysis was performed through extensive comparative gene expression analysis and profiling of over 40 MIA metabolites in the C. roseus flower petal expression system. We identified additional members of the known BIS and ORCA regulators. Further detailed study of the ORCA TFs suggests subfunctionalization of ORCA paralogs in terms of target gene-specific regulation and synergistic activity with the central jasmonate response regulator MYC2. Moreover, we identified specific amino acid residues within the ORCA DNA-binding domains that contribute to the differential regulation of some MIA pathway branches. Our results advance our understanding of TF paralog specificity for which, despite the common occurrence of closely related paralogs in many species, comparative studies are scarce.
Collapse
Affiliation(s)
- Maite Colinas
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
| | - Jacob Pollier
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- VIB Metabolomics Core, Ghent, Belgium
| | - Dries Vaneechoutte
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
| | - Deniz G. Malat
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
| | - Fabian Schweizer
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
| | - Liesbeth De Milde
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
| | - Rebecca De Clercq
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
| | - Joana G. Guedes
- CIBIO/InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade do Porto, Vairaão, Portugal
- I3S-Instituto de Investigação e Inovação em Saúde, IBMC-Instituto de Biologia Molecular e Celular, Universidade do Porto, Porto, Portugal
- ICBAS–Instituto de Ciências Biomédicas Abel Salazar, Universidade do Porto, Porto, Portugal
| | - Teresa Martínez-Cortés
- CIBIO/InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade do Porto, Vairaão, Portugal
| | - Francisco J. Molina-Hidalgo
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
| | - Mariana Sottomayor
- CIBIO/InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade do Porto, Vairaão, Portugal
- Faculdade de Ciências, Universidade do Porto, Porto, Portugal
| | - Klaas Vandepoele
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
| | - Alain Goossens
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
| |
Collapse
|
47
|
Mayne CG, Toy W, Carlson KE, Bhatt T, Fanning SW, Greene GL, Katzenellenbogen BS, Chandarlapaty S, Katzenellenbogen JA, Tajkhorshid E. Defining the Energetic Basis for a Conformational Switch Mediating Ligand-Independent Activation of Mutant Estrogen Receptors in Breast Cancer. Mol Cancer Res 2021; 19:1559-1570. [PMID: 34021071 DOI: 10.1158/1541-7786.mcr-20-1017] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2020] [Revised: 04/07/2021] [Accepted: 05/10/2021] [Indexed: 12/25/2022]
Abstract
Although most primary estrogen receptor (ER)-positive breast cancers respond well to endocrine therapies, many relapse later as metastatic disease due to endocrine therapy resistance. Over one third of these are associated with mutations in the ligand-binding domain (LBD) that activate the receptor independent of ligand. We have used an array of advanced computational techniques rooted in molecular dynamics simulations, in concert with and validated by experiments, to characterize the molecular mechanisms by which specific acquired somatic point mutations give rise to ER constitutive activation. By comparing structural and energetic features of constitutively active mutants and ligand-bound forms of ER-LBD with unliganded wild-type (WT) ER, we characterize a spring force originating from strain in the Helix 11-12 loop of WT-ER, opposing folding of Helix 12 into the active conformation and keeping WT-ER off and disordered, with the ligand-binding pocket open for rapid ligand binding. We quantify ways in which this spring force is abrogated by activating mutations that latch (Y537S) or relax (D538G) the folded form of the loop, enabling formation of the active conformation without ligand binding. We also identify a new ligand-mediated hydrogen-bonding network that stabilizes the active, ligand-bound conformation of WT-ER LBD, and similarly stabilizes the active conformation of the ER mutants in the hormone-free state. IMPLICATIONS: Our investigations provide deep insight into the energetic basis for the structural mechanisms of receptor activation through mutation, exemplified here with ER in endocrine-resistant metastatic breast cancers, with potential application to other dysregulated receptor signaling due to driver mutations.
Collapse
Affiliation(s)
- Christopher G Mayne
- Department of Biochemistry, University of Illinois at Urbana-Champaign, NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, Urbana, Illinois
| | - Weiyi Toy
- Memorial Sloan Kettering Cancer Center, Human Oncology and Pathogenesis Program, New York, New York
| | - Kathryn E Carlson
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois
| | - Trusha Bhatt
- Memorial Sloan Kettering Cancer Center, Human Oncology and Pathogenesis Program, New York, New York
| | - Sean W Fanning
- Ben May Department for Cancer Research, University of Chicago, Chicago, Illinois
| | - Geoffrey L Greene
- Ben May Department for Cancer Research, University of Chicago, Chicago, Illinois
| | - Benita S Katzenellenbogen
- Department of Molecular and Integrative Physiology, University of Illinois at Urbana-Champaign, Urbana, Illinois
| | - Sarat Chandarlapaty
- Memorial Sloan Kettering Cancer Center, Human Oncology and Pathogenesis Program, New York, New York
| | | | - Emad Tajkhorshid
- Department of Biochemistry, University of Illinois at Urbana-Champaign, NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, Urbana, Illinois. .,Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois
| |
Collapse
|
48
|
Rodrigues JA, Espley RV, Allan AC. Genomic analysis uncovers functional variation in the C-terminus of anthocyanin-activating MYB transcription factors. HORTICULTURE RESEARCH 2021; 8:77. [PMID: 33790254 PMCID: PMC8012628 DOI: 10.1038/s41438-021-00514-1] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2020] [Revised: 02/16/2021] [Accepted: 03/01/2021] [Indexed: 05/26/2023]
Abstract
MYB transcription factors regulate diverse aspects of plant development and secondary metabolism, often by partnering in transcriptional regulatory complexes. Here, we harness genomic resources to identify novel MYBs, thereby producing an updated eudicot MYB phylogeny with revised relationships among subgroups as well as new information on sequence variation in the disordered C-terminus of anthocyanin-activating MYBs. BLAST® and hidden Markov model scans of gene annotations identified a total of 714 MYB transcription factors across the genomes of four crops that span the eudicots: apple, grape, kiwifruit and tomato. Codon model-based phylogenetic inference identified novel members of previously defined subgroups, and the function of specific anthocyanin-activating subgroup 6 members was assayed transiently in tobacco leaves. Sequence conservation within subgroup 6 highlighted one previously described and two novel short linear motifs in the disordered C-terminal region. The novel motifs have a mix of hydrophobic and acidic residues and are predicted to be relatively ordered compared with flanking protein sequences. Comparison of motifs with the Eukaryotic Linear Motif database suggests roles in protein-protein interaction. Engineering of motifs and their flanking regions from strong anthocyanin activators into weak activators, and vice versa, affected function. We conclude that, although the MYB C-terminal sequence diverges greatly even within MYB clades, variation within the C-terminus at and near relatively ordered regions offers opportunities for exploring MYB function and developing superior alleles for plant breeding.
Collapse
Affiliation(s)
- Jessica A Rodrigues
- The New Zealand Institute for Plant and Food Research Limited, 120 Mount Albert Road, Sandringham, Auckland, 1025, New Zealand
| | - Richard V Espley
- The New Zealand Institute for Plant and Food Research Limited, 120 Mount Albert Road, Sandringham, Auckland, 1025, New Zealand
| | - Andrew C Allan
- The New Zealand Institute for Plant and Food Research Limited, 120 Mount Albert Road, Sandringham, Auckland, 1025, New Zealand.
- School of Biological Sciences, University of Auckland, 3A Symonds St, Auckland, 1010, New Zealand.
| |
Collapse
|
49
|
Jensen KS. Measuring and Analyzing Binding Kinetics of Coupled Folding and Binding Reactions Under Pseudo-First-Order Conditions. Methods Mol Biol 2021; 2141:629-650. [PMID: 32696381 DOI: 10.1007/978-1-0716-0524-0_32] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/13/2023]
Abstract
Many intrinsically disordered proteins (IDPs) adopt a well-defined structure upon binding to their interaction partners. Kinetic characterization is a requirement for the investigation of the dynamics and mechanisms of these folding-upon-binding reactions. Here a protocol is described for the investigation of binding kinetics of bimolecular binding and folding reactions of IDPs to their ligand partner under pseudo-first-order conditions using stopped-flow mixing and fluorescence detection.
Collapse
Affiliation(s)
- Kristine Steen Jensen
- Department for Biophysical Chemistry, Center for Molecular Protein Science, LTH, Lund University, Lund, Sweden.
| |
Collapse
|
50
|
Zhao B, Katuwawala A, Uversky VN, Kurgan L. IDPology of the living cell: intrinsic disorder in the subcellular compartments of the human cell. Cell Mol Life Sci 2021; 78:2371-2385. [PMID: 32997198 PMCID: PMC11071772 DOI: 10.1007/s00018-020-03654-0] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2020] [Revised: 09/09/2020] [Accepted: 09/22/2020] [Indexed: 12/11/2022]
Abstract
Intrinsic disorder can be found in all proteomes of all kingdoms of life and in viruses, being particularly prevalent in the eukaryotes. We conduct a comprehensive analysis of the intrinsic disorder in the human proteins while mapping them into 24 compartments of the human cell. In agreement with previous studies, we show that human proteins are significantly enriched in disorder relative to a generic protein set that represents the protein universe. In fact, the fraction of proteins with long disordered regions and the average protein-level disorder content in the human proteome are about 3 times higher than in the protein universe. Furthermore, levels of intrinsic disorder in the majority of human subcellular compartments significantly exceed the average disorder content in the protein universe. Relative to the overall amount of disorder in the human proteome, proteins localized in the nucleus and cytoskeleton have significantly increased amounts of disorder, measured by both high disorder content and presence of multiple long intrinsically disordered regions. We empirically demonstrate that, on average, human proteins are assigned to 2.3 subcellular compartments, with proteins localized to few subcellular compartments being more disordered than the proteins that are localized to many compartments. Functionally, the disordered proteins localized in the most disorder-enriched subcellular compartments are primarily responsible for interactions with nucleic acids and protein partners. This is the first-time disorder is comprehensively mapped into the human cell. Our observations add a missing piece to the puzzle of functional disorder and its organization inside the cell.
Collapse
Affiliation(s)
- Bi Zhao
- Department of Computer Science, Virginia Commonwealth University, 401 West Main Street, Room E4225, Richmond, VA, 23284, USA
| | - Akila Katuwawala
- Department of Computer Science, Virginia Commonwealth University, 401 West Main Street, Room E4225, Richmond, VA, 23284, USA
| | - Vladimir N Uversky
- Department of Molecular Medicine, USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, 12901 Bruce B. Downs Blvd. MDC07, Tampa, FL, 33612, USA.
- Laboratory of New Methods in Biology, Institute for Biological Instrumentation of the Russian Academy of Sciences, Federal Research Center "Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences", Pushchino, Russia.
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, 401 West Main Street, Room E4225, Richmond, VA, 23284, USA.
| |
Collapse
|