1
|
Johnson CN, Sojitra KA, Sohn EJ, Moreno-Romero AK, Baudin A, Xu X, Mittal J, Libich DS. Insights into Molecular Diversity within the FUS/EWS/TAF15 Protein Family: Unraveling Phase Separation of the N-Terminal Low-Complexity Domain from RNA-Binding Protein EWS. J Am Chem Soc 2024; 146:8071-8085. [PMID: 38492239 PMCID: PMC11156192 DOI: 10.1021/jacs.3c12034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/18/2024]
Abstract
The FET protein family, comprising FUS, EWS, and TAF15, plays crucial roles in mRNA maturation, transcriptional regulation, and DNA damage response. Clinically, they are linked to Ewing family tumors and neurodegenerative diseases such as amyotrophic lateral sclerosis. The fusion protein EWS::FLI1, the causative mutation of Ewing sarcoma, arises from a genomic translocation that fuses a portion of the low-complexity domain (LCD) of EWS (EWSLCD) with the DNA binding domain of the ETS transcription factor FLI1. This fusion protein modifies transcriptional programs and disrupts native EWS functions, such as splicing. The exact role of the intrinsically disordered EWSLCD remains a topic of active investigation, but its ability to phase separate and form biomolecular condensates is believed to be central to EWS::FLI1's oncogenic properties. Here, we used paramagnetic relaxation enhancement NMR, microscopy, and all-atom molecular dynamics (MD) simulations to better understand the self-association and phase separation tendencies of the EWSLCD. Our NMR data and mutational analysis suggest that a higher density and proximity of tyrosine residues amplify the likelihood of condensate formation. MD simulations revealed that the tyrosine-rich termini exhibit compact conformations with unique contact networks and provided critical input on the relationship between contacts formed within a single molecule (intramolecular) and inside the condensed phase (intermolecular). These findings enhance our understanding of FET proteins' condensate-forming capabilities and underline differences between EWS, FUS, and TAF15.
Collapse
Affiliation(s)
- Courtney N. Johnson
- Greehey Children’s Cancer Research Institute, The University of Texas Health Science Center at San Antonio, San Antonio, Texas, 78229, United States
- Department of Biochemistry and Structural Biology, The University of Texas Health Science Center at San Antonio, San Antonio, Texas, 78229, United States
| | - Kandarp A Sojitra
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, Texas 77843, United States
| | - Erich J. Sohn
- Greehey Children’s Cancer Research Institute, The University of Texas Health Science Center at San Antonio, San Antonio, Texas, 78229, United States
- Department of Biochemistry and Structural Biology, The University of Texas Health Science Center at San Antonio, San Antonio, Texas, 78229, United States
| | - Alma K. Moreno-Romero
- Greehey Children’s Cancer Research Institute, The University of Texas Health Science Center at San Antonio, San Antonio, Texas, 78229, United States
- Department of Biochemistry and Structural Biology, The University of Texas Health Science Center at San Antonio, San Antonio, Texas, 78229, United States
| | - Antoine Baudin
- Greehey Children’s Cancer Research Institute, The University of Texas Health Science Center at San Antonio, San Antonio, Texas, 78229, United States
- Department of Biochemistry and Structural Biology, The University of Texas Health Science Center at San Antonio, San Antonio, Texas, 78229, United States
| | - Xiaoping Xu
- Greehey Children’s Cancer Research Institute, The University of Texas Health Science Center at San Antonio, San Antonio, Texas, 78229, United States
- Department of Biochemistry and Structural Biology, The University of Texas Health Science Center at San Antonio, San Antonio, Texas, 78229, United States
| | - Jeetain Mittal
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, Texas 77843, United States
- Department of Chemistry, Texas A&M University, College Station, Texas 77843, United States
- Interdisciplinary Graduate Program in Genetics and Genomics, Texas A&M University, College Station, Texas 77843, United States
| | - David S. Libich
- Greehey Children’s Cancer Research Institute, The University of Texas Health Science Center at San Antonio, San Antonio, Texas, 78229, United States
- Department of Biochemistry and Structural Biology, The University of Texas Health Science Center at San Antonio, San Antonio, Texas, 78229, United States
| |
Collapse
|
2
|
Pietrek LM, Stelzl LS, Hummer G. Hierarchical Assembly of Single-Stranded RNA. J Chem Theory Comput 2024; 20:2246-2260. [PMID: 38361440 PMCID: PMC10938505 DOI: 10.1021/acs.jctc.3c01049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Revised: 12/09/2023] [Accepted: 01/25/2024] [Indexed: 02/17/2024]
Abstract
Single-stranded RNA (ssRNA) plays a major role in the flow of genetic information-most notably, in the form of messenger RNA (mRNA)-and in the regulation of biological processes. The highly dynamic nature of chains of unpaired nucleobases challenges structural characterizations of ssRNA by experiments or molecular dynamics (MD) simulations alike. Here, we use hierarchical chain growth (HCG) to construct ensembles of ssRNA chains. HCG assembles the structures of protein and nucleic acid chains from fragment libraries created by MD simulations. Applied to homo- and heteropolymeric ssRNAs of different lengths, we find that HCG produces structural ensembles that overall are in good agreement with diverse experiments, including nuclear magnetic resonance (NMR), small-angle X-ray scattering (SAXS), and single-molecule Förster resonance energy transfer (FRET). The agreement can be further improved by ensemble refinement using Bayesian inference of ensembles (BioEn). HCG can also be used to assemble RNA structures that combine base-paired and base-unpaired regions, as illustrated for the 5' untranslated region (UTR) of SARS-CoV-2 RNA.
Collapse
Affiliation(s)
- Lisa M. Pietrek
- Department
of Theoretical Biophysics, Max Planck Institute
of Biophysics, Max-von-Laue-Straße 3, 60438 Frankfurt am Main, Germany
| | - Lukas S. Stelzl
- Faculty
of Biology, Johannes Gutenberg University
Mainz, Gresemundweg 2, 55128 Mainz, Germany
- KOMET
1, Institute of Physics, Johannes Gutenberg
University Mainz, 55099 Mainz, Germany
- Institute
of Molecular Biology (IMB), 55128 Mainz, Germany
| | - Gerhard Hummer
- Department
of Theoretical Biophysics, Max Planck Institute
of Biophysics, Max-von-Laue-Straße 3, 60438 Frankfurt am Main, Germany
- Institute
for Biophysics, Goethe University, Max-von-Laue-Straße 9, 60438 Frankfurt am Main, Germany
| |
Collapse
|
3
|
Tesei G, Trolle AI, Jonsson N, Betz J, Knudsen FE, Pesce F, Johansson KE, Lindorff-Larsen K. Conformational ensembles of the human intrinsically disordered proteome. Nature 2024; 626:897-904. [PMID: 38297118 DOI: 10.1038/s41586-023-07004-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Accepted: 12/19/2023] [Indexed: 02/02/2024]
Abstract
Intrinsically disordered proteins and regions (collectively, IDRs) are pervasive across proteomes in all kingdoms of life, help to shape biological functions and are involved in numerous diseases. IDRs populate a diverse set of transiently formed structures and defy conventional sequence-structure-function relationships1. Developments in protein science have made it possible to predict the three-dimensional structures of folded proteins at the proteome scale2. By contrast, there is a lack of knowledge about the conformational properties of IDRs, partly because the sequences of disordered proteins are poorly conserved and also because only a few of these proteins have been characterized experimentally. The inability to predict structural properties of IDRs across the proteome has limited our understanding of the functional roles of IDRs and how evolution shapes them. As a supplement to previous structural studies of individual IDRs3, we developed an efficient molecular model to generate conformational ensembles of IDRs and thereby to predict their conformational properties from sequences4,5. Here we use this model to simulate nearly all of the IDRs in the human proteome. Examining conformational ensembles of 28,058 IDRs, we show how chain compaction is correlated with cellular function and localization. We provide insights into how sequence features relate to chain compaction and, using a machine-learning model trained on our simulation data, show the conservation of conformational properties across orthologues. Our results recapitulate observations from previous studies of individual protein systems and exemplify how to link-at the proteome scale-conformational ensembles with cellular function and localization, amino acid sequence, evolutionary conservation and disease variants. Our freely available database of conformational properties will encourage further experimental investigation and enable the generation of hypotheses about the biological roles and evolution of IDRs.
Collapse
Affiliation(s)
- Giulio Tesei
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| | - Anna Ida Trolle
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Nicolas Jonsson
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Johannes Betz
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Frederik E Knudsen
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Francesco Pesce
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Kristoffer E Johansson
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Kresten Lindorff-Larsen
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|