1
|
Hadarovich A, Singh HR, Ghosh S, Scheremetjew M, Rostam N, Hyman AA, Toth-Petroczy A. PICNIC accurately predicts condensate-forming proteins regardless of their structural disorder across organisms. Nat Commun 2024; 15:10668. [PMID: 39663388 PMCID: PMC11634905 DOI: 10.1038/s41467-024-55089-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Accepted: 11/28/2024] [Indexed: 12/13/2024] Open
Abstract
Biomolecular condensates are membraneless organelles that can concentrate hundreds of different proteins in cells to operate essential biological functions. However, accurate identification of their components remains challenging and biased towards proteins with high structural disorder content with focus on self-phase separating (driver) proteins. Here, we present a machine learning algorithm, PICNIC (Proteins Involved in CoNdensates In Cells) to classify proteins that localize to biomolecular condensates regardless of their role in condensate formation. PICNIC successfully predicts condensate members by learning amino acid patterns in the protein sequence and structure in addition to the intrinsic disorder. Extensive experimental validation of 24 positive predictions in cellulo shows an overall ~82% accuracy regardless of the structural disorder content of the tested proteins. While increasing disorder content is associated with organismal complexity, our analysis of 26 species reveals no correlation between predicted condensate proteome content and disorder content across organisms. Overall, we present a machine learning classifier to interrogate condensate components at whole-proteome levels across the tree of life.
Collapse
Affiliation(s)
- Anna Hadarovich
- Max Planck Institute of Molecular Cell Biology and Genetics, 01307, Dresden, Germany
- Center for Systems Biology Dresden, 01307, Dresden, Germany
| | - Hari Raj Singh
- Max Planck Institute of Molecular Cell Biology and Genetics, 01307, Dresden, Germany
| | - Soumyadeep Ghosh
- Max Planck Institute of Molecular Cell Biology and Genetics, 01307, Dresden, Germany
- Center for Systems Biology Dresden, 01307, Dresden, Germany
| | - Maxim Scheremetjew
- Max Planck Institute of Molecular Cell Biology and Genetics, 01307, Dresden, Germany
- Center for Systems Biology Dresden, 01307, Dresden, Germany
| | - Nadia Rostam
- Max Planck Institute of Molecular Cell Biology and Genetics, 01307, Dresden, Germany
- Center for Systems Biology Dresden, 01307, Dresden, Germany
- Department of Biology, College of Science, University of Sulaimani, Sulaymaniyah, Iraq
| | - Anthony A Hyman
- Max Planck Institute of Molecular Cell Biology and Genetics, 01307, Dresden, Germany
- Center for Systems Biology Dresden, 01307, Dresden, Germany
- Department of Biology, College of Science, University of Sulaimani, Sulaymaniyah, Iraq
| | - Agnes Toth-Petroczy
- Max Planck Institute of Molecular Cell Biology and Genetics, 01307, Dresden, Germany.
- Center for Systems Biology Dresden, 01307, Dresden, Germany.
- Cluster of Excellence Physics of Life, TU Dresden, 01062, Dresden, Germany.
| |
Collapse
|
2
|
Cagliani R, Forni D, Mozzi A, Fuchs R, Hagai T, Sironi M. Evolutionary analysis of ZAP and its cofactors identifies intrinsically disordered regions as central elements in host-pathogen interactions. Comput Struct Biotechnol J 2024; 23:3143-3154. [PMID: 39234301 PMCID: PMC11372611 DOI: 10.1016/j.csbj.2024.07.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2024] [Revised: 07/30/2024] [Accepted: 07/30/2024] [Indexed: 09/06/2024] Open
Abstract
The zinc-finger antiviral protein (ZAP) is an innate immunity sensor of non-self nucleic acids. Its antiviral activity is exerted through the physical interaction with different cofactors, including TRIM25, Riplet and KHNYN. Cellular proteins that interact with infectious agents are expected to be engaged in genetic conflicts that often result in their rapid evolution. To test this possibility and to identify the regions most strongly targeted by natural selection, we applied in silico molecular evolution tools to analyze the evolutionary history of ZAP and cofactors in four mammalian groups. We report evidence of positive selection in all genes and in most mammalian groups. On average, the intrinsically disordered regions (IDRs) embedded in the four proteins evolve significantly faster than folded domains and most positively selected sites fall within IDRs. In ZAP, the PARP domain also shows abundant signals of selection, and independent evolution in different mammalian groups suggests modulation of its ADP-ribose binding ability. Detailed analyses of the biophysical properties of IDRs revealed that chain compaction and conformational entropy are conserved across mammals. The IDRs in ZAP and KHNYN are particularly compact, indicating that they may promote phase separation (PS). In line with this hypothesis, we predicted several PS-promoting regions in ZAP and KHNYN, as well as in TRIM25. Positively selected sites are abundant in these regions, suggesting that PS may be important for the antiviral functions of these proteins and the evolutionary arms race with viruses. Our data shed light into the evolution of ZAP and cofactors and indicate that IDRs represent central elements in host-pathogen interactions.
Collapse
Affiliation(s)
- Rachele Cagliani
- Scientific Institute IRCCS E. MEDEA, Computational Biology Unit, Bosisio Parini 23842, Italy
| | - Diego Forni
- Scientific Institute IRCCS E. MEDEA, Computational Biology Unit, Bosisio Parini 23842, Italy
| | - Alessandra Mozzi
- Scientific Institute IRCCS E. MEDEA, Computational Biology Unit, Bosisio Parini 23842, Italy
| | - Rotem Fuchs
- Shmunis School of Biomedicine and Cancer Research, George S Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Tzachi Hagai
- Shmunis School of Biomedicine and Cancer Research, George S Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Manuela Sironi
- Scientific Institute IRCCS E. MEDEA, Computational Biology Unit, Bosisio Parini 23842, Italy
| |
Collapse
|
3
|
Haseltine WA, Patarca R. The RNA Revolution in the Central Molecular Biology Dogma Evolution. Int J Mol Sci 2024; 25:12695. [PMID: 39684407 DOI: 10.3390/ijms252312695] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2024] [Revised: 11/24/2024] [Accepted: 11/25/2024] [Indexed: 12/18/2024] Open
Abstract
Human genome projects in the 1990s identified about 20,000 protein-coding sequences. We are now in the RNA revolution, propelled by the realization that genes determine phenotype beyond the foundational central molecular biology dogma, stating that inherited linear pieces of DNA are transcribed to RNAs and translated into proteins. Crucially, over 95% of the genome, initially considered junk DNA between protein-coding genes, encodes essential, functionally diverse non-protein-coding RNAs, raising the gene count by at least one order of magnitude. Most inherited phenotype-determining changes in DNA are in regulatory areas that control RNA and regulatory sequences. RNAs can directly or indirectly determine phenotypes by regulating protein and RNA function, transferring information within and between organisms, and generating DNA. RNAs also exhibit high structural, functional, and biomolecular interaction plasticity and are modified via editing, methylation, glycosylation, and other mechanisms, which bestow them with diverse intra- and extracellular functions without altering the underlying DNA. RNA is, therefore, currently considered the primary determinant of cellular to populational functional diversity, disease-linked and biomolecular structural variations, and cell function regulation. As demonstrated by RNA-based coronavirus vaccines' success, RNA technology is transforming medicine, agriculture, and industry, as did the advent of recombinant DNA technology in the 1980s.
Collapse
Affiliation(s)
- William A Haseltine
- Access Health International, 384 West Lane, Ridgefield, CT 06877, USA
- Feinstein Institutes for Medical Research, 350 Community Dr, Manhasset, NY 11030, USA
| | - Roberto Patarca
- Access Health International, 384 West Lane, Ridgefield, CT 06877, USA
- Feinstein Institutes for Medical Research, 350 Community Dr, Manhasset, NY 11030, USA
| |
Collapse
|
4
|
Hudaiberdiev S, Ovcharenko I. Functional characteristics and computational model of abundant hyperactive loci in the human genome. eLife 2024; 13:RP95170. [PMID: 39535534 PMCID: PMC11560132 DOI: 10.7554/elife.95170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2024] Open
Abstract
Enhancers and promoters are classically considered to be bound by a small set of transcription factors (TFs) in a sequence-specific manner. This assumption has come under increasing skepticism as the datasets of ChIP-seq assays of TFs have expanded. In particular, high-occupancy target (HOT) loci attract hundreds of TFs with often no detectable correlation between ChIP-seq peaks and DNA-binding motif presence. Here, we used a set of 1003 TF ChIP-seq datasets (HepG2, K562, H1) to analyze the patterns of ChIP-seq peak co-occurrence in combination with functional genomics datasets. We identified 43,891 HOT loci forming at the promoter (53%) and enhancer (47%) regions. HOT promoters regulate housekeeping genes, whereas HOT enhancers are involved in tissue-specific process regulation. HOT loci form the foundation of human super-enhancers and evolve under strong negative selection, with some of these loci being located in ultraconserved regions. Sequence-based classification analysis of HOT loci suggested that their formation is driven by the sequence features, and the density of mapped ChIP-seq peaks across TF-bound loci correlates with sequence features and the expression level of flanking genes. Based on the affinities to bind to promoters and enhancers we detected five distinct clusters of TFs that form the core of the HOT loci. We report an abundance of HOT loci in the human genome and a commitment of 51% of all TF ChIP-seq binding events to HOT locus formation thus challenging the classical model of enhancer activity and propose a model of HOT locus formation based on the existence of large transcriptional condensates.
Collapse
Affiliation(s)
- Sanjarbek Hudaiberdiev
- National Institute for Biotechnology and Information, National Library of Medicine, National Institutes of HealthBethesdaUnited States
| | - Ivan Ovcharenko
- National Institute for Biotechnology and Information, National Library of Medicine, National Institutes of HealthBethesdaUnited States
| |
Collapse
|
5
|
Morris OM, Toprakcioglu Z, Röntgen A, Cali M, Knowles TPJ, Vendruscolo M. Aggregation of the amyloid-β peptide (Aβ40) within condensates generated through liquid-liquid phase separation. Sci Rep 2024; 14:22633. [PMID: 39349560 PMCID: PMC11442885 DOI: 10.1038/s41598-024-72265-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2024] [Accepted: 09/05/2024] [Indexed: 10/02/2024] Open
Abstract
The deposition of the amyloid-β (Aβ) peptide into amyloid fibrils is a hallmark of Alzheimer's disease. Recently, it has been reported that some proteins can aggregate and form amyloids through an intermediate pathway involving a liquid-like condensed phase. These observations prompted us to investigate the phase space of Aβ. We thus explored the ability of Aβ to undergo liquid-liquid phase separation, and the subsequent liquid-to-solid transition that takes place within the resulting condensates. Through the use of microfluidic approaches, we observed that the 40-residue form of Αβ (Αβ40) can undergo liquid-liquid phase separation, and that accessing a liquid-like intermediate state enables Αβ40 to self-assemble and aggregate into amyloid fibrils through this pathway. These results prompt further studies to investigate the possible role of Αβ liquid-liquid phase separation and its subsequent aggregation in the context of Alzheimer's disease and more generally on neurodegenerative processes.
Collapse
Affiliation(s)
- Owen M Morris
- Centre for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, CB2 1EW, UK
| | - Zenon Toprakcioglu
- Centre for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, CB2 1EW, UK
| | - Alexander Röntgen
- Centre for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, CB2 1EW, UK
| | - Mariana Cali
- Centre for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, CB2 1EW, UK
| | - Tuomas P J Knowles
- Centre for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, CB2 1EW, UK
- Cavendish Laboratory, Department of Physics, University of Cambridge, Cambridge, CB3 OHE, UK
| | - Michele Vendruscolo
- Centre for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, CB2 1EW, UK.
| |
Collapse
|
6
|
Zhang S, Lim CM, Occhetta M, Vendruscolo M. AlphaFold2-based prediction of the co-condensation propensity of proteins. Proc Natl Acad Sci U S A 2024; 121:e2315005121. [PMID: 39133858 PMCID: PMC11348322 DOI: 10.1073/pnas.2315005121] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Accepted: 02/09/2024] [Indexed: 08/29/2024] Open
Abstract
The process of protein phase separation into liquid condensates has been implicated in the formation of membraneless organelles (MLOs), which selectively concentrate biomolecules to perform essential cellular functions. Although the importance of this process in health and disease is increasingly recognized, the experimental identification of proteins forming MLOs remains a complex challenge. In this study, we addressed this problem by harnessing the power of AlphaFold2 to perform computational predictions of the conformational properties of proteins from their amino acid sequences. We thus developed the CoDropleT (co-condensation into droplet transformer) method of predicting the propensity of co-condensation of protein pairs. The method was trained by combining experimental datasets of co-condensing proteins from the CD-CODE database with curated negative datasets of non-co-condensing proteins. To illustrate the performance of the method, we applied it to estimate the propensity of proteins to co-condense into MLOs. Our results suggest that CoDropleT could facilitate functional and therapeutic studies on protein condensation by predicting the composition of protein condensates.
Collapse
Affiliation(s)
- Shengyu Zhang
- Centre for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, CambridgeCB2 1EW, United Kingdom
| | - Christine M. Lim
- Centre for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, CambridgeCB2 1EW, United Kingdom
| | - Martina Occhetta
- Centre for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, CambridgeCB2 1EW, United Kingdom
| | - Michele Vendruscolo
- Centre for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, CambridgeCB2 1EW, United Kingdom
| |
Collapse
|
7
|
Middendorf L, Ravi Iyengar B, Eicholt LA. Sequence, Structure, and Functional Space of Drosophila De Novo Proteins. Genome Biol Evol 2024; 16:evae176. [PMID: 39212966 PMCID: PMC11363682 DOI: 10.1093/gbe/evae176] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/29/2024] [Indexed: 09/04/2024] Open
Abstract
During de novo emergence, new protein coding genes emerge from previously nongenic sequences. The de novo proteins they encode are dissimilar in composition and predicted biochemical properties to conserved proteins. However, functional de novo proteins indeed exist. Both identification of functional de novo proteins and their structural characterization are experimentally laborious. To identify functional and structured de novo proteins in silico, we applied recently developed machine learning based tools and found that most de novo proteins are indeed different from conserved proteins both in their structure and sequence. However, some de novo proteins are predicted to adopt known protein folds, participate in cellular reactions, and to form biomolecular condensates. Apart from broadening our understanding of de novo protein evolution, our study also provides a large set of testable hypotheses for focused experimental studies on structure and function of de novo proteins in Drosophila.
Collapse
Affiliation(s)
- Lasse Middendorf
- Institute for Evolution and Biodiversity, University of Muenster, Huefferstrasse 1, 48149 Muenster, Germany
| | - Bharat Ravi Iyengar
- Institute for Evolution and Biodiversity, University of Muenster, Huefferstrasse 1, 48149 Muenster, Germany
| | - Lars A Eicholt
- Institute for Evolution and Biodiversity, University of Muenster, Huefferstrasse 1, 48149 Muenster, Germany
| |
Collapse
|
8
|
Sood A, Zhang B. Preserving condensate structure and composition by lowering sequence complexity. Biophys J 2024; 123:1815-1826. [PMID: 38824391 PMCID: PMC11267431 DOI: 10.1016/j.bpj.2024.05.026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2024] [Revised: 04/25/2024] [Accepted: 05/28/2024] [Indexed: 06/03/2024] Open
Abstract
Biomolecular condensates play a vital role in organizing cellular chemistry. They selectively partition biomolecules, preventing unwanted cross talk and buffering against chemical noise. Intrinsically disordered proteins (IDPs) serve as primary components of these condensates due to their flexibility and ability to engage in multivalent interactions, leading to spontaneous aggregation. Theoretical advancements are critical at connecting IDP sequences with condensate emergent properties to establish the so-called molecular grammar. We proposed an extension to the stickers and spacers model, incorporating heterogeneous, nonspecific pairwise interactions between spacers alongside specific interactions among stickers. Our investigation revealed that although spacer interactions contribute to phase separation and co-condensation, their nonspecific nature leads to disorganized condensates. Specific sticker-sticker interactions drive the formation of condensates with well-defined networked structures and molecular composition. We discussed how evolutionary pressures might emerge to affect these interactions, leading to the prevalence of low-complexity domains in IDP sequences. These domains suppress spurious interactions and facilitate the formation of biologically meaningful condensates.
Collapse
Affiliation(s)
- Amogh Sood
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts
| | - Bin Zhang
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts.
| |
Collapse
|
9
|
Kilgore HR, Chinn I, Mikhael PG, Mitnikov I, Van Dongen C, Zylberberg G, Afeyan L, Banani S, Wilson-Hawken S, Lee TI, Barzilay R, Young RA. Protein codes promote selective subcellular compartmentalization. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.15.589616. [PMID: 38659952 PMCID: PMC11042338 DOI: 10.1101/2024.04.15.589616] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]
Abstract
Cells have evolved mechanisms to distribute ~10 billion protein molecules to subcellular compartments where diverse proteins involved in shared functions must efficiently assemble. Here, we demonstrate that proteins with shared functions share amino acid sequence codes that guide them to compartment destinations. A protein language model, ProtGPS, was developed that predicts with high performance the compartment localization of human proteins excluded from the training set. ProtGPS successfully guided generation of novel protein sequences that selectively assemble in targeted subcellular compartments. ProtGPS also identified pathological mutations that change this code and lead to altered subcellular localization of proteins. Our results indicate that protein sequences contain not only a folding code, but also a previously unrecognized code governing their distribution in specific cellular compartments.
Collapse
Affiliation(s)
- Henry R. Kilgore
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | - Itamar Chinn
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Peter G. Mikhael
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Ilan Mitnikov
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | | | - Guy Zylberberg
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Lena Afeyan
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Salman Banani
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
- Department of Pathology, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA 02115, USA
| | - Susana Wilson-Hawken
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
- Program of Computational & Systems Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Tong Ihn Lee
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | - Regina Barzilay
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Richard A. Young
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| |
Collapse
|
10
|
Liang Q, Peng N, Xie Y, Kumar N, Gao W, Miao Y. MolPhase, an advanced prediction algorithm for protein phase separation. EMBO J 2024; 43:1898-1918. [PMID: 38565952 PMCID: PMC11065880 DOI: 10.1038/s44318-024-00090-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Revised: 02/27/2024] [Accepted: 03/14/2024] [Indexed: 04/04/2024] Open
Abstract
We introduce MolPhase, an advanced algorithm for predicting protein phase separation (PS) behavior that improves accuracy and reliability by utilizing diverse physicochemical features and extensive experimental datasets. MolPhase applies a user-friendly interface to compare distinct biophysical features side-by-side along protein sequences. By additional comparison with structural predictions, MolPhase enables efficient predictions of new phase-separating proteins and guides hypothesis generation and experimental design. Key contributing factors underlying MolPhase include electrostatic pi-interactions, disorder, and prion-like domains. As an example, MolPhase finds that phytobacterial type III effectors (T3Es) are highly prone to homotypic PS, which was experimentally validated in vitro biochemically and in vivo in plants, mimicking their injection and accumulation in the host during microbial infection. The physicochemical characteristics of T3Es dictate their patterns of association for multivalent interactions, influencing the material properties of phase-separating droplets based on the surrounding microenvironment in vivo or in vitro. Robust integration of MolPhase's effective prediction and experimental validation exhibit the potential to evaluate and explore how biomolecule PS functions in biological systems.
Collapse
Affiliation(s)
- Qiyu Liang
- School of Physical and Mathematical Sciences, Nanyang Technological University, 637371, Singapore, Singapore
- School of Biological Sciences, Nanyang Technological University, 637551, Singapore, Singapore
| | - Nana Peng
- School of Biological Sciences, Nanyang Technological University, 637551, Singapore, Singapore
| | - Yi Xie
- School of Biological Sciences, Nanyang Technological University, 637551, Singapore, Singapore
| | - Nivedita Kumar
- School of Biological Sciences, Nanyang Technological University, 637551, Singapore, Singapore
| | - Weibo Gao
- School of Physical and Mathematical Sciences, Nanyang Technological University, 637371, Singapore, Singapore
| | - Yansong Miao
- School of Biological Sciences, Nanyang Technological University, 637551, Singapore, Singapore.
- Institute for Digital Molecular Analytics and Science, Nanyang Technological University, 636921, Singapore, Singapore.
| |
Collapse
|
11
|
Thappeta Y, Cañas-Duarte SJ, Kallem T, Fragasso A, Xiang Y, Gray W, Lee C, Cegelski L, Jacobs-Wagner C. Glycogen phase separation drives macromolecular rearrangement and asymmetric division in E. coli. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.19.590186. [PMID: 38659787 PMCID: PMC11042326 DOI: 10.1101/2024.04.19.590186] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]
Abstract
Bacteria often experience nutrient limitation in nature and the laboratory. While exponential and stationary growth phases are well characterized in the model bacterium Escherichia coli, little is known about what transpires inside individual cells during the transition between these two phases. Through quantitative cell imaging, we found that the position of nucleoids and cell division sites becomes increasingly asymmetric during transition phase. These asymmetries were coupled with spatial reorganization of proteins, ribosomes, and RNAs to nucleoid-centric localizations. Results from live-cell imaging experiments, complemented with genetic and 13C whole-cell nuclear magnetic resonance spectroscopy studies, show that preferential accumulation of the storage polymer glycogen at the old cell pole leads to the observed rearrangements and asymmetric divisions. In vitro experiments suggest that these phenotypes are likely due to the propensity of glycogen to phase separate in crowded environments, as glycogen condensates exclude fluorescent proteins under physiological crowding conditions. Glycogen-associated differences in cell sizes between strains and future daughter cells suggest that glycogen phase separation allows cells to store large glucose reserves without counting them as cytoplasmic space.
Collapse
Affiliation(s)
- Yashna Thappeta
- Sarafan Chemistry, Engineering, and Medicine for Human Health Institute, Stanford University, Stanford, CA, USA
- Department of Biology, Stanford University, Stanford, CA, USA
| | - Silvia J. Cañas-Duarte
- Sarafan Chemistry, Engineering, and Medicine for Human Health Institute, Stanford University, Stanford, CA, USA
- Howard Hughes Medical Institute, Stanford University, Stanford, USA
| | - Till Kallem
- Department of Chemistry, Stanford University, Stanford, CA, USA
| | - Alessio Fragasso
- Sarafan Chemistry, Engineering, and Medicine for Human Health Institute, Stanford University, Stanford, CA, USA
- Department of Biology, Stanford University, Stanford, CA, USA
| | - Yingjie Xiang
- Mechanical Engineering and Materials Science, Yale University, New Haven, CT
| | - William Gray
- Mechanical Engineering and Materials Science, Yale University, New Haven, CT
| | - Cheyenne Lee
- Mechanical Engineering and Materials Science, Yale University, New Haven, CT
| | | | - Christine Jacobs-Wagner
- Sarafan Chemistry, Engineering, and Medicine for Human Health Institute, Stanford University, Stanford, CA, USA
- Department of Biology, Stanford University, Stanford, CA, USA
- Department of Microbiology and Immunology, Stanford University, Stanford, CA, USA
- Howard Hughes Medical Institute, Stanford University, Stanford, USA
| |
Collapse
|
12
|
Tesei G, Trolle AI, Jonsson N, Betz J, Knudsen FE, Pesce F, Johansson KE, Lindorff-Larsen K. Conformational ensembles of the human intrinsically disordered proteome. Nature 2024; 626:897-904. [PMID: 38297118 DOI: 10.1038/s41586-023-07004-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Accepted: 12/19/2023] [Indexed: 02/02/2024]
Abstract
Intrinsically disordered proteins and regions (collectively, IDRs) are pervasive across proteomes in all kingdoms of life, help to shape biological functions and are involved in numerous diseases. IDRs populate a diverse set of transiently formed structures and defy conventional sequence-structure-function relationships1. Developments in protein science have made it possible to predict the three-dimensional structures of folded proteins at the proteome scale2. By contrast, there is a lack of knowledge about the conformational properties of IDRs, partly because the sequences of disordered proteins are poorly conserved and also because only a few of these proteins have been characterized experimentally. The inability to predict structural properties of IDRs across the proteome has limited our understanding of the functional roles of IDRs and how evolution shapes them. As a supplement to previous structural studies of individual IDRs3, we developed an efficient molecular model to generate conformational ensembles of IDRs and thereby to predict their conformational properties from sequences4,5. Here we use this model to simulate nearly all of the IDRs in the human proteome. Examining conformational ensembles of 28,058 IDRs, we show how chain compaction is correlated with cellular function and localization. We provide insights into how sequence features relate to chain compaction and, using a machine-learning model trained on our simulation data, show the conservation of conformational properties across orthologues. Our results recapitulate observations from previous studies of individual protein systems and exemplify how to link-at the proteome scale-conformational ensembles with cellular function and localization, amino acid sequence, evolutionary conservation and disease variants. Our freely available database of conformational properties will encourage further experimental investigation and enable the generation of hypotheses about the biological roles and evolution of IDRs.
Collapse
Affiliation(s)
- Giulio Tesei
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| | - Anna Ida Trolle
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Nicolas Jonsson
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Johannes Betz
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Frederik E Knudsen
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Francesco Pesce
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Kristoffer E Johansson
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Kresten Lindorff-Larsen
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
13
|
Sood A, Zhang B. Preserving condensate structure and composition by lowering sequence complexity. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.29.569249. [PMID: 38076908 PMCID: PMC10705451 DOI: 10.1101/2023.11.29.569249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/22/2023]
Abstract
Biological condensates play a vital role in organizing cellular chemistry. They selectively partition biomolecules, preventing unwanted cross-talk and buffering against chemical noise. Intrinsically disordered proteins (IDPs) serve as primary components of these condensates due to their flexibility and ability to engage in multivalent, non-specific interactions, leading to spontaneous aggregation. Theoretical advancements are critical at connecting IDP sequences with condensate emergent properties to establish the so-called molecular grammar. We proposed an extension to the stickers and spacers model, incorporating non-specific pairwise interactions between spacers alongside specific interactions among stickers. Our investigation revealed that while spacer interactions contribute to phase separation and co-condensation, their non-specific nature leads to disorganized condensates. Specific sticker-sticker interactions drive the formation of condensates with well-defined structures and molecular composition. We discussed how evolutionary pressures might emerge to affect these interactions, leading to the prevalence of low complexity domains in IDP sequences. These domains suppress spurious interactions and facilitate the formation of biologically meaningful condensates.
Collapse
Affiliation(s)
- Amogh Sood
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Bin Zhang
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
| |
Collapse
|
14
|
Abstract
RNA granules are mesoscale assemblies that form in the absence of limiting membranes. RNA granules contain factors for RNA biogenesis and turnover and are often assumed to represent specialized compartments for RNA biochemistry. Recent evidence suggests that RNA granules assemble by phase separation of subsoluble ribonucleoprotein (RNP) complexes that partially demix from the cytoplasm or nucleoplasm. We explore the possibility that some RNA granules are nonessential condensation by-products that arise when RNP complexes exceed their solubility limit as a consequence of cellular activity, stress, or aging. We describe the use of evolutionary and mutational analyses and single-molecule techniques to distinguish functional RNA granules from "incidental condensates."
Collapse
Affiliation(s)
- Andrea Putnam
- Howard Hughes Medical Institute, Department of Molecular Biology and Genetics, Johns Hopkins University, Baltimore, Maryland 21205, USA
| | - Laura Thomas
- Howard Hughes Medical Institute, Department of Molecular Biology and Genetics, Johns Hopkins University, Baltimore, Maryland 21205, USA
| | - Geraldine Seydoux
- Howard Hughes Medical Institute, Department of Molecular Biology and Genetics, Johns Hopkins University, Baltimore, Maryland 21205, USA
| |
Collapse
|