1
|
Pintado-Grima C, Bárcenas O, Ventura S. Expanding the Landscape of Amyloid Sequences with CARs-DB: A Database of Polar Amyloidogenic Peptides from Disordered Proteins. Methods Mol Biol 2024; 2714:171-185. [PMID: 37676599 DOI: 10.1007/978-1-0716-3441-7_10] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/08/2023]
Abstract
Several databases collecting amyloidogenic regions have been released to provide information on protein sequences able to form amyloid fibrils. However, most of these resources are built with data from experiments that detect highly hydrophobic stretches located within transiently exposed protein segments. We recently demonstrated that cryptic amyloidogenic regions (CARs) of polar nature have the potential to form amyloid fibrils in vitro. Given the underrepresentation of these types of sequences in current amyloid databases, we developed CARs-DB, the first repository that collects thousands of predicted CARs from intrinsically disordered regions. This protocol chapter describes how to use CARs-DB to search for sequences of interest that might be connected to disease or functional protein-protein interactions. In addition, we provide study cases to illustrate the database's features to users. The CARs-DB is readily accessible at http://carsdb.ppmclab.com/ .
Collapse
Affiliation(s)
- Carlos Pintado-Grima
- Institut de Biotecnologia i de Biomedicina and Departament de Bioquímica i Biologia Molecular, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Oriol Bárcenas
- Institut de Biotecnologia i de Biomedicina and Departament de Bioquímica i Biologia Molecular, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Salvador Ventura
- Institut de Biotecnologia i de Biomedicina and Departament de Bioquímica i Biologia Molecular, Universitat Autònoma de Barcelona, Barcelona, Spain.
| |
Collapse
|
2
|
Barbosa Pereira PJ, Manso JA, Macedo-Ribeiro S. The structural plasticity of polyglutamine repeats. Curr Opin Struct Biol 2023; 80:102607. [PMID: 37178477 DOI: 10.1016/j.sbi.2023.102607] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 04/11/2023] [Accepted: 04/12/2023] [Indexed: 05/15/2023]
Abstract
From yeast to humans, polyglutamine (polyQ) repeat tracts are found frequently in the proteome and are particularly prominent in the activation domains of transcription factors. PolyQ is a polymorphic motif that modulates functional protein-protein interactions and aberrant self-assembly. Expansion of the polyQ repeated sequences beyond critical physiological repeat length thresholds triggers self-assembly and is linked to severe pathological implications. This review provides an overview of the current knowledge on the structures of polyQ tracts in the soluble and aggregated states and discusses the influence of neighboring regions on polyQ secondary structure, aggregation, and fibril morphologies. The influence of the genetic context of the polyQ-encoding trinucleotides is briefly discussed as a challenge for future endeavors in this field.
Collapse
Affiliation(s)
- Pedro José Barbosa Pereira
- IBMC - Instituto de Biologia Molecular e Celular, Universidade do Porto, 4200-135, Porto, Portugal; Instituto de Investigação e Inovação em Saúde, Universidade do Porto, 4200-135, Porto, Portugal.
| | - José A Manso
- IBMC - Instituto de Biologia Molecular e Celular, Universidade do Porto, 4200-135, Porto, Portugal; Instituto de Investigação e Inovação em Saúde, Universidade do Porto, 4200-135, Porto, Portugal
| | - Sandra Macedo-Ribeiro
- IBMC - Instituto de Biologia Molecular e Celular, Universidade do Porto, 4200-135, Porto, Portugal; Instituto de Investigação e Inovação em Saúde, Universidade do Porto, 4200-135, Porto, Portugal
| |
Collapse
|
3
|
Cryo-EM structure of hnRNPDL-2 fibrils, a functional amyloid associated with limb-girdle muscular dystrophy D3. Nat Commun 2023; 14:239. [PMID: 36646699 PMCID: PMC9842712 DOI: 10.1038/s41467-023-35854-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Accepted: 01/04/2023] [Indexed: 01/18/2023] Open
Abstract
hnRNPDL is a ribonucleoprotein (RNP) involved in transcription and RNA-processing that hosts missense mutations causing limb-girdle muscular dystrophy D3 (LGMD D3). Mammalian-specific alternative splicing (AS) renders three natural isoforms, hnRNPDL-2 being predominant in humans. We present the cryo-electron microscopy structure of full-length hnRNPDL-2 amyloid fibrils, which are stable, non-toxic, and bind nucleic acids. The high-resolution amyloid core consists of a single Gly/Tyr-rich and highly hydrophilic filament containing internal water channels. The RNA binding domains are located as a solenoidal coat around the core. The architecture and activity of hnRNPDL-2 fibrils are reminiscent of functional amyloids, our results suggesting that LGMD D3 might be a loss-of-function disease associated with impaired fibrillation. Strikingly, the fibril core matches exon 6, absent in the soluble hnRNPDL-3 isoform. This provides structural evidence for AS controlling hnRNPDL assembly by precisely including/skipping an amyloid exon, a mechanism that holds the potential to generate functional diversity in RNPs.
Collapse
|
4
|
Pintado-Grima C, Santos J, Iglesias V, Manglano-Artuñedo Z, Pallarès I, Ventura S. Exploring cryptic amyloidogenic regions in prion-like proteins from plants. FRONTIERS IN PLANT SCIENCE 2023; 13:1060410. [PMID: 36726678 PMCID: PMC9885169 DOI: 10.3389/fpls.2022.1060410] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Accepted: 12/19/2022] [Indexed: 06/18/2023]
Abstract
Prion-like domains (PrLDs) are intrinsically disordered regions (IDRs) of low sequence complexity with a similar composition to yeast prion domains. PrLDs-containing proteins have been involved in different organisms' regulatory processes. Regions of moderate amyloid propensity within IDRs have been shown to assemble autonomously into amyloid fibrils. These sequences tend to be rich in polar amino acids and often escape from the detection of classical bioinformatics screenings that look for highly aggregation-prone hydrophobic sequence stretches. We defined them as cryptic amyloidogenic regions (CARs) and recently developed an integrated database that collects thousands of predicted CARs in IDRs. CARs seem to be evolutionary conserved among disordered regions because of their potential to stablish functional contacts with other biomolecules. Here we have focused on identifying and characterizing CARs in prion-like proteins (pCARs) from plants, a lineage that has been poorly studied in comparison with other prionomes. We confirmed the intrinsic amyloid potential for a selected pCAR from Arabidopsis thaliana and explored functional enrichments and compositional bias of pCARs in plant prion-like proteins.
Collapse
Affiliation(s)
- Carlos Pintado-Grima
- Departament de Bioquímica i Biologia Molecular, Institut de Biotecnologia i Biomedicina, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Jaime Santos
- Departament de Bioquímica i Biologia Molecular, Institut de Biotecnologia i Biomedicina, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Valentín Iglesias
- Departament de Bioquímica i Biologia Molecular, Institut de Biotecnologia i Biomedicina, Universitat Autònoma de Barcelona, Barcelona, Spain
- Barcelona Institute for Global Health, Barcelona Centre for International Health Research (ISGlobal, Hospital Clínic-Universitat de Barcelona), Barcelona, Spain
- Nanomalaria Group, Institute for Bioengineering of Catalonia (IBEC), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Zoe Manglano-Artuñedo
- Departament de Bioquímica i Biologia Molecular, Institut de Biotecnologia i Biomedicina, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Irantzu Pallarès
- Departament de Bioquímica i Biologia Molecular, Institut de Biotecnologia i Biomedicina, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Salvador Ventura
- Departament de Bioquímica i Biologia Molecular, Institut de Biotecnologia i Biomedicina, Universitat Autònoma de Barcelona, Barcelona, Spain
| |
Collapse
|
5
|
Dunn MJ, Shazib SUA, Simonton E, Slot JC, Anderson MZ. Architectural groups of a subtelomeric gene family evolve along distinct paths in Candida albicans. G3 (BETHESDA, MD.) 2022; 12:jkac283. [PMID: 36269198 PMCID: PMC9713401 DOI: 10.1093/g3journal/jkac283] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/26/2022] [Accepted: 10/09/2022] [Indexed: 12/08/2023]
Abstract
Subtelomeres are dynamic genomic regions shaped by elevated rates of recombination, mutation, and gene birth/death. These processes contribute to formation of lineage-specific gene family expansions that commonly occupy subtelomeres across eukaryotes. Investigating the evolution of subtelomeric gene families is complicated by the presence of repetitive DNA and high sequence similarity among gene family members that prevents accurate assembly from whole genome sequences. Here, we investigated the evolution of the telomere-associated (TLO) gene family in Candida albicans using 189 complete coding sequences retrieved from 23 genetically diverse strains across the species. Tlo genes conformed to the 3 major architectural groups (α/β/γ) previously defined in the genome reference strain but significantly differed in the degree of within-group diversity. One group, Tloβ, was always found at the same chromosome arm with strong sequence similarity among all strains. In contrast, diverse Tloα sequences have proliferated among chromosome arms. Tloγ genes formed 7 primary clades that included each of the previously identified Tloγ genes from the genome reference strain with 3 Tloγ genes always found on the same chromosome arm among strains. Architectural groups displayed regions of high conservation that resolved newly identified functional motifs, providing insight into potential regulatory mechanisms that distinguish groups. Thus, by resolving intraspecies subtelomeric gene variation, it is possible to identify previously unknown gene family complexity that may underpin adaptive functional variation.
Collapse
Affiliation(s)
- Matthew J Dunn
- Department of Microbiology, The Ohio State University, Columbus, OH 43210, USA
| | - Shahed U A Shazib
- Department of Microbiology, The Ohio State University, Columbus, OH 43210, USA
| | - Emily Simonton
- Department of Microbiology, The Ohio State University, Columbus, OH 43210, USA
| | - Jason C Slot
- Department of Plant Pathology, The Ohio State University, Columbus, OH 43210, USA
| | - Matthew Z Anderson
- Department of Microbiology, The Ohio State University, Columbus, OH 43210, USA
- Department of Microbial Infection and Immunity, The Ohio State University, Columbus, OH 43210, USA
| |
Collapse
|
6
|
Prion-like low complexity regions enable avid virus-host interactions during HIV-1 infection. Nat Commun 2022; 13:5879. [PMID: 36202818 PMCID: PMC9537594 DOI: 10.1038/s41467-022-33662-6] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2022] [Accepted: 09/27/2022] [Indexed: 12/24/2022] Open
Abstract
Cellular proteins CPSF6, NUP153 and SEC24C play crucial roles in HIV-1 infection. While weak interactions of short phenylalanine-glycine (FG) containing peptides with isolated capsid hexamers have been characterized, how these cellular factors functionally engage with biologically relevant mature HIV-1 capsid lattices is unknown. Here we show that prion-like low complexity regions (LCRs) enable avid CPSF6, NUP153 and SEC24C binding to capsid lattices. Structural studies revealed that multivalent CPSF6 assembly is mediated by LCR-LCR interactions, which are templated by binding of CPSF6 FG peptides to a subset of hydrophobic capsid pockets positioned along adjoining hexamers. In infected cells, avid CPSF6 LCR-mediated binding to HIV-1 cores is essential for functional virus-host interactions. The investigational drug lenacapavir accesses unoccupied hydrophobic pockets in the complex to potently impair HIV-1 inside the nucleus without displacing the tightly bound cellular cofactor from virus cores. These results establish previously undescribed mechanisms of virus-host interactions and antiviral action.
Collapse
|
7
|
Biró B, Zhao B, Kurgan L. Complementarity of the residue-level protein function and structure predictions in human proteins. Comput Struct Biotechnol J 2022; 20:2223-2234. [PMID: 35615015 PMCID: PMC9118482 DOI: 10.1016/j.csbj.2022.05.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2022] [Revised: 05/02/2022] [Accepted: 05/02/2022] [Indexed: 11/24/2022] Open
Abstract
Sequence-based predictors of the residue-level protein function and structure cover a broad spectrum of characteristics including intrinsic disorder, secondary structure, solvent accessibility and binding to nucleic acids. They were catalogued and evaluated in numerous surveys and assessments. However, methods focusing on a given characteristic are studied separately from predictors of other characteristics, while they are typically used on the same proteins. We fill this void by studying complementarity of a representative collection of methods that target different predictions using a large, taxonomically consistent, and low similarity dataset of human proteins. First, we bridge the gap between the communities that develop structure-trained vs. disorder-trained predictors of binding residues. Motivated by a recent study of the protein-binding residue predictions, we empirically find that combining the structure-trained and disorder-trained predictors of the DNA-binding and RNA-binding residues leads to substantial improvements in predictive quality. Second, we investigate whether diverse predictors generate results that accurately reproduce relations between secondary structure, solvent accessibility, interaction sites, and intrinsic disorder that are present in the experimental data. Our empirical analysis concludes that predictions accurately reflect all combinations of these relations. Altogether, this study provides unique insights that support combining results produced by diverse residue-level predictors of protein function and structure.
Collapse
Affiliation(s)
- Bálint Biró
- Institute of Genetics and Biotechnology, Hungarian University of Agriculture and Life Sciences, Gödöllő, Hungary
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States
| | - Bi Zhao
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States
| |
Collapse
|
8
|
Behbahanipour M, García-Pardo J, Ventura S. Decoding the role of coiled-coil motifs in human prion-like proteins. Prion 2021; 15:143-154. [PMID: 34428113 PMCID: PMC8386614 DOI: 10.1080/19336896.2021.1961569] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2021] [Revised: 07/22/2021] [Accepted: 07/25/2021] [Indexed: 11/28/2022] Open
Abstract
Prions are self-propagating proteins that cause fatal neurodegenerative diseases in humans. However, increasing evidence suggests that eukaryotic cells exploit prion conformational conversion for functional purposes. A recent study delineated a group of twenty prion-like proteins in humans, characterized by the presence of low-complexity glutamine-rich sequences with overlapping coiled-coil (CCs) motifs. This is the case of Mediator complex subunit 15 (MED15), which is overexpressed in a wide range of human cancers. Biophysical studies demonstrated that the prion-like domain (PrLD) of MED15 forms homodimers in solution, sustained by CCs interactions. Furthermore, the same coiled-coil (CC) region plays a crucial role in the PrLD structural transition to a transmissible β-sheet amyloid state. In this review, we discuss the role of CCs motifs and their contribution to amyloid transitions in human prion-like domains (PrLDs), while providing a comprehensive overview of six predicted human prion-like proteins involved in transcription, gene expression, or DNA damage response and associated with human disease, whose PrLDs contain or overlap with CCs sequences. Finally, we try to rationalize how these molecular signatures might relate to both their function and involvement in disease.
Collapse
Affiliation(s)
- Molood Behbahanipour
- Institut De Biotecnologia I De Biomedicina (Ibb) and Departament De Bioquímica I Biologia Molecular, Universitat Autónoma De Barcelona, Barcelona, Spain
| | - Javier García-Pardo
- Institut De Biotecnologia I De Biomedicina (Ibb) and Departament De Bioquímica I Biologia Molecular, Universitat Autónoma De Barcelona, Barcelona, Spain
| | - Salvador Ventura
- Institut De Biotecnologia I De Biomedicina (Ibb) and Departament De Bioquímica I Biologia Molecular, Universitat Autónoma De Barcelona, Barcelona, Spain
| |
Collapse
|