1
|
Dovidchenko NV, Lobanov MY, Galzitskaya OV. Is there a bias in the codon frequency corresponding to homo-repeats found in human proteins? Biosystems 2024; 246:105357. [PMID: 39442908 DOI: 10.1016/j.biosystems.2024.105357] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2024] [Revised: 09/30/2024] [Accepted: 10/20/2024] [Indexed: 10/25/2024]
Abstract
It is well known that there is a codon usage bias in genomes, that is, some codons are observed more often than others. Codons implicated in the homo-repeats regions in human proteins are no exception. In this work, we analyzed the codon usage bias for all amino acid residues in homo-repeats larger than 4 in 3753 human proteins from 20447 protein sequences from the canonically reviewed human proteome. We have discovered that almost all homo-repeats in the human proteome, most of which encode Ala, Glu, Gly, Leu, Pro, and Ser (∼80% of all homo-repeats), have a codon usage bias, i.e. are mainly encoded by one codon. Moreover, there is a strong shift in homo-repeats in favor of the content of GC rich codons. Homo-repeats with Ala, Glu, Gly, Leu, Pro, and Ser predominate in the PDB, which has both ordered and disordered status. Examining the distribution of splicing sites, we found that about 15% of homo-repeats either contain or are located within 10 nucleotides of the splicing site, and Glu and Leu predominate in these homo-repeats. Our data is important for future study of the functions of homo-repeats, protein-protein interactions, and evolutionary fitness.
Collapse
Affiliation(s)
- Nikita V Dovidchenko
- Gamaleya Research Centre of Epidemiology and Microbiology, 123098, Moscow, Russia; Institute of Protein Research, Russian Academy of Sciences, 142290, Pushchino, Moscow Region, Russia.
| | - Mikhail Yu Lobanov
- Institute of Protein Research, Russian Academy of Sciences, 142290, Pushchino, Moscow Region, Russia
| | - Oxana V Galzitskaya
- Gamaleya Research Centre of Epidemiology and Microbiology, 123098, Moscow, Russia; Institute of Protein Research, Russian Academy of Sciences, 142290, Pushchino, Moscow Region, Russia; Institute of Theoretical and Experimental Biophysics, Russian Academy of Sciences, 142290, Pushchino, Moscow Region, Russia.
| |
Collapse
|
2
|
Chakrabarty B, Parekh N. DbStRiPs: Database of structural repeats in proteins. Protein Sci 2022; 31:23-36. [PMID: 33641184 PMCID: PMC8740836 DOI: 10.1002/pro.4052] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2020] [Revised: 02/11/2021] [Accepted: 02/15/2021] [Indexed: 01/03/2023]
Abstract
Recent interest in repeat proteins has arisen due to stable structural folds, high evolutionary conservation and repertoire of functions provided by these proteins. However, repeat proteins are poorly characterized because of high sequence variation between repeating units and structure-based identification and classification of repeats is desirable. Using a robust network-based pipeline, manual curation and Kajava's structure-based classification schema, we have developed a database of tandem structural repeats, Database of Structural Repeats in Proteins (DbStRiPs). A unique feature of this database is that available knowledge on sequence repeat families is incorporated by mapping Pfam classification scheme onto structural classification. Integration of sequence and structure-based classifications help in identifying different functional groups within the same structural subclass, leading to refinement in the annotation of repeat proteins. Analysis of complete Protein Data Bank revealed 16,472 repeat annotations in 15,141 protein chains, one previously uncharacterized novel protein repeat family (PRF), named left-handed beta helix, and 33 protein repeat clusters (PRCs). Based on their unique structural motif, ~79% of these repeat proteins are classified in one of the 14 PRFs or 33 PRCs, and the remaining are grouped as unclassified repeat proteins. Each repeat protein is provided with a detailed annotation in DbStRiPs that includes start and end boundaries of repeating units, copy number, secondary and tertiary structure view, repeat class/subclass, disease association, MSA of repeating units and cross-references to various protein pattern databases, human protein atlas and interaction resources. DbStRiPs provides easy search and download options to high-quality annotations of structural repeat proteins (URL: http://bioinf.iiit.ac.in/dbstrips/).
Collapse
Affiliation(s)
- Broto Chakrabarty
- Centre for Computational Natural Sciences and Bioinformatics, International Institute of Information TechnologyHyderabadIndia
| | - Nita Parekh
- Centre for Computational Natural Sciences and Bioinformatics, International Institute of Information TechnologyHyderabadIndia
| |
Collapse
|
3
|
Galzitskaya OV, Lobanov MY. Proteome-scale understanding of relationship between homo-repeat enrichments and protein aggregation properties. PLoS One 2018; 13:e0206941. [PMID: 30399196 PMCID: PMC6219797 DOI: 10.1371/journal.pone.0206941] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2018] [Accepted: 10/22/2018] [Indexed: 02/07/2023] Open
Abstract
Expansion of homo-repeats is a molecular basis for human neurological diseases. We are the first who studied the influence of homo-repeats with lengths larger than four amino acid residues on the aggregation properties of 1449683 proteins across 122 eukaryotic and bacterial proteomes. Only 15% of proteins (215481) include homo-repeats of such length. We demonstrated that RNA-binding proteins with a prion-like domain are enriched with homo-repeats in comparison with other non-redundant protein sequences and those in the PDB. We performed a bioinformatics analysis for these proteins and found that proteins with homo-repeats are on average two times longer than those in the whole database. Moreover, we are first to discover that as a rule, homo-repeats appear in proteins not alone but in pairs: hydrophobic and aromatic homo-repeats appear with similar ones, while homo-repeats with small, polar and charged amino acids appear together with different preferences. We elaborated a new complementary approach to demonstrate the influence of homo-repeats on their host protein aggregation properties. We have shown that addition of artificial homo-repeats to natural and random proteins results in intensification of aggregation properties of the proteins. The maximal effect is observed for the insertion of artificial homo-repeats with 5–6 residues, which is consistent with the minimal length of an amyloidogenic region. We have also demonstrated that the ability of proteins with homo-repeats to aggregate cannot be explained only by the presence of long homo-repeats in them. There should be other characteristics of proteins intensifying the aggregation property including such as the appearance of homo-repeats in pairs in the same protein. We are the first who elaborated a new approach to study the influence of homo-repeats present in proteins on their aggregation properties and performed an appropriate analysis of the large number of proteomes and proteins.
Collapse
Affiliation(s)
- Oxana V. Galzitskaya
- Group of Bioinformatics, Institute of Protein Research, Russian Academy of Science, Pushchino, Moscow Region, Russia
- * E-mail:
| | - Miсhail Yu. Lobanov
- Group of Bioinformatics, Institute of Protein Research, Russian Academy of Science, Pushchino, Moscow Region, Russia
| |
Collapse
|
4
|
Non-random distribution of homo-repeats: links with biological functions and human diseases. Sci Rep 2016; 6:26941. [PMID: 27256590 PMCID: PMC4891720 DOI: 10.1038/srep26941] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2016] [Accepted: 05/06/2016] [Indexed: 12/22/2022] Open
Abstract
The biological function of multiple repetitions of single amino acids, or homo-repeats, is largely unknown, but their occurrence in proteins has been associated with more than 20 hereditary diseases. Analysing 122 bacterial and eukaryotic genomes, we observed that the number of proteins containing homo-repeats is significantly larger than expected from theoretical estimates. Analysis of statistical significance indicates that the minimal size of homo-repeats varies with amino acid type and proteome. In an attempt to characterize proteins harbouring long homo-repeats, we found that those containing polar or small amino acids S, P, H, E, D, K, Q and N are enriched in structural disorder as well as protein- and RNA-interactions. We observed that E, S, Q, G, L, P, D, A and H homo-repeats are strongly linked with occurrence in human diseases. Moreover, S, E, P, A, Q, D and T homo-repeats are significantly enriched in neuronal proteins associated with autism and other disorders. We release a webserver for further exploration of homo-repeats occurrence in human pathology at http://bioinfo.protres.ru/hradis/.
Collapse
|
5
|
Li C, Nagel J, Androulakis S, Song J, Buckle AM. PolyQ 2.0: an improved version of PolyQ, a database of human polyglutamine proteins. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2016; 2016:baw021. [PMID: 26980520 PMCID: PMC4792530 DOI: 10.1093/database/baw021] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/13/2015] [Accepted: 02/08/2016] [Indexed: 11/13/2022]
Abstract
Proteins with expanded polyglutamine (polyQ) repeats are involved in human neurodegenerative diseases, via a gain-of-function mechanism of neuronal toxicity involving protein conformational changes that result in the formation and deposition of β-sheet-rich aggregates. Aggregation is dependent on the context and properties of the host protein, such as domain context and location of the repeat tract. In order to explore this relationship in greater detail, here we describe PolyQ 2.0, an updated database that provides a comprehensive knowledgebase for human polyQ proteins. Compared with the previous PolyQ database, our new database provides a variety of substantial updates including detailed biological annotations and search options. Biological annotations in terms of domain context information, protein structural and functional annotation, single point mutations, predicted disordered regions, protein-protein interaction partners, metabolic/signaling pathways, post-translational modification sites and evolutionary information are made available. Several new database functionalities have also been provided, including search using multiple/combinatory keywords, and submission of new data entries. Also, several third-party plug-ins are employed to enhance data visualization in PolyQ 2.0. In PolyQ 2.0 the proteins are reclassified into 3 new categories and contain 9 reviewed disease-associated polyQ proteins, 105 reviewed non-disease polyQ proteins and 146 un-reviewed polyQ proteins (reviewed by UniProt curators). We envisage that this updated database will be a useful resource for functional and structural investigation of human polyQ proteins. Database URL: http://lightning.med.monash.edu/polyq2/.
Collapse
Affiliation(s)
- Chen Li
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology
| | - Jeremy Nagel
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology
| | - Steve Androulakis
- Monash Bioinformatics Platform, Monash University, Melbourne, Vic. 3800, Australia, and
| | - Jiangning Song
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology National Engineering Laboratory of Industrial Enzymes and Key Laboratory of Systems Microbial Biotechnology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
| | - Ashley M Buckle
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology
| |
Collapse
|
6
|
Li C, Ching Han Chang C, Nagel J, Porebski BT, Hayashida M, Akutsu T, Song J, Buckle AM. Critical evaluation of in silico methods for prediction of coiled-coil domains in proteins. Brief Bioinform 2016; 17:270-82. [PMID: 26177815 PMCID: PMC6078162 DOI: 10.1093/bib/bbv047] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2015] [Revised: 05/29/2015] [Indexed: 12/19/2022] Open
Abstract
Coiled-coils refer to a bundle of helices coiled together like strands of a rope. It has been estimated that nearly 3% of protein-encoding regions of genes harbour coiled-coil domains (CCDs). Experimental studies have confirmed that CCDs play a fundamental role in subcellular infrastructure and controlling trafficking of eukaryotic cells. Given the importance of coiled-coils, multiple bioinformatics tools have been developed to facilitate the systematic and high-throughput prediction of CCDs in proteins. In this article, we review and compare 12 sequence-based bioinformatics approaches and tools for coiled-coil prediction. These approaches can be categorized into two classes: coiled-coil detection and coiled-coil oligomeric state prediction. We evaluated and compared these methods in terms of their input/output, algorithm, prediction performance, validation methods and software utility. All the independent testing data sets are available at http://lightning.med.monash.edu/coiledcoil/. In addition, we conducted a case study of nine human polyglutamine (PolyQ) disease-related proteins and predicted CCDs and oligomeric states using various predictors. Prediction results for CCDs were highly variable among different predictors. Only two peptides from two proteins were confirmed to be CCDs by majority voting. Both domains were predicted to form dimeric coiled-coils using oligomeric state prediction. We anticipate that this comprehensive analysis will be an insightful resource for structural biologists with limited prior experience in bioinformatics tools, and for bioinformaticians who are interested in designing novel approaches for coiled-coil and its oligomeric state prediction.
Collapse
|
7
|
Thompson KW, Marquez SB, Lu L, Reisman D. Induction of functional Brm protein from Brm knockout mice. Oncoscience 2015; 2:349-61. [PMID: 26097869 PMCID: PMC4468321 DOI: 10.18632/oncoscience.153] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2015] [Accepted: 04/08/2015] [Indexed: 12/18/2022] Open
Abstract
Once the knockout of the Brm gene was found to be nontumorigenic in mice, the study of BRM's involvement in cancer seemed less important compared with that of its homolog, Brg1. This has likely contributed to the disparity that has been observed in the publication ratio between BRG1 and BRM. We show that a previously published Brm knockout mouse is an incomplete knockout whereby a truncated isoform of Brm is detected in normal tissue and in tumors. We show that this truncated Brm isoform has functionality comparable to wild type Brm. By immunohistochemistry (IHC), this truncated Brm is undetectable in normal lung tissue and is minimal to very low in Brmnull tumors. However, it is significant in a subset (~40%) of Brg1/Brm double knockout (DKO) tumors that robustly express this truncated BRM, which in part stems from an increase in Brm mRNA levels. Thus, it is likely that this mutant mouse model does not accurately reflect the role that Brm plays in cancer development. We suggest that the construction of a completely new mouse Brm knockout, where Brm is functionally absent, is needed to determine whether or not Brm is actually tumorigenic and if Brm might be a tumor suppressor.
Collapse
Affiliation(s)
- Kenneth W. Thompson
- Division of Hematology and Oncology, Department of Medicine, University of Florida, Gainesville, Florida, USA
| | - Stefanie B. Marquez
- Division of Hematology and Oncology, Department of Medicine, University of Florida, Gainesville, Florida, USA
| | - Li Lu
- Department of Pathology, University of Florida, Gainesville, Florida, USA
| | - David Reisman
- Division of Hematology and Oncology, Department of Medicine, University of Florida, Gainesville, Florida, USA
| |
Collapse
|
8
|
Mishra R, Thakur AK. Amyloid nanospheres from polyglutamine rich peptides: assemblage through an intermolecular salt bridge interaction. Org Biomol Chem 2015; 13:4155-9. [DOI: 10.1039/c4ob02589j] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Amyloid fiber formation by two polyglutamine peptides through a nucleation polymerization pathway. An intermolecular salt bridge between the positively charged lysine and the negatively charged glutamate induces the formation of nanospherical amyloids through a non-nucleated pathway.
Collapse
Affiliation(s)
- Rahul Mishra
- Department of Biological Sciences and Bioengineering
- Indian Institute of Technology
- Kanpur
- India
| | - Ashwani K. Thakur
- Department of Biological Sciences and Bioengineering
- Indian Institute of Technology
- Kanpur
- India
| |
Collapse
|
9
|
Atkin G, Paulson H. Ubiquitin pathways in neurodegenerative disease. Front Mol Neurosci 2014; 7:63. [PMID: 25071440 PMCID: PMC4085722 DOI: 10.3389/fnmol.2014.00063] [Citation(s) in RCA: 107] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2014] [Accepted: 06/19/2014] [Indexed: 12/11/2022] Open
Abstract
Control of proper protein synthesis, function, and turnover is essential for the health of all cells. In neurons these demands take on the additional importance of supporting and regulating the highly dynamic connections between neurons that are necessary for cognitive function, learning, and memory. Regulating multiple unique synaptic protein environments within a single neuron while maintaining cell health requires the highly regulated processes of ubiquitination and degradation of ubiquitinated proteins through the proteasome. In this review, we examine the effects of dysregulated ubiquitination and protein clearance on the handling of disease-associated proteins and neuronal health in the most common neurodegenerative diseases.
Collapse
Affiliation(s)
- Graham Atkin
- Department of Neurology, University of Michigan Ann Arbor, MI, USA
| | - Henry Paulson
- Department of Neurology, University of Michigan Ann Arbor, MI, USA
| |
Collapse
|
10
|
Wolfe KJ, Ren HY, Trepte P, Cyr DM. Polyglutamine-rich suppressors of huntingtin toxicity act upstream of Hsp70 and Sti1 in spatial quality control of amyloid-like proteins. PLoS One 2014; 9:e95914. [PMID: 24828240 PMCID: PMC4020751 DOI: 10.1371/journal.pone.0095914] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2013] [Accepted: 04/01/2014] [Indexed: 11/30/2022] Open
Abstract
Protein conformational maladies such as Huntington Disease are characterized by accumulation of intracellular and extracellular protein inclusions containing amyloid-like proteins. There is an inverse correlation between proteotoxicity and aggregation, so facilitated protein aggregation appears cytoprotective. To define mechanisms for protective protein aggregation, a screen for suppressors of nuclear huntingtin (Htt103Q) toxicity was conducted. Nuclear Htt103Q is highly toxic and less aggregation prone than its cytosolic form, so we identified suppressors of cytotoxicity caused by Htt103Q tagged with a nuclear localization signal (NLS). High copy suppressors of Htt103Q-NLS toxicity include the polyQ-domain containing proteins Nab3, Pop2, and Cbk1, and each suppresses Htt toxicity via a different mechanism. Htt103Q-NLS appears to inactivate the essential functions of Nab3 in RNA processing in the nucleus. Function of Pop2 and Cbk1 is not impaired by nuclear Htt103Q, as their respective polyQ-rich domains are sufficient to suppress Htt103Q toxicity. Pop2 is a subunit of an RNA processing complex and is localized throughout the cytoplasm. Expression of just the Pop2 polyQ domain and an adjacent proline-rich stretch is sufficient to suppress Htt103Q toxicity. The proline-rich domain in Pop2 resembles an aggresome targeting signal, so Pop2 may act in trans to positively impact spatial quality control of Htt103Q. Cbk1 accumulates in discrete perinuclear foci and overexpression of the Cbk1 polyQ domain concentrates diffuse Htt103Q into these foci, which correlates with suppression of Htt toxicity. Protective action of Pop2 and Cbk1 in spatial quality control is dependent upon the Hsp70 co-chaperone Sti1, which packages amyloid-like proteins into benign foci. Protein:protein interactions between Htt103Q and its intracellular neighbors lead to toxic and protective outcomes. A subset of polyQ-rich proteins buffer amyloid toxicity by funneling toxic aggregation intermediates to the Hsp70/Sti1 system for spatial organization into benign species.
Collapse
Affiliation(s)
- Katie J. Wolfe
- Department of Cell Biology and Physiology, University of North Carolina, Chapel Hill, North Carolina, United States of America
| | - Hong Yu Ren
- Department of Cell Biology and Physiology, University of North Carolina, Chapel Hill, North Carolina, United States of America
| | - Philipp Trepte
- Neuroproteomics, Max Delbrueck Center for Molecular Medicine, Berlin, Germany
| | - Douglas M. Cyr
- Department of Cell Biology and Physiology, University of North Carolina, Chapel Hill, North Carolina, United States of America
- * E-mail:
| |
Collapse
|
11
|
Di Domenico T, Potenza E, Walsh I, Parra RG, Giollo M, Minervini G, Piovesan D, Ihsan A, Ferrari C, Kajava AV, Tosatto SCE. RepeatsDB: a database of tandem repeat protein structures. Nucleic Acids Res 2013; 42:D352-7. [PMID: 24311564 PMCID: PMC3964956 DOI: 10.1093/nar/gkt1175] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
RepeatsDB (http://repeatsdb.bio.unipd.it/) is a database of annotated tandem repeat protein structures. Tandem repeats pose a difficult problem for the analysis of protein structures, as the underlying sequence can be highly degenerate. Several repeat types haven been studied over the years, but their annotation was done in a case-by-case basis, thus making large-scale analysis difficult. We developed RepeatsDB to fill this gap. Using state-of-the-art repeat detection methods and manual curation, we systematically annotated the Protein Data Bank, predicting 10 745 repeat structures. In all, 2797 structures were classified according to a recently proposed classification schema, which was expanded to accommodate new findings. In addition, detailed annotations were performed in a subset of 321 proteins. These annotations feature information on start and end positions for the repeat regions and units. RepeatsDB is an ongoing effort to systematically classify and annotate structural protein repeats in a consistent way. It provides users with the possibility to access and download high-quality datasets either interactively or programmatically through web services.
Collapse
Affiliation(s)
- Tomás Di Domenico
- Department of Biomedical Sciences, University of Padua, 35131 Padova, Italy, Department of Biological Chemistry, Universidad de Buenos Aires, Buenos Aires C1428EGA, Argentina, Department of Information Engineering, University of Padua, 35121 Padova, Italy, Department of Biosciences, COMSATS Institute of Information Technology, Sahiwal, Pakistan, Centre de Recherches de Biochimie Macromoléculaire, CNRS, 34293 Montpellier Cedex 5, France and Institut de Biologie Computationnelle, 34293 Montpellier Cedex 5, France
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Kubrycht J, Sigler K, Souček P, Hudeček J. Structures composing protein domains. Biochimie 2013; 95:1511-24. [DOI: 10.1016/j.biochi.2013.04.001] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2013] [Accepted: 04/02/2013] [Indexed: 12/21/2022]
|
13
|
Almeida B, Fernandes S, Abreu IA, Macedo-Ribeiro S. Trinucleotide repeats: a structural perspective. Front Neurol 2013; 4:76. [PMID: 23801983 PMCID: PMC3687200 DOI: 10.3389/fneur.2013.00076] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2013] [Accepted: 06/04/2013] [Indexed: 11/29/2022] Open
Abstract
Trinucleotide repeat (TNR) expansions are present in a wide range of genes involved in several neurological disorders, being directly involved in the molecular mechanisms underlying pathogenesis through modulation of gene expression and/or the function of the RNA or protein it encodes. Structural and functional information on the role of TNR sequences in RNA and protein is crucial to understand the effect of TNR expansions in neurodegeneration. Therefore, this review intends to provide to the reader a structural and functional view of TNR and encoded homopeptide expansions, with a particular emphasis on polyQ expansions and its role at inducing the self-assembly, aggregation and functional alterations of the carrier protein, which culminates in neuronal toxicity and cell death. Detail will be given to the Machado-Joseph Disease-causative and polyQ-containing protein, ataxin-3, providing clues for the impact of polyQ expansion and its flanking regions in the modulation of ataxin-3 molecular interactions, function, and aggregation.
Collapse
Affiliation(s)
- Bruno Almeida
- Instituto de Biologia Molecular e Celular, Universidade do Porto , Porto , Portugal
| | | | | | | |
Collapse
|
14
|
Lyubchenko YL, Krasnoslobodtsev AV, Luca S. Fibrillogenesis of huntingtin and other glutamine containing proteins. Subcell Biochem 2012; 65:225-51. [PMID: 23225006 PMCID: PMC4226413 DOI: 10.1007/978-94-007-5416-4_10] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
This chapter focuses on the aggregation of glutamine containing peptides and proteins with an emphasis on huntingtin protein, whose aggregation leads to the development of Huntington's disease. The kinetics that leads to the formation of amyloids, the structure of aggregates of various types and the morphological mechanical properties of amyloid fibrils are described. The kinetics of amyloid fibril formation has been proposed to follow a nucleation dependent polymerization model, dependent upon the size of the nucleus. This model and the effect of the polyglutamine length on the nucleus size are reviewed. Aggregate structure is characterized at two different levels. The atomic-scale resolution structure of fibrillar and crystalline aggregates of polyglutamine containing proteins and peptides was determined by X-ray crystallography and solid-state nuclear magnetic resonance (NMR). The chapter outlines the results obtained by both these techniques. Atomic force microscopy (AFM) was instrumental in elucidating the morphology of fibrils, their organization and assembly. The chapter also discusses the high stability of amyloid fibrils, including their mechanical properties as revealed by AFM.
Collapse
Affiliation(s)
- Yuri L. Lyubchenko
- Department of Pharmaceutical Sciences, Nanoimaging Core Facility College of Pharmacy, University of Nebraska Medical Center, COP 1012, 986025 Nebraska Medical Center, Omaha, NE 68198-6025, USA
| | - Alexey V. Krasnoslobodtsev
- Department of Pharmaceutical Sciences, Nanoimaging Core Facility College of Pharmacy, University of Nebraska Medical Center, COP 1012, 986025 Nebraska Medical Center, Omaha, NE 68198-6025, USA
| | - Sorin Luca
- Department of Pharmaceutical Sciences, Nanoimaging Core Facility College of Pharmacy, University of Nebraska Medical Center, COP 1012, 986025 Nebraska Medical Center, Omaha, NE 68198-6025, USA
| |
Collapse
|
15
|
Luo H, Lin K, David A, Nijveen H, Leunissen JAM. ProRepeat: an integrated repository for studying amino acid tandem repeats in proteins. Nucleic Acids Res 2011; 40:D394-9. [PMID: 22102581 PMCID: PMC3245022 DOI: 10.1093/nar/gkr1019] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
ProRepeat (http://prorepeat.bioinformatics.nl/) is an integrated curated repository and analysis platform for in-depth research on the biological characteristics of amino acid tandem repeats. ProRepeat collects repeats from all proteins included in the UniProt knowledgebase, together with 85 completely sequenced eukaryotic proteomes contained within the RefSeq collection. It contains non-redundant perfect tandem repeats, approximate tandem repeats and simple, low-complexity sequences, covering the majority of the amino acid tandem repeat patterns found in proteins. The ProRepeat web interface allows querying the repeat database using repeat characteristics like repeat unit and length, number of repetitions of the repeat unit and position of the repeat in the protein. Users can also search for repeats by the characteristics of repeat containing proteins, such as entry ID, protein description, sequence length, gene name and taxon. ProRepeat offers powerful analysis tools for finding biological interesting properties of repeats, such as the strong position bias of leucine repeats in the N-terminus of eukaryotic protein sequences, the differences of repeat abundance among proteomes, the functional classification of repeat containing proteins and GC content constrains of repeats’ corresponding codons.
Collapse
Affiliation(s)
- Hong Luo
- Laboratory of Bioinformatics, Wageningen University and Research Centre, PO Box 569, 6700 AN Wageningen, Netherlands
| | | | | | | | | |
Collapse
|
16
|
The Rate of PolyQ-Mediated Aggregation Is Dramatically Affected by the Number and Location of Surrounding Domains. J Mol Biol 2011; 413:879-87. [DOI: 10.1016/j.jmb.2011.09.014] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2011] [Revised: 09/08/2011] [Accepted: 09/12/2011] [Indexed: 11/21/2022]
|
17
|
Ishii Y, Takizawa T, Iwasaki H, Fujita Y, Murakami M, Groppe JC, Tanaka K. Nucleotide polymorphisms in the canine Noggin gene and their distribution among dog (Canis lupus familiaris) breeds. Biochem Genet 2011; 50:12-8. [PMID: 21882044 PMCID: PMC3258390 DOI: 10.1007/s10528-011-9453-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2010] [Accepted: 04/28/2011] [Indexed: 11/30/2022]
Abstract
Noggin (NOG) is an important regulator for the signaling of bone morphogenetic proteins. In this study, we sequenced the complete coding sequence of the canine NOG gene and characterized the nucleotide polymorphisms. The sequence length varied from 717 to 729 bp, depending on the number of a 6-bp tandem repeat unit (GGCGCG), an insertion that has not been observed in other mammalian NOG genes investigated to date. It results in extensions of (Gly–Ala)3–5 in the putative NOG protein. To survey the distribution of these tandem repeat polymorphisms, we analyzed 126 individuals in seven dog breeds. We identified only three alleles: (GGCGCG)3, (GGCGCG)4, and (GGCGCG)5. Although the allele frequencies were remarkably different among the breeds, the three alleles were present in all seven of the breeds and did not show any deviation from Hardy–Weinberg equilibrium.
Collapse
Affiliation(s)
- Yuji Ishii
- Laboratory of Animal Biotechnology, School of Veterinary Medicine, Azabu University, Chuou-ku, Sagamihara, 252-5201, Japan
| | | | | | | | | | | | | |
Collapse
|
18
|
Saunders HM, Gilis D, Rooman M, Dehouck Y, Robertson AL, Bottomley SP. Flanking domain stability modulates the aggregation kinetics of a polyglutamine disease protein. Protein Sci 2011; 20:1675-81. [PMID: 21780213 DOI: 10.1002/pro.698] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2011] [Revised: 06/29/2011] [Accepted: 06/30/2011] [Indexed: 11/12/2022]
Abstract
Spinocerebellar Ataxia Type 3 (SCA3) is one of nine polyglutamine (polyQ) diseases that are all characterized by progressive neuronal dysfunction and the presence of neuronal inclusions containing aggregated polyQ protein, suggesting that protein misfolding is a key part of this disease. Ataxin-3, the causative protein of SCA3, contains a globular, structured N-terminal domain (the Josephin domain) and a flexible polyQ-containing C-terminal tail, the repeat-length of which modulates pathogenicity. It has been suggested that the fibrillogenesis pathway of ataxin-3 begins with a non-polyQ-dependent step mediated by Josephin domain interactions, followed by a polyQ-dependent step. To test the involvement of the Josephin domain in ataxin-3 fibrillogenesis, we have created both pathogenic and nonpathogenic length ataxin-3 variants with a stabilized Josephin domain, and have both stabilized and destabilized the isolated Josephin domain. We show that changing the thermodynamic stability of the Josephin domain modulates ataxin-3 fibrillogenesis. These data support the hypothesis that the first stage of ataxin-3 fibrillogenesis is caused by interactions involving the non-polyQ containing Josephin domain and that the thermodynamic stability of this domain is linked to the aggregation propensity of ataxin-3.
Collapse
Affiliation(s)
- Helen M Saunders
- Department of Biochemistry and Molecular Biology, Monash University, Clayton, Victoria 3800, Australia
| | | | | | | | | | | |
Collapse
|