1
|
RepEx: A web server to extract sequence repeats from protein and DNA sequences. Comput Biol Chem 2018; 78:424-430. [PMID: 30598392 DOI: 10.1016/j.compbiolchem.2018.12.015] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2018] [Accepted: 12/25/2018] [Indexed: 11/20/2022]
Abstract
Evolution builds up new genetic material from existing ones, not in random, but in highly ordered and eloquent patterns. Most of these sequence repeats are revelatory of valuable information contributing to areas of disease research and function of macromolecules, to name a few. In the age of next generation genome sequencing, rapid and efficient extraction of all unbiased sequence repeats from macromolecules is the need of the hour. In view of this reckoning, an online web-based computing server, RepEx, has been developed to extract and display all possible repeats for DNA and protein sequences. Apart from exact or identical repeats, the server has been designed adeptly to identify and extract degenerate, inverted, everted and mirror repeats from both DNA and protein sequences. The server has striking output displays, featuring interactive graphs and comprehensive output files. In addition, RepEx has been accoutered with an easy-to-use interface and search filters to facilitate a user-defined query or search and is freely available and accessible via the World Wide Web at http://bioserver2.physics.iisc.ac.in/RepEx/.
Collapse
|
2
|
Abstract
This review explores the presence and functions of polyglutamine (polyQ) in viral proteins. In mammals, mutations in polyQ segments (and CAG repeats at the nucleotide level) have been linked to neural disorders and ataxias. PolyQ regions in normal human proteins have documented functional roles, in transcription factors and, more recently, in regulating autophagy. Despite the high frequency of polyQ repeats in eukaryotic genomes, little attention has been given to the presence or possible role of polyQ sequences in virus genomes. A survey described here revealed that polyQ repeats occur rarely in RNA viruses, suggesting that they have detrimental effects on virus replication at the nucleotide or protein level. However, there have been sporadic reports of polyQ segments in potyviruses and in reptilian nidoviruses (among the largest RNA viruses known). Conserved polyQ segments are found in the regulatory control proteins of many DNA viruses. Variable length polyQ tracts are found in proteins that contribute to transmissibility (cowpox A-type inclusion protein (ATI)) and control of latency (herpes viruses). New longer-read sequencing methods, using original biological samples, should reveal more details on the presence and functional role of polyQ in viruses, as well as the nucleotide regions that encode them. Given the known toxic effects of polyQ repeats, the role of these segments in neurovirulent and tumorigenic viruses should be further explored.
Collapse
|
3
|
Involvement of non-polyalanine (polyA) residues in aggregation of polyA proteins: Clue for inhibition of aggregation. Comput Biol Chem 2014; 53PB:318-323. [PMID: 25462338 DOI: 10.1016/j.compbiolchem.2014.11.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2014] [Revised: 11/11/2014] [Accepted: 11/11/2014] [Indexed: 11/21/2022]
Abstract
Presence of polyalanine (polyA) stretches in some proteins is found to be associated with their aggregation, which causes disorders in various developmental processes. In this work, inherent propensities towards aggregation of some residues, which are not part of the polyA stretches, have been identified by using the primary sequences of seven polyA proteins with the help of Betascan, PASTA and Tango programs and explored unambiguously. This provides a basis for proposing molecular mechanism of this type of aggregation. Reported suppression of aggregation of polyA proteins by chaperones like HSP40 and HSP70 is substantiated through molecular docking. The hydrophobic residues of identified aggregating region are found to be interacting with hydrophobic surface of chaperones. This suggests a crucial clue for possible way to inhibit the aggregation of such proteins.
Collapse
|
4
|
Atanesyan L, Günther V, Dichtl B, Georgiev O, Schaffner W. Polyglutamine tracts as modulators of transcriptional activation from yeast to mammals. Biol Chem 2012; 393:63-70. [DOI: 10.1515/bc-2011-252] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2011] [Accepted: 11/09/2011] [Indexed: 11/15/2022]
Abstract
Abstract
Microsatellite repeats are genetically unstable and subject to expansion and shrinkage. A subset of them, triplet repeats, can occur within the coding region and specify homomeric tracts of amino acids. Polyglutamine (polyQ) tracts are enriched in eukaryotic regulatory proteins, notably transcription factors, and we had shown before that they can contribute to transcriptional activation in mammalian cells. Here we generalize this finding by also including evolutionarily divergent organisms, namely, Drosophila and baker’s yeast. In all three systems, Gal4-based model transcription factors were more active if they harbored a polyQ tract, and the activity depended on the length of the tract. By contrast, a polyserine tract was inactive. PolyQs acted from either an internal or a C-terminal position, thus ruling out a merely structural ‘linker’ effect. Finally, a two-hybrid assay in mammalian cells showed that polyQ tracts can interact with each other, supporting the concept that a polyQ-containing transcription factor can recruit other factors with polyQ tracts or glutamine-rich activation domains. The widespread occurrence of polyQ repeats in regulatory proteins suggests a beneficial role; in addition to the contribution to transcriptional activity, their genetic instability might help a species to adapt to changing environmental conditions in a potentially reversible manner.
Collapse
|
5
|
Haerty W, Golding GB. Low-complexity sequences and single amino acid repeats: not just "junk" peptide sequences. Genome 2011; 53:753-62. [PMID: 20962881 DOI: 10.1139/g10-063] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
For decades proteins were thought to interact in a "lock and key" system, which led to the definition of a paradigm linking stable three-dimensional structure to biological function. As a consequence, any non-structured peptide was considered to be nonfunctional and to evolve neutrally. Surprisingly, the most commonly shared peptides between eukaryotic proteomes are low-complexity sequences that in most conditions do not present a stable three-dimensional structure. However, because these sequences evolve rapidly and because the size variation of a few of them can have deleterious effects, low-complexity sequences have been suggested to be the target of selection. Here we review evidence that supports the idea that these simple sequences should not be considered just "junk" peptides and that selection drives the evolution of many of them.
Collapse
Affiliation(s)
- Wilfried Haerty
- Biology Department, McMaster University, Hamilton, ON, Canada
| | | |
Collapse
|
6
|
Whan V, Hobbs M, McWilliam S, Lynn DJ, Lutzow YS, Khatkar M, Barendse W, Raadsma H, Tellam RL. Bovine proteins containing poly-glutamine repeats are often polymorphic and enriched for components of transcriptional regulatory complexes. BMC Genomics 2010; 11:654. [PMID: 21092319 PMCID: PMC3014979 DOI: 10.1186/1471-2164-11-654] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2010] [Accepted: 11/23/2010] [Indexed: 11/12/2022] Open
Abstract
Background About forty human diseases are caused by repeat instability mutations. A distinct subset of these diseases is the result of extreme expansions of polymorphic trinucleotide repeats; typically CAG repeats encoding poly-glutamine (poly-Q) tracts in proteins. Polymorphic repeat length variation is also apparent in human poly-Q encoding genes from normal individuals. As these coding sequence repeats are subject to selection in mammals, it has been suggested that normal variations in some of these typically highly conserved genes are implicated in morphological differences between species and phenotypic variations within species. At present, poly-Q encoding genes in non-human mammalian species are poorly documented, as are their functions and propensities for polymorphic variation. Results The current investigation identified 178 bovine poly-Q encoding genes (Q ≥ 5) and within this group, 26 genes with orthologs in both human and mouse that did not contain poly-Q repeats. The bovine poly-Q encoding genes typically had ubiquitous expression patterns although there was bias towards expression in epithelia, brain and testes. They were also characterised by unusually large sizes. Analysis of gene ontology terms revealed that the encoded proteins were strongly enriched for functions associated with transcriptional regulation and many contributed to physical interaction networks in the nucleus where they presumably act cooperatively in transcriptional regulatory complexes. In addition, the coding sequence CAG repeats in some bovine genes impacted mRNA splicing thereby generating unusual transcriptional diversity, which in at least one instance was tissue-specific. The poly-Q encoding genes were prioritised using multiple criteria for their likelihood of being polymorphic and then the highest ranking group was experimentally tested for polymorphic variation within a cattle diversity panel. Extensive and meiotically stable variation was identified. Conclusions Transcriptional diversity can potentially be generated in poly-Q encoding genes by the impact of CAG repeat tracts on mRNA alternative splicing. This effect, combined with the physical interactions of the encoded proteins in large transcriptional regulatory complexes suggests that polymorphic variations of proteins in these complexes have strong potential to affect phenotype.
Collapse
Affiliation(s)
- Vicki Whan
- CSIRO Livestock Industries, Queensland Bioscience Precinct, 306 Carmody Rd, St Lucia, Queensland 4067, Australia
| | | | | | | | | | | | | | | | | |
Collapse
|
7
|
Mularoni L, Ledda A, Toll-Riera M, Albà MM. Natural selection drives the accumulation of amino acid tandem repeats in human proteins. Genome Res 2010; 20:745-54. [PMID: 20335526 DOI: 10.1101/gr.101261.109] [Citation(s) in RCA: 71] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Amino acid tandem repeats are found in a large number of eukaryotic proteins. They are often encoded by trinucleotide repeats and exhibit high intra- and interspecies size variability due to the high mutation rate associated with replication slippage. The extent to which natural selection is important in shaping amino acid repeat evolution is a matter of debate. On one hand, their high frequency may simply reflect their high probability of expansion by slippage, and they could essentially evolve in a neutral manner. On the other hand, there is experimental evidence that changes in repeat size can influence protein-protein interactions, transcriptional activity, or protein subcellular localization, indicating that repeats could be functionally relevant and thus shaped by selection. To gauge the relative contribution of neutral and selective forces in amino acid repeat evolution, we have performed a comparative analysis of amino acid repeat conservation in a large set of orthologous proteins from 12 vertebrate species. As a neutral model of repeat evolution we have used sequences with the same DNA triplet composition as the coding sequences--and thus expected to be subject to the same mutational forces--but located in syntenic noncoding genomic regions. The results strongly indicate that selection has played a more important role than previously suspected in amino acid tandem repeat evolution, by increasing the repeat retention rate and by modulating repeat size. The data obtained in this study have allowed us to identify a set of 92 repeats that are postulated to play important functional roles due to their strong selective signature, including five cases with direct experimental evidence.
Collapse
Affiliation(s)
- Loris Mularoni
- Biomedical Informatics Research Programme (GRIB), Fundació Institut Municipal d'Investigació Mèdica, Barcelona 08003, Spain
| | | | | | | |
Collapse
|
8
|
Yenamandra SP, Sompallae R, Klein G, Kashuba E. Comparative analysis of the Epstein-Barr virus encoded nuclear proteins of EBNA-3 family. Comput Biol Med 2009; 39:1036-42. [PMID: 19762010 DOI: 10.1016/j.compbiomed.2009.08.006] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2009] [Revised: 08/12/2009] [Accepted: 08/18/2009] [Indexed: 10/20/2022]
Abstract
It is known that the EBNA-3 family proteins (EBNA-3, -4 and -6, alternative nomenclature EBNA-3A, B and C correspondingly) show a limited sequence similarity. We have analyzed EBNA-3 proteins both at the primary sequence and secondary structure levels. EBNA-3 and EBNA-4 were structurally more similar compared to other combinations with EBNA-6. We found "Stonin Homology Domain" profile in EBNA-4 and "Proline Rich Domain" in all EBNA-3 family of proteins. We have also found positive and negative charge clusters in all three proteins and mixed charge clusters in EBNA-3. Charged clusters are believed to play an important role in interactions with DNA or signaling proteins. Additionally, unique primary sequence repeats were found in all three proteins.
Collapse
Affiliation(s)
- Surya Pavan Yenamandra
- Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet, S-17177 Stockholm, Sweden
| | | | | | | |
Collapse
|
9
|
Simon M, Hancock JM. Tandem and cryptic amino acid repeats accumulate in disordered regions of proteins. Genome Biol 2009; 10:R59. [PMID: 19486509 PMCID: PMC2718493 DOI: 10.1186/gb-2009-10-6-r59] [Citation(s) in RCA: 92] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2009] [Accepted: 06/01/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Amino acid repeats (AARs) are common features of protein sequences. They often evolve rapidly and are involved in a number of human diseases. They also show significant associations with particular Gene Ontology (GO) functional categories, particularly transcription, suggesting they play some role in protein function. It has been suggested recently that AARs play a significant role in the evolution of intrinsically unstructured regions (IURs) of proteins. We investigate the relationship between AAR frequency and evolution and their localization within proteins based on a set of 5,815 orthologous proteins from four mammalian (human, chimpanzee, mouse and rat) and a bird (chicken) genome. We consider two classes of AAR (tandem repeats and cryptic repeats: regions of proteins containing overrepresentations of short amino acid repeats). RESULTS Mammals show very similar repeat frequencies but chicken shows lower frequencies of many of the cryptic repeats common in mammals. Regions flanking tandem AARs evolve more rapidly than the rest of the protein containing the repeat and this phenomenon is more pronounced for non-conserved repeats than for conserved ones. GO associations are similar to those previously described for the mammals, but chicken cryptic repeats show fewer significant associations. Comparing the overlaps of AARs with IURs and protein domains showed that up to 96% of some AAR types are associated preferentially with IURs. However, no more than 15% of IURs contained an AAR. CONCLUSIONS Their location within IURs explains many of the evolutionary properties of AARs. Further study is needed on the types of IURs containing AARs.
Collapse
Affiliation(s)
- Michelle Simon
- Bioinformatics Group, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Harwell, Oxfordshire, OX11 0RD, UK
| | - John M Hancock
- Bioinformatics Group, MRC Harwell, Mammalian Genetics Unit, Harwell Science and Innovation Campus, Harwell, Oxfordshire, OX11 0RD, UK
| |
Collapse
|
10
|
Cole TJ. Glucocorticoid action and the development of selective glucocorticoid receptor ligands. ACTA ACUST UNITED AC 2007; 12:269-300. [PMID: 17045197 DOI: 10.1016/s1387-2656(06)12008-6] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Glucocorticoids are important endocrine regulators of a wide range of physiological systems ranging from respiratory development, immune function to responses to stress. Glucocorticoids in cells activate the cytoplasmic glucocorticoid receptor (GR) that dimerizes, translocates to the nucleus and functions as a ligand-dependent transcriptional regulator. Synthetic glucocorticoids such as dexamethasone and prednisolone have for decades been the cornerstone for the clinical treatment of inflammatory diseases, such as rheumatoid arthritis and asthma, and in some lymphoid cancers, yet its prolonged use has undesirable side effects such as obesity, diabetes, immune suppression and osteoporosis. Detailed knowledge on the mechanism of GR action has led to the development of novel selective glucocorticoid receptor modulators (SGRMs) that show promise of being efficacious for specific treatments of disease but with fewer side effects. SGRMs promote specific recruitment of transcriptional co-regulators that elicit specific gene responses and show promise of greater efficacy and specificity in treatment of inflammatory diseases and type-2 diabetes.
Collapse
Affiliation(s)
- Timothy J Cole
- Department of Biochemistry and Molecular Biology, Monash University, Clayton, Victoria 3800, Australia.
| |
Collapse
|
11
|
Mularoni L, Veitia RA, Albà MM. Highly constrained proteins contain an unexpectedly large number of amino acid tandem repeats. Genomics 2006; 89:316-25. [PMID: 17196365 DOI: 10.1016/j.ygeno.2006.11.011] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2006] [Revised: 10/30/2006] [Accepted: 11/22/2006] [Indexed: 11/16/2022]
Abstract
Single-amino-acid tandem repeats are very common in mammalian proteins but their function and evolution are still poorly understood. Here we investigate how the variability and prevalence of amino acid repeats are related to the evolutionary constraints operating on the proteins. We find a significant positive correlation between repeat size difference and protein nonsynonymous substitution rate in human and mouse orthologous genes. This association is observed for all the common amino acid repeat types and indicates that rapid diversification of repeat structures, involving both trinucleotide slippage and nucleotide substitutions, preferentially occurs in proteins subject to low selective constraints. However, strikingly, we also observe a significant negative correlation between the number of repeats in a protein and the gene nonsynonymous substitution rate, particularly for glutamine, glycine, and alanine repeats. This implies that proteins subject to strong selective constraints tend to contain an unexpectedly high number of repeats, which tend to be well conserved between the two species. This is consistent with a role for selection in the maintenance of a significant number of repeats. Analysis of the codon structure of the sequences encoding the repeats shows that codon purity is associated with high repeat size interspecific variability. Interestingly, polyalanine and polyglutamine repeats associated with disease show very distinctive features regarding the degree of repeat conservation and the protein sequence selective constraints.
Collapse
Affiliation(s)
- Loris Mularoni
- Research Unit on Biomedical Informatics, Institut Municipal d'Investigació Mèdica, Universitat Pompeu Fabra, Barcelona 08003, Spain
| | | | | |
Collapse
|
12
|
Li X, Kahveci T. A Novel algorithm for identifying low-complexity regions in a protein sequence. ACTA ACUST UNITED AC 2006; 22:2980-7. [PMID: 17018537 DOI: 10.1093/bioinformatics/btl495] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION We consider the problem of identifying low-complexity regions (LCRs) in a protein sequence. LCRs are regions of biased composition, normally consisting of different kinds of repeats. RESULTS We define new complexity measures to compute the complexity of a sequence based on a given scoring matrix, such as BLOSUM 62. Our complexity measures also consider the order of amino acids in the sequence and the sequence length. We develop a novel graph-based algorithm called GBA to identify LCRs in a protein sequence. In the graph constructed for the sequence, each vertex corresponds to a pair of similar amino acids. Each edge connects two pairs of amino acids that can be grouped together to form a longer repeat. GBA finds short subsequences as LCR candidates by traversing this graph. It then extends them to find longer subsequences that may contain full repeats with low complexities. Extended subsequences are then post-processed to refine repeats to LCRs. Our experiments on real data show that GBA has significantly higher recall compared to existing algorithms, including 0j.py, CARD, and SEG. AVAILABILITY The program is available on request.
Collapse
Affiliation(s)
- Xuehui Li
- Department of Computer and Information Science and Engineering, University of Florida, Gainesville, FL 32611, USA.
| | | |
Collapse
|
13
|
Pasolini P, Costagliola D, Rocco L, Tinti F. Molecular organization of 5S rDNAs in Rajidae (Chondrichthyes): Structural features and evolution of piscine 5S rRNA genes and nontranscribed intergenic spacers. J Mol Evol 2006; 62:564-74. [PMID: 16612546 DOI: 10.1007/s00239-005-0118-z] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2005] [Accepted: 01/12/2006] [Indexed: 12/17/2022]
Abstract
The genomic and gene organisation of 5S rDNA clusters have been extensively characterized in bony fish and eukaryotes, providing general issues for understanding the molecular evolution of this multigene DNA family. By contrast, the 5S rDNA features have been rarely investigated in cartilaginous fish (only three species). Here, we provide evidence for a dual 5S rDNA gene system in the Rajidae by sequence analysis of the coding region (5S) and adjacent nontranscribed spacer (NTS) in five Mediterranean species of rays (Rajidae), and in a large number of piscine taxa including lampreys and bony fish. As documented in several bony fish, two functional 5S rDNA types were found here also in the rajid genome: a short one (I) and a long one (II), distinguished by distinct 5S and NTS sequences. That the ancestral piscine genome had these two 5S rDNA loci might be argued from the occurrence of homologous dual gene systems that exist in several fish taxa and from 5S phylogenetic relationships. An extensive analysis of NTS-II sequences of Rajidae and Dasyatidae revealed the occurrence of large simple sequence repeat (SSR) regions that are formed by microsatellite arrays. The localization and organization of SSR within the NTS-II are conserved in Rajiformes since the Upper Cretaceous. The direct correlation between the SSRs extension and the NTS length indicated that they might play a role in the maintenance of the larger 5S rDNA clusters in rays. The phylogenetic analysis indicated that NTS-II is a valuable systematic tool limited to distantly related taxa of Rajiformes.
Collapse
Affiliation(s)
- Paola Pasolini
- Molecular Genetics for Environmental and Fishery Resources Laboratory, Interdept. Centre of Research in Environmental Sciences, University of Bologna, Ravenna, Italy
| | | | | | | |
Collapse
|
14
|
van Limpt V, Schramm A, van Lakeman A, Sluis P, Chan A, van Noesel M, Baas F, Caron H, Eggert A, Versteeg R. The Phox2B homeobox gene is mutated in sporadic neuroblastomas. Oncogene 2005; 23:9280-8. [PMID: 15516980 DOI: 10.1038/sj.onc.1208157] [Citation(s) in RCA: 94] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Neuroblastomas are embryonal tumours of the sympatho-adrenal lineage with a clinical course ranging from spontaneous regression to fatal progression. The Phox2B homeobox transcription factor functions in the differentiation of the sympatho-adrenal lineage. Targets of Phox2B are, for example, genes of the (nor)adrenalin synthesis route, like Dopamine Beta Hydroxylase (DBH). Congenital Central Hypoventilation Syndrome was recently found to result from Phox2B mutations and two such patients in addition developed neuroblastoma. A germline mutation in Phox2B was identified in a family with hereditary neuroblastoma. Here, we report the first analysis of Phox2B in a series of 237 sporadic neuroblastomas and 22 cell lines. Six frameshift mutations were found in exons 2 and 3; including one in cell line SK-N-SH. Two patients showed de novo constitutional mutations. One of them was diagnosed with Haddad syndrome. All analysed cases expressed the mutated and wild-type Phox2B alleles. Ectopic expression of TrkA, the Nerve Growth Factor receptor, strongly downregulated Phox2B and DBH expression in cell line SH-SY5Y. However, TrkA and Phox2B showed a positive correlation in a panel of 66 neuroblastoma tumours. Although Phox2B mutations are infrequent (2.3%), they implicate a role for the Phox2B pathway in oncogenesis.
Collapse
Affiliation(s)
- Vera van Limpt
- Department of Human Genetics, Academic Medical Center, University of Amsterdam, PO Box 22700, 1100 DE Amsterdam, The Netherlands
| | | | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Hancock JM, Simon M. Simple sequence repeats in proteins and their significance for network evolution. Gene 2005; 345:113-8. [PMID: 15716087 DOI: 10.1016/j.gene.2004.11.023] [Citation(s) in RCA: 75] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2004] [Revised: 11/08/2004] [Accepted: 11/09/2004] [Indexed: 11/16/2022]
Abstract
Only 5-6% of mammalian genomes are genes; the remainders are made up primarily of transposable elements and different types of simple sequence repeat (SSRs) (micro- and minisatellites and cryptic repeats), which tend to accumulate in organisms with larger genomes. SSRs are also found at the level of protein sequences and may or may not be encoded by SSRs at the DNA sequence level. Studies of proteins containing SSRs indicate that they tend to belong to particular functional classes, particularly transcription factors and protein kinases. Protein SSRs coded for by pure codon repeats evolve rapidly while those encoded by mixtures of codons evolve slowly. We outline a conceptualization of how protein SSRs may arise and become fixed in proteins during evolution, and suggest that emergence and change in length of protein SSRs may affect the topology of protein interaction networks.
Collapse
Affiliation(s)
- John M Hancock
- Bioinformatics Group, MRC Mammalian Genetics Unit, Harwell, Oxfordshire OX11 0RD, UK.
| | | |
Collapse
|
16
|
Abstract
Amino acid tandem repeats, also called homopolymeric tracts, are extremely abundant in eukaryotic proteins. To gain insight into the genome-wide evolution of these regions in mammals, we analyzed the repeat content in a large data set of rat-mouse-human orthologs. Our results show that human proteins contain more amino acid repeats than rodent proteins and that trinucleotide repeats are also more abundant in human coding sequences. Using the human species as an outgroup, we were able to address differences in repeat loss and repeat gain in the rat and mouse lineages. In this data set, mouse proteins contain substantially more repeats than rat proteins, which can be at least partly attributed to a higher repeat loss in the rat lineage. The data are consistent with a role for trinucleotide slippage in the generation of novel amino acid repeats. We confirm the previously observed functional bias of proteins with repeats, with overrepresentation of transcription factors and DNA-binding proteins. We show that genes encoding amino acid repeats tend to have an unusually high GC content, and that differences in coding GC content among orthologs are directly related to the presence/absence of repeats. We propose that the different GC content isochore structure in rodents and humans may result in an increased amino acid repeat prevalence in the human lineage.
Collapse
Affiliation(s)
- M Mar Albà
- Grup de Recerca en Informàtica Biomèdica, Departament de Ciències Experimentals i de la Salut, Institut Municipal d'Investigació Mèdica, Universitat Pompeu Fabra, 08003 Barcelona, Spain.
| | | |
Collapse
|
17
|
Aurrekoetxea-Hernández K, Buetti E. Transforming growth factor beta enhances the glucocorticoid response of the mouse mammary tumor virus promoter through Smad and GA-binding proteins. J Virol 2004; 78:2201-11. [PMID: 14963116 PMCID: PMC369214 DOI: 10.1128/jvi.78.5.2201-2211.2004] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
Tissue-specific transcription is advantageously investigated by using viral promoters, which are selected for compact regulatory elements. Mouse mammary tumor virus (MMTV) has adapted to specialized cell types and targets initially B lymphocytes. We previously showed that, in B-cell lines, glucocorticoid-induced MMTV transcription requires an ETS family factor, GA-binding protein (GABP), bound in tandem to the MMTV DNA next to the glucocorticoid receptor (GR). We now report that transforming growth factor beta (TGF-beta) superinduces this response up to 10-fold through binding of its effectors, Smads, between the GABP-binding motifs. The basal level was unaffected. The TGF-beta-glucocorticoid cooperation also depended on GR and GABP binding, was transferable to another promoter, and occurred both with transiently transfected and with integrated templates. Smad3 associated in vitro with GR, with GABPalpha (via the MH2 domain), and with GABPbeta, Smad4 only with GABPalpha. Interactions of Smad3 with GABP (when coexpressed or endogenous to B cells) were shown by coprecipitation and by mammalian two-hybrid assay. This composite DNA element integrates three signaling pathways deriving from TGF-beta, glucocorticoid hormones, and a unique ETS factor, and may allow MMTV to exploit factors from the milk. It may as well indicate novel possibilities for cellular regulatory networks.
Collapse
|
18
|
Abstract
We have identified Xenopus Id4, a member of the Id (inhibitor of differentiation/DNA binding) class of helix-loop-helix proteins. Id factors dimerize with general bHLH factors, preventing their interaction with tissue-specific bHLH factors, to inhibit premature differentiation. The presence of several Id proteins could reflect simple redundancy in function, or more interestingly, might suggest different activities for these proteins. During embryonic development, Xenopus Id4 is expressed in a number of neural tissues, including Rohon-Beard neurons, olfactory placode, eye primordia, and the trigeminal ganglia. It is also expressed in other organs, such as the pronephros and liver primordium. As embryogenesis progresses, it is expressed in the migrating melanocytes and lateral line structures. We compare the expression of Id4 mRNA with that of Id2 and Id3 and find that the Id genes are expressed in complementary patterns during neurogenesis, myogenesis, kidney development, in the tailbud, and in the migrating neural crest. To examine the regulation of Id gene expression during Xenopus neural development, we show that expression of Id3 and Id4 can be induced by overexpression of BMP4 in the whole embryo and in ectodermal explants. Expression of Id2, Id3, and Id4 in these explants is unaffected by the expression of FGF-8 or a dominant-negative Ras (N17ras), suggesting that Id genes are not regulated by the FGF signaling pathway in naive ectoderm. We also show that Notch signaling can activate Id2 and Id3 expression in the whole embryo. In contrast, Id4 expression in the Rohon-Beard cells is inhibited by activated Notch and increased by a dominant-negative Delta. This may reflect an increase in Rohon-Beard cells in response to inhibition of Notch signaling rather than transcriptional regulation of Id4. Finally, to compare the activities of Id2, Id3, and Id4, we use animal cap explants and in vivo overexpression to show that Id proteins can differentially inhibit the activities of neurogenin and neuroD, both neurogenic bHLH molecules and MyoD, a myogenic bHLH protein. Id4 is able to inhibit the activity all these bHLH molecules, Id2 inhibits MyoD and neuroD, while Id3 blocks only neuroD activity in our assays.
Collapse
Affiliation(s)
- Karen J Liu
- Department of Molecular and Cell Biology, 401 Barker Hall, University of California, Berkeley, CA 94720, USA
| | | |
Collapse
|
19
|
Lavoie H, Debeane F, Trinh QD, Turcotte JF, Corbeil-Girard LP, Dicaire MJ, Saint-Denis A, Pagé M, Rouleau GA, Brais B. Polymorphism, shared functions and convergent evolution of genes with sequences coding for polyalanine domains. Hum Mol Genet 2003; 12:2967-79. [PMID: 14519685 DOI: 10.1093/hmg/ddg329] [Citation(s) in RCA: 83] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
Mutations causing expansions of polyalanine domains are responsible for nine hereditary diseases. Other GC-rich sequences coding for some polyalanine domains were found to be polymorphic in human. These observations prompted us to identify all sequences in the human genome coding for polyalanine stretches longer than four alanines and establish their degree of polymorphism. We identified 494 annotated human proteins containing 604 polyalanine domains. Thirty-two percent (31/98) of tested sequences coding for more than seven alanines were polymorphic. The length of the polyalanine-coding sequence and its GCG or GCC repeat content are the major predictors of polymorphism. GCG codons are over-represented in human polyalanine coding sequences. Our data suggest that GCG and GCC codons play a key role in polyalanine-coding sequence appearance and polymorphism. The grouping by shared function of polyalanine-containing proteins in Homo sapiens, Drosophila melanogaster and Caenorhabditis elegans shows that the majority are involved in transcriptional regulation. Phylogenetic analyses of HOX, GATA and EVX protein families demonstrate that polyalanine domains arose independently in different members of these families, suggesting that convergent molecular evolution may have played a role. Finally polyalanine domains in vertebrates are conserved between mammals and are rarer and shorter in Gallus gallus and Danio rerio. Together our results show that the polymorphic nature of sequences coding for polyalanine domains makes them prime candidates for mutations in hereditary diseases and suggests that they have appeared in many different protein families through convergent evolution.
Collapse
Affiliation(s)
- Hugo Lavoie
- Laboratoire de Neurogénétique, Centre de Recherche du Centre Hospitalier de l'Université de Montréal, Québec, Canada
| | | | | | | | | | | | | | | | | | | |
Collapse
|
20
|
Li YC, Korol AB, Fahima T, Beiles A, Nevo E. Microsatellites: genomic distribution, putative functions and mutational mechanisms: a review. Mol Ecol 2002; 11:2453-65. [PMID: 12453231 DOI: 10.1046/j.1365-294x.2002.01643.x] [Citation(s) in RCA: 607] [Impact Index Per Article: 27.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Microsatellites, or tandem simple sequence repeats (SSR), are abundant across genomes and show high levels of polymorphism. SSR genetic and evolutionary mechanisms remain controversial. Here we attempt to summarize the available data related to SSR distribution in coding and noncoding regions of genomes and SSR functional importance. Numerous lines of evidence demonstrate that SSR genomic distribution is nonrandom. Random expansions or contractions appear to be selected against for at least part of SSR loci, presumably because of their effect on chromatin organization, regulation of gene activity, recombination, DNA replication, cell cycle, mismatch repair system, etc. This review also discusses the role of two putative mutational mechanisms, replication slippage and recombination, and their interaction in SSR variation.
Collapse
Affiliation(s)
- You-Chun Li
- Institute of Evolution, University of Haifa, Haifa 31905, Israel
| | | | | | | | | |
Collapse
|
21
|
Heeley RP, Rusconi SG, Sutcliffe RG, Kenyon CJ. Mutations flanking the polyglutamine repeat in the modulatory domain of rat glucocorticoid receptor lead to an increase in affinity for hormone. Endocr Res 2002; 28:217-29. [PMID: 12489571 DOI: 10.1081/erc-120015060] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
A polyglutamine repeat in the N-terminus of the rat glucocorticoid receptor shows polymorphism, with variants of Q2RQ5, Q2RQ15-21. We investigated whether these natural polymorphisms affect receptor function, and whether alleles with polyglutamine repeats shorter than Q2RQ5, between Q2RQ6-14, or longer than Q2RQ21 are not found naturally because they encode a dysfunctional receptor. Ligand binding and transactivation properties of sets of natural (Q2RQ5-Q2RQ21) and artificial (Q4-Q80) alleles were compared following expression in CV-1 cells. The sequence of artificial alleles at sites flanking the repeat region was altered slightly to facilitate cloning. Western blotting showed that all constructs expressed GR protein in CV-1 cells. When co-expressed with an MMTV-lacZ reporter plasmid, all GR proteins were shown to be transcriptionally active in the presence of hormone. Scatchard analysis of ligand binding curves showed that affinities for dexamethasone and corticosterone were not affected by variation in the polyglutamine repeat either the natural or artificial sets of alleles. However, affinities were greater for the artificial compared with the natural alleles (2-3-fold for dexametasone, p < 0.001; and 4-fold for corticosterone,p < 0.001). These differences provide evidence of a direct or indirect interaction within GR between the ligand binding domain and residues flanking the polyglutamine repeat of the N-terminal domain.
Collapse
Affiliation(s)
- Robert P Heeley
- Molecular Medicine Centre, University of Edinburgh, Edinburgh, EH4 2XU, UK
| | | | | | | |
Collapse
|
22
|
St Pierre SE, Galindo MI, Couso JP, Thor S. Control ofDrosophilaimaginal disc development byrotundandroughened eye: differentially expressed transcripts of the same gene encoding functionally distinct zinc finger proteins. Development 2002; 129:1273-81. [PMID: 11874922 DOI: 10.1242/dev.129.5.1273] [Citation(s) in RCA: 69] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
The Drosophila rotund gene is required in the wings, antenna, haltere, proboscis and legs. A member of the Rac family of GTPases, denoted the rotund racGAP gene, was previously identified in the rotund region. However, previous studies indicated that rotund racGAP was not responsible for the rotund phenotypes and that the rotund gene had yet to be identified. We have isolated the rotund gene and show that it is a member of the Krüppel family of zinc finger genes. The adjacent roughened eye locus specifically affects the eye and is genetically separable from rotund. However, roughened eye and rotund are tightly linked, and we have therefore also isolated the roughened eye transcript. Intriguingly, we show that roughened eye is part of the rotund gene but is represented by a different transcript. The rotund and roughened eye transcripts result from the utilization of two different promoters that direct expression in non-overlapping domains in the larval imaginal discs. The predicted Rotund and Roughened Eye proteins share the same C-terminal region, including the zinc finger domain, but differ in their N-terminal regions. Each cDNA can rescue only the corresponding mutation and show negative effects when expressed in each others domain of expression. These results indicate that in addition to the differential expression of rotund and roughened eye, their proteins have distinct activities. rotund and roughened eye act downstream of early patterning genes such as dachshund and appear to be involved in Notch signaling by regulating Delta, scabrous and Serrate.
Collapse
Affiliation(s)
- Susan E St Pierre
- Department of Neurobiology, Harvard Medical School, 220 Longwood Avenue, Boston, MA 02115, USA
| | | | | | | |
Collapse
|
23
|
Albrecht AN, Schwabe GC, Stricker S, Böddrich A, Wanker EE, Mundlos S. The synpolydactyly homolog (spdh) mutation in the mouse -- a defect in patterning and growth of limb cartilage elements. Mech Dev 2002; 112:53-67. [PMID: 11850178 DOI: 10.1016/s0925-4773(01)00639-6] [Citation(s) in RCA: 42] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
We have investigated the recessive mouse mutant synpolydactyly homolog (spdh) as a model for human synpolydactyly (SPD). As in human SPD, the spdh phenotype consists of central polydactyly, syndactyly and brachydactyly and is caused by the expansion of a polyalanine encoding repeat in the 5' region of the Hoxd13 gene. We performed a detailed phenotypic and functional analysis of spdh/spdh embryos using skeletal preparations, histology, in situ hybridization, BrdU labeling of proliferating cells, and in vitro expression studies. The absence of normal phalangeal joints and the misexpression of genes involved in joint formation demonstrate a role for Hox-genes in joint patterning. The spdh mutation results in abnormal limb pattering, defective chondrocyte differentiation, and in a drastic reduction in proliferation. Abnormal chondrocyte differentiation and proliferation persisted after birth and correlated with the expression of the mutant Hoxd13 and other Hox-genes during late-embryonic and postnatal growth.
Collapse
|
24
|
Rebuffat A, Bernasconi A, Ceppi M, Wehrli H, Verca SB, Ibrahim M, Frey BM, Frey FJ, Rusconi S. Selective enhancement of gene transfer by steroid-mediated gene delivery. Nat Biotechnol 2001; 19:1155-61. [PMID: 11731785 DOI: 10.1038/nbt1201-1155] [Citation(s) in RCA: 65] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The incorporation of transgenes into the host cells' nuclei is problematic using conventional nonviral gene delivery technologies. Here we describe a strategy called steroid-mediated gene delivery (SMGD), which uses steroid receptors as shuttles to facilitate the uptake of transfected DNA into the nucleus. We use glucocorticoid receptors (GRs) as a model system with which to test the principle of SMGD. To this end, we synthesized and tested several bifunctional steroid derivatives, finally focusing on a compound named DR9NP, consisting of a dexamethasone backbone linked to a psoralen moiety using a nine-atom chemical spacer. DR9NP binds to the GR in either its free or DNA-crosslinked form, inducing the translocation of the GR to the nucleus. The expression of transfected DR9NP-decorated reporter plasmids is enhanced in dividing cells: expression of steroid-decorated reporter plasmids depends on the presence of the GR, is independent of the transactivation potential of the GR, and correlates with enhanced nuclear accumulation of the transgene in GR-positive cells. The SMGD effect is also observed in cells naturally expressing GRs and is significantly increased in nondividing cell cultures. We propose that SMGD could be used as a platform for selective targeting of transgenes in nonviral somatic gene transfer.
Collapse
Affiliation(s)
- A Rebuffat
- Division of Nephrology, Inselspital, Bern, Switzerland
| | | | | | | | | | | | | | | | | |
Collapse
|
25
|
Hancock JM, Worthey EA, Santibáñez-Koref MF. A Role for Selection in Regulating the Evolutionary Emergence of Disease-Causing and Other Coding CAG Repeats in Humans and Mice. Mol Biol Evol 2001; 18:1014-23. [PMID: 11371590 DOI: 10.1093/oxfordjournals.molbev.a003873] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The evolutionary expansion of CAG repeats in human triplet expansion disease genes is intriguing because of their deleterious phenotype. In the past, this expansion has been suggested to reflect a broad genomewide expansion of repeats, which would imply that mutational and evolutionary processes acting on repeats differ between species. Here, we tested this hypothesis by analyzing repeat- and flanking-sequence evolution in 28 repeat-containing genes that had been sequenced in humans and mice and by considering overall lengths and distributions of CAG repeats in the two species. We found no evidence that these repeats were longer in humans than in mice. We also found no evidence for preferential accumulation of CAG repeats in the human genome relative to mice from an analysis of the lengths of repeats identified in sequence databases. We then investigated whether sequence properties, such as base and amino acid composition and base substitution rates, showed any relationship to repeat evolution. We found that repeat-containing genes were enriched in certain amino acids, presumably as the result of selection, but that this did not reflect underlying biases in base composition. We also found that regions near repeats showed higher nonsynonymous substitution rates than the remainder of the gene and lower nonsynonymous rates in genes that contained a repeat in both the human and the mouse. Higher rates of nonsynonymous mutation in the neighborhood of repeats presumably reflect weaker purifying selection acting in these regions of the proteins, while the very low rate of nonsynonymous mutation in proteins containing a CAG repeat in both species presumably reflects a high level of purifying selection. Based on these observations, we propose that the mutational processes giving rise to polyglutamine repeats in human and murine proteins do not differ. Instead, we propose that the evolution of polyglutamine repeats in proteins results from an interplay between mutational processes and selection.
Collapse
Affiliation(s)
- J M Hancock
- MRC Clinical Sciences Centre, Imperial College School of Medicine, Hammersmith Hospital, London, England.
| | | | | |
Collapse
|
26
|
Dean DA, Urban G, Aragon IV, Swingle M, Miller B, Rusconi S, Bueno M, Dean NM, Honkanen RE. Serine/threonine protein phosphatase 5 (PP5) participates in the regulation of glucocorticoid receptor nucleocytoplasmic shuttling. BMC Cell Biol 2001; 2:6. [PMID: 11389770 PMCID: PMC32197 DOI: 10.1186/1471-2121-2-6] [Citation(s) in RCA: 59] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2000] [Accepted: 05/17/2001] [Indexed: 11/15/2022] Open
Abstract
BACKGROUND In most cells glucocorticoid receptors (GR) reside predominantly in the cytoplasm. Upon hormone binding, the GR translocates into the nucleus, where the hormone-activated GR-complex regulates the transcription of GR-responsive genes. Serine/threonine protein phosphatase type 5 (PP5) associates with the GR-heat-shock protein-90 complex, and the suppression of PP5 expression with ISIS 15534 stimulates the activity of GR-responsive reporter plasmids, without affecting the binding of hormone to the GR. RESULTS To further characterize the mechanism by which PP5 affects GR-induced gene expression, we employed immunofluorescence microscopy to track the movement of a GR-green fluorescent fusion protein (GR-GFP) that retained hormone binding, nuclear translocation activity and specific DNA binding activity, but is incapable of transactivation. In the absence of glucocorticoids, GR-GFP localized mainly in the cytoplasm. Treatment with dexamethasone results in the efficient translocation of GR-GFPs into the nucleus. The nuclear accumulation of GR-GFP, without the addition of glucocorticoids, was also observed when the expression of PP5 was suppressed by treatment with ISIS 15534. In contrast, ISIS 15534 treatment had no apparent effect on calcium induced nuclear translocation of NFAT-GFP. CONCLUSION These studies suggest that PP5 participates in the regulation of glucocorticoid receptor nucleocytoplasmic shuttling, and that the GR-induced transcriptional activity observed when the expression of PP5 is suppressed by treatment with ISIS 15534 results from the nuclear accumulation of GR in a form that is capable of binding DNA yet still requires agonist to elicit maximal transcriptional activation.
Collapse
Affiliation(s)
- David A Dean
- Division of Pulmonary and Critical Care Medicine, Northwestern University, Chicago, IL 60611 USA
| | - Gudrun Urban
- Department of Biochemistry and Molecular Biology, University of South Alabama, Mobile, Alabama 36688 USA
| | - Ileana V Aragon
- Department of Biochemistry and Molecular Biology, University of South Alabama, Mobile, Alabama 36688 USA
| | - Mark Swingle
- Department of Biochemistry and Molecular Biology, University of South Alabama, Mobile, Alabama 36688 USA
| | - Beth Miller
- Department of Biochemistry and Molecular Biology, University of South Alabama, Mobile, Alabama 36688 USA
| | - Sandro Rusconi
- Institut de Biochimie, University of Fribourg, Perolles, CH-1700 Fribourg, Switzerland
| | - Manuel Bueno
- Institut de Biochimie, University of Fribourg, Perolles, CH-1700 Fribourg, Switzerland
| | - Nicholas M Dean
- Department of Pharmacology, ISIS Pharmaceuticals, Carlsbad, California 92008 USA
| | - Richard E Honkanen
- Department of Biochemistry and Molecular Biology, University of South Alabama, Mobile, Alabama 36688 USA
| |
Collapse
|
27
|
Lanz RB, McKenna NJ, Onate SA, Albrecht U, Wong J, Tsai SY, Tsai MJ, O'Malley BW. A steroid receptor coactivator, SRA, functions as an RNA and is present in an SRC-1 complex. Cell 1999; 97:17-27. [PMID: 10199399 DOI: 10.1016/s0092-8674(00)80711-4] [Citation(s) in RCA: 573] [Impact Index Per Article: 22.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
Nuclear receptors play critical roles in the regulation of eukaryotic gene expression. We report the isolation and functional characterization of a novel transcriptional coactivator, termed steroid receptor RNA activator (SRA). SRA is selective for steroid hormone receptors and mediates transactivation via their amino-terminal activation function. We provide functional and mechanistic evidence that SRA acts as an RNA transcript; transfected SRA, unlike other steroid receptor coregulators, functions in the presence of cycloheximide, and SRA mutants containing multiple translational stop signals retain their ability to activate steroid receptor-dependent gene expression. Biochemical fractionation shows that SRA exists in distinct ribonucleoprotein complexes, one of which contains the nuclear receptor coactivator steroid receptor coactivator 1. We suggest that SRA may act to confer functional specificity upon multiprotein complexes recruited by liganded receptors during transcriptional activation.
Collapse
Affiliation(s)
- R B Lanz
- Department of Cell Biology, Baylor College of Medicine, Houston, Texas 77030, USA
| | | | | | | | | | | | | | | |
Collapse
|
28
|
Russell MW, Kemp P, Wang L, Brody LC, Izumo S. Molecular cloning of the human HAND2 gene. BIOCHIMICA ET BIOPHYSICA ACTA 1998; 1443:393-9. [PMID: 9878849 DOI: 10.1016/s0167-4781(98)00237-1] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
We have cloned and characterized the coding sequence of the human HAND2 basic helix-loop-helix transcription factor. The amino acid sequence includes an amino-terminal polyalanine repeat which is precisely conserved in the rat HAND2 gene. Northern analysis indicates that the HAND2 transcript is 2.3 kb in length and strongly expressed in the human heart.
Collapse
MESH Headings
- Amino Acid Sequence
- Base Sequence
- Basic Helix-Loop-Helix Transcription Factors
- Chromosome Mapping
- Chromosomes, Human, Pair 4/genetics
- Cloning, Molecular
- DNA/chemistry
- DNA/genetics
- DNA, Complementary/chemistry
- DNA, Complementary/genetics
- DNA-Binding Proteins/genetics
- Gene Expression
- Genes/genetics
- Humans
- Molecular Sequence Data
- RNA, Messenger/genetics
- RNA, Messenger/metabolism
- Sequence Alignment
- Sequence Analysis, DNA
- Sequence Homology, Amino Acid
- Sequence Homology, Nucleic Acid
- Tissue Distribution
- Transcription Factors/genetics
- Zebrafish Proteins
Collapse
Affiliation(s)
- M W Russell
- Department of Pediatrics and Communicable Diseases, University of Michigan, Ann Arbor, MI 48109,
| | | | | | | | | |
Collapse
|
29
|
Huse B, Verca SB, Matthey P, Rusconi S. Definition of a negative modulation domain in the human progesterone receptor. Mol Endocrinol 1998; 12:1334-42. [PMID: 9731702 DOI: 10.1210/mend.12.9.0164] [Citation(s) in RCA: 21] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
The progesterone receptor (PR) occurs in two major forms, the full-length PRB and the amino-truncated PRA, which lacks 164 amino-terminal residues. PRB functions as a strong transcriptional activator of progesterone-responsive genes, whereas PRA is inactive in several cell types where it may even act as a trans-dominant repressor of PRB and other steroid receptors, like the glucocorticoid receptor or, reportedly, the estrogen receptor. We initially observed that a PR deleted of its entire amino domain (PR538-C) is incapable of trans-repressing PRB or glucocorticoid receptor, suggesting that a negative modulation domain must be contained in the region between position 165 and 538. After testing progressive deletion mutants and chimeras, we demonstrate that this negative modulating domain is confined within 120 residues in the amino-terminal region and that it contains a subdomain of 40 residues that is crucial for intermolecular transrepression. Duplication, deletion, and transplantation of the negative modulation domain show that the negative modulation domain has only a limited functional autonomy. In our hands, transrepression of estrogen receptor could not be substantiated, and, under our conditions, at least an equimolar concentration of PRA expression plasmid is required for transrepression. Our deletion studies reveal domains that correlate with strong homology patches between the amino-terminal domains of mammalian and avian PR.
Collapse
Affiliation(s)
- B Huse
- Biochemistry Institute, Université de Fribourg, Pérolles, Switzerland
| | | | | | | |
Collapse
|
30
|
Persengiev SP, Kilpatrick DL. Characterization of a cDNA containing trinucleotide repeat sequences that is highly enriched in spermatogenic cells. Mol Reprod Dev 1997; 46:476-81. [PMID: 9094094 DOI: 10.1002/(sici)1098-2795(199704)46:4<476::aid-mrd5>3.0.co;2-l] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Trinucleotide repeat sequences have become of great interest due to their association with specific genetic disorders. Here we report the identification of a cDNA containing opa trinucleotide repeats from mouse testis, termed t-OPA. The opa repeat is contained within the longest open reading frame within the cDNA. Northern analysis demonstrated that four distinct t-OPA transcripts (1.6, 2.5, 3.6, 4.0 kilobases) are preferentially expressed in mouse and rat testis, with low expression in the pituitary, brain, and adrenal gland. Further, t-OPA RNAs were highly abundant in both pachytene spermatocytes and round spermatids and decreased in cytoplasts. Polysome profile analysis indicated that t-OPA mRNAs are translated in mouse testis with efficiencies similar to other transcripts expressed in late meiotic/early post-meiotic spermatogenic cells. These findings thus suggest a role for cell-specific mRNAs containing opa repeats during mouse spermatogenesis.
Collapse
Affiliation(s)
- S P Persengiev
- Neurobiology Group, Worcester Foundation for Biomedical Research, Shrewsbury, MA 01545, USA
| | | |
Collapse
|
31
|
Abstract
Most traits in biological populations appear to be under stabilizing selection, which acts to eliminate quantitative genetic variation. Yet, virtually all measured traits in biological populations continue to show significant quantitative genetic variation. The paradox can be resolved by postulating the existence of an abundant, though unspecified, source of mutations that has quantitative effects on phenotype, but does not reduce fitness. Does such a source actually exist? We propose that it does, in the form of repeat-number variation in SSRs (simple sequence repeats, of which the triplet repeats of human neurodegenerative diseases are a special case). Viewing SSRs as a major source of quantitative mutation has broad implications for understanding molecular processes of evolutionary adaptation, including the evolutionary control of the mutation process itself.
Collapse
Affiliation(s)
- Y Kashi
- Department of Food Engineering and Biotechnology, The Technion, Technion City, Haifa, Israel.
| | | | | |
Collapse
|
32
|
Delaunay F, Liden J, Gustafsson JA, Okret S. Glucocorticoid receptor lacking the tau 1 transactivation domain is a gene-specific regulator of the wild-type glucocorticoid-receptor activity. EUROPEAN JOURNAL OF BIOCHEMISTRY 1996; 242:839-45. [PMID: 9022717 DOI: 10.1111/j.1432-1033.1996.0839r.x] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
The glucocorticoid receptor (GR) contains a major transactivation function (tau 1), located in the N-terminal domain. tau 1 contributes to about 80% of the ligand-inducible transcriptional activity of GR. In this study, we show that GR devoid of tau 1 (symbol: see text] GR) can inhibit activation of gene expression by wild-type GR but this does not occur for all target genes. Activation of the mouse mammary tumor virus promoter by wild-type GR in transiently transfected chinese hamster ovary (CHO) cells lacking endogenous GR was repressed by cotransfecting [symbol: see text] GR. This effect was proportional to the amount of transfected [symbol: see text] GR and was not due to squelching. A moderate expression level of stably transfected [symbol: see text] GR mutant was also shown to repress the transcriptional activity of endogenous GR present in rat skeletal myoblast L8 cells. Glucocorticoid mediated down regulation of endogenous GR gene expression can be blocked by the [symbol: see text] GR mutant in stably transfected L8 cells. In contrast, no inhibition was observed on glucocorticoid induction of the endogenous glutamine synthetase gene in L8 cells. However, glucocorticoid induction of a reporter gene driven by the chicken glutamine synthetase promoter was inhibited by [symbol: see text] GR in L8 cells. Stable expression of wild-type GR in CHO cells rendered the cells glucocorticoid responsive with regard to glutamine synthetase induction but coexpression of [symbol: see text] GR did not repress induction of the endogenous glutamine synthetase gene expression by wild-type GR. Expression of [symbol: see text] GR alone in CHO cells did not render the glutamine synthetase gene glucocorticoid responsive, indicating that [symbol: see text] GR has no transcriptional activity on the glutamine synthetase gene. We conclude from these results that the structure of glucocorticoid-response elements within target genes may be very critical for the ability of the mutant receptor to exhibit a dominant negative effect.
Collapse
Affiliation(s)
- F Delaunay
- Department of Medical Nutrition, Karolinska Institute, Huddinge University Hospital, Novum, Sweden.
| | | | | | | |
Collapse
|
33
|
Ohshima K, Kang S, Larson JE, Wells RD. Cloning, characterization, and properties of seven triplet repeat DNA sequences. J Biol Chem 1996; 271:16773-83. [PMID: 8663377 DOI: 10.1074/jbc.271.28.16773] [Citation(s) in RCA: 82] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
Several neuromuscular and neurodegenerative diseases are caused by genetically unstable triplet repeat sequences (CTG.CAG, CGG.CCG, or AAG.CTT) in or near the responsible genes. We implemented novel cloning strategies with chemically synthesized oligonucleotides to clone seven of the triplet repeat sequences (GTA.TAC, GAT.ATC, GTT.AAC, CAC.GTG, AGG.CCT, TCG.CGA, and AAG.CTT), and the adjoining paper (Ohshima, K., Kang, S., Larson, J. E., and Wells, R. D.(1996) J. Biol. Chem. 271, 16784-16791) describes studies on TTA.TAA. This approach in conjunction with in vivo expansion studies in Escherichia coli enabled the preparation of at least 81 plasmids containing the repeat sequences with lengths of approximately 16 up to 158 triplets in both orientations with varying extents of polymorphisms. The inserts were characterized by DNA sequencing as well as DNA polymerase pausings, two-dimensional agarose gel electrophoresis, and chemical probe analyses to evaluate the capacity to adopt negative supercoil induced non-B DNA conformations. AAG.CTT and AGG.CCT form intramolecular triplexes, and the other five repeat sequences do not form any previously characterized non-B structures. However, long tracts of TCG.CGA showed strong inhibition of DNA synthesis at specific loci in the repeats as seen in the cases of CTG.CAG and CGG.CCG (Kang, S., Ohshima, K., Shimizu, M., Amirhaeri, S., and Wells, R. D.(1995) J. Biol. Chem. 270, 27014-27021). This work along with other studies (Wells, R. D.(1996) J. Biol. Chem. 271, 2875-2878) on CTG.CAG, CGG.CCG, and TTA.TAA makes available long inserts of all 10 triplet repeat sequences for a variety of physical, molecular biological, genetic, and medical investigations. A model to explain the reduction in mRNA abundance in Friedreich's ataxia based on intermolecular triplex formation is proposed.
Collapse
Affiliation(s)
- K Ohshima
- Department of Biochemistry and Biophysics, Texas A&M University, Texas Medical Center, Houston, Texas 77030-3303, USA
| | | | | | | |
Collapse
|
34
|
Gortner G, Pfenninger M, Kahl G, Weising K. Northern blot analysis of simple repetitive sequence transcription in plants. Electrophoresis 1996; 17:1183-9. [PMID: 8855401 DOI: 10.1002/elps.1150170702] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
The presence of simple repetitive sequence motifs in RNA from various plant species was probed by Northern blot analysis. Hybridization of total, poly(A)(+)- and poly(A)(-)-RNA with microsatellite-complementary oligonucleotide probes revealed distinct bands with most but not all probe/species combinations, demonstrating the presence of di-, tri- and tetranucleotide repeat motifs in plant transcripts. Only trinucleotide repeat-derived hybridization signals were found to be enriched in the poly(A)(+)-fraction. The quality of Northern blot signals proved to be highly dependent on hybridization stringency. Thus, under the stringency conditions usually applied for oligonucleotide hybridization, some probes [(GT)8, (CAC)5, (TCC)5, and (CCTA)4] cross-hybridized to bands corresponding in size to 18S and/or 26S rRNA. Cross-hybridization to rRNA was significantly reduced at higher stringencies. These results stress the importance of carefully adjusting the hybridization conditions in Northern blot analysis of simple sequence transcripts.
Collapse
Affiliation(s)
- G Gortner
- Department of Biology, University of Frankfurt am Main, Germany
| | | | | | | |
Collapse
|
35
|
|