1
|
Gupta K, Czerminski JT, Lawrence JB. Trisomy silencing by XIST: translational prospects and challenges. Hum Genet 2024; 143:843-855. [PMID: 38459355 PMCID: PMC11294271 DOI: 10.1007/s00439-024-02651-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Accepted: 01/25/2024] [Indexed: 03/10/2024]
Abstract
XIST RNA is heavily studied for its role in fundamental epigenetics and X-chromosome inactivation; however, the translational potential of this singular RNA has been much less explored. This article combines elements of a review on XIST biology with our perspective on the translational prospects and challenges of XIST transgenics. We first briefly review aspects of XIST RNA basic biology that are key to its translational relevance, and then discuss recent efforts to develop translational utility of XIST for chromosome dosage disorders, particularly Down syndrome (DS). Remarkably, it was shown in vitro that expression of an XIST transgene inserted into one chromosome 21 can comprehensively silence that chromosome and "dosage compensate" Trisomy 21, the cause of DS. Here we summarize recent findings and discuss potential paths whereby ability to induce "trisomy silencing" can advance translational research for new therapeutic strategies. Despite its common nature, the underlying biology for various aspects of DS, including cell types and pathways impacted (and when), is poorly understood. Recent studies show that an inducible iPSC system to dosage-correct chromosome 21 can provide a powerful approach to unravel the cells and pathways directly impacted, and the developmental timing, information key to design pharmacotherapeutics. In addition, we discuss prospects of a more far-reaching and challenging possibility that XIST itself could be developed into a therapeutic agent, for targeted cellular "chromosome therapy". A few rare case studies of imbalanced X;autosome translocations indicate that natural XIST can rescue an otherwise lethal trisomy. The potential efficacy of XIST transgenes later in development faces substantial biological and technical challenges, although recent findings are encouraging, and technology is rapidly evolving. Hence, it is compelling to consider the transformative possibility that XIST-mediated chromosome therapy may ultimately be developed, for specific pathologies seen in DS, or other duplication disorders.
Collapse
Affiliation(s)
- Khusali Gupta
- Department of Neurology, University of Massachusetts Chan Medical School, Worcester, MA, 01655, USA
| | - Jan T Czerminski
- Department of Neurology, University of Massachusetts Chan Medical School, Worcester, MA, 01655, USA
- Medical Scientist Training Program, University of Massachusetts Chan Medical School, Worcester, MA, 01655, USA
| | - Jeanne B Lawrence
- Department of Neurology, University of Massachusetts Chan Medical School, Worcester, MA, 01655, USA.
- Department of Pediatrics, University of Massachusetts Chan Medical School, Worcester, MA, 01655, USA.
| |
Collapse
|
2
|
Haerter CAG, Blanco DR, Traldi JB, Feldberg E, Margarido VP, Lui RL. Are scattered microsatellites weak chromosomal markers? Guided mapping reveals new insights into Trachelyopterus (Siluriformes: Auchenipteridae) diversity. PLoS One 2023; 18:e0285388. [PMID: 37310952 DOI: 10.1371/journal.pone.0285388] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Accepted: 04/22/2023] [Indexed: 06/15/2023] Open
Abstract
The scattered distribution pattern of microsatellites is a challenging problem in fish cytogenetics. This type of array hinders the identification of useful patterns and the comparison between species, often resulting in over-limited interpretations that only label it as "scattered" or "widely distributed". However, several studies have shown that the distribution pattern of microsatellites is non-random. Thus, here we tested whether a scattered microsatellite could have distinct distribution patterns on homeologous chromosomes of closely related species. The clustered sites of 18S and 5S rDNA, U2 snRNA and H3/H4 histone genes were used as a guide to compare the (GATA)n microsatellite distribution pattern on the homeologous chromosomes of six Trachelyopterus species: T. coriaceus and Trachelyopterus aff. galeatus from the Araguaia River basin; T. striatulus, T. galeatus and T. porosus from the Amazonas River basin; and Trachelyopterus aff. coriaceus from the Paraguay River basin. Most species had similar patterns of the (GATA)n microsatellite in the histone genes and 5S rDNA carriers. However, we have found a chromosomal polymorphism of the (GATA)n sequence in the 18S rDNA carriers of Trachelyopterus galeatus, which is in Hard-Weinberg equilibrium and possibly originated through amplification events; and a chromosome polymorphism in Trachelyopterus aff. galeatus, which combined with an inversion polymorphism of the U2 snRNA in the same chromosome pair resulted in six possible cytotypes, which are in Hardy-Weinberg disequilibrium. Therefore, comparing the distribution pattern on homeologous chromosomes across the species, using gene clusters as a guide to identify it, seems to be an effective way to further the analysis of scattered microsatellites in fish cytogenetics.
Collapse
Affiliation(s)
| | | | - Josiane Baccarin Traldi
- Departamento de Genética, Instituto de Ciências Biológicas, Universidade Federal do Amazonas, Manaus, Brasil
| | | | - Vladimir Pavan Margarido
- Universidade Estadual do Oeste do Paraná, Centro de Ciências Biológicas e da Saúde, Cascavel, Paraná, Brasil
| | - Roberto Laridondo Lui
- Universidade Estadual do Oeste do Paraná, Centro de Ciências Biológicas e da Saúde, Cascavel, Paraná, Brasil
| |
Collapse
|
3
|
Gharesouran J, Hosseinzadeh H, Ghafouri-Fard S, Taheri M, Rezazadeh M. STRs: Ancient Architectures of the Genome beyond the Sequence. J Mol Neurosci 2021; 71:2441-2455. [PMID: 34056692 DOI: 10.1007/s12031-021-01850-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2021] [Accepted: 04/22/2021] [Indexed: 01/24/2023]
Abstract
Short tandem repeats (STRs) are commonly defined as short runs of repetitive nucleotides, consisting of tandemly repeating 2-6- bp motif units, which are ubiquitously distributed throughout genomes. Functional STRs are polymorphic in the population, and their variations influence gene expression, which subsequently may result in pathogenic phenotypes. To understand STR phenotypic effects and their functional roles, we describe four different mutational mechanisms including the unequal crossing-over model, gene conversion, retrotransposition mechanism and replication slippage. Due to the multi-allelic nature, small length, abundance, high variability, codominant inheritance, nearly neutral evolution, extensive genome coverage and simple assaying of STRs, these markers are widely used in various types of biological research, including population genetics studies, genome mapping, molecular epidemiology, paternity analysis and gene flow studies. In this review, we focus on the current knowledge regarding STR genomic distribution, function, mutation and applications.
Collapse
Affiliation(s)
- Jalal Gharesouran
- Molecular Genetics Division, GMG center, Tabriz, Iran.,Division of Medical Genetics, Tabriz Childrens Hospital, Tabriz University of Medical Sciences, Tabriz, Iran.,Department of Medical Genetics, Faculty of Medicine, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Hassan Hosseinzadeh
- Molecular Genetics Division, GMG center, Tabriz, Iran.,Division of Medical Genetics, Tabriz Childrens Hospital, Tabriz University of Medical Sciences, Tabriz, Iran.,Department of Medical Genetics, Faculty of Medicine, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Soudeh Ghafouri-Fard
- Department of Medical Genetics, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Mohammad Taheri
- Skull Base Research Center, Loghman Hakim Hospital, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
| | - Maryam Rezazadeh
- Division of Medical Genetics, Tabriz Childrens Hospital, Tabriz University of Medical Sciences, Tabriz, Iran. .,Department of Medical Genetics, Faculty of Medicine, Tabriz University of Medical Sciences, Tabriz, Iran.
| |
Collapse
|
4
|
Creamer KM, Lawrence JB. XIST RNA: a window into the broader role of RNA in nuclear chromosome architecture. Philos Trans R Soc Lond B Biol Sci 2017; 372:20160360. [PMID: 28947659 PMCID: PMC5627162 DOI: 10.1098/rstb.2016.0360] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/15/2017] [Indexed: 12/31/2022] Open
Abstract
XIST RNA triggers the transformation of an active X chromosome into a condensed, inactive Barr body and therefore provides a unique window into transitions of higher-order chromosome architecture. Despite recent progress, how XIST RNA localizes and interacts with the X chromosome remains poorly understood. Genetic engineering of XIST into a trisomic autosome demonstrates remarkable capacity of XIST RNA to localize and comprehensively silence that autosome. Thus, XIST does not require X chromosome-specific sequences but operates on mechanisms available genome-wide. Prior results suggested XIST localization is controlled by attachment to the insoluble nuclear scaffold. Our recent work affirms that scaffold attachment factor A (SAF-A) is involved in anchoring XIST, but argues against the view that SAF-A provides a unimolecular bridge between RNA and the chromosome. Rather, we suggest that a complex meshwork of architectural proteins interact with XIST RNA. Parallel work studying the territory of actively transcribed chromosomes suggests that repeat-rich RNA 'coats' euchromatin and may impact chromosome architecture in a manner opposite of XIST A model is discussed whereby RNA may not just recruit histone modifications, but more directly impact higher-order chromatin condensation via interaction with architectural proteins of the nucleus.This article is part of the themed issue 'X-chromosome inactivation: a tribute to Mary Lyon'.
Collapse
Affiliation(s)
- K M Creamer
- Department of Neurology and Pediatrics, University of Massachusetts Medical School, Worcester, MA 01655, USA
| | - J B Lawrence
- Department of Neurology and Pediatrics, University of Massachusetts Medical School, Worcester, MA 01655, USA
| |
Collapse
|
5
|
Bagshaw AT. Functional Mechanisms of Microsatellite DNA in Eukaryotic Genomes. Genome Biol Evol 2017; 9:2428-2443. [PMID: 28957459 PMCID: PMC5622345 DOI: 10.1093/gbe/evx164] [Citation(s) in RCA: 64] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/23/2017] [Indexed: 02/06/2023] Open
Abstract
Microsatellite repeat DNA is best known for its length mutability, which is implicated in several neurological diseases and cancers, and often exploited as a genetic marker. Less well-known is the body of work exploring the widespread and surprisingly diverse functional roles of microsatellites. Recently, emerging evidence includes the finding that normal microsatellite polymorphism contributes substantially to the heritability of human gene expression on a genome-wide scale, calling attention to the task of elucidating the mechanisms involved. At present, these are underexplored, but several themes have emerged. I review evidence demonstrating roles for microsatellites in modulation of transcription factor binding, spacing between promoter elements, enhancers, cytosine methylation, alternative splicing, mRNA stability, selection of transcription start and termination sites, unusual structural conformations, nucleosome positioning and modification, higher order chromatin structure, noncoding RNA, and meiotic recombination hot spots.
Collapse
|
6
|
Casa V, Runfola V, Micheloni S, Aziz A, Dilworth FJ, Gabellini D. Polycomb repressive complex 1 provides a molecular explanation for repeat copy number dependency in FSHD muscular dystrophy. Hum Mol Genet 2017; 26:753-767. [PMID: 28040729 PMCID: PMC5409123 DOI: 10.1093/hmg/ddw426] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2016] [Accepted: 12/15/2016] [Indexed: 11/13/2022] Open
Abstract
Repression of repetitive elements is crucial to preserve genome integrity and has been traditionally ascribed to constitutive heterochromatin pathways. FacioScapuloHumeral Muscular Dystrophy (FSHD), one of the most common myopathies, is characterized by a complex interplay of genetic and epigenetic events. The main FSHD form is linked to a reduced copy number of the D4Z4 macrosatellite repeat on 4q35, causing loss of silencing and aberrant expression of the D4Z4-embedded DUX4 gene leading to disease. By an unknown mechanism, D4Z4 copy-number correlates with FSHD phenotype. Here we show that the DUX4 proximal promoter (DUX4p) is sufficient to nucleate the enrichment of both constitutive and facultative heterochromatin components and to mediate a copy-number dependent gene silencing. We found that both the CpG/GC dense DNA content and the repetitive nature of DUX4p arrays are important for their repressive ability. We showed that DUX4p mediates a copy number-dependent Polycomb Repressive Complex 1 (PRC1) recruitment, which is responsible for the copy-number dependent gene repression. Overall, we directly link genetic and epigenetic defects in FSHD by proposing a novel molecular explanation for the copy number-dependency in FSHD pathogenesis, and offer insight into the molecular functions of repeats in chromatin regulation.
Collapse
Affiliation(s)
- Valentina Casa
- Gene Expression and Muscular Dystrophy Unit, Division of Regenerative Medicine, IRCCS San Raffaele Scientific Institute, Milan 20132, Italy.,Università Vita-Salute San Raffaele, Milan 20132, Italy
| | - Valeria Runfola
- Gene Expression and Muscular Dystrophy Unit, Division of Regenerative Medicine, IRCCS San Raffaele Scientific Institute, Milan 20132, Italy
| | - Stefano Micheloni
- Gene Expression and Muscular Dystrophy Unit, Division of Regenerative Medicine, IRCCS San Raffaele Scientific Institute, Milan 20132, Italy
| | - Arif Aziz
- The Sprott Center for Stem Cell Research, Regenerative Medicine Program, Ottawa Hospital Research Institute, Ottawa, ON K1Y 4E9, Canada
| | - F Jeffrey Dilworth
- The Sprott Center for Stem Cell Research, Regenerative Medicine Program, Ottawa Hospital Research Institute, Ottawa, ON K1Y 4E9, Canada
| | - Davide Gabellini
- Gene Expression and Muscular Dystrophy Unit, Division of Regenerative Medicine, IRCCS San Raffaele Scientific Institute, Milan 20132, Italy.,Dulbecco Telethon Institute, Milan 20132, Italy
| |
Collapse
|
7
|
Hall LL, Lawrence JB. RNA as a fundamental component of interphase chromosomes: could repeats prove key? Curr Opin Genet Dev 2016; 37:137-147. [PMID: 27218204 DOI: 10.1016/j.gde.2016.04.005] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2015] [Revised: 04/01/2016] [Accepted: 04/06/2016] [Indexed: 11/29/2022]
Abstract
Beginning with the precedent of XIST RNA as a 'chromosomal RNA' (cRNA), there is growing interest in the possibility that a diversity of non-coding RNAs may function in chromatin. We review findings which lead us to suggest that RNA is essentially a widespread component of interphase chromosomes. Further, RNA likely contributes to architecture and regulation, with repeat-rich 'junk' RNA in euchromatin (ecRNA) promoting a more open chromatin state. Thousands of low-abundance nuclear RNAs have been reported, however it remains a challenge to determine which of these may function in chromatin. Recent findings indicate that repetitive sequences are enriched in chromosome-associated non-coding RNAs, and repeat-rich RNA shows unusual properties, including localization and stability, with similarities to XIST RNA. We suggest two frontiers in genome biology are emerging and may intersect: the broad contribution of RNA to interphase chromosomes and the distinctive properties of repeat-rich intronic or intergenic junk sequences that may play a role in chromosome structure and regulation.
Collapse
Affiliation(s)
- Lisa L Hall
- Department of Cell & Developmental Biology, University of Massachusetts Medical School, 55 Lake Avenue North, Worcester, MA 01655, USA
| | - Jeanne B Lawrence
- Department of Cell & Developmental Biology, University of Massachusetts Medical School, 55 Lake Avenue North, Worcester, MA 01655, USA.
| |
Collapse
|
8
|
Casanova M, Liyakat Ali TM, Rougeulle C. Enlightening the contribution of the dark matter to the X chromosome inactivation process in mammals. Semin Cell Dev Biol 2016; 56:48-57. [PMID: 27174438 DOI: 10.1016/j.semcdb.2016.05.003] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2015] [Revised: 05/04/2016] [Accepted: 05/04/2016] [Indexed: 02/07/2023]
Abstract
X-chromosome inactivation (XCI) in mammals represents an exceptional example of transcriptional co-regulation occurring at the level of an entire chromosome. XCI is considered as a means to compensate for gene dosage imbalance between sexes, yet the largest part of the chromosome is composed of repeated elements of different nature and origins. Here we consider XCI from a repeat point of view, interrogating the mechanisms for inactivating X chromosome-derived repeated sequences and discussing the contribution of repetitive elements to the silencing process itself and to its evolution.
Collapse
Affiliation(s)
- Miguel Casanova
- Sorbonne Paris Cité, Epigenetics and Cell Fate, UMR 7216 CNRS, Université Paris Diderot, Paris, France
| | | | - Claire Rougeulle
- Sorbonne Paris Cité, Epigenetics and Cell Fate, UMR 7216 CNRS, Université Paris Diderot, Paris, France.
| |
Collapse
|
9
|
Escape Artists of the X Chromosome. Trends Genet 2016; 32:348-359. [PMID: 27103486 DOI: 10.1016/j.tig.2016.03.007] [Citation(s) in RCA: 104] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2016] [Revised: 03/23/2016] [Accepted: 03/24/2016] [Indexed: 01/24/2023]
Abstract
Inactivation of one X chromosome in mammalian females achieves dosage compensation between XX females and XY males; however, over 15% of human X-linked genes continue to be expressed from the inactive X chromosome. New genomic methodologies have improved our identification and characterization of these escape genes, revealing the importance of DNA sequence, chromatin structure, and chromosome ultrastructure in regulating expression from an otherwise inactive chromosome. Study of these exceptions to the rule of silencing highlights the interconnectedness of chromatin and chromosome structure in X-chromosome inactivation (XCI). Recent advances also demonstrate the importance of these genes in sexually dimorphic disease risk, particularly cancer.
Collapse
|
10
|
Pinter SF. A Tale of Two Cities: How Xist and its partners localize to and silence the bicompartmental X. Semin Cell Dev Biol 2016; 56:19-34. [PMID: 27072488 DOI: 10.1016/j.semcdb.2016.03.023] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2016] [Revised: 03/30/2016] [Accepted: 03/30/2016] [Indexed: 10/22/2022]
Abstract
Sex chromosomal dosage compensation in mammals takes the form of X chromosome inactivation (XCI), driven by the non-coding RNA Xist. In contrast to dosage compensation systems of flies and worms, mammalian XCI has to restrict its function to the Xist-producing X chromosome, while leaving autosomes and active X untouched. The mechanisms behind the long-range yet cis-specific localization and silencing activities of Xist have long been enigmatic, but genomics, proteomics, super-resolution microscopy, and innovative genetic approaches have produced significant new insights in recent years. In this review, I summarize and integrate these findings with a particular focus on the redundant yet mutually reinforcing pathways that enable long-term transcriptional repression throughout the soma. This includes an exploration of concurrent epigenetic changes acting in parallel within two distinct compartments of the inactive X. I also examine how Polycomb repressive complexes 1 and 2 and macroH2A may bridge XCI establishment and maintenance. XCI is a remarkable phenomenon that operates across multiple scales, combining changes in nuclear architecture, chromosome topology, chromatin compaction, and nucleosome/nucleotide-level epigenetic cues. Learning how these pathways act in concert likely holds the answer to the riddle posed by Cattanach's and other autosomal translocations: What makes the X especially receptive to XCI?
Collapse
Affiliation(s)
- Stefan F Pinter
- Department of Genetics and Genome Sciences, Institute for Systems Genomics, University of Connecticut Health Center, 263 Farmington Ave, Farmington, CT 06030-6403, USA.
| |
Collapse
|
11
|
Lawrence J, Telfer C. Interview: from Down's syndrome to basic epigenetics and back again. Epigenomics 2014; 5:611-4. [PMID: 24283875 DOI: 10.2217/epi.13.71] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Dr Jeanne Lawrence talks to Caroline Telfer, Commissioning Editor. Dr Jeanne Lawrence is an internationally recognized leader in the study of chromosome regulation by noncoding RNA and nuclear and genome organization. Her research bridges fundamental questions about genome regulation with clinical implications of recent advances in epigenetics. Her interest in chromosome structure and regulation has been a theme throughout her career and she has been honored for her work developing sensitive FISH technology for the detection of single copy genes, as well as RNAs. Her laboratory's publications include the initial demonstration of cell type-specific gene organization with nuclear subdomains; the novel biology of a noncoding RNA, XIST, which coats a whole X-chromosome to induce its silencing; and a new architectural role for a large noncoding RNA to scaffold a nuclear body. Her laboratory's work on epigenetic chromosome regulation in stem cells led to recent studies regarding unanticipated roles of repeat sequences in normal chromosome regulation and deregulation in cancer. Most recently, her laboratory has demonstrated a new approach to translate the basic mechanism of X-chromosome inactivation to correct a chromosomal dosage imbalance in patient-derived cells with trisomy 21 (Down's syndrome). Dr Lawrence has received awards from numerous agencies, including a Research Career Development Award from the National Center for Human Genome Research, career awards from the American Society of Cell Biology, the German Society for Biochemistry, the Muscular Dystrophy Association and a John Merck Fund Translational Research Award. She has served on the NIH National Advisory Council for Human Genome Research, numerous study sections and is currently a monitoring editor for the Journal of Cell Biology. Dr Lawrence has a BA in Biology and Music from Stephens College (MO, USA), a MS in Human Genetics and Genetic Counseling from Rutgers University (NJ, USA) and a PhD in Developmental Biology from Brown University (RI, USA). She is currently a Professor and Interim Chair of the Department of Cell and Developmental Biology at the University of Massachusetts Medical School (MA, USA).
Collapse
Affiliation(s)
- Jeanne Lawrence
- Department of Cell & Developmental Biology, University of Massachusetts Medical School, 55 Lake Avenue North, Worcester, MA 01655, USA.
| | | |
Collapse
|
12
|
Bala Tannan N, Brahmachary M, Garg P, Borel C, Alnefaie R, Watson CT, Thomas NS, Sharp AJ. DNA methylation profiling in X;autosome translocations supports a role for L1 repeats in the spread of X chromosome inactivation. Hum Mol Genet 2013; 23:1224-36. [PMID: 24186870 DOI: 10.1093/hmg/ddt553] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
X chromosome inactivation (XCI) is an epigenetic mechanism that silences the majority of genes on one X chromosome in females. Previous studies have suggested that the spread of XCI might be facilitated in part by common repeats such as long interspersed nuclear elements (LINEs). However, owing to the unusual sequence content of the X and the nonrandom distribution of genes that escape XCI, it has been unclear whether the correlation between repeat elements and XCI is a functional one. To test the hypothesis that the spread of XCI shows sequence specificity, we have analyzed the pattern of XCI in autosomal chromatin by performing DNA methylation profiling in six unbalanced X;autosome translocations. Using promoter hypermethylation as an epigenetic signature of XCI, we have determined the inactivation status of 1050 autosomal genes after translocation onto an inactive derivative X. By performing a comparative sequence analysis of autosomal genes that are either subject to or escape the X inactivation signal, we identified a number of common repetitive elements, including L1 and L2 LINEs, and DNA motifs that are significantly enriched around inactive autosomal genes. We show that these same motifs predominantly map to L1P repeat elements, are significantly enriched on the X chromosome versus the autosomes and also occur at higher densities around X-linked genes that are subject to X inactivation compared with those that escape X inactivation. These results are consistent with a potential causal relationship between DNA sequence features such as L1s and the spread of XCI, lending strong support to Mary Lyon's 'repeat hypothesis'.
Collapse
Affiliation(s)
- Neeta Bala Tannan
- Department of Genetics and Genomic Sciences, Mount Sinai School of Medicine, New York, USA
| | | | | | | | | | | | | | | |
Collapse
|
13
|
Cotton AM, Chen CY, Lam LL, Wasserman WW, Kobor MS, Brown CJ. Spread of X-chromosome inactivation into autosomal sequences: role for DNA elements, chromatin features and chromosomal domains. Hum Mol Genet 2013; 23:1211-23. [PMID: 24158853 PMCID: PMC4051349 DOI: 10.1093/hmg/ddt513] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
X-chromosome inactivation results in dosage equivalence between the X chromosome in males and females; however, over 15% of human X-linked genes escape silencing and these genes are enriched on the evolutionarily younger short arm of the X chromosome. The spread of inactivation onto translocated autosomal material allows the study of inactivation without the confounding evolutionary history of the X chromosome. The heterogeneity and reduced extent of silencing on autosomes are evidence for the importance of DNA elements underlying the spread of silencing. We have assessed DNA methylation in six unbalanced X-autosome translocations using the Illumina Infinium HumanMethylation450 array. Two to 42% of translocated autosomal genes showed this mark of silencing, with the highest degree of inactivation observed for trisomic autosomal regions. Generally, the extent of silencing was greatest close to the translocation breakpoint; however, silencing was detected well over 100 kb into the autosomal DNA. Alu elements were found to be enriched at autosomal genes that escaped from inactivation while L1s were enriched at subject genes. In cells without the translocation, there was enrichment of heterochromatic features such as EZH2 and H3K27me3 for those genes that become silenced when translocated, suggesting that underlying chromatin structure predisposes genes towards silencing. Additionally, the analysis of topological domains indicated physical clustering of autosomal genes of common inactivation status. Overall, our analysis indicated a complex interaction between DNA sequence, chromatin features and the three-dimensional structure of the chromosome.
Collapse
|
14
|
Hamarsheh O, Amro A. Characterization of simple sequence repeats (SSRs) from Phlebotomus papatasi (Diptera: Psychodidae) expressed sequence tags (ESTs). Parasit Vectors 2011; 4:189. [PMID: 21958493 PMCID: PMC3191335 DOI: 10.1186/1756-3305-4-189] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2011] [Accepted: 09/29/2011] [Indexed: 10/31/2022] Open
Abstract
BACKGROUND Phlebotomus papatasi is a natural vector of Leishmania major, which causes cutaneous leishmaniasis in many countries. Simple sequence repeats (SSRs), or microsatellites, are common in eukaryotic genomes and are short, repeated nucleotide sequence elements arrayed in tandem and flanked by non-repetitive regions. The enrichment methods used previously for finding new microsatellite loci in sand flies remain laborious and time consuming; in silico mining, which includes retrieval and screening of microsatellites from large amounts of sequence data from sequence data bases using microsatellite search tools can yield many new candidate markers. RESULTS Simple sequence repeats (SSRs) were characterized in P. papatasi expressed sequence tags (ESTs) derived from a public database, National Center for Biotechnology Information (NCBI). A total of 42,784 sequences were mined, and 1,499 SSRs were identified with a frequency of 3.5% and an average density of 15.55 kb per SSR. Dinucleotide motifs were the most common SSRs, accounting for 67% followed by tri-, tetra-, and penta-nucleotide repeats, accounting for 31.1%, 1.5%, and 0.1%, respectively. The length of microsatellites varied from 5 to 16 repeats. Dinucleotide types; AG and CT have the highest frequency. Dinucleotide SSR-ESTs are relatively biased toward an excess of (AX)n repeats and a low GC base content. Forty primer pairs were designed based on motif lengths for further experimental validation. CONCLUSION The first large-scale survey of SSRs derived from P. papatasi is presented; dinucleotide SSRs identified are more frequent than other types. EST data mining is an effective strategy to identify functional microsatellites in P. papatasi.
Collapse
Affiliation(s)
- Omar Hamarsheh
- Department of Biological Sciences, Faculty of Science and Technology, Al-Quds University, PO Box 51000, Jerusalem, Palestine.
| | | |
Collapse
|
15
|
Hall LL, Lawrence JB. XIST RNA and architecture of the inactive X chromosome: implications for the repeat genome. COLD SPRING HARBOR SYMPOSIA ON QUANTITATIVE BIOLOGY 2011; 75:345-56. [PMID: 21447818 DOI: 10.1101/sqb.2010.75.030] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
XIST RNA paints and induces silencing of one X chromosome in mammalian female cells, providing a powerful model to investigate long-range chromosomal regulation. This chapter focuses on events downstream from the spread of XIST RNA across the interphase chromosome, to consider how this large noncoding RNA interacts with and silences a whole chromosome. Several lines of evidence are summarized that point to the involvement of repeat sequences in different aspects of the X-inactivation process. Although the "repeat genome" comprises close to half of the human genome, the potential for abundant repeats to contribute to genome regulation has been largely overlooked and may be underestimated. X inactivation has the potential to reveal roles of interspersed and other repeats in the genome. For example, evidence indicates that XIST RNA acts at the architectural level of the whole chromosome to induce formation of a silent core enriched for nongenic and repetitive (Cot-1) DNA, which corresponds to the DAPI-dense Barr body. Expression of repeat RNAs may contribute to chromosome remodeling, and evidence suggests that other types of repeat elements may be involved in escape from X inactivation. Despite great progress in decoding the rest of the genome, we suggest that the repeat genome may contain meaningful but complex language that remains to be better studied and understood.
Collapse
Affiliation(s)
- L L Hall
- Department of Cell Biology, University of Massachusetts Medical School, North Worcester, Massachusetts 01655, USA
| | | |
Collapse
|
16
|
Nguyen DK, Yang F, Kaul R, Alkan C, Antonellis A, Friery KF, Zhu B, de Jong PJ, Disteche CM. Clcn4-2 genomic structure differs between the X locus in Mus spretus and the autosomal locus in Mus musculus: AT motif enrichment on the X. Genome Res 2011; 21:402-9. [PMID: 21282478 DOI: 10.1101/gr.108563.110] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
In Mus spretus, the chloride channel 4 gene Clcn4-2 is X-linked and dosage compensated by X up-regulation and X inactivation, while in the closely related mouse species Mus musculus, Clcn4-2 has been translocated to chromosome 7. We sequenced Clcn4-2 in M. spretus and identified the breakpoints of the evolutionary translocation in the Mus lineage. Genetic and epigenetic differences were observed between the 5'ends of the autosomal and X-linked loci. Remarkably, Clcn4-2 introns have been truncated on chromosome 7 in M. musculus as compared with the X-linked loci from seven other eutherian mammals. Intron sequences specifically preserved in the X-linked loci were significantly enriched in AT-rich oligomers. Genome-wide analyses showed an overall enrichment in AT motifs unique to the eutherian X (except for genes that escape X inactivation), suggesting a role for these motifs in regulation of the X chromosome.
Collapse
Affiliation(s)
- Di Kim Nguyen
- Department of Pathology, University of Washington, Seattle, Washington 98195, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Kumar RP, Senthilkumar R, Singh V, Mishra RK. Repeat performance: how do genome packaging and regulation depend on simple sequence repeats? Bioessays 2010; 32:165-74. [PMID: 20091758 DOI: 10.1002/bies.200900111] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Non-coding DNA has consistently increased during evolution of higher eukaryotes. Since the number of genes has remained relatively static during the evolution of complex organisms, it is believed that increased degree of sophisticated regulation of genes has contributed to the increased complexity. A higher proportion of non-coding DNA, including repeats, is likely to provide more complex regulatory potential. Here, we propose that repeats play a regulatory role by contributing to the packaging of the genome during cellular differentiation. Repeats, and in particular the simple sequence repeats, are proposed to serve as landmarks that can target regulatory mechanisms to a large number of genomic sites with the help of very few factors and regulate the linked loci in a coordinated manner. Repeats may, therefore, function as common target sites for regulatory mechanisms involved in the packaging and dynamic compartmentalization of the chromatin into active and inactive regions during cellular differentiation.
Collapse
Affiliation(s)
- Ram Parikshan Kumar
- Centre for Cellular and Molecular Biology, Uppal Road, Hyderabad 500 007, India
| | | | | | | |
Collapse
|
18
|
Goto Y, Kimura H. Inactive X chromosome-specific histone H3 modifications and CpG hypomethylation flank a chromatin boundary between an X-inactivated and an escape gene. Nucleic Acids Res 2010; 37:7416-28. [PMID: 19843608 PMCID: PMC2794193 DOI: 10.1093/nar/gkp860] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
In mammals, the dosage compensation of sex chromosomes between males and females is achieved by transcriptional inactivation of one of the two X chromosomes in females. However, a number of genes escape X-inactivation in humans. It remains poorly understood how the transcriptional activity of these ‘escape genes’ is maintained despite the chromosome-wide heterochromatin formation. To address this question, we analyzed a putative chromatin boundary between the inactivated RBM10 and an escape gene, UBA1/UBE1. Chromatin immunoprecipitation revealed that trimethylated histone H3 lysine 9 and H4 lysine 20 were enriched in the last exon through the proximal downstream region of RBM10, but were remarkably diminished at ∼2 kb upstream of the UBA1 transcription start site. Whereas RNA polymerase II was not loaded onto the intergenic region, CTCF (CCCTC binding factor) was enriched around the boundary, where some CpG sites were hypomethylated specifically on inactive X. These findings suggest that local DNA hypomethylation and CTCF binding are involved in the formation of a chromatin boundary, which protects the UBA1 escape gene against the chromosome-wide transcriptional silencing.
Collapse
Affiliation(s)
- Yuji Goto
- Nuclear Function and Dynamics Unit, Horizontal Medical Research Organization, Graduate School of Medicine, Kyoto University, Sakyo-ku, Kyoto 606-8501, Japan
| | | |
Collapse
|
19
|
Dementyeva EV, Shevchenko AI, Zakian SM. X-chromosome upregulation and inactivation: two sides of the dosage compensation mechanism in mammals. Bioessays 2009; 31:21-8. [PMID: 19153998 DOI: 10.1002/bies.080149] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Mammals have a very complex, tightly controlled, and developmentally regulated process of dosage compensation. One form of the process equalizes expression of the X-linked genes, present as a single copy in males (XY) and as two copies in females (XX), by inactivation of one of the two X-chromosomes in females. The second form of the process leads to balanced expression between the X-linked and autosomal genes by transcriptional upregulation of the active X in males and females. However, not all X-linked genes are absolutely balanced. This review is focused on the recent advances in studying the dosage compensation phenomenon in mammals.
Collapse
Affiliation(s)
- Elena V Dementyeva
- Institute of Cytology and Genetics, Russian Academy of Sciences, Siberian Department, Novosibirsk, Russia
| | | | | |
Collapse
|
20
|
Prothero KE, Stahl JM, Carrel L. Dosage compensation and gene expression on the mammalian X chromosome: one plus one does not always equal two. Chromosome Res 2009; 17:637-48. [PMID: 19802704 PMCID: PMC4941101 DOI: 10.1007/s10577-009-9063-9] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Counting chromosomes is not just simple math. Although normal males and females differ in sex chromosome content (XY vs. XX), X chromosome imbalance is tolerated because dosage compensation mechanisms have evolved to ensure functional equivalence. In mammals this is accomplished by two processes--X chromosome inactivation that silences most genes on one X chromosome in females, leading to functional X monosomy for most genes in both sexes, and X chromosome upregulation that results in increased gene expression on the single active X in males and females, equalizing dosage relative to autosomes. This review focuses on genes on the X chromosome, and how gene content, organization and expression levels can be influenced by these two processes. Special attention is given to genes that are not X inactivated, and are not necessarily fully dosage compensated. These genes that "escape" X inactivation are of medical importance as they explain phenotypes in individuals with sex chromosome aneuploidies and may impact normal traits and disorders that differ between men and women. Moreover, escape genes give insight into how X chromosome inactivation is spread and maintained on the X.
Collapse
Affiliation(s)
- Katie E. Prothero
- Department of Biochemistry and Molecular Biology, Penn State College of Medicine, Hershey, PA 17033, USA
| | - Jill M. Stahl
- Department of Biochemistry and Molecular Biology, Penn State College of Medicine, Hershey, PA 17033, USA
| | - Laura Carrel
- Department of Biochemistry and Molecular Biology, Penn State College of Medicine, Hershey, PA 17033, USA
| |
Collapse
|
21
|
Sidhu SK, Minks J, Chang SC, Cotton AM, Brown CJ. X chromosome inactivation: heterogeneity of heterochromatin. Biochem Cell Biol 2008; 86:370-9. [PMID: 18923538 DOI: 10.1139/o08-100] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
The silent X chromosome in mammalian females is a classic example of facultative heterochromatin, the term highlighting the compacted and inactive nature of the chromosome. However, it is now clear that the heterochromatin of the inactive X is not homogeneous--as indeed, not all genes on the inactive X are silenced. We summarize known features and events of X inactivation in different mouse and human model systems, and highlight the heterogeneity of chromatin along the inactive X. Characterizing this heterogeneity is likely to provide insight into the cis-acting sequences involved in X chromosome inactivation.
Collapse
Affiliation(s)
- Sharan K Sidhu
- Molecular Epigenetics Group, Department of Medical Genetics, University of British Columbia, 2350 Health Sciences Mall, Vancouver, BC V6T 1Z3, Canada
| | | | | | | | | |
Collapse
|
22
|
Van Laere AS, Coppieters W, Georges M. Characterization of the bovine pseudoautosomal boundary: Documenting the evolutionary history of mammalian sex chromosomes. Genome Res 2008; 18:1884-95. [PMID: 18981267 DOI: 10.1101/gr.082487.108] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Here, we report the sequence characterization of the bovine pseudoautosomal boundary (PAB) and its neighborhood. We demonstrate that it maps to the 5' end of the GPR143 gene, which has concomitantly lost upstream noncoding exons on the Y chromosome. We show that the bovine PAB was created approximately 20.7 million years ago by illegitimate intrachromatid recombination between inverted, ruminant-specific Bov-tA repeats. Accordingly, we demonstrate that cattle share their PAB with all other examined ruminants including sheep, but not with cetaceans or more distantly related mammals. We provide evidence that, since its creation, the ancestral ruminant PAB has been displaced by attrition, which occurs at variable rates in different species, and that it is capable of retreat by attrition erasure. We have estimated the ratio of male to female mutation rates in the Bovidae family as approximately 1.7, and we provide evidence that the mutation rate is higher in the recombining pseudoautosomal region than in the adjacent, nonrecombining gonosome-specific sequences.
Collapse
Affiliation(s)
- Anne-Sophie Van Laere
- Unit of Animal Genomics, GIGA-R and Faculty of Veterinary Medicine, University of Liège, Belgium
| | | | | |
Collapse
|
23
|
Straub T, Becker PB. DNA sequence and the organization of chromosomal domains. Curr Opin Genet Dev 2008; 18:175-80. [DOI: 10.1016/j.gde.2008.01.001] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2008] [Revised: 01/10/2008] [Accepted: 01/11/2008] [Indexed: 10/22/2022]
|
24
|
Arnau V, Gallach M, Marín I. Fast comparison of DNA sequences by oligonucleotide profiling. BMC Res Notes 2008; 1:5. [PMID: 18710530 PMCID: PMC2518268 DOI: 10.1186/1756-0500-1-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2008] [Accepted: 02/28/2008] [Indexed: 11/24/2022] Open
Abstract
Background The comparison of DNA sequences is a traditional problem in genomics and bioinformatics. Many new opportunities emerge due to the improvement of personal computers, allowing the implementation of novel strategies of analysis. Findings We describe a new program, called UVWORD, which determines the number of times that each DNA word present in a sequence (target) is found in a second sequence (source), a procedure that we have called oligonucleotide profiling. On a standard computer, the user may search for words of a size ranging from k = 1 to k = 14 nucleotides. Average counts for groups of contiguous words may also be established. The rate of analysis on standard computers is from 3.4 (k = 14) to 16 millions of words per second (1 ≤ k ≤ 8). This makes feasible the fast screening of even the longest known DNA molecules. Discussion We show that the combination of the ability of analyzing words of relatively long size, which occur very rarely by chance, and the fast speed of the program allows to perform novel types of screenings, complementary to those provided by standard programs such as BLAST. This method can be used to determine oligonucleotide content, to characterize the distribution of repetitive sequences in chromosomes, to determine the evolutionary conservation of sequences in different species, to establish regions of similar DNA among chromosomes or genomes, etc.
Collapse
Affiliation(s)
- Vicente Arnau
- Departmento de Informática. Universidad de Valencia, Spain. vicente@
| | | | | |
Collapse
|
25
|
Gallach M, Arnau V, Marín I. Global patterns of sequence evolution in Drosophila. BMC Genomics 2007; 8:408. [PMID: 17996078 PMCID: PMC2180185 DOI: 10.1186/1471-2164-8-408] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2007] [Accepted: 11/09/2007] [Indexed: 01/30/2023] Open
Abstract
Background Sequencing of the genomes of several Drosophila allows for the first precise analyses of how global sequence patterns change among multiple, closely related animal species. A basic question is whether there are characteristic features that differentiate chromosomes within a species or between different species. Results We explored the euchromatin of the chromosomes of seven Drosophila species to establish their global patterns of DNA sequence diversity. Between species, differences in the types and amounts of simple sequence repeats were found. Within each species, the autosomes have almost identical oligonucleotide profiles. However, X chromosomes and autosomes have, in all species, a qualitatively different composition. The X chromosomes are less complex than the autosomes, containing both a higher amount of simple DNA sequences and, in several cases, chromosome-specific repetitive sequences. Moreover, we show that the right arm of the X chromosome of Drosophila pseudoobscura, which evolved from an autosome 10 – 18 millions of years ago, has a composition which is identical to that of the original, left arm of the X chromosome. Conclusion The consistent differences among species, differences among X chromosomes and autosomes and the convergent evolution of X and neo-X chromosomes demonstrate that strong forces are acting on drosophilid genomes to generate peculiar chromosomal landscapes. We discuss the relationships of the patterns observed with differential recombination and mutation rates and with the process of dosage compensation.
Collapse
Affiliation(s)
- Miguel Gallach
- Instituto de Biomedicina de Valencia, Consejo Superior de Investigaciones Científicas (IBV-CSIC), Valencia, Spain.
| | | | | |
Collapse
|
26
|
Wutz A, Gribnau J. X inactivation Xplained. Curr Opin Genet Dev 2007; 17:387-93. [PMID: 17869504 DOI: 10.1016/j.gde.2007.08.001] [Citation(s) in RCA: 122] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2007] [Revised: 08/01/2007] [Accepted: 08/02/2007] [Indexed: 11/21/2022]
Abstract
Random inactivation of one of the two female X chromosomes establishes dosage compensation between XY males and XX females in placental mammals. X inactivation is controlled by the X inactivation center (Xic). Recent advances in genome sequencing show that the Xic has evolved from an ancestral vertebrate gene cluster in placental mammals and has undergone separate rearrangements in marsupials. The Xic ensures that all but one X chromosome per diploid genome are inactivated. Which chromosome remains active is randomly chosen. Pairing of Xic loci on the two X chromosomes and alternate states of the X chromosomes before inactivation have recently been implicated in the mechanism of random choice. Chromosome-wide silencing is then initiated by the noncoding Xist RNA, which evolved with the mammalian Xic and covers the inactive X chromosome.
Collapse
Affiliation(s)
- Anton Wutz
- Research Institute of Molecular Pathology, Dr. Bohr-Gasse 7, 1030 Vienna, Austria.
| | | |
Collapse
|
27
|
Ellegren H, Hultin-Rosenberg L, Brunström B, Dencker L, Kultima K, Scholz B. Faced with inequality: chicken do not have a general dosage compensation of sex-linked genes. BMC Biol 2007; 5:40. [PMID: 17883843 PMCID: PMC2099419 DOI: 10.1186/1741-7007-5-40] [Citation(s) in RCA: 190] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2007] [Accepted: 09/20/2007] [Indexed: 11/10/2022] Open
Abstract
Background The contrasting dose of sex chromosomes in males and females potentially introduces a large-scale imbalance in levels of gene expression between sexes, and between sex chromosomes and autosomes. In many organisms, dosage compensation has thus evolved to equalize sex-linked gene expression in males and females. In mammals this is achieved by X chromosome inactivation and in flies and worms by up- or down-regulation of X-linked expression, respectively. While otherwise widespread in systems with heteromorphic sex chromosomes, the case of dosage compensation in birds (males ZZ, females ZW) remains an unsolved enigma. Results Here, we use a microarray approach to show that male chicken embryos generally express higher levels of Z-linked genes than female birds, both in soma and in gonads. The distribution of male-to-female fold-change values for Z chromosome genes is wide and has a mean of 1.4–1.6, which is consistent with absence of dosage compensation and sex-specific feedback regulation of gene expression at individual loci. Intriguingly, without global dosage compensation, the female chicken has significantly lower expression levels of Z-linked compared to autosomal genes, which is not the case in male birds. Conclusion The pronounced sex difference in gene expression is likely to contribute to sexual dimorphism among birds, and potentially has implication to avian sex determination. Importantly, this report, together with a recent study of sex-biased expression in somatic tissue of chicken, demonstrates the first example of an organism with a lack of global dosage compensation, providing an unexpected case of a viable system with large-scale imbalance in gene expression between sexes.
Collapse
Affiliation(s)
- Hans Ellegren
- Department of Evolutionary Biology, Uppsala University, Norbyvägen 18D, SE-752 36 Uppsala, Sweden
| | - Lina Hultin-Rosenberg
- Department of Evolutionary Biology, Uppsala University, Norbyvägen 18D, SE-752 36 Uppsala, Sweden
| | - Björn Brunström
- Department of Environmental Toxicology, Uppsala University, Norbyvägen 18A, SE-752 36 Uppsala, Sweden
| | - Lennart Dencker
- Department of Pharmaceutical Biosciences, Uppsala University, Box 594, SE-751 24 Uppsala, Sweden
| | - Kim Kultima
- Department of Pharmaceutical Biosciences, Uppsala University, Box 594, SE-751 24 Uppsala, Sweden
| | - Birger Scholz
- Department of Pharmaceutical Biosciences, Uppsala University, Box 594, SE-751 24 Uppsala, Sweden
| |
Collapse
|
28
|
Wutz A. Xist function: bridging chromatin and stem cells. Trends Genet 2007; 23:457-64. [PMID: 17681633 DOI: 10.1016/j.tig.2007.07.004] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2007] [Revised: 06/06/2007] [Accepted: 07/18/2007] [Indexed: 11/19/2022]
Abstract
In mammals, dosage compensation is achieved by transcriptional silencing of one of the two female X chromosomes. X inactivation is dynamically regulated in development. The non-coding Xist RNA localizes to the inactive X, initiates gene repression in the early embryo, and later stabilizes the inactive state. Different functions of Xist are observed depending on which epigenetic regulatory pathways are active in a given cell. Because Xist has evolved recently, with the origin of placental mammals, the underlying pathways are also important in regulating developmental control genes. This review emphasizes the opportunity that Xist provides to functionally define epigenetic transitions in development, to understand cell identity, pluripotency and stem cell differentiation.
Collapse
Affiliation(s)
- Anton Wutz
- Research Institute of Molecular Pathology, Dr. Bohr-Gasse 7, 1030 Vienna, Austria.
| |
Collapse
|
29
|
Abstract
How the mechanisms of dosage compensation distinguish the sex chromosomes from the autosomes has been something of a mystery. A recent study in Caenorhabditis elegans has identified clusters of two common DNA motifs as a cis-acting code for the recruitment of the DCC, the protein complex that mediates dosage compensation.
Collapse
Affiliation(s)
- Xinxian Deng
- Department of Pathology, University of Washington, Seattle, WA 98195, USA
| | | |
Collapse
|
30
|
Freeling M, Rapaka L, Lyons E, Pedersen B, Thomas BC. G-boxes, bigfoot genes, and environmental response: characterization of intragenomic conserved noncoding sequences in Arabidopsis. THE PLANT CELL 2007; 19:1441-57. [PMID: 17496117 PMCID: PMC1913728 DOI: 10.1105/tpc.107.050419] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/18/2007] [Revised: 03/10/2007] [Accepted: 04/19/2007] [Indexed: 05/15/2023]
Abstract
A tetraploidy left Arabidopsis thaliana with 6358 pairs of homoeologs that, when aligned, generated 14,944 intragenomic conserved noncoding sequences (CNSs). Our previous work assembled these phylogenetic footprints into a database. We show that known transcription factor (TF) binding motifs, including the G-box, are overrepresented in these CNSs. A total of 254 genes spanning long lengths of CNS-rich chromosomes (Bigfoot) dominate this database. Therefore, we made subdatabases: one containing Bigfoot genes and the other containing genes with three to five CNSs (Smallfoot). Bigfoot genes are generally TFs that respond to signals, with their modal CNS positioned 3.1 kb 5' from the ATG. Smallfoot genes encode components of signal transduction machinery, the cytoskeleton, or involve transcription. We queried each subdatabase with each possible 7-nucleotide sequence. Among hundreds of hits, most were purified from CNSs, and almost all of those significantly enriched in CNSs had no experimental history. The 7-mers in CNSs are not 5'- to 3'-oriented in Bigfoot genes but are often oriented in Smallfoot genes. CNSs with one G-box tend to have two G-boxes. CNSs were shared with the homoeolog only and with no other gene, suggesting that binding site turnover impedes detection. Bigfoot genes may function in adaptation to environmental change.
Collapse
Affiliation(s)
- Michael Freeling
- Department of Plant and Microbial Biology, University of California, Berkeley, California 94720, USA.
| | | | | | | | | |
Collapse
|
31
|
Robin S, Schbath S, Vandewalle V. Statistical tests to compare motif count exceptionalities. BMC Bioinformatics 2007; 8:84. [PMID: 17346349 PMCID: PMC1838430 DOI: 10.1186/1471-2105-8-84] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2006] [Accepted: 03/08/2007] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Finding over- or under-represented motifs in biological sequences is now a common task in genomics. Thanks to p-value calculation for motif counts, exceptional motifs are identified and represent candidate functional motifs. The present work addresses the related question of comparing the exceptionality of one motif in two different sequences. Just comparing the motif count p-values in each sequence is indeed not sufficient to decide if this motif is significantly more exceptional in one sequence compared to the other one. A statistical test is required. RESULTS We develop and analyze two statistical tests, an exact binomial one and an asymptotic likelihood ratio test, to decide whether the exceptionality of a given motif is equivalent or significantly different in two sequences of interest. For that purpose, motif occurrences are modeled by Poisson processes, with a special care for overlapping motifs. Both tests can take the sequence compositions into account. As an illustration, we compare the octamer exceptionalities in the Escherichia coli K-12 backbone versus variable strain-specific loops. CONCLUSION The exact binomial test is particularly adapted for small counts. For large counts, we advise to use the likelihood ratio test which is asymptotic but strongly correlated with the exact binomial test and very simple to use.
Collapse
Affiliation(s)
- Stéphane Robin
- INA PG/ENGREF/INRA, UMR518 Unité Mathématiques et Informatique Appliquées, 75005 Paris, France
| | - Sophie Schbath
- INRA, UR1077 Unité Mathématique, Informatique et Génome, 78350 Jouy-en-Josas, France
| | - Vincent Vandewalle
- INA PG/ENGREF/INRA, UMR518 Unité Mathématiques et Informatique Appliquées, 75005 Paris, France
| |
Collapse
|
32
|
Bush EC, Lahn BT. The evolution of word composition in metazoan promoter sequence. PLoS Comput Biol 2006; 2:e150. [PMID: 17083273 PMCID: PMC1630712 DOI: 10.1371/journal.pcbi.0020150] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2006] [Accepted: 09/27/2006] [Indexed: 12/02/2022] Open
Abstract
The field of molecular evolution provides many examples of the principle that molecular differences between species contain information about evolutionary history. One surprising case can be found in the frequency of short words in DNA: more closely related species have more similar word compositions. Interest in this has often focused on its utility in deducing phylogenetic relationships. However, it is also of interest because of the opportunity it provides for studying the evolution of genome function. Word-frequency differences between species change too slowly to be purely the result of random mutational drift. Rather, their slow pattern of change reflects the direct or indirect action of purifying selection and the presence of functional constraints. Many such constraints are likely to exist, and an important challenge is to distinguish them. Here we develop a method to do so by isolating the effects acting at different word sizes. We apply our method to 2-, 4-, and 8-base-pair (bp) words across several classes of noncoding sequence. Our major result is that similarities in 8-bp word frequencies scale with evolutionary time for regions immediately upstream of genes. This association is present although weaker in intronic sequence, but cannot be detected in intergenic sequence using our method. In contrast, 2-bp and 4-bp word frequencies scale with time in all classes of noncoding sequence. These results suggest that different genomic processes are involved at different word sizes. The pattern in 2-bp and 4-bp words may be due to evolutionary changes in processes such as DNA replication and repair, as has been suggested before. The pattern in 8-bp words may reflect evolutionary changes in gene-regulatory machinery, such as changes in the frequencies of transcription-factor binding sites, or in the affinity of transcription factors for particular sequences. One of the foundations of molecular evolution is the idea that more closely related species are more similar on the molecular level. One example that has been known for several years is the genomic composition of short words (i.e., short segments) of DNA. Given a sample of genome sequence, one can count the occurrences of all words of a certain length. It turns out that closely related species have more similar word frequencies. The pattern of how these frequencies change over evolutionary time is likely to be influenced by the many functions of the genome (coding for proteins, controlling gene expression, etc.). Bush and Lahn investigated the influence of genomic function on word-frequency variation in 13 animal genomes. Using a method designed to isolate the effects acting at particular word sizes, the authors examined how word frequencies vary in different categories of noncoding sequence. They found that interspecies patterns of word-frequency variation change depending on word size and sequence category. These results suggest that noncoding sequence is subject to different functional constraints depending on its location in the genome. An especially interesting possibility is that the patterns in longer words may reflect evolutionary changes in gene regulatory machinery.
Collapse
Affiliation(s)
- Eliot C Bush
- Department of Human Genetics, University of Chicago, Chicago, Illinois, United States of America
- Howard Hughes Medical Institute, University of Chicago, Chicago, Illinois, United States of America
- * To whom correspondence should be addressed. E-mail: (ECB); (BTL)
| | - Bruce T Lahn
- Department of Human Genetics, University of Chicago, Chicago, Illinois, United States of America
- Howard Hughes Medical Institute, University of Chicago, Chicago, Illinois, United States of America
- * To whom correspondence should be addressed. E-mail: (ECB); (BTL)
| |
Collapse
|
33
|
Carrel L, Park C, Tyekucheva S, Dunn J, Chiaromonte F, Makova KD. Genomic environment predicts expression patterns on the human inactive X chromosome. PLoS Genet 2006; 2:e151. [PMID: 17009873 PMCID: PMC1584270 DOI: 10.1371/journal.pgen.0020151] [Citation(s) in RCA: 67] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2006] [Accepted: 08/03/2006] [Indexed: 11/18/2022] Open
Abstract
What genomic landmarks render most genes silent while leaving others expressed on the inactive X chromosome in mammalian females? To date, signals determining expression status of genes on the inactive X remain enigmatic despite the availability of complete genomic sequences. Long interspersed repeats (L1s), particularly abundant on the X, are hypothesized to spread the inactivation signal and are enriched in the vicinity of inactive genes. However, both L1s and inactive genes are also more prevalent in ancient evolutionary strata. Did L1s accumulate there because of their role in inactivation or simply because they spent more time on the rarely recombining X? Here we utilize an experimentally derived inactivation profile of the entire human X chromosome to uncover sequences important for its inactivation, and to predict expression status of individual genes. Focusing on Xp22, where both inactive and active genes reside within evolutionarily young strata, we compare neighborhoods of genes with different inactivation states to identify enriched oligomers. Occurrences of such oligomers are then used as features to train a linear discriminant analysis classifier. Remarkably, expression status is correctly predicted for 84% and 91% of active and inactive genes, respectively, on the entire X, suggesting that oligomers enriched in Xp22 capture most of the genomic signal determining inactivation. To our surprise, the majority of oligomers associated with inactivated genes fall within L1 elements, even though L1 frequency in Xp22 is low. Moreover, these oligomers are enriched in parts of L1 sequences that are usually underrepresented in the genome. Thus, our results strongly support the role of L1s in X inactivation, yet indicate that a chromatin microenvironment composed of multiple genomic sequence elements determines expression status of X chromosome genes.
Collapse
Affiliation(s)
- Laura Carrel
- Department of Biochemistry and Molecular Biology, Pennsylvania State University College of Medicine, Hershey, Pennsylvania, United States of America
- * To whom correspondence should be addressed. E-mail: (LC); (KDM)
| | - Chungoo Park
- Center for Comparative Genomics and Bioinformatics, Pennsylvania State University, University Park, Pennsylvania, United States of America
- Department of Biology, Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Svitlana Tyekucheva
- Center for Comparative Genomics and Bioinformatics, Pennsylvania State University, University Park, Pennsylvania, United States of America
- Department of Statistics, Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - John Dunn
- Department of Pediatrics, Case Western Reserve University School of Medicine, Cleveland, Ohio, United States of America
| | - Francesca Chiaromonte
- Center for Comparative Genomics and Bioinformatics, Pennsylvania State University, University Park, Pennsylvania, United States of America
- Department of Statistics, Pennsylvania State University, University Park, Pennsylvania, United States of America
- Department of Health Evaluation Sciences, Pennsylvania State University College of Medicine, Hershey, Pennsylvania, United States of America
| | - Kateryna D Makova
- Center for Comparative Genomics and Bioinformatics, Pennsylvania State University, University Park, Pennsylvania, United States of America
- Department of Biology, Pennsylvania State University, University Park, Pennsylvania, United States of America
- * To whom correspondence should be addressed. E-mail: (LC); (KDM)
| |
Collapse
|
34
|
Wang Z, Willard HF, Mukherjee S, Furey TS. Evidence of influence of genomic DNA sequence on human X chromosome inactivation. PLoS Comput Biol 2006; 2:e113. [PMID: 16948528 PMCID: PMC1557588 DOI: 10.1371/journal.pcbi.0020113] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2006] [Accepted: 07/17/2006] [Indexed: 01/01/2023] Open
Abstract
A significant number of human X-linked genes escape X chromosome inactivation and are thus expressed from both the active and inactive X chromosomes. The basis for escape from inactivation and the potential role of the X chromosome primary DNA sequence in determining a gene's X inactivation status is unclear. Using a combination of the X chromosome sequence and a comprehensive X inactivation profile of more than 600 genes, two independent yet complementary approaches were used to systematically investigate the relationship between X inactivation and DNA sequence features. First, statistical analyses revealed that a number of repeat features, including long interspersed nuclear element (LINE) and mammalian-wide interspersed repeat repetitive elements, are significantly enriched in regions surrounding transcription start sites of genes that are subject to inactivation, while Alu repetitive elements and short motifs containing ACG/CGT are significantly enriched in those that escape inactivation. Second, linear support vector machine classifiers constructed using primary DNA sequence features were used to correctly predict the X inactivation status for >80% of all X-linked genes. We further identified a small set of features that are important for accurate classification, among which LINE-1 and LINE-2 content show the greatest individual discriminatory power. Finally, as few as 12 features can be used for accurate support vector machine classification. Taken together, these results suggest that features of the underlying primary DNA sequence of the human X chromosome may influence the spreading and/or maintenance of X inactivation.
Collapse
Affiliation(s)
- Zhong Wang
- Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina, United States of America
| | - Huntington F Willard
- Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina, United States of America
| | - Sayan Mukherjee
- Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina, United States of America
| | - Terrence S Furey
- Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
35
|
Clemson CM, Hall LL, Byron M, McNeil J, Lawrence JB. The X chromosome is organized into a gene-rich outer rim and an internal core containing silenced nongenic sequences. Proc Natl Acad Sci U S A 2006; 103:7688-93. [PMID: 16682630 PMCID: PMC1472506 DOI: 10.1073/pnas.0601069103] [Citation(s) in RCA: 168] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2006] [Indexed: 11/18/2022] Open
Abstract
We investigated whether genes escape X chromosome inactivation by positioning outside of the territory defined by XIST RNA. Results reveal an unanticipated higher order organization of genes and noncoding sequences. All 15 X-linked genes, regardless of activity, position on the border of the XIST RNA territory, which resides outside of the DAPI-dense Barr body. Although more strictly delineated on the inactive X chromosome (Xi), all genes localized predominantly to the outer rim of the Xi and active X chromosome. This outer rim is decorated only by X chromosome DNA paints and is excluded from both the XIST RNA and dense DAPI staining. The only DNA found well within the Barr body and XIST RNA territory was centromeric and Cot-1 DNA; hence, the core of the X chromosome essentially excludes genes and is composed primarily of noncoding repeat-rich DNA. Moreover, we show that this core of repetitive sequences is expressed throughout the nucleus yet is silenced throughout Xi, providing direct evidence for chromosome-wide regulation of "junk" DNA transcription. Collective results suggest that the Barr body, long presumed to be the physical manifestation of silenced genes, is in fact composed of a core of silenced noncoding DNA. Instead of acting at a local gene level, XIST RNA appears to interact with and silence core architectural elements to effectively condense and shut down the Xi.
Collapse
Affiliation(s)
- Christine Moulton Clemson
- Department of Cell Biology, University of Massachusetts Medical Center, 55 Lake Avenue North, Worcester, MA 01655
| | - Lisa L. Hall
- Department of Cell Biology, University of Massachusetts Medical Center, 55 Lake Avenue North, Worcester, MA 01655
| | - Meg Byron
- Department of Cell Biology, University of Massachusetts Medical Center, 55 Lake Avenue North, Worcester, MA 01655
| | - John McNeil
- Department of Cell Biology, University of Massachusetts Medical Center, 55 Lake Avenue North, Worcester, MA 01655
| | - Jeanne Bentley Lawrence
- Department of Cell Biology, University of Massachusetts Medical Center, 55 Lake Avenue North, Worcester, MA 01655
| |
Collapse
|
36
|
Ross MT, Bentley DR, Tyler-Smith C. The sequences of the human sex chromosomes. Curr Opin Genet Dev 2006; 16:213-8. [PMID: 16650760 DOI: 10.1016/j.gde.2006.04.006] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2006] [Accepted: 04/18/2006] [Indexed: 10/24/2022]
Abstract
The sequences of both of the human sex chromosomes and of a substantial part of the chimpanzee Y chromosome have now been determined, and most of the protein-coding genes have been identified. The X chromosome codes for more than 800 proteins but the Y chromosome for only approximately 60, illustrating their very different evolutionary histories since their origin from an autosomal pair approximately 300 million years ago and explaining their differential importance in disease. These sequences have provided the basis for understanding normal patterns of variation, such as the distribution of SNPs, and patterns of linkage disequilibrium. In addition, they have been useful for identifying variants associated with simple Mendelian disorders such as microphthalmia or mental retardation, and more complex disorders such as osteoporosis.
Collapse
Affiliation(s)
- Mark T Ross
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
| | | | | |
Collapse
|