1
|
Andrade Ruiz L, Kops GJPL, Sacristan C. Vertebrate centromere architecture: from chromatin threads to functional structures. Chromosoma 2024:10.1007/s00412-024-00823-z. [PMID: 38856923 DOI: 10.1007/s00412-024-00823-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2024] [Revised: 05/21/2024] [Accepted: 05/27/2024] [Indexed: 06/11/2024]
Abstract
Centromeres are chromatin structures specialized in sister chromatid cohesion, kinetochore assembly, and microtubule attachment during chromosome segregation. The regional centromere of vertebrates consists of long regions of highly repetitive sequences occupied by the Histone H3 variant CENP-A, and which are flanked by pericentromeres. The three-dimensional organization of centromeric chromatin is paramount for its functionality and its ability to withstand spindle forces. Alongside CENP-A, key contributors to the folding of this structure include components of the Constitutive Centromere-Associated Network (CCAN), the protein CENP-B, and condensin and cohesin complexes. Despite its importance, the intricate architecture of the regional centromere of vertebrates remains largely unknown. Recent advancements in long-read sequencing, super-resolution and cryo-electron microscopy, and chromosome conformation capture techniques have significantly improved our understanding of this structure at various levels, from the linear arrangement of centromeric sequences and their epigenetic landscape to their higher-order compaction. In this review, we discuss the latest insights on centromere organization and place them in the context of recent findings describing a bipartite higher-order organization of the centromere.
Collapse
Affiliation(s)
- Lorena Andrade Ruiz
- Hubrecht Institute, Royal Netherlands Academy of Arts and Sciences, Utrecht, Netherlands
- University Medical Center Utrecht, Utrecht, Netherlands
- Oncode Institute, Utrecht, Netherlands
| | - Geert J P L Kops
- Hubrecht Institute, Royal Netherlands Academy of Arts and Sciences, Utrecht, Netherlands
- University Medical Center Utrecht, Utrecht, Netherlands
- Oncode Institute, Utrecht, Netherlands
| | - Carlos Sacristan
- Hubrecht Institute, Royal Netherlands Academy of Arts and Sciences, Utrecht, Netherlands.
- University Medical Center Utrecht, Utrecht, Netherlands.
- Oncode Institute, Utrecht, Netherlands.
| |
Collapse
|
2
|
Nassar R, Thompson L, Fouquerel E. Molecular mechanisms protecting centromeres from self-sabotage and implications for cancer therapy. NAR Cancer 2023; 5:zcad019. [PMID: 37180029 PMCID: PMC10167631 DOI: 10.1093/narcan/zcad019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Revised: 03/27/2023] [Accepted: 04/20/2023] [Indexed: 05/15/2023] Open
Abstract
Centromeres play a crucial role in DNA segregation by mediating the cohesion and separation of sister chromatids during cell division. Centromere dysfunction, breakage or compromised centromeric integrity can generate aneuploidies and chromosomal instability, which are cellular features associated with cancer initiation and progression. Maintaining centromere integrity is thus essential for genome stability. However, the centromere itself is prone to DNA breaks, likely due to its intrinsically fragile nature. Centromeres are complex genomic loci that are composed of highly repetitive DNA sequences and secondary structures and require the recruitment and homeostasis of a centromere-associated protein network. The molecular mechanisms engaged to preserve centromere inherent structure and respond to centromeric damage are not fully understood and remain a subject of ongoing research. In this article, we provide a review of the currently known factors that contribute to centromeric dysfunction and the molecular mechanisms that mitigate the impact of centromere damage on genome stability. Finally, we discuss the potential therapeutic strategies that could arise from a deeper understanding of the mechanisms preserving centromere integrity.
Collapse
Affiliation(s)
- Rim Nassar
- UPMC Hillman Cancer Center, Department of Pharmacology and Chemical Biology, University of Pittsburgh Cancer Institute, Pittsburgh, PA 15232, USA
| | - Lily Thompson
- UPMC Hillman Cancer Center, Department of Pharmacology and Chemical Biology, University of Pittsburgh Cancer Institute, Pittsburgh, PA 15232, USA
- Department of Biochemistry and Molecular Biology, Thomas Jefferson University, Philadelphia, PA 19107, USA
| | - Elise Fouquerel
- UPMC Hillman Cancer Center, Department of Pharmacology and Chemical Biology, University of Pittsburgh Cancer Institute, Pittsburgh, PA 15232, USA
| |
Collapse
|
3
|
DiMeLo-seq: a long-read, single-molecule method for mapping protein-DNA interactions genome wide. Nat Methods 2022; 19:711-723. [PMID: 35396487 PMCID: PMC9189060 DOI: 10.1038/s41592-022-01475-6] [Citation(s) in RCA: 34] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2021] [Accepted: 03/24/2022] [Indexed: 12/13/2022]
Abstract
Studies of genome regulation routinely use high-throughput DNA sequencing approaches to determine where specific proteins interact with DNA, and they rely on DNA amplification and short-read sequencing, limiting their quantitative application in complex genomic regions. To address these limitations, we developed Directed Methylation with Long-read sequencing (DiMeLo-seq), which uses antibody-tethered enzymes to methylate DNA near a target protein’s binding sites in situ. These exogenous methylation marks are then detected simultaneously with endogenous CpG methylation on unamplified DNA using long-read, single-molecule sequencing technologies. We optimized and benchmarked DiMeLo-seq by mapping chromatin-binding proteins and histone modifications across the human genome. Furthermore, we identified where centromere protein A (CENP-A) localizes within highly repetitive regions that we re unmappable with short sequencing reads, and we estimated the density of CENP-A molecules along single chromatin fibers. DiMeLo-seq is a versatile method that provides multimodal, genome-wide information for investigating protein-DNA interactions.
Collapse
|
4
|
Hartley GA, Okhovat M, O'Neill RJ, Carbone L. Comparative analyses of gibbon centromeres reveal dynamic genus specific shifts in repeat composition. Mol Biol Evol 2021; 38:3972-3992. [PMID: 33983366 PMCID: PMC8382927 DOI: 10.1093/molbev/msab148] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Centromeres are functionally conserved chromosomal loci essential for proper chromosome segregation during cell division, yet they show high sequence diversity across species. Despite their variation, a near universal feature of centromeres is the presence of repetitive sequences, such as DNA satellites and transposable elements (TEs). Because of their rapidly evolving karyotypes, gibbons represent a compelling model to investigate divergence of functional centromere sequences across short evolutionary timescales. In this study, we use ChIP-seq, RNA-seq, and fluorescence in situ hybridization to comprehensively investigate the centromeric repeat content of the four extant gibbon genera (Hoolock, Hylobates, Nomascus, and Siamang). In all gibbon genera, we find that CENP-A nucleosomes and the DNA-proteins that interface with the inner kinetochore preferentially bind retroelements of broad classes rather than satellite DNA. A previously identified gibbon-specific composite retrotransposon, LAVA, known to be expanded within the centromere regions of one gibbon genus (Hoolock), displays centromere- and species-specific sequence differences, potentially as a result of its co-option to a centromeric function. When dissecting centromere satellite composition, we discovered the presence of the retroelement-derived macrosatellite SST1 in multiple centromeres of Hoolock, whereas alpha-satellites represent the predominate satellite in the other genera, further suggesting an independent evolutionary trajectory for Hoolock centromeres. Finally, using de novo assembly of centromere sequences, we determined that transcripts originating from gibbon centromeres recapitulate the species-specific TE composition. Combined, our data reveal dynamic shifts in the repeat content that define gibbon centromeres and coincide with the extensive karyotypic diversity within this lineage.
Collapse
Affiliation(s)
- Gabrielle A Hartley
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, 06269
| | - Mariam Okhovat
- Department of Medicine, Knight Cardiovascular Institute, Oregon Health and Science University, Portland, OR, 97239
| | - Rachel J O'Neill
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, 06269.,Institute for Systems Genomics, University of Connecticut, Storrs, CT, 06269.,Department of Genomics and Genome Sciences, UConn Health, Farmington, CT, 06030
| | - Lucia Carbone
- Department of Medicine, Knight Cardiovascular Institute, Oregon Health and Science University, Portland, OR, 97239.,Division of Genetics, Oregon National Primate Research Center, Beaverton, OR, 97006.,Department of Molecular and Medical Genetics, Oregon Health and Science University, Portland, OR, 97239.,Department of Medical Informatics and Clinical Epidemiology, Oregon Health and Science University, Portland, OR, 97239
| |
Collapse
|
5
|
Lopes M, Louzada S, Gama-Carvalho M, Chaves R. Genomic Tackling of Human Satellite DNA: Breaking Barriers through Time. Int J Mol Sci 2021; 22:4707. [PMID: 33946766 PMCID: PMC8125562 DOI: 10.3390/ijms22094707] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Revised: 04/24/2021] [Accepted: 04/27/2021] [Indexed: 12/12/2022] Open
Abstract
(Peri)centromeric repetitive sequences and, more specifically, satellite DNA (satDNA) sequences, constitute a major human genomic component. SatDNA sequences can vary on a large number of features, including nucleotide composition, complexity, and abundance. Several satDNA families have been identified and characterized in the human genome through time, albeit at different speeds. Human satDNA families present a high degree of sub-variability, leading to the definition of various subfamilies with different organization and clustered localization. Evolution of satDNA analysis has enabled the progressive characterization of satDNA features. Despite recent advances in the sequencing of centromeric arrays, comprehensive genomic studies to assess their variability are still required to provide accurate and proportional representation of satDNA (peri)centromeric/acrocentric short arm sequences. Approaches combining multiple techniques have been successfully applied and seem to be the path to follow for generating integrated knowledge in the promising field of human satDNA biology.
Collapse
Affiliation(s)
- Mariana Lopes
- Laboratory of Cytogenomics and Animal Genomics (CAG), Department of Genetics and Biotechnology (DGB), University of Trás-os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal; (M.L.); (S.L.)
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisbon, 1749-016 Lisbon, Portugal;
| | - Sandra Louzada
- Laboratory of Cytogenomics and Animal Genomics (CAG), Department of Genetics and Biotechnology (DGB), University of Trás-os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal; (M.L.); (S.L.)
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisbon, 1749-016 Lisbon, Portugal;
| | - Margarida Gama-Carvalho
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisbon, 1749-016 Lisbon, Portugal;
| | - Raquel Chaves
- Laboratory of Cytogenomics and Animal Genomics (CAG), Department of Genetics and Biotechnology (DGB), University of Trás-os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal; (M.L.); (S.L.)
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisbon, 1749-016 Lisbon, Portugal;
| |
Collapse
|
6
|
Thakur J, Packiaraj J, Henikoff S. Sequence, Chromatin and Evolution of Satellite DNA. Int J Mol Sci 2021; 22:ijms22094309. [PMID: 33919233 PMCID: PMC8122249 DOI: 10.3390/ijms22094309] [Citation(s) in RCA: 85] [Impact Index Per Article: 28.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Revised: 04/16/2021] [Accepted: 04/17/2021] [Indexed: 12/15/2022] Open
Abstract
Satellite DNA consists of abundant tandem repeats that play important roles in cellular processes, including chromosome segregation, genome organization and chromosome end protection. Most satellite DNA repeat units are either of nucleosomal length or 5–10 bp long and occupy centromeric, pericentromeric or telomeric regions. Due to high repetitiveness, satellite DNA sequences have largely been absent from genome assemblies. Although few conserved satellite-specific sequence motifs have been identified, DNA curvature, dyad symmetries and inverted repeats are features of various satellite DNAs in several organisms. Satellite DNA sequences are either embedded in highly compact gene-poor heterochromatin or specialized chromatin that is distinct from euchromatin. Nevertheless, some satellite DNAs are transcribed into non-coding RNAs that may play important roles in satellite DNA function. Intriguingly, satellite DNAs are among the most rapidly evolving genomic elements, such that a large fraction is species-specific in most organisms. Here we describe the different classes of satellite DNA sequences, their satellite-specific chromatin features, and how these features may contribute to satellite DNA biology and evolution. We also discuss how the evolution of functional satellite DNA classes may contribute to speciation in plants and animals.
Collapse
Affiliation(s)
- Jitendra Thakur
- Department of Biology, Emory University, Atlanta, GA 30322, USA;
- Correspondence:
| | - Jenika Packiaraj
- Department of Biology, Emory University, Atlanta, GA 30322, USA;
| | - Steven Henikoff
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA;
- Fred Hutchinson Cancer Research Center, Howard Hughes Medical Institute, Seattle, WA 98109, USA
| |
Collapse
|
7
|
Smith OK, Limouse C, Fryer KA, Teran NA, Sundararajan K, Heald R, Straight AF. Identification and characterization of centromeric sequences in Xenopus laevis. Genome Res 2021; 31:958-967. [PMID: 33875480 PMCID: PMC8168581 DOI: 10.1101/gr.267781.120] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Accepted: 04/08/2021] [Indexed: 11/24/2022]
Abstract
Centromeres play an essential function in cell division by specifying the site of kinetochore formation on each chromosome for mitotic spindle attachment. Centromeres are defined epigenetically by the histone H3 variant Centromere Protein A (Cenpa). Cenpa nucleosomes maintain the centromere by designating the site for new Cenpa assembly after dilution by replication. Vertebrate centromeres assemble on tandem arrays of repetitive sequences, but the function of repeat DNA in centromere formation has been challenging to dissect due to the difficulty in manipulating centromeres in cells. Xenopus laevis egg extracts assemble centromeres in vitro, providing a system for studying centromeric DNA functions. However, centromeric sequences in Xenopus laevis have not been extensively characterized. In this study, we combine Cenpa ChIP-seq with a k-mer based analysis approach to identify the Xenopus laevis centromere repeat sequences. By in situ hybridization, we show that Xenopus laevis centromeres contain diverse repeat sequences, and we map the centromere position on each Xenopus laevis chromosome using the distribution of centromere-enriched k-mers. Our identification of Xenopus laevis centromere sequences enables previously unapproachable centromere genomic studies. Our approach should be broadly applicable for the analysis of centromere and other repetitive sequences in any organism.
Collapse
Affiliation(s)
- Owen K Smith
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California 94305-5307, USA.,Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Charles Limouse
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California 94305-5307, USA
| | - Kelsey A Fryer
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California 94305-5307, USA.,Department of Genetics, Stanford University School of Medicine, Stanford, California 94305-5120, USA
| | - Nicole A Teran
- Department of Genetics, Stanford University School of Medicine, Stanford, California 94305-5120, USA
| | - Kousik Sundararajan
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California 94305-5307, USA
| | - Rebecca Heald
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, California 94720-3200, USA
| | - Aaron F Straight
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California 94305-5307, USA
| |
Collapse
|
8
|
Discovery of 33mer in chromosome 21 - the largest alpha satellite higher order repeat unit among all human somatic chromosomes. Sci Rep 2019; 9:12629. [PMID: 31477765 PMCID: PMC6718397 DOI: 10.1038/s41598-019-49022-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2019] [Accepted: 08/13/2019] [Indexed: 11/10/2022] Open
Abstract
The centromere is important for segregation of chromosomes during cell division in eukaryotes. Its destabilization results in chromosomal missegregation, aneuploidy, hallmarks of cancers and birth defects. In primate genomes centromeres contain tandem repeats of ~171 bp alpha satellite DNA, commonly organized into higher order repeats (HORs). In spite of crucial importance, satellites have been understudied because of gaps in sequencing - genomic “black holes”. Bioinformatical studies of genomic sequences open possibilities to revolutionize understanding of repetitive DNA datasets. Here, using robust (Global Repeat Map) algorithm we identified in hg38 sequence of human chromosome 21 complete ensemble of alpha satellite HORs with six long repeat units (≥20 mers), five of them novel. Novel 33mer HOR has the longest HOR unit identified so far among all somatic chromosomes and novel 23mer reverse HOR is distant far from the centromere. Also, we discovered that for hg38 assembly the 33mer sequences in chromosomes 21, 13, 14, and 22 are 100% identical but nearby gaps are present; that seems to require an additional more precise sequencing. Chromosome 21 is of significant interest for deciphering the molecular base of Down syndrome and of aneuploidies in general. Since the chromosome identifier probes are largely based on the detection of higher order alpha satellite repeats, distinctions between alpha satellite HORs in chromosomes 21 and 13 here identified might lead to a unique chromosome 21 probe in molecular cytogenetics, which would find utility in diagnostics. It is expected that its complete sequence analysis will have profound implications for understanding pathogenesis of diseases and development of new therapeutic approaches.
Collapse
|
9
|
McNulty SM, Sullivan BA. Alpha satellite DNA biology: finding function in the recesses of the genome. Chromosome Res 2018; 26:115-138. [PMID: 29974361 DOI: 10.1007/s10577-018-9582-3] [Citation(s) in RCA: 74] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2018] [Accepted: 06/14/2018] [Indexed: 02/05/2023]
Abstract
Repetitive DNA, formerly referred to by the misnomer "junk DNA," comprises a majority of the human genome. One class of this DNA, alpha satellite, comprises up to 10% of the genome. Alpha satellite is enriched at all human centromere regions and is competent for de novo centromere assembly. Because of the highly repetitive nature of alpha satellite, it has been difficult to achieve genome assemblies at centromeres using traditional next-generation sequencing approaches, and thus, centromeres represent gaps in the current human genome assembly. Moreover, alpha satellite DNA is transcribed into repetitive noncoding RNA and contributes to a large portion of the transcriptome. Recent efforts to characterize these transcripts and their function have uncovered pivotal roles for satellite RNA in genome stability, including silencing "selfish" DNA elements and recruiting centromere and kinetochore proteins. This review will describe the genomic and epigenetic features of alpha satellite DNA, discuss recent findings of noncoding transcripts produced from distinct alpha satellite arrays, and address current progress in the functional understanding of this oft-neglected repetitive sequence. We will discuss unique challenges of studying human satellite DNAs and RNAs and point toward new technologies that will continue to advance our understanding of this largely untapped portion of the genome.
Collapse
Affiliation(s)
- Shannon M McNulty
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC, 27710, USA
| | - Beth A Sullivan
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC, 27710, USA. .,Division of Human Genetics, Duke University Medical Center, Durham, NC, 27710, USA.
| |
Collapse
|
10
|
McNulty SM, Sullivan LL, Sullivan BA. Human Centromeres Produce Chromosome-Specific and Array-Specific Alpha Satellite Transcripts that Are Complexed with CENP-A and CENP-C. Dev Cell 2017; 42:226-240.e6. [PMID: 28787590 DOI: 10.1016/j.devcel.2017.07.001] [Citation(s) in RCA: 130] [Impact Index Per Article: 18.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2017] [Revised: 05/24/2017] [Accepted: 07/03/2017] [Indexed: 11/28/2022]
Abstract
Human centromeres are defined by alpha satellite DNA arrays that are distinct and chromosome specific. Most human chromosomes contain multiple alpha satellite arrays that are competent for centromere assembly. Here, we show that human centromeres are defined by chromosome-specific RNAs linked to underlying organization of distinct alpha satellite arrays. Active and inactive arrays on the same chromosome produce discrete sets of transcripts in cis. Non-coding RNAs produced from active arrays are complexed with CENP-A and CENP-C, while inactive-array transcripts associate with CENP-B and are generally less stable. Loss of CENP-A does not affect transcript abundance or stability. However, depletion of array-specific RNAs reduces CENP-A and CENP-C at the targeted centromere via faulty CENP-A loading, arresting cells before mitosis. This work shows that each human alpha satellite array produces a unique set of non-coding transcripts, and RNAs present at active centromeres are necessary for kinetochore assembly and cell-cycle progression.
Collapse
Affiliation(s)
- Shannon M McNulty
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC 27710, USA
| | - Lori L Sullivan
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC 27710, USA
| | - Beth A Sullivan
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC 27710, USA; Division of Human Genetics, Duke University Medical Center, Durham, NC 27710, USA.
| |
Collapse
|
11
|
Miga KH. The Promises and Challenges of Genomic Studies of Human Centromeres. PROGRESS IN MOLECULAR AND SUBCELLULAR BIOLOGY 2017; 56:285-304. [PMID: 28840242 DOI: 10.1007/978-3-319-58592-5_12] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Human centromeres are genomic regions that act as sites of kinetochore assembly to ensure proper chromosome segregation during mitosis and meiosis. Although the biological importance of centromeres in genome stability, and ultimately, cell viability are well understood, the complete sequence content and organization in these multi-megabase-sized regions remains unknown. The lack of a high-resolution reference assembly inhibits standard bioinformatics protocols, and as a result, sequence-based studies involving human centromeres lag far behind the advances made for the non-repetitive sequences in the human genome. In this chapter, I introduce what is known about the genomic organization in the highly repetitive regions spanning human centromeres, and discuss the challenges these sequences pose for assembly, alignment, and data interpretation. Overcoming these obstacles is expected to issue a new era for centromere genomics, which will offer new discoveries in basic cell biology and human biomedical research.
Collapse
Affiliation(s)
- Karen H Miga
- Center for Biomolecular Science and Engineering, University of California, Santa Cruz, CA, USA.
| |
Collapse
|
12
|
Thakur J, Henikoff S. CENPT bridges adjacent CENPA nucleosomes on young human α-satellite dimers. Genome Res 2016; 26:1178-87. [PMID: 27384170 PMCID: PMC5052034 DOI: 10.1101/gr.204784.116] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2016] [Accepted: 06/29/2016] [Indexed: 12/15/2022]
Abstract
Nucleosomes containing the CenH3 (CENPA or CENP-A) histone variant replace H3 nucleosomes at centromeres to provide a foundation for kinetochore assembly. CENPA nucleosomes are part of the constitutive centromere associated network (CCAN) that forms the inner kinetochore on which outer kinetochore proteins assemble. Two components of the CCAN, CENPC and the histone-fold protein CENPT, provide independent connections from the ∼171-bp centromeric α-satellite repeat units to the outer kinetochore. However, the spatial relationship between CENPA nucleosomes and these two branches remains unclear. To address this issue, we use a base-pair resolution genomic readout of protein-protein interactions, comparative chromatin immunoprecipitation (ChIP) with sequencing, together with sequential ChIP, to infer the in vivo molecular architecture of the human CCAN. In contrast to the currently accepted model in which CENPT associates with H3 nucleosomes, we find that CENPT is centered over the CENPB box between two well-positioned CENPA nucleosomes on the most abundant centromeric young α-satellite dimers and interacts with the CENPB/CENPC complex. Upon cross-linking, the entire CENPA/CENPB/CENPC/CENPT complex is nuclease-protected over an α-satellite dimer that comprises the fundamental unit of centromeric chromatin. We conclude that CENPA/CENPC and CENPT pathways for kinetochore assembly are physically integrated over young α-satellite dimers.
Collapse
Affiliation(s)
- Jitendra Thakur
- Howard Hughes Medical Institute, Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA
| | - Steven Henikoff
- Howard Hughes Medical Institute, Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA
| |
Collapse
|
13
|
Matylla-Kulinska K, Tafer H, Weiss A, Schroeder R. Functional repeat-derived RNAs often originate from retrotransposon-propagated ncRNAs. WILEY INTERDISCIPLINARY REVIEWS-RNA 2014; 5:591-600. [PMID: 25045147 PMCID: PMC4233971 DOI: 10.1002/wrna.1243] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/20/2013] [Revised: 04/15/2014] [Accepted: 04/22/2014] [Indexed: 12/19/2022]
Abstract
The human genome is scattered with repetitive sequences, and the ENCODE project revealed that 60–70% of the genomic DNA is transcribed into RNA. As a consequence, the human transcriptome contains a large portion of repeat-derived RNAs (repRNAs). Here, we present a hypothesis for the evolution of novel functional repeat-derived RNAs from non-coding RNAs (ncRNAs) by retrotransposition. Upon amplification, the ncRNAs can diversify in sequence and subsequently evolve new activities, which can result in novel functions. Non-coding transcripts derived from highly repetitive regions can therefore serve as a reservoir for the evolution of novel functional RNAs. We base our hypothetical model on observations reported for short interspersed nuclear elements derived from 7SL RNA and tRNAs, α satellites derived from snoRNAs and SL RNAs derived from U1 small nuclear RNA. Furthermore, we present novel putative human repeat-derived ncRNAs obtained by the comparison of the Dfam and Rfam databases, as well as several examples in other species. We hypothesize that novel functional ncRNAs can derive also from other repetitive regions and propose Genomic SELEX as a tool for their identification.
Collapse
Affiliation(s)
- Katarzyna Matylla-Kulinska
- Department of Biochemistry and Cell Biology, Max F. Perutz Laboratories, University of Vienna, Vienna, Austria
| | | | | | | |
Collapse
|
14
|
Abstract
The centromere is the chromosomal locus essential for chromosome inheritance and genome stability. Human centromeres are located at repetitive alpha satellite DNA arrays that compose approximately 5% of the genome. Contiguous alpha satellite DNA sequence is absent from the assembled reference genome, limiting current understanding of centromere organization and function. Here, we review the progress in centromere genomics spanning the discovery of the sequence to its molecular characterization and the work done during the Human Genome Project era to elucidate alpha satellite structure and sequence variation. We discuss exciting recent advances in alpha satellite sequence assembly that have provided important insight into the abundance and complex organization of this sequence on human chromosomes. In light of these new findings, we offer perspectives for future studies of human centromere assembly and function.
Collapse
Affiliation(s)
- Megan E. Aldrup-MacDonald
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC 27710, USA; E-Mail:
- Division of Human Genetics, Duke University, Durham, NC 27710, USA
| | - Beth A. Sullivan
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC 27710, USA; E-Mail:
- Division of Human Genetics, Duke University, Durham, NC 27710, USA
- Author to whom correspondence should be addressed; E-Mail: ; Tel.: +1-919-684-9038
| |
Collapse
|
15
|
Shang WH, Hori T, Martins N, Toyoda A, Misu S, Monma N, Hiratani I, Maeshima K, Ikeo K, Fujiyama A, Kimura H, Earnshaw W, Fukagawa T. Chromosome engineering allows the efficient isolation of vertebrate neocentromeres. Dev Cell 2013; 24:635-48. [PMID: 23499358 PMCID: PMC3925796 DOI: 10.1016/j.devcel.2013.02.009] [Citation(s) in RCA: 137] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2012] [Revised: 01/21/2013] [Accepted: 02/15/2013] [Indexed: 01/04/2023]
Abstract
Centromeres are specified by sequence-independent epigenetic mechanisms in most organisms. Rarely, centromere repositioning results in neocentromere formation at ectopic sites. However, the mechanisms governing how and where neocentromeres form are unknown. Here, we established a chromosome-engineering system in chicken DT40 cells that allowed us to efficiently isolate neocentromere-containing chromosomes. Neocentromeres appear to be structurally and functionally equivalent to native centromeres. Chromatin immunoprecipitation sequencing (ChIP-seq) analysis with 18 neocentromeres revealed that the centromere-specific histone H3 variant CENP-A occupies an ∼40 kb region at each neocentromere, which has no preference for specific DNA sequence motifs. Furthermore, we found that neocentromeres were not associated with histone modifications H3K9me3, H3K4me2, and H3K36me3 or with early replication timing. Importantly, low but significant levels of CENP-A are detected around endogenous centromeres, which are capable of seeding neocentromere assembly if the centromere core is removed. In summary, our experimental system provides valuable insights for understanding how neocentromeres form.
Collapse
Affiliation(s)
- Wei-Hao Shang
- Department of Molecular Genetics, National Institute of Genetics and The Graduate University for Advanced Studies (SOKENDAI), Mishima, Shizuoka 411-8540, Japan
| | - Tetsuya Hori
- Department of Molecular Genetics, National Institute of Genetics and The Graduate University for Advanced Studies (SOKENDAI), Mishima, Shizuoka 411-8540, Japan
| | - Nuno M.C. Martins
- Wellcome Trust Centre for Cell Biology, University of Edinburgh, King’s Buildings, Mayfield Road, Edinburgh, EH9 3JR, UK
| | - Atsushi Toyoda
- Comparative Genomics Laboratory, National Institute of Genetics and The Graduate University for Advanced Studies (SOKENDAI), Mishima, Shizuoka 411-8540, Japan
| | - Sadahiko Misu
- Cell Innovation Project, National Institute of Genetics and The Graduate University for Advanced Studies (SOKENDAI), Mishima, Shizuoka 411-8540, Japan
| | - Norikazu Monma
- Cell Innovation Project, National Institute of Genetics and The Graduate University for Advanced Studies (SOKENDAI), Mishima, Shizuoka 411-8540, Japan
| | - Ichiro Hiratani
- Laboratory of Biological Macromolecules, National Institute of Genetics and The Graduate University for Advanced Studies (SOKENDAI), Mishima, Shizuoka 411-8540, Japan
| | - Kazuhiro Maeshima
- Laboratory of Biological Macromolecules, National Institute of Genetics and The Graduate University for Advanced Studies (SOKENDAI), Mishima, Shizuoka 411-8540, Japan
| | - Kazuho Ikeo
- Cell Innovation Project, National Institute of Genetics and The Graduate University for Advanced Studies (SOKENDAI), Mishima, Shizuoka 411-8540, Japan
| | - Asao Fujiyama
- Comparative Genomics Laboratory, National Institute of Genetics and The Graduate University for Advanced Studies (SOKENDAI), Mishima, Shizuoka 411-8540, Japan
- National Institute of Informatics, Hitotsubashi, Chiyoda-ku, Tokyo 101-8430, Japan
| | - Hiroshi Kimura
- Graduate School of Frontier Biosciences, Osaka University, 1-3 Yamada-oka, Suita, Osaka 565-0871, Japan
| | - William C. Earnshaw
- Wellcome Trust Centre for Cell Biology, University of Edinburgh, King’s Buildings, Mayfield Road, Edinburgh, EH9 3JR, UK
| | - Tatsuo Fukagawa
- Department of Molecular Genetics, National Institute of Genetics and The Graduate University for Advanced Studies (SOKENDAI), Mishima, Shizuoka 411-8540, Japan
| |
Collapse
|
16
|
Abstract
Centromeres, the sites of spindle attachment during mitosis and meiosis, are located in specific positions in the human genome, normally coincident with diverse subsets of alpha satellite DNA. While there is strong evidence supporting the association of some subfamilies of alpha satellite with centromere function, the basis for establishing whether a given alpha satellite sequence is or is not designated a functional centromere is unknown, and attempts to understand the role of particular sequence features in establishing centromere identity have been limited by the near identity and repetitive nature of satellite sequences. Utilizing a broadly applicable experimental approach to test sequence competency for centromere specification, we have carried out a genomic and epigenetic functional analysis of endogenous human centromere sequences available in the current human genome assembly. The data support a model in which functionally competent sequences confer an opportunity for centromere specification, integrating genomic and epigenetic signals and promoting the concept of context-dependent centromere inheritance.
Collapse
|
17
|
Hypomethylation of LINE-1, and not centromeric SAT-α, is associated with centromeric instability in head and neck squamous cell carcinoma. Cell Oncol (Dordr) 2012; 35:259-67. [DOI: 10.1007/s13402-012-0085-5] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/16/2012] [Indexed: 10/28/2022] Open
|
18
|
The evolutionary life cycle of the resilient centromere. Chromosoma 2012; 121:327-40. [PMID: 22527114 DOI: 10.1007/s00412-012-0369-6] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2012] [Revised: 03/20/2012] [Accepted: 03/20/2012] [Indexed: 12/13/2022]
Abstract
The centromere is a chromosomal structure that is essential for the accurate segregation of replicated eukaryotic chromosomes to daughter cells. In most centromeres, the underlying DNA is principally made up of repetitive DNA elements, such as tandemly repeated satellite DNA and retrotransposable elements. Paradoxically, for such an essential genomic region, the DNA is rapidly evolving both within and between species. In this review, we show that the centromere locus is a resilient structure that can undergo evolutionary cycles of birth, growth, maturity, death and resurrection. The birth phase is highlighted by examples in humans and other organisms where centromere DNA deletions or chromosome rearrangements can trigger the epigenetic assembly of neocentromeres onto genomic sites without typical features of centromere DNA. In addition, functional centromeres can be generated in the laboratory using various methodologies. Recent mapping of the foundation centromere mark, the histone H3 variant CENP-A, onto near-complete genomes has uncovered examples of new centromeres which have not accumulated centromere repeat DNA. During the growth period of the centromere, repeat DNA begins to appear at some, but not all, loci. The maturity stage is characterised by centromere repeat accumulation, expansions and contractions and the rapid evolution of the centromere DNA between chromosomes of the same species and between species. This stage provides inherent centromere stability, facilitated by repression of gene activity and meiotic recombination at and around the centromeres. Death to a centromere can result from genomic instability precipitating rearrangements, deletions, accumulation of mutations and the loss of essential centromere binding proteins. Surprisingly, ancestral centromeres can undergo resurrection either in the field or in the laboratory, via as yet poorly understood mechanisms. The underlying principle for the preservation of a centromeric evolutionary life cycle is to provide resilience and perpetuity for the all-important structure and function of the centromere.
Collapse
|
19
|
Lee HR, Hayden KE, Willard HF. Organization and molecular evolution of CENP-A--associated satellite DNA families in a basal primate genome. Genome Biol Evol 2011; 3:1136-49. [PMID: 21828373 PMCID: PMC3194837 DOI: 10.1093/gbe/evr083] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Centromeric regions in many complex eukaryotic species contain highly repetitive satellite DNAs. Despite the diversity of centromeric DNA sequences among species, the functional centromeres in all species studied to date are marked by CENP-A, a centromere-specific histone H3 variant. Although it is well established that families of multimeric higher-order alpha satellite are conserved at the centromeres of human and great ape chromosomes and that diverged monomeric alpha satellite is found in old and new world monkey genomes, little is known about the organization, function, and evolution of centromeric sequences in more distant primates, including lemurs. Aye-Aye (Daubentonia madagascariensis) is a basal primate and is located at a key position in the evolutionary tree to study centromeric satellite transitions in primate genomes. Using the approach of chromatin immunoprecipitation with antibodies directed to CENP-A, we have identified two satellite families, Daubentonia madagascariensis Aye-Aye 1 (DMA1) and Daubentonia madagascariensis Aye-Aye 2 (DMA2), related to each other but unrelated in sequence to alpha satellite or any other previously described primate or mammalian satellite DNA families. Here, we describe the initial genomic and phylogenetic organization of DMA1 and DMA2 and present evidence of higher-order repeats in Aye-Aye centromeric domains, providing an opportunity to study the emergence of chromosome-specific modes of satellite DNA evolution in primate genomes.
Collapse
Affiliation(s)
- Hye-Ran Lee
- Genome Biology Group, Duke Institute for Genome Sciences & Policy, Duke University, USA
| | | | | |
Collapse
|
20
|
Heterochromatin is required for normal distribution of Neurospora crassa CenH3. Mol Cell Biol 2011; 31:2528-42. [PMID: 21505064 DOI: 10.1128/mcb.01285-10] [Citation(s) in RCA: 99] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Centromeres serve as platforms for the assembly of kinetochores and are essential for nuclear division. Here we identified Neurospora crassa centromeric DNA by chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) of DNA associated with tagged versions of the centromere foundation proteins CenH3 (CENP-A) and CEN-C (CENP-C) and the kinetochore protein CEN-T (CENP-T). On each chromosome we found an ∼150- to 300-kbp region of enrichment for all three proteins. These regions correspond to intervals predicted to be centromeric DNA by genetic mapping and DNA sequence analyses. By ChIP-seq we found extensive colocalization of CenH3, CEN-C, CEN-T, and histone H3K9 trimethylation (H3K9me3). In contrast, H3K4me2, which has been found at the cores of plant, fission yeast, Drosophila, and mammalian centromeres, was not enriched in Neurospora centromeric DNA. DNA methylation was most pronounced at the periphery of centromeric DNA. Mutation of dim-5, which encodes an H3K9 methyltransferase responsible for nearly all H3K9me3, resulted in altered distribution of CenH3-green fluorescent protein (GFP). Similarly, CenH3-GFP distribution was altered in the absence of HP1, the chromodomain protein that binds to H3K9me3. We conclude that eukaryotes with regional centromeres make use of different strategies for maintenance of CenH3 at centromeres, and we suggest a model in which centromere proteins nucleate at the core but require DIM-5 and HP1 for spreading.
Collapse
|
21
|
Paar V, Glunčić M, Basar I, Rosandić M, Paar P, Cvitković M. Large Tandem, Higher Order Repeats and Regularly Dispersed Repeat Units Contribute Substantially to Divergence Between Human and Chimpanzee Y Chromosomes. J Mol Evol 2010; 72:34-55. [DOI: 10.1007/s00239-010-9401-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2010] [Accepted: 10/25/2010] [Indexed: 10/18/2022]
|
22
|
|
23
|
Vermaak D, Malik HS. Multiple roles for heterochromatin protein 1 genes in Drosophila. Annu Rev Genet 2009; 43:467-92. [PMID: 19919324 DOI: 10.1146/annurev-genet-102108-134802] [Citation(s) in RCA: 118] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Heterochromatin is the gene-poor, transposon-rich, late-replicating chromatin compartment that was first cytologically defined more than 70 years ago. The identification of heterochromatin protein 1 (HP1) paved the way for a molecular dissection of this important component of complex eukaryotic genomes. Although initial studies revealed HP1's key role in heterochromatin maintenance and function, more recent studies have discovered a role for HP1 in numerous processes including, surprisingly, euchromatic gene expression. Drosophila genomes possess at least five HP1 paralogs that have significantly different roles, ranging from canonical heterochromatic function at pericentric and telomeric regions to exclusive localization and regulation of euchromatic genes. They also possess paralogs exclusively involved in defending the germline against mobile elements. Pursuing a survey of recent genetic and evolutionary findings, we highlight how Drosophila genomes represent the best opportunity to dissect the diversity and incredible versatility of HP1 proteins in organizing and protecting eukaryotic genomes.
Collapse
Affiliation(s)
- Danielle Vermaak
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA
| | | |
Collapse
|
24
|
Oliver PL, Goodstadt L, Bayes JJ, Birtle Z, Roach KC, Phadnis N, Beatson SA, Lunter G, Malik HS, Ponting CP. Accelerated evolution of the Prdm9 speciation gene across diverse metazoan taxa. PLoS Genet 2009; 5:e1000753. [PMID: 19997497 PMCID: PMC2779102 DOI: 10.1371/journal.pgen.1000753] [Citation(s) in RCA: 208] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2009] [Accepted: 11/04/2009] [Indexed: 12/12/2022] Open
Abstract
The onset of prezygotic and postzygotic barriers to gene flow between populations is a hallmark of speciation. One of the earliest postzygotic isolating barriers to arise between incipient species is the sterility of the heterogametic sex in interspecies' hybrids. Four genes that underlie hybrid sterility have been identified in animals: Odysseus, JYalpha, and Overdrive in Drosophila and Prdm9 (Meisetz) in mice. Mouse Prdm9 encodes a protein with a KRAB motif, a histone methyltransferase domain and several zinc fingers. The difference of a single zinc finger distinguishes Prdm9 alleles that cause hybrid sterility from those that do not. We find that concerted evolution and positive selection have rapidly altered the number and sequence of Prdm9 zinc fingers across 13 rodent genomes. The patterns of positive selection in Prdm9 zinc fingers imply that rapid evolution has acted on the interface between the Prdm9 protein and the DNA sequences to which it binds. Similar patterns are apparent for Prdm9 zinc fingers for diverse metazoans, including primates. Indeed, allelic variation at the DNA–binding positions of human PRDM9 zinc fingers show significant association with decreased risk of infertility. Prdm9 thus plays a role in determining male sterility both between species (mouse) and within species (human). The recurrent episodes of positive selection acting on Prdm9 suggest that the DNA sequences to which it binds must also be evolving rapidly. Our findings do not identify the nature of the underlying DNA sequences, but argue against the proposed role of Prdm9 as an essential transcription factor in mouse meiosis. We propose a hypothetical model in which incompatibilities between Prdm9-binding specificity and satellite DNAs provide the molecular basis for Prdm9-mediated hybrid sterility. We suggest that Prdm9 should be investigated as a candidate gene in other instances of hybrid sterility in metazoans. Speciation, the process by which one species splits into two, involves reproductive barriers between previously interbreeding populations. The question of how speciation occurs has rightly occupied the attention of biologists since before Darwin's “On the Origin of Species.” Studies of recently diverged species have revealed the presence of hybrid sterility genes (colloquially referred to as “speciation genes”), alleles of which are associated with sterility of interspecies hybrids. Mouse Prdm9 is the only known such gene in vertebrate animals. Here we report that the Prdm9 protein has evolved extremely rapidly in its DNA-binding domain, comprising an array of “zinc fingers.” This suggests that hybrid sterility may arise from a mismatch between the DNA-binding specificity of Prdm9 and rapidly evolving DNA. We propose that Prdm9 binds to satellite-DNA repeats evolving rapidly within and between different species. Prdm9 evolution is unusual because other hybrid sterility genes appear only to evolve rapidly in isolated bursts, whereas Prdm9 has evolved rapidly over 700 million years, in many rodent species, diverse primates and other metazoans. This leads to the tantalizing possibility that Prdm9 may have served as a “speciation gene” on other occasions in metazoan evolution, a possibility that will now need to be investigated.
Collapse
Affiliation(s)
- Peter L. Oliver
- Medical Research Council Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom
| | - Leo Goodstadt
- Medical Research Council Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom
| | - Joshua J. Bayes
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, California, United States of America
| | - Zoë Birtle
- Medical Research Council Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom
| | - Kevin C. Roach
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| | - Nitin Phadnis
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
| | - Scott A. Beatson
- School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane, Queensland, Australia
| | - Gerton Lunter
- Medical Research Council Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom
| | - Harmit S. Malik
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
- Howard Hughes Medical Institute, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
- * E-mail: (CPP); (HSM)
| | - Chris P. Ponting
- Medical Research Council Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom
- * E-mail: (CPP); (HSM)
| |
Collapse
|
25
|
Pertile MD, Graham AN, Choo KHA, Kalitsis P. Rapid evolution of mouse Y centromere repeat DNA belies recent sequence stability. Genome Res 2009; 19:2202-13. [PMID: 19737860 DOI: 10.1101/gr.092080.109] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The Y centromere sequence of house mouse, Mus musculus, remains unknown despite our otherwise significant knowledge of the genome sequence of this important mammalian model organism. Here, we report the complete molecular characterization of the C57BL/6J chromosome Y centromere, which comprises a highly diverged minor satellite-like sequence (designated Ymin) with higher-order repeat (HOR) sequence organization previously undescribed at mouse centromeres. The Ymin array is approximately 90 kb in length and resides within a single BAC clone that provides sequence information spanning an endogenous animal centromere for the first time. By exploiting direct patrilineal inheritance of the Y chromosome, we demonstrate stability of the Y centromere DNA structure spanning at least 175 inbred generations to beyond the time of domestication of the East Asian M.m. molossinus "fancy" mouse through which the Y chromosome was first introduced into the classical inbred laboratory mouse strains. Despite this stability, at least three unequal genetic exchange events have altered Ymin HOR unit length and sequence structure since divergence of the ancestral Mus musculus subspecies around 900,000 yr ago, with major turnover of the HOR arrays driving rapid divergence of sequence and higher-order structure at the mouse Y centromere. A comparative sequence analysis between the human and chimpanzee centromeres indicates a similar rapid divergence of the primate Y centromere. Our data point to a unique DNA sequence and organizational architecture for the mouse Y centromere that has evolved independently of all other mouse centromeres.
Collapse
Affiliation(s)
- Mark D Pertile
- Murdoch Childrens Research Institute, Victoria, Australia
| | | | | | | |
Collapse
|
26
|
|
27
|
Rosandić M, Glunčić M, Paar V, Basar I. The role of alphoid higher order repeats (HORs) in the centromere folding. J Theor Biol 2008; 254:555-60. [DOI: 10.1016/j.jtbi.2008.06.012] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2007] [Revised: 05/13/2008] [Accepted: 06/06/2008] [Indexed: 10/21/2022]
|
28
|
Abstract
Centromeres are the elements of chromosomes that assemble the proteinaceous kinetochore, maintain sister chromatid cohesion, regulate chromosome attachment to the spindle, and direct chromosome movement during cell division. Although the functions of centromeres and the proteins that contribute to their complex structure and function are conserved in eukaryotes, centromeric DNA diverges rapidly. Human centromeres are particularly complicated. Here, we review studies on the organization of homogeneous arrays of chromosome-specific alpha-satellite repeats and evolutionary links among eukaryotic centromeric sequences. We also discuss epigenetic mechanisms of centromere identity that confer structural and functional features of the centromere through DNA-protein interactions and post-translational modifications, producing centromere-specific chromatin signatures. The assembly and organization of human centromeres, the contributions of satellite DNA to centromere identity and diversity, and the mechanism whereby centromeres are distinguished from the rest of the genome reflect ongoing puzzles in chromosome biology.
Collapse
Affiliation(s)
- Mary G Schueler
- Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | | |
Collapse
|
29
|
Ma J, Bennetzen JL. Recombination, rearrangement, reshuffling, and divergence in a centromeric region of rice. Proc Natl Acad Sci U S A 2006; 103:383-8. [PMID: 16381819 PMCID: PMC1326179 DOI: 10.1073/pnas.0509810102] [Citation(s) in RCA: 109] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Centromeres have many unusual biological properties, including kinetochore attachment and severe repression of local meiotic recombination. These properties are partly an outcome, partly a cause, of unusual DNA structure in the centromeric region. Although several plant and animal genomes have been sequenced, most centromere sequences have not been completed or analyzed in depth. To shed light on the unique organization, variability, and evolution of centromeric DNA, detailed analysis of a 1.97-Mb sequence that includes centromere 8 (CEN8) of japonica rice was undertaken. Thirty-three long-terminal repeat (LTR)-retrotransposon families (including 11 previously unknown) were identified in the CEN8 region, totaling 245 elements and fragments that account for 67% of the region. The ratio of solo LTRs to intact elements in the CEN8 region is approximately 0.9:1, compared with approximately 2.2:1 in noncentromeric regions of rice. However, the ratio of solo LTRs to intact elements in the core of the CEN8 region ( approximately 2.5:1) is higher than in any other region investigated in rice, suggesting a hotspot for unequal recombination. Comparison of the CEN8 region of japonica and its orthologous segments from indica rice indicated that approximately 15% of the intact retrotransposons and solo LTRs were inserted into CEN8 after the divergence of japonica and indica from a common ancestor, compared with approximately 50% for previously studied euchromatic regions. Frequent DNA rearrangements were observed in the CEN8 region, including a 212-kb subregion that was found to be composed of three rearranged tandem repeats. Phylogenetic analysis also revealed recent segmental duplication and extensive rearrangement and reshuffling of the CentO satellite repeats.
Collapse
Affiliation(s)
- Jianxin Ma
- Department of Genetics, University of Georgia, Athens, GA 30602, USA
| | | |
Collapse
|
30
|
Ma J, Jackson SA. Retrotransposon accumulation and satellite amplification mediated by segmental duplication facilitate centromere expansion in rice. Genome Res 2005; 16:251-9. [PMID: 16354755 PMCID: PMC1361721 DOI: 10.1101/gr.4583106] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
The abundance of repetitive DNA varies greatly across centromeres within an individual or between different organisms. To shed light on the molecular mechanisms of centromere repeat proliferation, we performed structural analysis of LTR-retrotransposons, mostly centromere retrotransposons of rice (CRRs), and phylogenetic analysis of CentO satellite repeats harbored in the core region of the rice chromosome 4 centromere (CEN4). The data obtained demonstrate that the CRRs in the centromeric region we investigated have been enriched more significantly by recent rounds of segmental duplication than by original integration of active elements, suggesting that segmental duplication is an important process for CRR accumulation in the centromeric region. Our results also indicate that segmental duplication of large arrays of satellite repeats is primarily responsible for the amplification of satellite repeats, contributing to rapid reshuffling of CentO satellites. Intercentromere satellite homogenization was revealed by genome-wide comparison of CentO satellite monomers. However, a 10-bp duplication present in nearly half of the CEN4 monomers was found to be completely absent in rice centromere 8 (CEN8), suggesting that CEN4 and CEN8 may represent two different stages in the evolution of rice centromeres. These observations, obtained from the only complex eukaryotic centromeres to have been completely sequenced thus far, depict the evolutionary dynamics of rice centromeres with respect to the nature, timing, and process of centromeric repeat amplification.
Collapse
Affiliation(s)
- Jianxin Ma
- Department of Agronomy, Purdue University, West Lafayette, IN 47907, USA
| | | |
Collapse
|
31
|
Abstract
Alpha-satellite is a family of tandemly repeated sequences found at all normal human centromeres. In addition to its significance for understanding centromere function, alpha-satellite is also a model for concerted evolution, as alpha-satellite repeats are more similar within a species than between species. There are two types of alpha-satellite in the human genome; while both are made up of approximately 171-bp monomers, they can be distinguished by whether monomers are arranged in extremely homogeneous higher-order, multimeric repeat units or exist as more divergent monomeric alpha-satellite that lacks any multimeric periodicity. In this study, as a model to examine the genomic and evolutionary relationships between these two types, we have focused on the chromosome 17 centromeric region that has reached both higher-order and monomeric alpha-satellite in the human genome assembly. Monomeric and higher-order alpha-satellites on chromosome 17 are phylogenetically distinct, consistent with a model in which higher-order evolved independently of monomeric alpha-satellite. Comparative analysis between human chromosome 17 and the orthologous chimpanzee chromosome indicates that monomeric alpha-satellite is evolving at approximately the same rate as the adjacent non-alpha-satellite DNA. However, higher-order alpha-satellite is less conserved, suggesting different evolutionary rates for the two types of alpha-satellite.
Collapse
Affiliation(s)
- M Katharine Rudd
- Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina 27708, USA
| | | | | |
Collapse
|
32
|
Abstract
Centromeres represent the final frontier of eukaryotic genomes. Although they are defining features of chromosomes--the points at which spindle microtubules attach--the fundamental features that distinguish them from other parts of the chromosome remain mysterious. The function of centromeres is conserved throughout eukaryotic biology, but their DNA sequences are not. Rather, accumulating evidence favors chromatin-based centromeric identification. To understand how centromeric identity is maintained, researchers have studied DNA-protein interactions at native centromeres and ectopic "neocentromeres". Other studies have taken a comparative approach focusing on centromere-specific proteins, of which mammalian CENP-A and CENP-C are the prototypes. Elucidating the assembly and structure of chromatin at centromeres remain key challenges.
Collapse
Affiliation(s)
- Steven Henikoff
- Howard Hughes Medical Institute, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, PO Box 19024, Seattle, WA 98109-1024, USA.
| | | |
Collapse
|
33
|
Martens JHA, O'Sullivan RJ, Braunschweig U, Opravil S, Radolf M, Steinlein P, Jenuwein T. The profile of repeat-associated histone lysine methylation states in the mouse epigenome. EMBO J 2005; 24:800-12. [PMID: 15678104 PMCID: PMC549616 DOI: 10.1038/sj.emboj.7600545] [Citation(s) in RCA: 512] [Impact Index Per Article: 26.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2004] [Accepted: 12/13/2004] [Indexed: 12/12/2022] Open
Abstract
Histone lysine methylation has been shown to index silenced chromatin regions at, for example, pericentric heterochromatin or of the inactive X chromosome. Here, we examined the distribution of repressive histone lysine methylation states over the entire family of DNA repeats in the mouse genome. Using chromatin immunoprecipitation in a cluster analysis representing repetitive elements, our data demonstrate the selective enrichment of distinct H3-K9, H3-K27 and H4-K20 methylation marks across tandem repeats (e.g. major and minor satellites), DNA transposons, retrotransposons, long interspersed nucleotide elements and short interspersed nucleotide elements. Tandem repeats, but not the other repetitive elements, give rise to double-stranded (ds) RNAs that are further elevated in embryonic stem (ES) cells lacking the H3-K9-specific Suv39h histone methyltransferases. Importantly, although H3-K9 tri- and H4-K20 trimethylation appear stable at the satellite repeats, many of the other repeat-associated repressive marks vary in chromatin of differentiated ES cells or of embryonic trophoblasts and fibroblasts. Our data define a profile of repressive histone lysine methylation states for the repetitive complement of four distinct mouse epigenomes and suggest tandem repeats and dsRNA as primary triggers for more stable chromatin imprints.
Collapse
Affiliation(s)
- Joost H A Martens
- Research Institute of Molecular Pathology (IMP), The Vienna Biocenter, Vienna, Austria
| | - Roderick J O'Sullivan
- Research Institute of Molecular Pathology (IMP), The Vienna Biocenter, Vienna, Austria
| | - Ulrich Braunschweig
- Research Institute of Molecular Pathology (IMP), The Vienna Biocenter, Vienna, Austria
| | - Susanne Opravil
- Research Institute of Molecular Pathology (IMP), The Vienna Biocenter, Vienna, Austria
| | - Martin Radolf
- Research Institute of Molecular Pathology (IMP), The Vienna Biocenter, Vienna, Austria
| | - Peter Steinlein
- Research Institute of Molecular Pathology (IMP), The Vienna Biocenter, Vienna, Austria
| | - Thomas Jenuwein
- Research Institute of Molecular Pathology (IMP), The Vienna Biocenter, Vienna, Austria
- Research Institute of Molecular Pathology (IMP), The Vienna Biocenter, Dr Bohrgasse 7, 1030 Vienna, Austria. Tel.: +43 1 797 30 474; Fax: +43 1 798 7153; E-mail:
| |
Collapse
|
34
|
Basu J, Stromberg G, Compitello G, Willard HF, Van Bokkelen G. Rapid creation of BAC-based human artificial chromosome vectors by transposition with synthetic alpha-satellite arrays. Nucleic Acids Res 2005; 33:587-96. [PMID: 15673719 PMCID: PMC548352 DOI: 10.1093/nar/gki207] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
Efficient construction of BAC-based human artificial chromosomes (HACs) requires optimization of each key functional unit as well as development of techniques for the rapid and reliable manipulation of high-molecular weight BAC vectors. Here, we have created synthetic chromosome 17-derived alpha-satellite arrays, based on the 16-monomer repeat length typical of natural D17Z1 arrays, in which the consensus CENP-B box elements are either completely absent (0/16 monomers) or increased in density (16/16 monomers) compared to D17Z1 alpha-satellite (5/16 monomers). Using these vectors, we show that the presence of CENP-B box elements is a requirement for efficient de novo centromere formation and that increasing the density of CENP-B box elements may enhance the efficiency of de novo centromere formation. Furthermore, we have developed a novel, high-throughput methodology that permits the rapid conversion of any genomic BAC target into a HAC vector by transposon-mediated modification with synthetic alpha-satellite arrays and other key functional units. Taken together, these approaches offer the potential to significantly advance the utility of BAC-based HACs for functional annotation of the genome and for applications in gene transfer.
Collapse
Affiliation(s)
- Joydeep Basu
- Institute for Genome Sciences and Policy, Duke University CIEMAS Room 2379, 101 Science Drive, Durham, NC 27708, USA.
| | | | | | | | | |
Collapse
|