1
|
Zhang Y, Chu J, Cheng H, Li H. De novo reconstruction of satellite repeat units from sequence data. Genome Res 2023; 33:gr.278005.123. [PMID: 37918962 PMCID: PMC10760446 DOI: 10.1101/gr.278005.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Accepted: 10/18/2023] [Indexed: 11/04/2023]
Abstract
Satellite DNA are long tandemly repeating sequences in a genome and may be organized as high-order repeats (HORs). They are enriched in centromeres and are challenging to assemble. Existing algorithms for identifying satellite repeats either require the complete assembly of satellites or only work for simple repeat structures without HORs. Here we describe Satellite Repeat Finder (SRF), a new algorithm for reconstructing satellite repeat units and HORs from accurate reads or assemblies without prior knowledge on repeat structures. Applying SRF to real sequence data, we show that SRF could reconstruct known satellites in human and well-studied model organisms. We also find satellite repeats are pervasive in various other species, accounting for up to 12% of their genome contents but are often underrepresented in assemblies. With the rapid progress in genome sequencing, SRF will help the annotation of new genomes and the study of satellite DNA evolution even if such repeats are not fully assembled.
Collapse
Affiliation(s)
- Yujie Zhang
- Harvard School of Public Health, Boston, Massachusetts 02115, USA
| | - Justin Chu
- Department of Data Science, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Haoyu Cheng
- Department of Data Science, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Heng Li
- Department of Data Science, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA;
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts 02115, USA
| |
Collapse
|
2
|
Bzikadze AV, Pevzner PA. UniAligner: a parameter-free framework for fast sequence alignment. Nat Methods 2023; 20:1346-1354. [PMID: 37580559 DOI: 10.1038/s41592-023-01970-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Accepted: 07/05/2023] [Indexed: 08/16/2023]
Abstract
Even though the recent advances in 'complete genomics' revealed the previously inaccessible genomic regions, analysis of variations in centromeres and other extra-long tandem repeats (ETRs) faces an algorithmic challenge since there are currently no tools for accurate sequence comparison of ETRs. Counterintuitively, the classical alignment approaches, such as the Smith-Waterman algorithm, fail to construct biologically adequate alignments of ETRs. We present UniAligner-the parameter-free sequence alignment algorithm with sequence-dependent alignment scoring that automatically changes for any pair of compared sequences. UniAligner prioritizes matches of rare substrings that are more likely to be relevant to the evolutionary relationship between two sequences. We apply UniAligner to estimate the mutation rates in human centromeres, and quantify the extremely high rate of large duplications and deletions in centromeres. This high rate suggests that centromeres may represent some of the most rapidly evolving regions of the human genome with respect to their structural organization.
Collapse
Affiliation(s)
- Andrey V Bzikadze
- Graduate Program in Bioinformatics and Systems Biology, University of California, San Diego, La Jolla, CA, USA
| | - Pavel A Pevzner
- Department of Computer Science and Engineering, University of California, San Diego, La Jolla, CA, USA.
| |
Collapse
|
3
|
Ivanova NG, Kartavtseva IV, Stefanova VN, Ostromyshenskii DI, Podgornaya OI. Tandem Repeat Diversity in Two Closely Related Hamster Species—The Chinese Hamster (Cricetulus griseus) and Striped Hamster (Cricetulus barabensis). Biomedicines 2022; 10:biomedicines10040925. [PMID: 35453675 PMCID: PMC9025346 DOI: 10.3390/biomedicines10040925] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Revised: 04/12/2022] [Accepted: 04/13/2022] [Indexed: 11/16/2022] Open
Abstract
The Chinese hamster (Cricetulus griseus) and striped hamster (Cricetulus barabensis) are very closely related species with similar karyotypes. The karyotypes differ from each other by one Robertsonian rearrangement and X-chromosome morphology. The level of the tandem repeat (TR) sequences’ evolutional variability is high. The aim of the current work was to trace the TR distribution on the chromosomes of two very closely related species. The striped hamster genome has not yet been sequenced. We classified the Chinese hamster TR in the assemblies available and then compared the mode of the TR distribution in closely related species. Chinese and striped hamsters are separate species due to the relative species specificity of Chinese hamster TR and prominent differences in the TR distribution in both species. The TR variation observed within homologous striped hamster chromosomes is caused by a lack of inbreeding in natural populations. The set of TR tested could be used to examine the CHO lines’ instability that has been observed in heterochromatic regions.
Collapse
Affiliation(s)
- Nadezhda G. Ivanova
- Laboratory of Noncoding DNA, Institute of Cytology RAS, Saint Petersburg 194064, Russia; (V.N.S.); (D.I.O.); (O.I.P.)
- Correspondence:
| | - Irina V. Kartavtseva
- Laboratory of Evolutionary Zoology, Federal Scientific Center of the East Asia Terrestrial Biodiversity, Vladivostok 690022, Russia;
| | - Vera N. Stefanova
- Laboratory of Noncoding DNA, Institute of Cytology RAS, Saint Petersburg 194064, Russia; (V.N.S.); (D.I.O.); (O.I.P.)
| | - Dmitrii I. Ostromyshenskii
- Laboratory of Noncoding DNA, Institute of Cytology RAS, Saint Petersburg 194064, Russia; (V.N.S.); (D.I.O.); (O.I.P.)
| | - Olga I. Podgornaya
- Laboratory of Noncoding DNA, Institute of Cytology RAS, Saint Petersburg 194064, Russia; (V.N.S.); (D.I.O.); (O.I.P.)
- Department of Cytology and Histology, Faculty of Biology, St. Petersburg State University, Saint Petersburg 199034, Russia
| |
Collapse
|
4
|
Poignet M, Johnson Pokorná M, Altmanová M, Majtánová Z, Dedukh D, Albrecht T, Reif J, Osiejuk TS, Reifová R. Comparison of Karyotypes in Two Hybridizing Passerine Species: Conserved Chromosomal Structure but Divergence in Centromeric Repeats. Front Genet 2021; 12:768987. [PMID: 34938317 PMCID: PMC8687609 DOI: 10.3389/fgene.2021.768987] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Accepted: 11/10/2021] [Indexed: 11/23/2022] Open
Abstract
Changes in chromosomal structure involving chromosomal rearrangements or copy number variation of specific sequences can play an important role in speciation. Here, we explored the chromosomal structure of two hybridizing passerine species; the common nightingale (Luscinia megarhynchos) and the thrush nightingale (Luscinia luscinia), using conventional cytogenetic approaches, immunostaining of meiotic chromosomes, fluorescence in situ hybridization as well as comparative genomic hybridization (CGH). We found that the two nightingale species show conserved karyotypes with the same diploid chromosome number of 2n = 84. In addition to standard chromosomes, both species possessed a small germline restricted chromosome of similar size as a microchromosome. Just a few subtle changes in chromosome morphology were observed between the species, suggesting that only a limited number of chromosomal rearrangements occurred after the species divergence. The interspecific CGH experiment suggested that the two nightingale species might have diverged in centromeric repetitive sequences in most macro- and microchromosomes. In addition, some chromosomes showed changes in copy number of centromeric repeats between the species. The observation of very similar karyotypes in the two nightingale species is consistent with a generally slow rate of karyotype evolution in birds. The divergence of centromeric sequences between the two species could theoretically cause meiotic drive or reduced fertility in interspecific hybrids. Nevertheless, further studies are needed to evaluate the potential role of chromosomal structural variations in nightingale speciation.
Collapse
Affiliation(s)
- Manon Poignet
- Department of Zoology, Faculty of Science, Charles University, Prague, Czech Republic
| | - Martina Johnson Pokorná
- Department of Zoology, Faculty of Science, Charles University, Prague, Czech Republic
- Department of Ecology, Faculty of Science, Charles University, Prague, Czech Republic
- Institute of Animal Physiology and Genetics, Czech Academy of Sciences, Liběchov, Czech Republic
| | - Marie Altmanová
- Department of Ecology, Faculty of Science, Charles University, Prague, Czech Republic
- Institute of Animal Physiology and Genetics, Czech Academy of Sciences, Liběchov, Czech Republic
| | - Zuzana Majtánová
- Institute of Animal Physiology and Genetics, Czech Academy of Sciences, Liběchov, Czech Republic
| | - Dmitry Dedukh
- Institute of Animal Physiology and Genetics, Czech Academy of Sciences, Liběchov, Czech Republic
| | - Tomáš Albrecht
- Department of Zoology, Faculty of Science, Charles University, Prague, Czech Republic
- Institute of Vertebrate Biology, Czech Academy of Sciences, Brno, Czech Republic
| | - Jiří Reif
- Institute for Environmental Studies, Faculty of Science, Charles University, Prague, Czech Republic
- Department of Zoology and Laboratory of Ornithology, Faculty of Science, Palacký University, Olomouc, Czech Republic
| | - Tomasz S. Osiejuk
- Department of Behavioural Ecology, Institute of Environmental Biology, Faculty of Biology, Adam Mickiewicz University, Poznań, Poland
| | - Radka Reifová
- Department of Zoology, Faculty of Science, Charles University, Prague, Czech Republic
| |
Collapse
|
5
|
Construction of stable mouse artificial chromosome from native mouse chromosome 10 for generation of transchromosomic mice. Sci Rep 2021; 11:20050. [PMID: 34625612 PMCID: PMC8501010 DOI: 10.1038/s41598-021-99535-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Accepted: 09/22/2021] [Indexed: 12/16/2022] Open
Abstract
Mammalian artificial chromosomes derived from native chromosomes have been applied to biomedical research and development by generating cell sources and transchromosomic (Tc) animals. Human artificial chromosome (HAC) is a precedent chromosomal vector which achieved generation of valuable humanized animal models for fully human antibody production and human pharmacokinetics. While humanized Tc animals created by HAC vector have attained significant contributions, there was a potential issue to be addressed regarding stability in mouse tissues, especially highly proliferating hematopoietic cells. Mouse artificial chromosome (MAC) vectors derived from native mouse chromosome 11 demonstrated improved stability, and they were utilized for humanized Tc mouse production as a standard vector. In mouse, however, stability of MAC vector derived from native mouse chromosome other than mouse chromosome 11 remains to be evaluated. To clarify the potential of mouse centromeres in the additional chromosomes, we constructed a new MAC vector from native mouse chromosome 10 to evaluate the stability in Tc mice. The new MAC vector was transmitted through germline and stably maintained in the mouse tissues without any apparent abnormalities. Through this study, the potential of additional mouse centromere was demonstrated for Tc mouse production, and new MAC is expected to be used for various applications.
Collapse
|
6
|
Flynn JM, Brown EJ, Clark AG. Copy number evolution in simple and complex tandem repeats across the C57BL/6 and C57BL/10 inbred mouse lines. G3 GENES|GENOMES|GENETICS 2021; 11:6287064. [PMID: 34849804 PMCID: PMC8496272 DOI: 10.1093/g3journal/jkab184] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/27/2021] [Accepted: 05/25/2021] [Indexed: 12/01/2022]
Abstract
Simple sequence tandem repeats are among the most rapidly evolving compartments of the genome. Some repeat expansions are associated with mammalian disease or meiotic segregation distortion, yet the rates of copy number change across generations are not well known. Here, we use 14 distinct sublineages of the C57BL/6 and C57BL/10 inbred mouse strains, which have been evolving independently over about 300 generations, to estimate the rates of copy number changes in genome-wide tandem repeats. Rates of change varied across repeats and across lines. Notably, CAG, whose expansions in coding regions are associated with many neurological and genetic disorders, was highly stable in copy number, likely indicating stabilizing selection. Rates of change were positively correlated with copy number, but the direction and magnitude of changes varied across lines. Some mouse lines experienced consistent losses or gains across most simple repeats, but this did not correlate with copy number changes in complex repeats. Rates of copy number change were similar between simple repeats and the more abundant complex repeats after normalization by copy number. Finally, the Y-specific centromeric repeat had a fourfold higher rate of change than the homologous centromeric repeat on other chromosomes. Structural differences in satellite complexity, or restriction to the Y chromosome and elevated mutation rates of the male germline, may explain the higher rate of change. Overall, our work underscores the mutational fluidity of long tandem arrays of repeats, and the correlations and constraints between genome-wide tandem repeats, which suggest that turnover is not a completely neutral process.
Collapse
Affiliation(s)
- Jullien M Flynn
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| | - Emily J Brown
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| | - Andrew G Clark
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| |
Collapse
|
7
|
Nath S, Shaw DE, White MA. Improved contiguity of the threespine stickleback genome using long-read sequencing. G3-GENES GENOMES GENETICS 2021; 11:6114463. [PMID: 33598708 PMCID: PMC8022941 DOI: 10.1093/g3journal/jkab007] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/04/2020] [Accepted: 12/27/2020] [Indexed: 12/28/2022]
Abstract
While the cost and time for assembling a genome has drastically decreased, it still remains a challenge to assemble a highly contiguous genome. These challenges are rapidly being overcome by the integration of long-read sequencing technologies. Here, we use long-read sequencing to improve the contiguity of the threespine stickleback fish (Gasterosteus aculeatus) genome, a prominent genetic model species. Using Pacific Biosciences sequencing, we assembled a highly contiguous genome of a freshwater fish from Paxton Lake. Using contigs from this genome, we were able to fill over 76.7% of the gaps in the existing reference genome assembly, improving contiguity over fivefold. Our gap filling approach was highly accurate, validated by 10X Genomics long-distance linked-reads. In addition to closing a majority of gaps, we were able to assemble segments of telomeres and centromeres throughout the genome. This highlights the power of using long sequencing reads to assemble highly repetitive and difficult to assemble regions of genomes. This latest genome build has been released through a newly designed community genome browser that aims to consolidate the growing number of genomics datasets available for the threespine stickleback fish.
Collapse
Affiliation(s)
- Shivangi Nath
- Department of Genetics, University of Georgia, Athens, GA 30602, USA
| | - Daniel E Shaw
- Department of Genetics, University of Georgia, Athens, GA 30602, USA
| | - Michael A White
- Department of Genetics, University of Georgia, Athens, GA 30602, USA
| |
Collapse
|
8
|
Peichel CL, McCann SR, Ross JA, Naftaly AFS, Urton JR, Cech JN, Grimwood J, Schmutz J, Myers RM, Kingsley DM, White MA. Assembly of the threespine stickleback Y chromosome reveals convergent signatures of sex chromosome evolution. Genome Biol 2020; 21:177. [PMID: 32684159 PMCID: PMC7368989 DOI: 10.1186/s13059-020-02097-x] [Citation(s) in RCA: 55] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2020] [Accepted: 07/08/2020] [Indexed: 01/15/2023] Open
Abstract
BACKGROUND Heteromorphic sex chromosomes have evolved repeatedly across diverse species. Suppression of recombination between X and Y chromosomes leads to degeneration of the Y chromosome. The progression of degeneration is not well understood, as complete sequence assemblies of heteromorphic Y chromosomes have only been generated across a handful of taxa with highly degenerate sex chromosomes. Here, we describe the assembly of the threespine stickleback (Gasterosteus aculeatus) Y chromosome, which is less than 26 million years old and at an intermediate stage of degeneration. Our previous work identified that the non-recombining region between the X and the Y spans approximately 17.5 Mb on the X chromosome. RESULTS We combine long-read sequencing with a Hi-C-based proximity guided assembly to generate a 15.87 Mb assembly of the Y chromosome. Our assembly is concordant with cytogenetic maps and Sanger sequences of over 90 Y chromosome BAC clones. We find three evolutionary strata on the Y chromosome, consistent with the three inversions identified by our previous cytogenetic analyses. The threespine stickleback Y shows convergence with more degenerate sex chromosomes in the retention of haploinsufficient genes and the accumulation of genes with testis-biased expression, many of which are recent duplicates. However, we find no evidence for large amplicons identified in other sex chromosome systems. We also report an excellent candidate for the master sex-determination gene: a translocated copy of Amh (Amhy). CONCLUSIONS Together, our work shows that the evolutionary forces shaping sex chromosomes can cause relatively rapid changes in the overall genetic architecture of Y chromosomes.
Collapse
Affiliation(s)
- Catherine L. Peichel
- Divisions of Human Biology and Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA 98109 USA
- Institute of Ecology and Evolution, University of Bern, Baltzerstrasse 6, 3012 Bern, Switzerland
| | - Shaugnessy R. McCann
- Divisions of Human Biology and Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA 98109 USA
| | - Joseph A. Ross
- Divisions of Human Biology and Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA 98109 USA
- Graduate Program in Molecular and Cellular Biology, University of Washington, Seattle, WA 98195 USA
| | | | - James R. Urton
- Divisions of Human Biology and Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA 98109 USA
- Graduate Program in Molecular and Cellular Biology, University of Washington, Seattle, WA 98195 USA
| | - Jennifer N. Cech
- Divisions of Human Biology and Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA 98109 USA
- Graduate Program in Molecular and Cellular Biology, University of Washington, Seattle, WA 98195 USA
| | - Jane Grimwood
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806 USA
| | - Jeremy Schmutz
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806 USA
| | - Richard M. Myers
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806 USA
| | - David M. Kingsley
- Department of Developmental Biology and Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA 94305 USA
| | - Michael A. White
- Divisions of Human Biology and Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA 98109 USA
- Department of Genetics, University of Georgia, Athens, GA 30602 USA
| |
Collapse
|
9
|
Ivanova NG, Stefanova VN, Ostromyshenskii DI, Podgornaya OI. Tandem Repeats in the Genome of Sus scrofa, Their Localization on Chromosomes and in the Spermatogenic Cell Nuclei. RUSS J GENET+ 2019. [DOI: 10.1134/s102279541907007x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
10
|
Palacios-Gimenez OM, Bardella VB, Lemos B, Cabral-de-Mello DC. Satellite DNAs are conserved and differentially transcribed among Gryllus cricket species. DNA Res 2018; 25:137-147. [PMID: 29096008 PMCID: PMC5909420 DOI: 10.1093/dnares/dsx044] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2017] [Accepted: 10/19/2017] [Indexed: 11/21/2022] Open
Abstract
Satellite DNA (satDNA) is an abundant class of non-coding repetitive DNA that is preferentially found as tandemly repeated arrays in gene-poor heterochromatin but is also present in gene-rich euchromatin. Here, we used DNA- and RNA-seq from Gryllus assimilis to address the content and transcriptional patterns of satDNAs. We also mapped RNA-seq libraries for other Gryllus species against the satDNAs found in G. assimilis and G. bimaculatus genomes to investigate their evolutionary conservation and transcriptional profiles in Gryllus. Through DNA-seq read clustering analysis using RepeatExplorer, dotplots analysis and fluorescence in situ hybridization mapping, we found that ∼4% of the G. assimilis genome is represented by 11 well-defined A + T-rich satDNA families. These are mainly located in heterochromatic areas, with some repeats able to form high-order repeat structures. By in silico transcriptional analysis we identified satDNAs that are conserved in Gryllus but differentially transcribed. The data regarding satDNA presence in G. assimilis genome were discussed in an evolutionary context, with transcriptional data enabling comparisons between sexes and across tissues when possible. We discuss hypotheses for the conservation and transcription of satDNAs in Gryllus, which might result from their role in sexual differentiation at the chromatin level, heterochromatin formation and centromeric function.
Collapse
Affiliation(s)
- Octavio Manuel Palacios-Gimenez
- Departamento de Biologia, Instituto de Biociências/IB, UNESP-Univ Estadual Paulista, Rio Claro, São Paulo, Brazil.,Program in Molecular and Integrative Physiological Sciences, Department of Environmental Health, Harvard University T. H. Chan School of Public Health, Boston, MA 02115, USA
| | - Vanessa Bellini Bardella
- Departamento de Biologia, Instituto de Biociências/IB, UNESP-Univ Estadual Paulista, Rio Claro, São Paulo, Brazil
| | - Bernardo Lemos
- Program in Molecular and Integrative Physiological Sciences, Department of Environmental Health, Harvard University T. H. Chan School of Public Health, Boston, MA 02115, USA
| | | |
Collapse
|
11
|
Vlahovic I, Gluncic M, Rosandic M, Ugarkovic Ð, Paar V. Regular Higher Order Repeat Structures in Beetle Tribolium castaneum Genome. Genome Biol Evol 2018; 9:2668-2680. [PMID: 27492235 PMCID: PMC5737470 DOI: 10.1093/gbe/evw174] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/21/2016] [Indexed: 02/07/2023] Open
Abstract
Higher order repeats (HORs) containing tandems of primary and secondary repeat units (head-to-tail “tandem within tandem pattern”), referred to as regular HORs, are typical for primate alpha satellite DNAs and most pronounced in human genome. Regular HORs are known to be a result of recent evolutionary processes. In non-primate genomes mostly so called complex HORs have been found, without head to tail tandem of primary repeat units. In beetle Tribolium castaneum, considered as a model case for genome studies, large tandem repeats have been identified, but no HORs have been reported. Here, using our novel robust repeat finding algorithm Global Repeat Map, we discover two regular and six complex HORs in T. castaneum. In organizational pattern, the integrity and homogeneity of regular HORs in T. castaneum resemble human regular HORs (with T. castaneum monomers different from human alpha satellite monomers), involving a wider range of monomer lengths than in human HORs. Similar regular higher order repeat structures have previously not been found in insects. Some of these novel HORs in T. castaneum appear as most regular among known HORs in non-primate genomes, although with substantial riddling. This is intriguing, in particular from the point of view of role of non-coding repeats in modulation of gene expression.
Collapse
Affiliation(s)
- Ines Vlahovic
- Faculty of Science, University of Zagreb, Zagreb, Croatia
| | - Matko Gluncic
- Faculty of Science, University of Zagreb, Zagreb, Croatia
| | | | | | - Vladimir Paar
- Faculty of Science, University of Zagreb, Zagreb, Croatia.,Croatian Academy of Sciences and Arts, Zagreb, Croatia
| |
Collapse
|
12
|
Using human artificial chromosomes to study centromere assembly and function. Chromosoma 2017; 126:559-575. [DOI: 10.1007/s00412-017-0633-x] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2017] [Revised: 06/12/2017] [Accepted: 06/13/2017] [Indexed: 12/13/2022]
|
13
|
Park L. Evidence of Recent Intricate Adaptation in Human Populations. PLoS One 2016; 11:e0165870. [PMID: 27992444 PMCID: PMC5167553 DOI: 10.1371/journal.pone.0165870] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2016] [Accepted: 10/19/2016] [Indexed: 11/18/2022] Open
Abstract
Recent human adaptations have shaped population differentiation in genomic regions containing putative functional variants, mostly located in predicted regulatory elements. However, their actual functionalities and the underlying mechanism of recent adaptation remain poorly understood. In the current study, regions of genes and repeats were investigated for functionality depending on the degree of population differentiation, FST or ΔDAF (a difference in derived allele frequency). The high FST in the 5´ or 3´ untranslated regions (UTRs), in particular, confirmed that population differences arose mainly from differences in regulation. Expression quantitative trait loci (eQTL) analyses using lymphoblastoid cell lines indicated that the majority of the highly population-specific regions represented cis- and/or trans-eQTL. However, groups having the highest ΔDAFs did not necessarily have higher proportions of eQTL variants; in these groups, the patterns were complex, indicating recent intricate adaptations. The results indicated that East Asian (EAS) and European populations (EUR) experienced mutual selection pressures. The mean derived allele frequency of the high ΔDAF groups suggested that EAS and EUR underwent strong adaptation; however, the African population in Africa (AFR) experienced slight, yet broad, adaptation. The DAF distributions of variants in the gene regions showed clear selective pressure in each population, which implies the existence of more recent regulatory adaptations in cells other than lymphoblastoid cell lines. In-depth analysis of population-differentiated regions indicated that the coding gene, RNF135, represented a trans-regulation hotspot via cis-regulation by the population-specific variants in the region of selective sweep. Together, the results provide strong evidence of actual intricate adaptation of human populations via regulatory manipulation.
Collapse
Affiliation(s)
- Leeyoung Park
- Natural Science Research Institute, Yonsei University, Seoul, Korea
- * E-mail:
| |
Collapse
|
14
|
Cacheux L, Ponger L, Gerbault-Seureau M, Richard FA, Escudé C. Diversity and distribution of alpha satellite DNA in the genome of an Old World monkey: Cercopithecus solatus. BMC Genomics 2016; 17:916. [PMID: 27842493 PMCID: PMC5109768 DOI: 10.1186/s12864-016-3246-5] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2016] [Accepted: 11/02/2016] [Indexed: 11/10/2022] Open
Abstract
Background Alpha satellite is the major repeated DNA element of primate centromeres. Evolution of these tandemly repeated sequences has led to the existence of numerous families of monomers exhibiting specific organizational patterns. The limited amount of information available in non-human primates is a restriction to the understanding of the evolutionary dynamics of alpha satellite DNA. Results We carried out the targeted high-throughput sequencing of alpha satellite monomers and dimers from the Cercopithecus solatus genome, an Old World monkey from the Cercopithecini tribe. Computational approaches were used to infer the existence of sequence families and to study how these families are organized with respect to each other. While previous studies had suggested that alpha satellites in Old World monkeys were poorly diversified, our analysis provides evidence for the existence of at least four distinct families of sequences within the studied species and of higher order organizational patterns. Fluorescence in situ hybridization using oligonucleotide probes that are able to target each family in a specific way showed that the different families had distinct distributions on chromosomes and were not homogeneously distributed between chromosomes. Conclusions Our new approach provides an unprecedented and comprehensive view of the diversity and organization of alpha satellites in a species outside the hominoid group. We consider these data with respect to previously known alpha satellite families and to potential mechanisms for satellite DNA evolution. Applying this approach to other species will open new perspectives regarding the integration of satellite DNA into comparative genomic and cytogenetic studies. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-3246-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Lauriane Cacheux
- Département Régulations, Développement et Diversité Moléculaire, Structure et Instabilité des Génomes, INSERM U1154, CNRS UMR7196, Sorbonne Universités, Muséum national d'Histoire naturelle, Paris, France.,Département Systématique et Evolution, Institut de Systématique, Evolution, Biodiversité, UMR 7205 MNHN, CNRS, UPMC, EPHE, Sorbonne Universités, Muséum national d'Histoire naturelle, Paris, France
| | - Loïc Ponger
- Département Régulations, Développement et Diversité Moléculaire, Structure et Instabilité des Génomes, INSERM U1154, CNRS UMR7196, Sorbonne Universités, Muséum national d'Histoire naturelle, Paris, France
| | - Michèle Gerbault-Seureau
- Département Systématique et Evolution, Institut de Systématique, Evolution, Biodiversité, UMR 7205 MNHN, CNRS, UPMC, EPHE, Sorbonne Universités, Muséum national d'Histoire naturelle, Paris, France
| | - Florence Anne Richard
- Département Systématique et Evolution, Institut de Systématique, Evolution, Biodiversité, UMR 7205 MNHN, CNRS, UPMC, EPHE, Sorbonne Universités, Muséum national d'Histoire naturelle, Paris, France.,Université Versailles St-Quentin, Montigny-le-Bretonneux, France
| | - Christophe Escudé
- Département Régulations, Développement et Diversité Moléculaire, Structure et Instabilité des Génomes, INSERM U1154, CNRS UMR7196, Sorbonne Universités, Muséum national d'Histoire naturelle, Paris, France.
| |
Collapse
|
15
|
Large deletion on the Y-chromosome long arm (Yq) of C57BL/6JBomTac inbred mice. Mamm Genome 2016; 28:31-37. [DOI: 10.1007/s00335-016-9669-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2016] [Accepted: 10/19/2016] [Indexed: 10/20/2022]
|
16
|
Cech JN, Peichel CL. Centromere inactivation on a neo-Y fusion chromosome in threespine stickleback fish. Chromosome Res 2016; 24:437-450. [PMID: 27553478 DOI: 10.1007/s10577-016-9535-7] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2016] [Revised: 08/14/2016] [Accepted: 08/16/2016] [Indexed: 02/07/2023]
Abstract
Having one and only one centromere per chromosome is essential for proper chromosome segregation during both mitosis and meiosis. Chromosomes containing two centromeres are known as dicentric and often mis-segregate during cell division, resulting in aneuploidy or chromosome breakage. Dicentric chromosome can be stabilized by centromere inactivation, a process which reestablishes monocentric chromosomes. However, little is known about this process in naturally occurring dicentric chromosomes. Using a combination of fluorescence in situ hybridization (FISH) and immunofluorescence combined with FISH (IF-FISH) on metaphase chromosome spreads, we demonstrate that centromere inactivation has evolved on a neo-Y chromosome fusion in the Japan Sea threespine stickleback fish (Gasterosteus nipponicus). We found that the centromere derived from the ancestral Y chromosome has been inactivated. Our data further suggest that there have been genetic changes to this centromere in the two million years since the formation of the neo-Y chromosome, but it remains unclear whether these genetic changes are a cause or consequence of centromere inactivation.
Collapse
Affiliation(s)
- Jennifer N Cech
- Divisions of Basic Sciences and Human Biology, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave North, Mailstop C2-023, Seattle, WA, 98109, USA
- Graduate Program in Molecular and Cellular Biology, University of Washington, Seattle, WA, 98195, USA
| | - Catherine L Peichel
- Institute of Ecology and Evolution, University of Bern, Baltzerstrasse 6, 3012, Bern, Switzerland.
| |
Collapse
|
17
|
Kuznetsova IS, Ostromyshenskii DI, Komissarov AS, Prusov AN, Waisertreiger IS, Gorbunova AV, Trifonov VA, Ferguson-Smith MA, Podgornaya OI. LINE-related component of mouse heterochromatin and complex chromocenters’ composition. Chromosome Res 2016; 24:309-23. [DOI: 10.1007/s10577-016-9525-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2016] [Revised: 03/23/2016] [Accepted: 04/17/2016] [Indexed: 10/21/2022]
|
18
|
Hall AB, Papathanos PA, Sharma A, Cheng C, Akbari OS, Assour L, Bergman NH, Cagnetti A, Crisanti A, Dottorini T, Fiorentini E, Galizi R, Hnath J, Jiang X, Koren S, Nolan T, Radune D, Sharakhova MV, Steele A, Timoshevskiy VA, Windbichler N, Zhang S, Hahn MW, Phillippy AM, Emrich SJ, Sharakhov IV, Tu ZJ, Besansky NJ. Radical remodeling of the Y chromosome in a recent radiation of malaria mosquitoes. Proc Natl Acad Sci U S A 2016; 113:E2114-23. [PMID: 27035980 PMCID: PMC4839409 DOI: 10.1073/pnas.1525164113] [Citation(s) in RCA: 63] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Y chromosomes control essential male functions in many species, including sex determination and fertility. However, because of obstacles posed by repeat-rich heterochromatin, knowledge of Y chromosome sequences is limited to a handful of model organisms, constraining our understanding of Y biology across the tree of life. Here, we leverage long single-molecule sequencing to determine the content and structure of the nonrecombining Y chromosome of the primary African malaria mosquito, Anopheles gambiae We find that the An. gambiae Y consists almost entirely of a few massively amplified, tandemly arrayed repeats, some of which can recombine with similar repeats on the X chromosome. Sex-specific genome resequencing in a recent species radiation, the An. gambiae complex, revealed rapid sequence turnover within An. gambiae and among species. Exploiting 52 sex-specific An. gambiae RNA-Seq datasets representing all developmental stages, we identified a small repertoire of Y-linked genes that lack X gametologs and are not Y-linked in any other species except An. gambiae, with the notable exception of YG2, a candidate male-determining gene. YG2 is the only gene conserved and exclusive to the Y in all species examined, yet sequence similarity to YG2 is not detectable in the genome of a more distant mosquito relative, suggesting rapid evolution of Y chromosome genes in this highly dynamic genus of malaria vectors. The extensive characterization of the An. gambiae Y provides a long-awaited foundation for studying male mosquito biology, and will inform novel mosquito control strategies based on the manipulation of Y chromosomes.
Collapse
Affiliation(s)
- Andrew Brantley Hall
- The Interdisciplinary PhD Program in Genetics, Bioinformatics, and Computational Biology, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061
| | - Philippos-Aris Papathanos
- Section of Genomics and Genetics, Department of Experimental Medicine, University of Perugia, 06132 Perugia, Italy; Department of Life Sciences, Imperial College London, London SW7 2AZ, United Kingdom
| | - Atashi Sharma
- Department of Entomology, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061
| | - Changde Cheng
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556; Department of Biological Sciences, University of Notre Dame, Notre Dame, IN 46556
| | - Omar S Akbari
- Department of Entomology, Riverside Center for Disease Vector Research, Institute for Integrative Genome Biology, University of California, Riverside, CA 92521
| | - Lauren Assour
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556
| | - Nicholas H Bergman
- National Biodefense Analysis and Countermeasures Center, Frederick, MD 21702
| | - Alessia Cagnetti
- Section of Genomics and Genetics, Department of Experimental Medicine, University of Perugia, 06132 Perugia, Italy
| | - Andrea Crisanti
- Section of Genomics and Genetics, Department of Experimental Medicine, University of Perugia, 06132 Perugia, Italy; Department of Life Sciences, Imperial College London, London SW7 2AZ, United Kingdom
| | - Tania Dottorini
- Department of Life Sciences, Imperial College London, London SW7 2AZ, United Kingdom
| | - Elisa Fiorentini
- Department of Life Sciences, Imperial College London, London SW7 2AZ, United Kingdom
| | - Roberto Galizi
- Department of Life Sciences, Imperial College London, London SW7 2AZ, United Kingdom
| | - Jonathan Hnath
- National Biodefense Analysis and Countermeasures Center, Frederick, MD 21702
| | - Xiaofang Jiang
- The Interdisciplinary PhD Program in Genetics, Bioinformatics, and Computational Biology, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061
| | - Sergey Koren
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892
| | - Tony Nolan
- Department of Life Sciences, Imperial College London, London SW7 2AZ, United Kingdom
| | - Diane Radune
- National Biodefense Analysis and Countermeasures Center, Frederick, MD 21702
| | - Maria V Sharakhova
- Department of Entomology, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061; Laboratory of Evolutionary Cytogenetics, Tomsk State University, Tomsk 634050, Russia
| | - Aaron Steele
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556
| | - Vladimir A Timoshevskiy
- Department of Entomology, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061
| | - Nikolai Windbichler
- Department of Life Sciences, Imperial College London, London SW7 2AZ, United Kingdom
| | - Simo Zhang
- School of Informatics and Computing, Indiana University, Bloomington, IN 47405
| | - Matthew W Hahn
- School of Informatics and Computing, Indiana University, Bloomington, IN 47405; Department of Biology, Indiana University, Bloomington, IN 47405
| | - Adam M Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892
| | - Scott J Emrich
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556; Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556
| | - Igor V Sharakhov
- The Interdisciplinary PhD Program in Genetics, Bioinformatics, and Computational Biology, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061; Department of Entomology, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061; Laboratory of Evolutionary Cytogenetics, Tomsk State University, Tomsk 634050, Russia;
| | - Zhijian Jake Tu
- The Interdisciplinary PhD Program in Genetics, Bioinformatics, and Computational Biology, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061; Department of Biochemistry, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061
| | - Nora J Besansky
- Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556; Department of Biological Sciences, University of Notre Dame, Notre Dame, IN 46556;
| |
Collapse
|
19
|
Nambiar M, Smith GR. Repression of harmful meiotic recombination in centromeric regions. Semin Cell Dev Biol 2016; 54:188-97. [PMID: 26849908 DOI: 10.1016/j.semcdb.2016.01.042] [Citation(s) in RCA: 62] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2015] [Accepted: 01/27/2016] [Indexed: 11/16/2022]
Abstract
During the first division of meiosis, segregation of homologous chromosomes reduces the chromosome number by half. In most species, sister chromatid cohesion and reciprocal recombination (crossing-over) between homologous chromosomes are essential to provide tension to signal proper chromosome segregation during the first meiotic division. Crossovers are not distributed uniformly throughout the genome and are repressed at and near the centromeres. Rare crossovers that occur too near or in the centromere interfere with proper segregation and can give rise to aneuploid progeny, which can be severely defective or inviable. We review here how crossing-over occurs and how it is prevented in and around the centromeres. Molecular mechanisms of centromeric repression are only now being elucidated. However, rapid advances in understanding crossing-over, chromosome structure, and centromere functions promise to explain how potentially deleterious crossovers are avoided in certain chromosomal regions while allowing beneficial crossovers in others.
Collapse
Affiliation(s)
- Mridula Nambiar
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, WA, United States.
| | - Gerald R Smith
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, WA, United States.
| |
Collapse
|
20
|
Ohzeki JI, Larionov V, Earnshaw WC, Masumoto H. Genetic and epigenetic regulation of centromeres: a look at HAC formation. Chromosome Res 2015; 23:87-103. [PMID: 25682171 DOI: 10.1007/s10577-015-9470-z] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
The centromere is a specialized chromosomal locus required for accurate chromosome segregation. A specific histone H3 variant, CENP-A, assembles at centromeres. CENP-A is required for kinetochore protein assembly and is an epigenetic marker for the maintenance of a functional centromere. Human CENP-A chromatin normally assembles on α-satellite DNA (alphoid DNA), a centromeric repetitive sequence. Using alphoid DNA arrays, human artificial chromosomes (HACs) have been constructed in human HT1080 cells and used to dissect the requirements for CENP-A assembly on DNA sequence. However, centromere formation is not a simple genetic event. In other commonly used human cell lines, such as HeLa and U2OS cells, no functional de novo centromere formation occurs efficiently with the same centromeric alphoid DNA sequences. Recent studies using protein tethering combined with the HAC system and/or genetic manipulation have revealed that epigenetic chromatin regulation mechanisms are also involved in the CENP-A chromatin assembly pathway and subsequent centromere/kinetochore formation. We summarize the DNA sequence requirements for CENP-A assembly and discuss the epigenetic regulation of human centromeres.
Collapse
Affiliation(s)
- Jun-ichirou Ohzeki
- Laboratory of Cell Engineering, Department of Frontier Research, Kazusa DNA Research Institute, 2-6-7 Kazusa-Kamatari, Kisarazu, Chiba, 292-0818, Japan
| | | | | | | |
Collapse
|
21
|
Cech JN, Peichel CL. Identification of the centromeric repeat in the threespine stickleback fish (Gasterosteus aculeatus). Chromosome Res 2015; 23:767-79. [PMID: 26424612 DOI: 10.1007/s10577-015-9495-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2015] [Revised: 09/11/2015] [Accepted: 09/17/2015] [Indexed: 01/09/2023]
Abstract
Centromere sequences exist as gaps in many genome assemblies due to their repetitive nature. Here we take an unbiased approach utilizing centromere protein A (CENP-A) chomatin immunoprecipitation followed by high-throughput sequencing to identify the centromeric repeat sequence in the threespine stickleback fish (Gasterosteus aculeatus). A 186-bp, AT-rich repeat was validated as centromeric using both fluorescence in situ hybridization (FISH) and immunofluorescence combined with FISH (IF-FISH) on interphase nuclei and metaphase spreads. This repeat hybridizes strongly to the centromere on all chromosomes, with the exception of weak hybridization to the Y chromosome. Together, our work provides the first validated sequence information for the threespine stickleback centromere.
Collapse
Affiliation(s)
- Jennifer N Cech
- Divisions of Human Biology and Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave North, Mailstop C2-023, Seattle, WA, 98109, USA.,Graduate Program in Molecular and Cellular Biology, University of Washington, Seattle, WA, 98195, USA
| | - Catherine L Peichel
- Divisions of Human Biology and Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave North, Mailstop C2-023, Seattle, WA, 98109, USA.
| |
Collapse
|
22
|
Garavís M, Escaja N, Gabelica V, Villasante A, González C. Centromeric Alpha-Satellite DNA Adopts Dimeric i-Motif Structures Capped by AT Hoogsteen Base Pairs. Chemistry 2015; 21:9816-24. [PMID: 26013031 DOI: 10.1002/chem.201500448] [Citation(s) in RCA: 57] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2015] [Indexed: 11/11/2022]
Abstract
Human centromeric alpha-satellite DNA is composed of tandem arrays of two types of 171 bp monomers; type A and type B. The differences between these types are concentrated in a 17 bp region of the monomer called the A/B box. Here, we have determined the solution structure of the C-rich strand of the two main variants of the human alpha-satellite A box. We show that, under acidic conditions, the C-rich strands of two A boxes self-recognize and form a head-to-tail dimeric i-motif stabilized by four intercalated hemi-protonated C:C(+) base pairs. Interestingly, the stack of C:C(+) base pairs is capped by T:T and Hoogsteen A:T base pairs. The two main variants of the A box adopt a similar three-dimensional structure, although the residues involved in the formation of the i-motif core are different in each case. Together with previous studies showing that the B box (known as the CENP-B box) also forms dimeric i-motif structures, our finding of this non-canonical structure in the A box shows that centromeric alpha satellites in all human chromosomes are able to form i-motifs, which consequently raises the possibility that these structures may play a role in the structural organization of the centromere.
Collapse
Affiliation(s)
- Miguel Garavís
- Instituto de Química Física Rocasolano, CSIC, Serrano 119, 28006 Madrid (Spain).,Centro de Biología Molecular, "Severo Ochoa" (CSIC-UAM), Universidad Autónoma de Madrid, Nicolás Cabrera 1, 28049 Madrid (Spain)
| | - Núria Escaja
- Departament de Química Orgànica and IBUB, Universitat de Barcelona, Martí i Franquès 1-11, 08028 Barcelona (Spain)
| | - Valérie Gabelica
- Univ. Bordeaux, ARNA Laboratory, IECB, 2 rue Robert Escarpit 33600 Pessac (France).,Inserm, ARNA Laboratory, 146 Rue Leo Saignat, 33000 Bordeaux (France)
| | - Alfredo Villasante
- Centro de Biología Molecular, "Severo Ochoa" (CSIC-UAM), Universidad Autónoma de Madrid, Nicolás Cabrera 1, 28049 Madrid (Spain)
| | - Carlos González
- Instituto de Química Física Rocasolano, CSIC, Serrano 119, 28006 Madrid (Spain).
| |
Collapse
|
23
|
Soh YQS, Alföldi J, Pyntikova T, Brown LG, Graves T, Minx PJ, Fulton RS, Kremitzki C, Koutseva N, Mueller JL, Rozen S, Hughes JF, Owens E, Womack JE, Murphy WJ, Cao Q, de Jong P, Warren WC, Wilson RK, Skaletsky H, Page DC. Sequencing the mouse Y chromosome reveals convergent gene acquisition and amplification on both sex chromosomes. Cell 2014; 159:800-13. [PMID: 25417157 DOI: 10.1016/j.cell.2014.09.052] [Citation(s) in RCA: 213] [Impact Index Per Article: 21.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2014] [Revised: 09/04/2014] [Accepted: 09/22/2014] [Indexed: 01/27/2023]
Abstract
We sequenced the MSY (male-specific region of the Y chromosome) of the C57BL/6J strain of the laboratory mouse Mus musculus. In contrast to theories that Y chromosomes are heterochromatic and gene poor, the mouse MSY is 99.9% euchromatic and contains about 700 protein-coding genes. Only 2% of the MSY derives from the ancestral autosomes that gave rise to the mammalian sex chromosomes. Instead, all but 45 of the MSY's genes belong to three acquired, massively amplified gene families that have no homologs on primate MSYs but do have acquired, amplified homologs on the mouse X chromosome. The complete mouse MSY sequence brings to light dramatic forces in sex chromosome evolution: lineage-specific convergent acquisition and amplification of X-Y gene families, possibly fueled by antagonism between acquired X-Y homologs. The mouse MSY sequence presents opportunities for experimental studies of a sex-specific chromosome in its entirety, in a genetically tractable model organism.
Collapse
Affiliation(s)
- Y Q Shirleen Soh
- Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA; Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Jessica Alföldi
- Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA; Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | | | - Laura G Brown
- Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA; Howard Hughes Medical Institute, Whitehead Institute, Cambridge, MA 02142, USA
| | - Tina Graves
- The Genome Institute, Washington University School of Medicine, 4444 Forest Park Boulevard, St. Louis, MO 63108, USA
| | - Patrick J Minx
- The Genome Institute, Washington University School of Medicine, 4444 Forest Park Boulevard, St. Louis, MO 63108, USA
| | - Robert S Fulton
- The Genome Institute, Washington University School of Medicine, 4444 Forest Park Boulevard, St. Louis, MO 63108, USA
| | - Colin Kremitzki
- The Genome Institute, Washington University School of Medicine, 4444 Forest Park Boulevard, St. Louis, MO 63108, USA
| | - Natalia Koutseva
- Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA
| | - Jacob L Mueller
- Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA
| | - Steve Rozen
- Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA
| | | | - Elaine Owens
- College of Veterinary Medicine and Biomedical Sciences, 4458 Texas A&M University, College Station, TX 77843, USA
| | - James E Womack
- College of Veterinary Medicine and Biomedical Sciences, 4458 Texas A&M University, College Station, TX 77843, USA
| | - William J Murphy
- College of Veterinary Medicine and Biomedical Sciences, 4458 Texas A&M University, College Station, TX 77843, USA
| | - Qing Cao
- BACPAC Resources, Children's Hospital Oakland, 747 52nd Street, Oakland, CA 94609, USA
| | - Pieter de Jong
- BACPAC Resources, Children's Hospital Oakland, 747 52nd Street, Oakland, CA 94609, USA
| | - Wesley C Warren
- The Genome Institute, Washington University School of Medicine, 4444 Forest Park Boulevard, St. Louis, MO 63108, USA
| | - Richard K Wilson
- The Genome Institute, Washington University School of Medicine, 4444 Forest Park Boulevard, St. Louis, MO 63108, USA
| | - Helen Skaletsky
- Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA; Howard Hughes Medical Institute, Whitehead Institute, Cambridge, MA 02142, USA
| | - David C Page
- Whitehead Institute, 9 Cambridge Center, Cambridge, MA 02142, USA; Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Howard Hughes Medical Institute, Whitehead Institute, Cambridge, MA 02142, USA.
| |
Collapse
|
24
|
Nergadze SG, Belloni E, Piras FM, Khoriauli L, Mazzagatti A, Vella F, Bensi M, Vitelli V, Giulotto E, Raimondi E. Discovery and comparative analysis of a novel satellite, EC137, in horses and other equids. Cytogenet Genome Res 2014; 144:114-23. [PMID: 25342230 DOI: 10.1159/000368138] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/12/2014] [Indexed: 11/19/2022] Open
Abstract
Centromeres are the sites of kinetochore assembly and spindle fiber attachment and consist of protein-DNA complexes in which the DNA component is typically characterized by the presence of extended arrays of tandem repeats called satellite DNA. Here, we describe the isolation and characterization of a 137-bp-long new satellite DNA sequence from the horse genome (EC137), which is also present, even if less abundant, in the domestic donkey, the Grevy's zebra and the Burchelli's zebra. We investigated the chromosomal distribution of the EC137 sequence in these 4 species. Moreover, we analyzed its architectural organization by high-resolution FISH. The position of this sequence with respect to the primary constriction and in relation to the 2 major horse satellite tandem repeats (37 cen and 2PI) on horse chromosomes suggests that the new centromeric equine satellite is an accessory DNA element, presumably contributing to the organization of pericentromeric chromatin. FISH on combed DNA fibers reveals that the EC137 satellite is organized in relatively short stretches (2-8 kb) which are strictly intermingled within 37 cen or 2PI arrays. This arrangement suggests that interchanges between satellite families are a frequent occurrence in the horse genome.
Collapse
Affiliation(s)
- Solomon G Nergadze
- Department of Biology and Biotechnology 'L. Spallanzani', University of Pavia, Pavia, Italy
| | | | | | | | | | | | | | | | | | | |
Collapse
|
25
|
The Robertsonian phenomenon in the house mouse: mutation, meiosis and speciation. Chromosoma 2014; 123:529-44. [PMID: 25053180 DOI: 10.1007/s00412-014-0477-6] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2014] [Revised: 07/08/2014] [Accepted: 07/09/2014] [Indexed: 01/01/2023]
Abstract
Many different chromosomal races with reduced chromosome number due to the presence of Robertsonian fusion metacentrics have been described in western Europe and northern Africa, within the distribution area of the western house mouse Mus musculus domesticus. This subspecies of house mouse has become the ideal model for studies to elucidate the processes of chromosome mutation and fixation that lead to the formation of chromosomal races and for studies on the impact of chromosome heterozygosities on reproductive isolation and speciation. In this review, we briefly describe the history of the discovery of the first and subsequent metacentric races in house mice; then, we focus on the molecular composition of the centromeric regions involved in chromosome fusion to examine the molecular characteristics that may explain the great variability of the karyotype that house mice show. The influence that metacentrics exert on the nuclear architecture of the male meiocytes and the consequences on meiotic progression are described to illustrate the impact that chromosomal heterozygosities exert on fertility of house mice-of relevance to reproductive isolation and speciation. The evolutionary significance of the Robertsonian phenomenon in the house mouse is discussed in the final section of this review.
Collapse
|
26
|
Salih BF, Teif VB, Tripathi V, Trifonov EN. Strong nucleosomes of mouse genome including recovered centromeric sequences. J Biomol Struct Dyn 2014; 33:1164-75. [PMID: 24998943 DOI: 10.1080/07391102.2014.938700] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
Recently discovered strong nucleosomes (SNs) characterized by visibly periodical DNA sequences have been found to concentrate in centromeres of Arabidopsis thaliana and in transient meiotic centromeres of Caenorhabditis elegans. To find out whether such affiliation of SNs to centromeres is a more general phenomenon, we studied SNs of the Mus musculus. The publicly available genome sequences of mouse, as well as of practically all other eukaryotes do not include the centromere regions which are difficult to assemble because of a large amount of repeat sequences in the centromeres and pericentromeric regions. We recovered those missing sequences using the data from MNase-seq experiments in mouse embryonic stem cells, where the sequence of DNA inside nucleosomes, including missing regions, was determined by 100-bp paired-end sequencing. Those nucleosome sequences, which are not matching to the published genome sequence, would largely belong to the centromeres. By evaluating SN densities in centromeres and in non-centromeric regions, we conclude that mouse SNs concentrate in the centromeres of telocentric mouse chromosomes, with ~3.9 times excess compared to their density in the rest of the genome. The remaining non-centromeric SNs are harbored mainly by introns and intergenic regions, by retro-transposons, in particular. The centromeric involvement of the SNs opens new horizons for the chromosome and centromere structure studies.
Collapse
Affiliation(s)
- Bilal F Salih
- a Genome Diversity Center , Institute of Evolution, University of Haifa , Mount Carmel, Haifa 31905 , Israel
| | | | | | | |
Collapse
|
27
|
Kim JH, Zhang T, Wong NC, Davidson N, Maksimovic J, Oshlack A, Earnshaw WC, Kalitsis P, Hudson DF. Condensin I associates with structural and gene regulatory regions in vertebrate chromosomes. Nat Commun 2014; 4:2537. [PMID: 24088984 DOI: 10.1038/ncomms3537] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2013] [Accepted: 09/03/2013] [Indexed: 11/09/2022] Open
Abstract
The condensin complex is essential for correct packaging and segregation of chromosomes during mitosis and meiosis in all eukaryotes. To date, the genome-wide location and the nature of condensin-binding sites have remained elusive in vertebrates. Here we report the genome-wide map of condensin I in chicken DT40 cells. Unexpectedly, we find that condensin I binds predominantly to promoter sequences in mitotic cells. We also find a striking enrichment at both centromeres and telomeres, highlighting the importance of the complex in chromosome segregation. Taken together, the results show that condensin I is largely absent from heterochromatic regions. This map of the condensin I binding sites on the chicken genome reveals that patterns of condensin distribution on chromosomes are conserved from prokaryotes, through yeasts to vertebrates. Thus in three kingdoms of life, condensin is enriched on promoters of actively transcribed genes and at loci important for chromosome segregation.
Collapse
Affiliation(s)
- Ji Hun Kim
- 1] Murdoch Childrens Research Institute, Royal Children's Hospital, Parkville, Melbourne, Victoria 3052, Australia [2] Department of Paediatrics, University of Melbourne, Parkville, Melbourne, Victoria 3052, Australia
| | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Graham AN, Kalitsis P. Chromosome Y centromere array deletion leads to impaired centromere function. PLoS One 2014; 9:e86875. [PMID: 24466276 PMCID: PMC3899357 DOI: 10.1371/journal.pone.0086875] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2013] [Accepted: 12/17/2013] [Indexed: 11/18/2022] Open
Abstract
The centromere is an essential chromosomal structure that is required for the faithful distribution of replicated chromosomes to daughter cells. Defects in the centromere can compromise the stability of chromosomes resulting in segregation errors. We have characterised the centromeric structure of the spontaneous mutant mouse strain, BALB/cWt, which exhibits a high rate of Y chromosome instability. The Y centromere DNA array shows a de novo interstitial deletion and a reduction in the level of the foundation centromere protein, CENP-A, when compared to the non-deleted centromere array in the progenitor strain. These results suggest there is a lower threshold limit of centromere size that ensures full kinetochore function during cell division.
Collapse
Affiliation(s)
- Alison N. Graham
- Murdoch Childrens Research Institute, Melbourne, Victoria, Australia
| | - Paul Kalitsis
- Murdoch Childrens Research Institute, Melbourne, Victoria, Australia
- Department of Paediatrics, University of Melbourne, Melbourne, Victoria, Australia
- * E-mail:
| |
Collapse
|
29
|
Sex-biased chromatin and regulatory cross-talk between sex chromosomes, autosomes, and mitochondria. Biol Sex Differ 2014; 5:2. [PMID: 24422881 PMCID: PMC3907150 DOI: 10.1186/2042-6410-5-2] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/04/2013] [Accepted: 12/29/2013] [Indexed: 02/07/2023] Open
Abstract
Several autoimmune and neurological diseases exhibit a sex bias, but discerning the causes and mechanisms of these biases has been challenging. Sex differences begin to manifest themselves in early embryonic development, and gonadal differentiation further bifurcates the male and female phenotypes. Even at this early stage, however, there is evidence that males and females respond to environmental stimuli differently, and the divergent phenotypic responses may have consequences later in life. The effect of prenatal nutrient restriction illustrates this point, as adult women exposed to prenatal restrictions exhibited increased risk factors of cardiovascular disease, while men exposed to the same condition did not. Recent research has examined the roles of sex-specific genes, hormones, chromosomes, and the interactions among them in mediating sex-biased phenotypes. Such research has identified testosterone, for example, as a possible protective agent against autoimmune disorders and an XX chromosome complement as a susceptibility factor in murine models of lupus and multiple sclerosis. Sex-biased chromatin is an additional and likely important component. Research suggesting a role for X and Y chromosome heterochromatin in regulating epigenetic states of autosomes has highlighted unorthodox mechanisms of gene regulation. The crosstalk between the Y chromosomes and autosomes may be further mediated by the mitochondria. The organelles have solely maternal transmission and exert differential effects on males and females. Altogether, research supports the notion that the interaction between sex-biased elements might exert novel regulatory functions in the genome and contribute to sex-specific susceptibilities to autoimmune and neurological diseases.
Collapse
|
30
|
Cazaux B, Catalan J, Justy F, Escudé C, Desmarais E, Britton-Davidian J. Evolution of the structure and composition of house mouse satellite DNA sequences in the subgenus Mus (Rodentia: Muridea): a cytogenomic approach. Chromosoma 2013; 122:209-20. [PMID: 23515652 DOI: 10.1007/s00412-013-0402-4] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2012] [Revised: 01/29/2013] [Accepted: 02/23/2013] [Indexed: 12/16/2022]
Abstract
The composition and orientation of the house mouse satellite DNA sequences (minor, major, TLC) were investigated by a FISH and CO-FISH approach in 11 taxa belonging to three clades of the subgenus Mus. Using a phylogenetic framework, our results highlighted two distribution patterns. The TLC satellite, the most recently discovered satellite, was present in all clades but varied quantitatively among species. This distribution supported its appearance in the ancestor of the subgenus followed by independent evolution in species of each clade. In contrast, the minor and major satellites occurred in only two clades of the subgenus indicating the simultaneous and recent amplification of these sequences. In addition, although qualitative differences in the composition and orientation of the satellite sequences were observed among the taxa, none of the features studied were unique to the house mouse and could account for the extensive chromosomal plasticity evidenced in Mus musculus domesticus.
Collapse
Affiliation(s)
- B Cazaux
- Institut des Sciences de l'Evolution, Université Montpellier 2, cc065, Pl. E. Bataillon, 34095 Montpellier Cedex 05, France
| | | | | | | | | | | |
Collapse
|
31
|
Podgornaya O, Gavrilova E, Stephanova V, Demin S, Komissarov A. Large tandem repeats make up the chromosome bar code: a hypothesis. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2013; 90:1-30. [PMID: 23582200 DOI: 10.1016/b978-0-12-410523-2.00001-8] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
Much of tandem repeats' functional nature in any genome remains enigmatic because there are only few tools available for dissecting and elucidating the functions of repeated DNA. The large tandem repeat arrays (satellite DNA) found in two mouse whole-genome shotgun assemblies were classified into 4 superfamilies, 8 families, and 62 subfamilies. With the simplified variant of chromosome positioning of different tandem repeats, we noticed the nonuniform distribution instead of the positions reported for mouse major and minor satellites. It is visible that each chromosome possesses a kind of unique code made up of different large tandem repeats. The reference genomes allow marking only internal tandem repeats, and even with such a limited data, the colored "bar code" made up of tandem repeats is visible. We suppose that tandem repeats bare the mechanism for chromosomes to recognize the regions to be associated. The associations, initially established via RNA, become fixed by histone modifications (the histone or chromatin code) and specific proteins. In such a way, associations, being at the beginning flexible and regulated, that is, adjustable, appear as irreversible and inheritable in cell generations. Tandem repeat multiformity tunes the developed nuclei 3D pattern by sequential steps of associations. Tandem repeats-based chromosome bar code could be the carrier of the genome structural information; that is, the order of precise tandem repeat association is the DNA morphogenetic program. Tandem repeats are the cores of the distinct 3D structures postulated in "gene gating" hypothesis.
Collapse
|
32
|
Abstract
Centromeres, the sites of spindle attachment during mitosis and meiosis, are located in specific positions in the human genome, normally coincident with diverse subsets of alpha satellite DNA. While there is strong evidence supporting the association of some subfamilies of alpha satellite with centromere function, the basis for establishing whether a given alpha satellite sequence is or is not designated a functional centromere is unknown, and attempts to understand the role of particular sequence features in establishing centromere identity have been limited by the near identity and repetitive nature of satellite sequences. Utilizing a broadly applicable experimental approach to test sequence competency for centromere specification, we have carried out a genomic and epigenetic functional analysis of endogenous human centromere sequences available in the current human genome assembly. The data support a model in which functionally competent sequences confer an opportunity for centromere specification, integrating genomic and epigenetic signals and promoting the concept of context-dependent centromere inheritance.
Collapse
|
33
|
Normal DNA methylation dynamics in DICER1-deficient mouse embryonic stem cells. PLoS Genet 2012; 8:e1002919. [PMID: 22969435 PMCID: PMC3435250 DOI: 10.1371/journal.pgen.1002919] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2012] [Accepted: 07/09/2012] [Indexed: 11/19/2022] Open
Abstract
Reduced DNA methylation has been reported in DICER1-deficient mouse ES cells. Reductions seen at pericentric satellite repeats have suggested that siRNAs are required for the proper assembly of heterochromatin. More recent studies have postulated that the reduced methylation is an indirect effect: the loss of Mir290 cluster miRNAs leads to upregulation of the transcriptional repressor RBL2 that targets the downregulation of DNA methyltransferase (Dnmt) genes. However, the observations have been inconsistent. We surmised that the inconsistency could be related to cell line “age,” given that DNA methylation is lost progressively with passage in DNMT-deficient ES cells. We therefore subjected Dicer1−/− ES cells to two experimental regimes to rigorously test the level of functional DNMT activity. First, we cultured them for a prolonged period. If DNMT activity was reduced, further losses of methylation would occur. Second, we measured their DNMT activity in a rebound DNA methylation assay: DNA methylation was stripped from Cre/loxP conditionally mutant Dicer1 ES cells using a shRNA targeting Dnmt1 mRNA. Cre expression then converted these cells to Dicer1−/−, allowing for DNMT1 recovery and forcing the cells to remethylate in the absence of RNAi. In both cases, we found functional DNMT activity to be normal. Finally, we also show that the level of RBL2 protein is not at excess levels in Dicer1−/− ES cells as has been assumed. These studies reveal that reduced functional DNMT activity is not a salient feature of DICER1-deficient ES cells. We suggest that the reduced DNA methylation sometimes observed in these cells could be due to stochastic alterations in DNA methylation patterns that could offer growth or survival advantages in culture, or to the dysregulation of pathways acting in opposition to the DNMT pathway. In mammalian cells, DNA methylation is required for the maintenance of genome stability. Recent studies have shown that the genome-wide levels of DNA methylation can be reduced in DICER1-deficient mouse embryonic stem (ES) cells, suggesting that the activity of DNA methylating enzymes (DNMTs) may be regulated by small RNA molecules. The enzyme DICER1 catalyses the production of these small RNAs that serve as sequence-specific guides for modifying chromatin or transcription. However, these observations of defective DNA methylation have been inconsistent. We surmised that this inconsistency could be due to cell line “age,” because it can take many cell divisions before reduced DNMT activity may result in loss of DNA methylation. To test this idea, we rigorously assayed the functional level of DNMT activity in DICER1-deficient ES cells. First, we tested their ability to maintain DNA methylation over prolonged culture. Second, we tested their ability to rebound in DNA methylation after first stripping it from the genome. In both cases functional DNMT activity was entirely normal. We suggest that losses of DNA methylation sometimes seen in DICER1-deficient ES cells is stochastic and could involve cell line adaptation.
Collapse
|
34
|
Hayden KE, Willard HF. Composition and organization of active centromere sequences in complex genomes. BMC Genomics 2012; 13:324. [PMID: 22817545 PMCID: PMC3422206 DOI: 10.1186/1471-2164-13-324] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2012] [Accepted: 07/20/2012] [Indexed: 01/13/2023] Open
Abstract
Background Centromeres are sites of chromosomal spindle attachment during mitosis and meiosis. While the sequence basis for centromere identity remains a subject of considerable debate, one approach is to examine the genomic organization at these active sites that are correlated with epigenetic marks of centromere function. Results We have developed an approach to characterize both satellite and non-satellite centromeric sequences that are missing from current assemblies in complex genomes, using the dog genome as an example. Combining this genomic reference with an epigenetic dataset corresponding to sequences associated with the histone H3 variant centromere protein A (CENP-A), we identify active satellite sequence domains that appear to be both functionally and spatially distinct within the overall definition of satellite families. Conclusions These findings establish a genomic and epigenetic foundation for exploring the functional role of centromeric sequences in the previously sequenced dog genome and provide a model for similar studies within the context of less-characterized genomes.
Collapse
Affiliation(s)
- Karen E Hayden
- Genome Biology Group, Duke Institute for Genome Sciences & Policy, Duke University, Durham, NC, USA.
| | | |
Collapse
|
35
|
The evolutionary life cycle of the resilient centromere. Chromosoma 2012; 121:327-40. [PMID: 22527114 DOI: 10.1007/s00412-012-0369-6] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2012] [Revised: 03/20/2012] [Accepted: 03/20/2012] [Indexed: 12/13/2022]
Abstract
The centromere is a chromosomal structure that is essential for the accurate segregation of replicated eukaryotic chromosomes to daughter cells. In most centromeres, the underlying DNA is principally made up of repetitive DNA elements, such as tandemly repeated satellite DNA and retrotransposable elements. Paradoxically, for such an essential genomic region, the DNA is rapidly evolving both within and between species. In this review, we show that the centromere locus is a resilient structure that can undergo evolutionary cycles of birth, growth, maturity, death and resurrection. The birth phase is highlighted by examples in humans and other organisms where centromere DNA deletions or chromosome rearrangements can trigger the epigenetic assembly of neocentromeres onto genomic sites without typical features of centromere DNA. In addition, functional centromeres can be generated in the laboratory using various methodologies. Recent mapping of the foundation centromere mark, the histone H3 variant CENP-A, onto near-complete genomes has uncovered examples of new centromeres which have not accumulated centromere repeat DNA. During the growth period of the centromere, repeat DNA begins to appear at some, but not all, loci. The maturity stage is characterised by centromere repeat accumulation, expansions and contractions and the rapid evolution of the centromere DNA between chromosomes of the same species and between species. This stage provides inherent centromere stability, facilitated by repression of gene activity and meiotic recombination at and around the centromeres. Death to a centromere can result from genomic instability precipitating rearrangements, deletions, accumulation of mutations and the loss of essential centromere binding proteins. Surprisingly, ancestral centromeres can undergo resurrection either in the field or in the laboratory, via as yet poorly understood mechanisms. The underlying principle for the preservation of a centromeric evolutionary life cycle is to provide resilience and perpetuity for the all-important structure and function of the centromere.
Collapse
|
36
|
Burrack LS, Berman J. Flexibility of centromere and kinetochore structures. Trends Genet 2012; 28:204-12. [PMID: 22445183 DOI: 10.1016/j.tig.2012.02.003] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2011] [Revised: 02/13/2012] [Accepted: 02/15/2012] [Indexed: 12/14/2022]
Abstract
Centromeres, and the kinetochores that assemble on them, are essential for accurate chromosome segregation. Diverse centromere organization patterns and kinetochore structures have evolved in eukaryotes ranging from yeast to humans. In addition, centromere DNA and kinetochore position can vary even within individual cells. This flexibility is manifested in several ways: centromere DNA sequences evolve rapidly, kinetochore positions shift in response to altered chromosome structure, and kinetochore complex numbers change in response to fluctuations in kinetochore protein levels. Despite their differences, all of these diverse structures promote efficient chromosome segregation. This robustness is inherent to chromosome segregation mechanisms and balances genome stability with adaptability. In this review, we explore the mechanisms and consequences of centromere and kinetochore flexibility as well as the benefits and limitations of different experimental model systems for their study.
Collapse
Affiliation(s)
- Laura S Burrack
- Department of Genetics, Cell Biology and Development, University of Minnesota, Minneapolis, MN 55405, USA
| | | |
Collapse
|
37
|
Green LC, Kalitsis P, Chang TM, Cipetic M, Kim JH, Marshall O, Turnbull L, Whitchurch CB, Vagnarelli P, Samejima K, Earnshaw WC, Choo KHA, Hudson DF. Contrasting roles of condensin I and condensin II in mitotic chromosome formation. J Cell Sci 2012; 125:1591-604. [PMID: 22344259 DOI: 10.1242/jcs.097790] [Citation(s) in RCA: 123] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
In vertebrates, two condensin complexes exist, condensin I and condensin II, which have differing but unresolved roles in organizing mitotic chromosomes. To dissect accurately the role of each complex in mitosis, we have made and studied the first vertebrate conditional knockouts of the genes encoding condensin I subunit CAP-H and condensin II subunit CAP-D3 in chicken DT40 cells. Live-cell imaging reveals highly distinct segregation defects. CAP-D3 (condensin II) knockout results in masses of chromatin-containing anaphase bridges. CAP-H (condensin I)-knockout anaphases have a more subtle defect, with chromatids showing fine chromatin fibres that are associated with failure of cytokinesis and cell death. Super-resolution microscopy reveals that condensin-I-depleted mitotic chromosomes are wider and shorter, with a diffuse chromosome scaffold, whereas condensin-II-depleted chromosomes retain a more defined scaffold, with chromosomes more stretched and seemingly lacking in axial rigidity. We conclude that condensin II is required primarily to provide rigidity by establishing an initial chromosome axis around which condensin I can arrange loops of chromatin.
Collapse
Affiliation(s)
- Lydia C Green
- Murdoch Childrens Research Institute, Royal Children's Hospital, Melbourne, Victoria 3052, Australia
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
38
|
Komissarov AS, Gavrilova EV, Demin SJ, Ishov AM, Podgornaya OI. Tandemly repeated DNA families in the mouse genome. BMC Genomics 2011; 12:531. [PMID: 22035034 PMCID: PMC3218096 DOI: 10.1186/1471-2164-12-531] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2011] [Accepted: 10/28/2011] [Indexed: 12/23/2022] Open
Abstract
Background Functional and morphological studies of tandem DNA repeats, that combine high portion of most genomes, are mostly limited due to the incomplete characterization of these genome elements. We report here a genome wide analysis of the large tandem repeats (TR) found in the mouse genome assemblies. Results Using a bioinformatics approach, we identified large TR with array size more than 3 kb in two mouse whole genome shotgun (WGS) assemblies. Large TR were classified based on sequence similarity, chromosome position, monomer length, array variability, and GC content; we identified four superfamilies, eight families, and 62 subfamilies - including 60 not previously described. 1) The superfamily of centromeric minor satellite is only found in the unassembled part of the reference genome. 2) The pericentromeric major satellite is the most abundant superfamily and reveals high order repeat structure. 3) Transposable elements related superfamily contains two families. 4) The superfamily of heterogeneous tandem repeats includes four families. One family is found only in the WGS, while two families represent tandem repeats with either single or multi locus location. Despite multi locus location, TRPC-21A-MM is placed into a separated family due to its abundance, strictly pericentromeric location, and resemblance to big human satellites. To confirm our data, we next performed in situ hybridization with three repeats from distinct families. TRPC-21A-MM probe hybridized to chromosomes 3 and 17, multi locus TR-22A-MM probe hybridized to ten chromosomes, and single locus TR-54B-MM probe hybridized with the long loops that emerge from chromosome ends. In addition to in silico predicted several extra-chromosomes were positive for TR by in situ analysis, potentially indicating inaccurate genome assembly of the heterochromatic genome regions. Conclusions Chromosome-specific TR had been predicted for mouse but no reliable cytogenetic probes were available before. We report new analysis that identified in silico and confirmed in situ 3/17 chromosome-specific probe TRPC-21-MM. Thus, the new classification had proven to be useful tool for continuation of genome study, while annotated TR can be the valuable source of cytogenetic probes for chromosome recognition.
Collapse
|
39
|
Abstract
Reverse transcriptases have shaped genomes in many ways. A remarkable example of this shaping is found on telomeres of the genus Drosophila, where retrotransposons have a vital role in chromosome structure. Drosophila lacks telomerase; instead, three telomere-specific retrotransposons maintain chromosome ends. Repeated transpositions to chromosome ends produce long head to tail arrays of these elements. In both form and function, these arrays are analogous to the arrays of repeats added by telomerase to chromosomes in other organisms. Distantly related Drosophila exhibit this variant mechanism of telomere maintenance, which was established before the separation of extant Drosophila species. Nevertheless, the telomere-specific elements still have the hallmarks that characterize non-long terminal repeat (non-LTR) retrotransposons; they have also acquired characteristics associated with their roles at telomeres. These telomeric retrotransposons have shaped the Drosophila genome, but they have also been shaped by the genome. Here, we discuss ways in which these three telomere-specific retrotransposons have been modified for their roles in Drosophila chromosomes.
Collapse
|
40
|
Bensasson D. Evidence for a high mutation rate at rapidly evolving yeast centromeres. BMC Evol Biol 2011; 11:211. [PMID: 21767380 PMCID: PMC3155921 DOI: 10.1186/1471-2148-11-211] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2011] [Accepted: 07/18/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Although their role in cell division is essential, centromeres evolve rapidly in animals, plants and yeasts. Unlike the complex centromeres of plants and aminals, the point centromeres of Saccharomcyes yeasts can be readily sequenced to distinguish amongst the possible explanations for fast centromere evolution. RESULTS Using DNA sequences of all 16 centromeres from 34 strains of Saccharomyces cerevisiae and population genomic data from Saccharomyces paradoxus, I show that centromeres in both species evolve 3 times more rapidly even than selectively unconstrained DNA. Exceptionally high levels of polymorphism seen in multiple yeast populations suggest that rapid centromere evolution does not result from the repeated selective sweeps expected under meiotic drive. I further show that there is little evidence for crossing-over or gene conversion within centromeres, although there is clear evidence for recombination in their immediate vicinity. Finally I show that the mutation spectrum at centromeres is consistent with the pattern of spontaneous mutation elsewhere in the genome. CONCLUSIONS These results indicate that rapid centromere evolution is a common phenomenon in yeast species. Furthermore, these results suggest that rapid centromere evolution does not result from the mutagenic effect of gene conversion, but from a generalised increase in the mutation rate, perhaps arising from the unusual chromatin structure at centromeres in yeast and other eukaryotes.
Collapse
|
41
|
Abstract
Repeated DNA in heterochromatin presents enormous difficulties for whole-genome sequencing; hence, sequence organization in a significant portion of the genomes of multicellular organisms is relatively unknown. Two sequenced BACs now allow us to compare telomeric retrotransposon arrays from Drosophila melanogaster telomeres with an array of telomeric retrotransposons that transposed into the centromeric region of the Y chromosome >13 MYA, providing a unique opportunity to compare the structural evolution of this retrotransposon in two contexts. We find that these retrotransposon arrays, both heterochromatic, are maintained quite differently, resulting in sequence organizations that apparently reflect different roles in the two chromosomal environments. The telomere array has grown only by transposition of new elements to the chromosome end; the centromeric array instead has grown by repeated amplifications of segments of the original telomere array. Many elements in the telomere have been variably 5'-truncated apparently by gradual erosion and irregular deletions of the chromosome end; however, a significant fraction (4 and possibly 5 or 6 of 15 elements examined) remain complete and capable of further retrotransposition. In contrast, each element in the centromere region has lost ≥ 40% of its sequence by internal, rather than terminal, deletions, and no element retains a significant part of the original coding region. Thus the centromeric array has been restructured to resemble the highly repetitive satellite sequences typical of centromeres in multicellular organisms, whereas, over a similar or longer time period, the telomere array has maintained its ability to provide retrotransposons competent to extend telomere ends.
Collapse
|
42
|
Abstract
Data from maize show that centromeres strongly suppress crossing over and instead undergo frequent genetic exchange in the form of gene conversion. Centromeres are the most dynamic regions of the genome, yet they are typified by little or no crossing over, making it difficult to explain the origin of this diversity. To address this question, we developed a novel CENH3 ChIP display method that maps kinetochore footprints over transposon-rich areas of centromere cores. A high level of polymorphism made it possible to map a total of 238 within-centromere markers using maize recombinant inbred lines. Over half of the markers were shown to interact directly with kinetochores (CENH3) by chromatin immunoprecipitation. Although classical crossing over is fully suppressed across CENH3 domains, two gene conversion events (i.e., non-crossover marker exchanges) were identified in a mapping population. A population genetic analysis of 53 diverse inbreds suggests that historical gene conversion is widespread in maize centromeres, occurring at a rate >1×10−5/marker/generation. We conclude that gene conversion accelerates centromere evolution by facilitating sequence exchange among chromosomes. Centromeres, which harbor the attachment points for microtubules during cell division, are characterized by repetitive DNA, paucity of genes, and almost complete suppression of crossing over. The repetitive DNA within centromeres appears to evolve much faster than would be expected for genetically inert regions, however. Current explanations for this rapid evolution tend to be theoretical. On the one hand there are arguments that subtle forms of selection on selfish repeat sequences can explain the rapid rate of change, while on the other hand it seems plausible that some form of accelerated neutral evolution is occurring. Here, we address this question in maize, which is known for its excellent genetic mapping resources. We first developed a method for identifying hundreds of single copy markers in centromeres and confirmed that they lie within functional domains by using a chromatin immunoprecipitation assay for kinetochore protein CENH3. All markers were mapped in relation to each other. The data show that, whereas classical crossing over is suppressed, there is extensive genetic exchange in the form of gene conversion (by which short segments of one chromosome are copied onto the other). These results were confirmed by demonstrating that similar short exchange tracts are common among the centromeres from multiple diverse inbred lines of maize. Our study suggests that centromere diversity can be at least partially attributed to a high rate of previously “hidden” genetic exchange within the core kinetochore domains.
Collapse
|