1
|
Chen YL, Jones AN, Crawford A, Sattler M, Ettinger A, Torres-Padilla ME. Determinants of minor satellite RNA function in chromosome segregation in mouse embryonic stem cells. J Cell Biol 2024; 223:e202309027. [PMID: 38625077 PMCID: PMC11022885 DOI: 10.1083/jcb.202309027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Revised: 03/06/2024] [Accepted: 03/29/2024] [Indexed: 04/17/2024] Open
Abstract
The centromere is a fundamental higher-order structure in chromosomes ensuring their faithful segregation upon cell division. Centromeric transcripts have been described in several species and suggested to participate in centromere function. However, low sequence conservation of centromeric repeats appears inconsistent with a role in recruiting highly conserved centromeric proteins. Here, we hypothesized that centromeric transcripts may function through a secondary structure rather than sequence conservation. Using mouse embryonic stem cells (ESCs), we show that an imbalance in the levels of forward or reverse minor satellite (MinSat) transcripts leads to severe chromosome segregation defects. We further show that MinSat RNA adopts a stem-loop secondary structure, which is conserved in human α-satellite transcripts. We identify an RNA binding region in CENPC and demonstrate that MinSat transcripts function through the structured region of the RNA. Importantly, mutants that disrupt MinSat secondary structure do not cause segregation defects. We propose that the conserved role of centromeric transcripts relies on their secondary RNA structure.
Collapse
Affiliation(s)
- Yung-Li Chen
- Institute of Epigenetics and Stem Cells (IES), Helmholtz Munich, München, Germany
| | - Alisha N. Jones
- Institute of Structural Biology, Molecular Targets and Therapeutics Center, Helmholtz Munich, Neuherberg, Germany
| | - Amy Crawford
- Department of Chemistry, New York University, New York, NY, USA
| | - Michael Sattler
- Institute of Structural Biology, Molecular Targets and Therapeutics Center, Helmholtz Munich, Neuherberg, Germany
- Department of Bioscience, Bavarian NMR Center, School of Natural Sciences, Technical University of Munich, Garching, Germany
| | - Andreas Ettinger
- Institute of Epigenetics and Stem Cells (IES), Helmholtz Munich, München, Germany
| | - Maria-Elena Torres-Padilla
- Institute of Epigenetics and Stem Cells (IES), Helmholtz Munich, München, Germany
- Faculty of Biology, Ludwig-Maximilians Universität, München, Germany
| |
Collapse
|
2
|
Arora UP, Dumont BL. Molecular evolution of the mammalian kinetochore complex. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.27.600994. [PMID: 38979348 PMCID: PMC11230421 DOI: 10.1101/2024.06.27.600994] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/10/2024]
Abstract
Mammalian centromeres are satellite-rich chromatin domains that serve as sites for kinetochore complex assembly. Centromeres are highly variable in sequence and satellite organization across species, but the processes that govern the co-evolutionary dynamics between rapidly evolving centromeres and their associated kinetochore proteins remain poorly understood. Here, we pursue a course of phylogenetic analyses to investigate the molecular evolution of the complete kinetochore complex across primate and rodent species with divergent centromere repeat sequences and features. We show that many protein components of the core centromere associated network (CCAN) harbor signals of adaptive evolution, consistent with their intimate association with centromere satellite DNA and roles in the stability and recruitment of additional kinetochore proteins. Surprisingly, CCAN and outer kinetochore proteins exhibit comparable rates of adaptive divergence, suggesting that changes in centromere DNA can ripple across the kinetochore to drive adaptive protein evolution within distant domains of the complex. Our work further identifies kinetochore proteins subject to lineage-specific adaptive evolution, including rapidly evolving proteins in species with centromere satellites characterized by higher-order repeat structure and lacking CENP-B boxes. Thus, features of centromeric chromatin beyond the linear DNA sequence may drive selection on kinetochore proteins. Overall, our work spotlights adaptively evolving proteins with diverse centromere-associated functions, including centromere chromatin structure, kinetochore protein assembly, kinetochore-microtubule association, cohesion maintenance, and DNA damage response pathways. These adaptively evolving kinetochore protein candidates present compelling opportunities for future functional investigations exploring how their concerted changes with centromere DNA ensure the maintenance of genome stability.
Collapse
Affiliation(s)
- Uma P Arora
- The Jackson Laboratory, 600 Main Street, Bar Harbor ME 04609
- Tufts University, Graduate School of Biomedical Sciences, 136 Harrison Ave, Boston MA 02111
| | - Beth L Dumont
- The Jackson Laboratory, 600 Main Street, Bar Harbor ME 04609
- Tufts University, Graduate School of Biomedical Sciences, 136 Harrison Ave, Boston MA 02111
- Graduate School of Biomedical Science and Engineering, The University of Maine, Orono, Maine, 04469
| |
Collapse
|
3
|
Logsdon GA, Rozanski AN, Ryabov F, Potapova T, Shepelev VA, Catacchio CR, Porubsky D, Mao Y, Yoo D, Rautiainen M, Koren S, Nurk S, Lucas JK, Hoekzema K, Munson KM, Gerton JL, Phillippy AM, Ventura M, Alexandrov IA, Eichler EE. The variation and evolution of complete human centromeres. Nature 2024; 629:136-145. [PMID: 38570684 PMCID: PMC11062924 DOI: 10.1038/s41586-024-07278-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Accepted: 03/07/2024] [Indexed: 04/05/2024]
Abstract
Human centromeres have been traditionally very difficult to sequence and assemble owing to their repetitive nature and large size1. As a result, patterns of human centromeric variation and models for their evolution and function remain incomplete, despite centromeres being among the most rapidly mutating regions2,3. Here, using long-read sequencing, we completely sequenced and assembled all centromeres from a second human genome and compared it to the finished reference genome4,5. We find that the two sets of centromeres show at least a 4.1-fold increase in single-nucleotide variation when compared with their unique flanks and vary up to 3-fold in size. Moreover, we find that 45.8% of centromeric sequence cannot be reliably aligned using standard methods owing to the emergence of new α-satellite higher-order repeats (HORs). DNA methylation and CENP-A chromatin immunoprecipitation experiments show that 26% of the centromeres differ in their kinetochore position by >500 kb. To understand evolutionary change, we selected six chromosomes and sequenced and assembled 31 orthologous centromeres from the common chimpanzee, orangutan and macaque genomes. Comparative analyses reveal a nearly complete turnover of α-satellite HORs, with characteristic idiosyncratic changes in α-satellite HORs for each species. Phylogenetic reconstruction of human haplotypes supports limited to no recombination between the short (p) and long (q) arms across centromeres and reveals that novel α-satellite HORs share a monophyletic origin, providing a strategy to estimate the rate of saltatory amplification and mutation of human centromeric DNA.
Collapse
Affiliation(s)
- Glennis A Logsdon
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Department of Genetics, Epigenetics Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Allison N Rozanski
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Fedor Ryabov
- Masters Program in National Research University Higher School of Economics, Moscow, Russia
| | - Tamara Potapova
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | | | - Claudia R Catacchio
- Department of Biosciences, Biotechnology and Environment, University of Bari Aldo Moro, Bari, Italy
| | - David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Yafei Mao
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai, China
| | - DongAhn Yoo
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Mikko Rautiainen
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
- Institute for Molecular Medicine Finland (FIMM), Helsinki Institute of Life Science (HiLIFE), University of Helsinki, Helsinki, Finland
| | - Sergey Koren
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Sergey Nurk
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
- Oxford Nanopore Technologies, Oxford, United Kingdom
| | - Julian K Lucas
- Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA, USA
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Kendra Hoekzema
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Katherine M Munson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | | | - Adam M Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Mario Ventura
- Department of Biosciences, Biotechnology and Environment, University of Bari Aldo Moro, Bari, Italy
| | - Ivan A Alexandrov
- Department of Human Molecular Genetics and Biochemistry, Tel Aviv University, Tel Aviv, Israel
- Department of Anatomy and Anthropology, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- Dan David Center for Human Evolution and Biohistory Research, Tel Aviv University, Tel Aviv, Israel
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA.
| |
Collapse
|
4
|
Ramakrishnan Chandra J, Kalidass M, Demidov D, Dabravolski SA, Lermontova I. The role of centromeric repeats and transcripts in kinetochore assembly and function. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2024; 118:982-996. [PMID: 37665331 DOI: 10.1111/tpj.16445] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Revised: 08/09/2023] [Accepted: 08/18/2023] [Indexed: 09/05/2023]
Abstract
Centromeres are the chromosomal domains, where the kinetochore protein complex is formed, mediating proper segregation of chromosomes during cell division. Although the function of centromeres has remained conserved during evolution, centromeric DNA is highly variable, even in closely related species. In addition, the composition of the kinetochore complexes varies among organisms. Therefore, it is assumed that the centromeric position is determined epigenetically, and the centromeric histone H3 (CENH3) serves as an epigenetic marker. The loading of CENH3 onto centromeres depends on centromere-licensing factors, chaperones, and transcription of centromeric repeats. Several proteins that regulate CENH3 loading and kinetochore assembly interact with the centromeric transcripts and DNA in a sequence-independent manner. However, the functional aspects of these interactions are not fully understood. This review discusses the variability of centromeric sequences in different organisms and the regulation of their transcription through the RNA Pol II and RNAi machinery. The data suggest that the interaction of proteins involved in CENH3 loading and kinetochore assembly with centromeric DNA and transcripts plays a role in centromere, and possibly neocentromere, formation in a sequence-independent manner.
Collapse
Affiliation(s)
| | - Manikandan Kalidass
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstrasse 3, D-06466, Seeland, Germany
| | - Dmitri Demidov
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstrasse 3, D-06466, Seeland, Germany
| | - Siarhei A Dabravolski
- Department of Biotechnology Engineering, Braude Academic College of Engineering, Snunit 51, Karmiel, 2161002, Israel
| | - Inna Lermontova
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstrasse 3, D-06466, Seeland, Germany
| |
Collapse
|
5
|
Chaisson MJP, Sulovari A, Valdmanis PN, Miller DE, Eichler EE. Advances in the discovery and analyses of human tandem repeats. Emerg Top Life Sci 2023; 7:361-381. [PMID: 37905568 PMCID: PMC10806765 DOI: 10.1042/etls20230074] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Revised: 10/18/2023] [Accepted: 10/18/2023] [Indexed: 11/02/2023]
Abstract
Long-read sequencing platforms provide unparalleled access to the structure and composition of all classes of tandemly repeated DNA from STRs to satellite arrays. This review summarizes our current understanding of their organization within the human genome, their importance with respect to disease, as well as the advances and challenges in understanding their genetic diversity and functional effects. Novel computational methods are being developed to visualize and associate these complex patterns of human variation with disease, expression, and epigenetic differences. We predict accurate characterization of this repeat-rich form of human variation will become increasingly relevant to both basic and clinical human genetics.
Collapse
Affiliation(s)
- Mark J P Chaisson
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, U.S.A
- The Genomic and Epigenomic Regulation Program, USC Norris Cancer Center, University of Southern California, Los Angeles, CA 90089, U.S.A
| | - Arvis Sulovari
- Computational Biology, Cajal Neuroscience Inc, Seattle, WA 98102, U.S.A
| | - Paul N Valdmanis
- Division of Medical Genetics, Department of Medicine, University of Washington School of Medicine, Seattle, WA 98195, U.S.A
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, U.S.A
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA 98195, U.S.A
| | - Danny E Miller
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA 98195, U.S.A
- Brotman Baty Institute for Precision Medicine, University of Washington, Seattle, WA 98195, U.S.A
- Department of Pediatrics, University of Washington, Seattle, WA 98195, U.S.A
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, U.S.A
- Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, U.S.A
| |
Collapse
|
6
|
Arora UP, Sullivan BA, Dumont BL. Variation in the CENP-A sequence association landscape across diverse inbred mouse strains. Cell Rep 2023; 42:113178. [PMID: 37742188 PMCID: PMC10873113 DOI: 10.1016/j.celrep.2023.113178] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Revised: 04/25/2023] [Accepted: 09/08/2023] [Indexed: 09/26/2023] Open
Abstract
Centromeres are crucial for chromosome segregation, but their underlying sequences evolve rapidly, imposing strong selection for compensatory changes in centromere-associated kinetochore proteins to assure the stability of genome transmission. While this co-evolution is well documented between species, it remains unknown whether population-level centromere diversity leads to functional differences in kinetochore protein association. Mice (Mus musculus) exhibit remarkable variation in centromere size and sequence, but the amino acid sequence of the kinetochore protein CENP-A is conserved. Here, we apply k-mer-based analyses to CENP-A chromatin profiling data from diverse inbred mouse strains to investigate the interplay between centromere variation and kinetochore protein sequence association. We show that centromere sequence diversity is associated with strain-level differences in both CENP-A positioning and sequence preference along the mouse core centromere satellite. Our findings reveal intraspecies sequence-dependent differences in CENP-A/centromere association and open additional perspectives for understanding centromere-mediated variation in genome stability.
Collapse
Affiliation(s)
- Uma P Arora
- The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA; Graduate School of Biomedical Sciences, Tufts University, 136 Harrison Avenue, Boston, MA 02111, USA.
| | - Beth A Sullivan
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, 213 Research Drive, Box 3054, Durham, NC 27710, USA
| | - Beth L Dumont
- The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA; Graduate School of Biomedical Sciences, Tufts University, 136 Harrison Avenue, Boston, MA 02111, USA; Graduate School of Biomedical Science and Engineering, University of Maine, 5775 Stodder Hall, Room 46, Orono, ME 04469, USA.
| |
Collapse
|
7
|
Logsdon GA, Rozanski AN, Ryabov F, Potapova T, Shepelev VA, Mao Y, Rautiainen M, Koren S, Nurk S, Porubsky D, Lucas JK, Hoekzema K, Munson KM, Gerton JL, Phillippy AM, Alexandrov IA, Eichler EE. The variation and evolution of complete human centromeres. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.30.542849. [PMID: 37398417 PMCID: PMC10312506 DOI: 10.1101/2023.05.30.542849] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
We completely sequenced and assembled all centromeres from a second human genome and used two reference sets to benchmark genetic, epigenetic, and evolutionary variation within centromeres from a diversity panel of humans and apes. We find that centromere single-nucleotide variation can increase by up to 4.1-fold relative to other genomic regions, with the caveat that up to 45.8% of centromeric sequence, on average, cannot be reliably aligned with current methods due to the emergence of new α-satellite higher-order repeat (HOR) structures and two to threefold differences in the length of the centromeres. The extent to which this occurs differs depending on the chromosome and haplotype. Comparing the two sets of complete human centromeres, we find that eight harbor distinctly different α-satellite HOR array structures and four contain novel α-satellite HOR variants in high abundance. DNA methylation and CENP-A chromatin immunoprecipitation experiments show that 26% of the centromeres differ in their kinetochore position by at least 500 kbp-a property not readily associated with novel α-satellite HORs. To understand evolutionary change, we selected six chromosomes and sequenced and assembled 31 orthologous centromeres from the common chimpanzee, orangutan, and macaque genomes. Comparative analyses reveal nearly complete turnover of α-satellite HORs, but with idiosyncratic changes in structure characteristic to each species. Phylogenetic reconstruction of human haplotypes supports limited to no recombination between the p- and q-arms of human chromosomes and reveals that novel α-satellite HORs share a monophyletic origin, providing a strategy to estimate the rate of saltatory amplification and mutation of human centromeric DNA.
Collapse
Affiliation(s)
- Glennis A. Logsdon
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Allison N. Rozanski
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Fedor Ryabov
- Masters Program in National Research University Higher School of Economics, Moscow, Russia
| | - Tamara Potapova
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | | | - Yafei Mao
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai, China
| | - Mikko Rautiainen
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Sergey Koren
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Sergey Nurk
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Julian K. Lucas
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA, USA
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Kendra Hoekzema
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Katherine M. Munson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | | | - Adam M. Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Ivan A. Alexandrov
- Department of Human Molecular Genetics and Biochemistry, Tel Aviv University, Tel Aviv, Israel
- Department of Anatomy and Anthropology, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- Dan David Center for Human Evolution and Biohistory Research, Tel Aviv University, Tel Aviv, Israel
| | - Evan E. Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA
| |
Collapse
|
8
|
Copley KE, Shorter J. Repetitive elements in aging and neurodegeneration. Trends Genet 2023; 39:381-400. [PMID: 36935218 PMCID: PMC10121923 DOI: 10.1016/j.tig.2023.02.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Revised: 02/12/2023] [Accepted: 02/14/2023] [Indexed: 03/19/2023]
Abstract
Repetitive elements (REs), such as transposable elements (TEs) and satellites, comprise much of the genome. Here, we review how TEs and (peri)centromeric satellite DNA may contribute to aging and neurodegenerative disorders, including amyotrophic lateral sclerosis (ALS). Alterations in RE expression, retrotransposition, and chromatin microenvironment may shorten lifespan, elicit neurodegeneration, and impair memory and movement. REs may cause these phenotypes via DNA damage, protein sequestration, insertional mutagenesis, and inflammation. We discuss several TE families, including gypsy, HERV-K, and HERV-W, and how TEs interact with various factors, including transactive response (TAR) DNA-binding protein 43 kDa (TDP-43) and the siRNA and piwi-interacting (pi)RNA systems. Studies of TEs in neurodegeneration have focused on Drosophila and, thus, further examination in mammals is needed. We suggest that therapeutic silencing of REs could help mitigate neurodegenerative disorders.
Collapse
Affiliation(s)
- Katie E Copley
- Department of Biochemistry and Biophysics, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA 19104, USA; Neuroscience Graduate Group, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA 19104, USA
| | - James Shorter
- Department of Biochemistry and Biophysics, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA 19104, USA; Neuroscience Graduate Group, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA 19104, USA.
| |
Collapse
|
9
|
Harringmeyer OS, Hoekstra HE. Chromosomal inversion polymorphisms shape the genomic landscape of deer mice. Nat Ecol Evol 2022; 6:1965-1979. [PMID: 36253543 PMCID: PMC9715431 DOI: 10.1038/s41559-022-01890-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Accepted: 08/17/2022] [Indexed: 12/15/2022]
Abstract
Chromosomal inversions are an important form of structural variation that can affect recombination, chromosome structure and fitness. However, because inversions can be challenging to detect, the prevalence and hence the significance of inversions segregating within species remains largely unknown, especially in natural populations of mammals. Here, by combining population-genomic and long-read sequencing analyses in a single, widespread species of deer mouse (Peromyscus maniculatus), we identified 21 polymorphic inversions that are large (1.5-43.8 Mb) and cause near-complete suppression of recombination when heterozygous (0-0.03 cM Mb-1). We found that inversion breakpoints frequently occur in centromeric and telomeric regions and are often flanked by long inverted repeats (0.5-50 kb), suggesting that they probably arose via ectopic recombination. By genotyping inversions in populations across the species' range, we found that the inversions are often widespread and do not harbour deleterious mutational loads, and many are likely to be maintained as polymorphisms by divergent selection. Comparisons of forest and prairie ecotypes of deer mice revealed 13 inversions that contribute to differentiation between populations, of which five exhibit significant associations with traits implicated in local adaptation. Taken together, these results show that inversion polymorphisms have a significant impact on recombination, genome structure and genetic diversity in deer mice and likely facilitate local adaptation across the widespread range of this species.
Collapse
Affiliation(s)
- Olivia S Harringmeyer
- Department of Organismic & Evolutionary Biology, Department of Molecular & Cellular Biology, Museum of Comparative Zoology and Howard Hughes Medical Institute, Harvard University, Cambridge, MA, USA.
| | - Hopi E Hoekstra
- Department of Organismic & Evolutionary Biology, Department of Molecular & Cellular Biology, Museum of Comparative Zoology and Howard Hughes Medical Institute, Harvard University, Cambridge, MA, USA.
| |
Collapse
|
10
|
Despot-Slade E, Širca S, Mravinac B, Castagnone-Sereno P, Plohl M, Meštrović N. Satellitome analyses in nematodes illuminate complex species history and show conserved features in satellite DNAs. BMC Biol 2022; 20:259. [PMCID: PMC9673304 DOI: 10.1186/s12915-022-01460-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Accepted: 11/07/2022] [Indexed: 11/19/2022] Open
Abstract
Abstract
Background
Satellite DNAs (satDNAs) are tandemly repeated non-coding DNA sequences that belong to the most abundant and the fastest evolving parts of the eukaryotic genome. A satellitome represents the collection of different satDNAs in a genome. Due to extreme diversity and methodological difficulties to characterize and compare satDNA collection in complex genomes, knowledge on their putative functional constraints and capacity to participate in genome evolution remains rather elusive. SatDNA transcripts have been detected in many species, however comparative studies of satDNA transcriptome between species are extremely rare.
Results
We conducted a genome-wide survey and comparative analyses of satellitomes among different closely related Meloidogyne spp. nematodes. The evolutionary trends of satDNAs suggest that each round of proposed polyploidization in the evolutionary history is concomitant with the addition of a new set of satDNAs in the satellitome of any particular Meloidogyne species. Successive incorporation of new sets of satDNAs in the genome along the process of polyploidization supports multiple hybridization events as the main factor responsible for the formation of these species. Through comparative analyses of 83 distinct satDNAs, we found a CENP-B box-like sequence motif conserved among 11 divergent satDNAs (similarity ranges from 36 to 74%). We also found satDNAs that harbor a splice leader (SL) sequence which, in spite of overall divergence, shows conservation across species in two putative functional regions, the 25-nt SL exon and the Sm binding site. Intra- and interspecific comparative expression analyses of the complete satDNA set in the analyzed Meloidogyne species revealed transcription profiles including a subset of 14 actively transcribed satDNAs. Among those, 9 show active transcription in every species where they are found in the genome and throughout developmental stages.
Conclusions
Our results demonstrate the feasibility and power of comparative analysis of the non-coding repetitive genome for elucidation of the origin of species with a complex history. Although satDNAs generally evolve extremely quickly, the comparative analyses of 83 satDNAs detected in the analyzed Meloidogyne species revealed conserved sequence features in some satDNAs suggesting sequence evolution under selective pressure. SatDNAs that are actively transcribed in related genomes and throughout nematode development support the view that their expression is not stochastic.
Collapse
|
11
|
Haig D. Paradox lost: Concerted evolution and centromeric instability: Centromeres are hospitable habitats for repeats that evolve adaptations for proliferation within the nucleus sometimes at organismal cost.: Centromeres are hospitable habitats for repeats that evolve adaptations for proliferation within the nucleus sometimes at organismal cost. Bioessays 2022; 44:e2200023. [PMID: 35748194 DOI: 10.1002/bies.202200023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Revised: 06/07/2022] [Accepted: 06/09/2022] [Indexed: 11/11/2022]
Abstract
Homologous centromeres compete for segregation to the secondary oocyte nucleus at female meiosis I. Centromeric repeats also compete with each other to populate centromeres in mitotic cells of the germline and have become adapted to use the recombinational machinery present at centromeres to promote their own propagation. Repeats are not needed at centromeres, rather centromeres appear to be hospitable habitats for the colonization and proliferation of repeats. This is probably an indirect consequence of two distinctive features of centromeric DNA. Centromeres are subject to breakage by the mechanical forces exerted by microtubules and meiotic crossing-over is suppressed. Centromeric proteins acting in trans are under selection to mitigate the costs of centromeric repeats acting in cis. Collateral costs of mitotic competition at centromeres may help to explain the high rates of aneuploidy observed in early human embryos.
Collapse
Affiliation(s)
- David Haig
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, USA
| |
Collapse
|
12
|
Population Scale Analysis of Centromeric Satellite DNA Reveals Highly Dynamic Evolutionary Patterns and Genomic Organization in Long-Tailed and Rhesus Macaques. Cells 2022; 11:cells11121953. [PMID: 35741082 PMCID: PMC9221937 DOI: 10.3390/cells11121953] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2022] [Revised: 06/12/2022] [Accepted: 06/14/2022] [Indexed: 02/04/2023] Open
Abstract
Centromeric satellite DNA (cen-satDNA) consists of highly divergent repeat monomers, each approximately 171 base pairs in length. Here, we investigated the genetic diversity in the centromeric region of two primate species: long-tailed (Macaca fascicularis) and rhesus (Macaca mulatta) macaques. Fluorescence in situ hybridization and bioinformatic analysis showed the chromosome-specific organization and dynamic nature of cen-satDNAsequences, and their substantial diversity, with distinct subfamilies across macaque populations, suggesting increased turnovers. Comparative genomics identified high level polymorphisms spanning a 120 bp deletion region and a remarkable interspecific variability in cen-satDNA size and structure. Population structure analysis detected admixture patterns within populations, indicating their high divergence and rapid evolution. However, differences in cen-satDNA profiles appear to not be involved in hybrid incompatibility between the two species. Our study provides a genomic landscape of centromeric repeats in wild macaques and opens new avenues for exploring their impact on the adaptive evolution and speciation of primates.
Collapse
|
13
|
Ivanova NG, Kartavtseva IV, Stefanova VN, Ostromyshenskii DI, Podgornaya OI. Tandem Repeat Diversity in Two Closely Related Hamster Species—The Chinese Hamster (Cricetulus griseus) and Striped Hamster (Cricetulus barabensis). Biomedicines 2022; 10:biomedicines10040925. [PMID: 35453675 PMCID: PMC9025346 DOI: 10.3390/biomedicines10040925] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Revised: 04/12/2022] [Accepted: 04/13/2022] [Indexed: 11/16/2022] Open
Abstract
The Chinese hamster (Cricetulus griseus) and striped hamster (Cricetulus barabensis) are very closely related species with similar karyotypes. The karyotypes differ from each other by one Robertsonian rearrangement and X-chromosome morphology. The level of the tandem repeat (TR) sequences’ evolutional variability is high. The aim of the current work was to trace the TR distribution on the chromosomes of two very closely related species. The striped hamster genome has not yet been sequenced. We classified the Chinese hamster TR in the assemblies available and then compared the mode of the TR distribution in closely related species. Chinese and striped hamsters are separate species due to the relative species specificity of Chinese hamster TR and prominent differences in the TR distribution in both species. The TR variation observed within homologous striped hamster chromosomes is caused by a lack of inbreeding in natural populations. The set of TR tested could be used to examine the CHO lines’ instability that has been observed in heterochromatic regions.
Collapse
Affiliation(s)
- Nadezhda G. Ivanova
- Laboratory of Noncoding DNA, Institute of Cytology RAS, Saint Petersburg 194064, Russia; (V.N.S.); (D.I.O.); (O.I.P.)
- Correspondence:
| | - Irina V. Kartavtseva
- Laboratory of Evolutionary Zoology, Federal Scientific Center of the East Asia Terrestrial Biodiversity, Vladivostok 690022, Russia;
| | - Vera N. Stefanova
- Laboratory of Noncoding DNA, Institute of Cytology RAS, Saint Petersburg 194064, Russia; (V.N.S.); (D.I.O.); (O.I.P.)
| | - Dmitrii I. Ostromyshenskii
- Laboratory of Noncoding DNA, Institute of Cytology RAS, Saint Petersburg 194064, Russia; (V.N.S.); (D.I.O.); (O.I.P.)
| | - Olga I. Podgornaya
- Laboratory of Noncoding DNA, Institute of Cytology RAS, Saint Petersburg 194064, Russia; (V.N.S.); (D.I.O.); (O.I.P.)
- Department of Cytology and Histology, Faculty of Biology, St. Petersburg State University, Saint Petersburg 199034, Russia
| |
Collapse
|
14
|
Bruijnesteijn J, van der Wiel M, de Groot NG, Bontrop RE. Rapid Characterization of Complex Killer Cell Immunoglobulin-Like Receptor (KIR) Regions Using Cas9 Enrichment and Nanopore Sequencing. Front Immunol 2021; 12:722181. [PMID: 34594334 PMCID: PMC8476923 DOI: 10.3389/fimmu.2021.722181] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2021] [Accepted: 08/27/2021] [Indexed: 12/24/2022] Open
Abstract
Long-read sequencing approaches have considerably improved the quality and contiguity of genome assemblies. Such platforms bear the potential to resolve even extremely complex regions, such as multigenic immune families and repetitive stretches of DNA. Deep sequencing coverage, however, is required to overcome low nucleotide accuracy, especially in regions with high homopolymer density, copy number variation, and sequence similarity, such as the MHC and KIR gene clusters of the immune system. Therefore, we have adapted a targeted enrichment protocol in combination with long-read sequencing to efficiently annotate complex KIR gene regions. Using Cas9 endonuclease activity, segments of the KIR gene cluster were enriched and sequenced on an Oxford Nanopore Technologies platform. This provided sufficient coverage to accurately resolve and phase highly complex KIR haplotypes. Our strategy eliminates PCR-induced amplification errors, facilitates rapid characterization of large and complex multigenic regions, including its epigenetic footprint, and is applicable in multiple species, even in the absence of a reference genome.
Collapse
Affiliation(s)
- Jesse Bruijnesteijn
- Comparative Genetics and Refinement, Biomedical Primate Research Centre, Rijswijk, Netherlands
| | - Marit van der Wiel
- Comparative Genetics and Refinement, Biomedical Primate Research Centre, Rijswijk, Netherlands
| | - Natasja G de Groot
- Comparative Genetics and Refinement, Biomedical Primate Research Centre, Rijswijk, Netherlands
| | - Ronald E Bontrop
- Comparative Genetics and Refinement, Biomedical Primate Research Centre, Rijswijk, Netherlands.,Theoretical Biology and Bioinformatics, Utrecht University, Utrecht, Netherlands
| |
Collapse
|
15
|
Valeri MP, Dias GB, do Espírito Santo AA, Moreira CN, Yonenaga-Yassuda Y, Sommer IB, Kuhn GCS, Svartman M. First Description of a Satellite DNA in Manatees' Centromeric Regions. Front Genet 2021; 12:694866. [PMID: 34504514 PMCID: PMC8421680 DOI: 10.3389/fgene.2021.694866] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2021] [Accepted: 07/30/2021] [Indexed: 11/18/2022] Open
Abstract
Trichechus manatus and Trichechus inunguis are the two Sirenia species that occur in the Americas. Despite their increasing extinction risk, many aspects of their biology remain understudied, including the repetitive DNA fraction of their genomes. Here we used the sequenced genome of T. manatus and TAREAN to identify satellite DNAs (satDNAs) in this species. We report the first description of TMAsat, a satDNA comprising ~0.87% of the genome, with ~684bp monomers and centromeric localization. In T. inunguis, TMAsat showed similar monomer length, chromosome localization and conserved CENP-B box-like motifs as in T. manatus. We also detected this satDNA in the Dugong dugon and in the now extinct Hydrodamalis gigas genomes. The neighbor-joining tree shows that TMAsat sequences from T. manatus, T. inunguis, D. dugon, and H. gigas lack species-specific clusters, which disagrees with the predictions of concerted evolution. We detected a divergent TMAsat-like homologous sequence in elephants and hyraxes, but not in other mammals, suggesting this sequence was already present in the common ancestor of Paenungulata, and later became a satDNA in the Sirenians. This is the first description of a centromeric satDNA in manatees and will facilitate the inclusion of Sirenia in future studies of centromeres and satDNA biology.
Collapse
Affiliation(s)
- Mirela Pelizaro Valeri
- Laboratório de Citogenômica Evolutiva, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Guilherme Borges Dias
- Department of Genetics and Institute of Bioinformatics, University of Georgia, Athens, GA, United States
| | - Alice Alves do Espírito Santo
- Laboratório de Citogenômica Evolutiva, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Camila Nascimento Moreira
- Departamento de Genética e Biologia Evolutiva, Instituto de Biociências, Universidade de São Paulo, São Paulo, Brazil
| | - Yatiyo Yonenaga-Yassuda
- Departamento de Genética e Biologia Evolutiva, Instituto de Biociências, Universidade de São Paulo, São Paulo, Brazil
| | - Iara Braga Sommer
- Centro Nacional de Pesquisa e Conservação da Biodiversidade Marinha do Nordeste, Instituto Chico Mendes de Conservação da Biodiversidade, Brasília, Brazil
| | - Gustavo C. S. Kuhn
- Laboratório de Citogenômica Evolutiva, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Marta Svartman
- Laboratório de Citogenômica Evolutiva, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| |
Collapse
|
16
|
Despot-Slade E, Mravinac B, Širca S, Castagnone-Sereno P, Plohl M, Meštrović N. The Centromere Histone Is Conserved and Associated with Tandem Repeats Sharing a Conserved 19-bp Box in the Holocentromere of Meloidogyne Nematodes. Mol Biol Evol 2021; 38:1943-1965. [PMID: 33399875 PMCID: PMC8097292 DOI: 10.1093/molbev/msaa336] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Although centromeres have conserved function, centromere-specific histone H3 (CenH3) and centromeric DNA evolve rapidly. The centromere drive model explains this phenomenon as a consequence of the conflict between fast-evolving DNA and CenH3, suggesting asymmetry in female meiosis as a crucial factor. We characterized evolution of the CenH3 protein in three closely related, polyploid mitotic parthenogenetic species of the Meloidogyne incognita group, and in the distantly related meiotic parthenogen Meloidogyne hapla. We identified duplication of the CenH3 gene in a putative sexual ancestral Meloidogyne. We found that one CenH3 (αCenH3) remained conserved in all extant species, including in distant Meloidogyne hapla, whereas the other evolved rapidly and under positive selection into four different CenH3 variants. This pattern of CenH3 evolution in Meloidogyne species suggests the subspecialization of CenH3s in ancestral sexual species. Immunofluorescence performed on mitotic Meloidogyne incognita revealed a dominant role of αCenH3 on its centromere, whereas the other CenH3s have lost their function in mitosis. The observed αCenH3 chromosome distribution disclosed cluster-like centromeric organization. The ChIP-Seq analysis revealed that in M. incognita αCenH3-associated DNA dominantly comprises tandem repeats, composed of divergent monomers which share a completely conserved 19-bp long box. Conserved αCenH3-associated DNA is also confirmed in the related mitotic Meloidogyne incognita group species suggesting preservation of both centromere protein and DNA constituents. We hypothesize that the absence of centromere drive in mitosis might allow for CenH3 and its associated DNA to achieve an equilibrium in which they can persist for long periods of time.
Collapse
Affiliation(s)
| | | | - Saša Širca
- Agricultural Institute Slovenia, Ljubljana, Slovenia
| | | | | | | |
Collapse
|
17
|
Arora UP, Charlebois C, Lawal RA, Dumont BL. Population and subspecies diversity at mouse centromere satellites. BMC Genomics 2021; 22:279. [PMID: 33865332 PMCID: PMC8052823 DOI: 10.1186/s12864-021-07591-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2021] [Accepted: 04/08/2021] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND Mammalian centromeres are satellite-rich chromatin domains that execute conserved roles in kinetochore assembly and chromosome segregation. Centromere satellites evolve rapidly between species, but little is known about population-level diversity across these loci. RESULTS We developed a k-mer based method to quantify centromere copy number and sequence variation from whole genome sequencing data. We applied this method to diverse inbred and wild house mouse (Mus musculus) genomes to profile diversity across the core centromere (minor) satellite and the pericentromeric (major) satellite repeat. We show that minor satellite copy number varies more than 10-fold among inbred mouse strains, whereas major satellite copy numbers span a 3-fold range. In contrast to widely held assumptions about the homogeneity of mouse centromere repeats, we uncover marked satellite sequence heterogeneity within single genomes, with diversity levels across the minor satellite exceeding those at the major satellite. Analyses in wild-caught mice implicate subspecies and population origin as significant determinants of variation in satellite copy number and satellite heterogeneity. Intriguingly, we also find that wild-caught mice harbor dramatically reduced minor satellite copy number and elevated satellite sequence heterogeneity compared to inbred strains, suggesting that inbreeding may reshape centromere architecture in pronounced ways. CONCLUSION Taken together, our results highlight the power of k-mer based approaches for probing variation across repetitive regions, provide an initial portrait of centromere variation across Mus musculus, and lay the groundwork for future functional studies on the consequences of natural genetic variation at these essential chromatin domains.
Collapse
Affiliation(s)
- Uma P Arora
- The Jackson Laboratory, 600 Main Street, Bar Harbor, ME, 04609, USA.
- Tufts University, Graduate School of Biomedical Sciences, 136 Harrison Ave, Boston, MA, 02111, USA.
| | | | | | - Beth L Dumont
- The Jackson Laboratory, 600 Main Street, Bar Harbor, ME, 04609, USA.
- Tufts University, Graduate School of Biomedical Sciences, 136 Harrison Ave, Boston, MA, 02111, USA.
| |
Collapse
|
18
|
Horse Clinical Cytogenetics: Recurrent Themes and Novel Findings. Animals (Basel) 2021; 11:ani11030831. [PMID: 33809432 PMCID: PMC8001954 DOI: 10.3390/ani11030831] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Revised: 03/12/2021] [Accepted: 03/13/2021] [Indexed: 12/17/2022] Open
Abstract
Clinical cytogenetic studies in horses have been ongoing for over half a century and clearly demonstrate that chromosomal disorders are among the most common non-infectious causes of decreased fertility, infertility, and congenital defects. Large-scale cytogenetic surveys show that almost 30% of horses with reproductive or developmental problems have chromosome aberrations, whereas abnormal karyotypes are found in only 2-5% of the general population. Among the many chromosome abnormalities reported in the horse, most are unique or rare. However, all surveys agree that there are two recurrent conditions: X-monosomy and SRY-negative XY male-to-female sex reversal, making up approximately 35% and 11% of all chromosome abnormalities, respectively. The two are signature conditions for the horse and rare or absent in other domestic species. The progress in equine genomics and the development of molecular tools, have qualitatively improved clinical cytogenetics today, allowing for refined characterization of aberrations and understanding the underlying molecular mechanisms. While cutting-edge genomics tools promise further improvements in chromosome analysis, they will not entirely replace traditional cytogenetics, which still is the most straightforward, cost-effective, and fastest approach for the initial evaluation of potential breeding animals and horses with reproductive or developmental disorders.
Collapse
|
19
|
Ivanova NG, Ostromyshenskii D, Podgornaya O. Tandem Repeat-Based Probes Support the Loop Model of Pericentromere Packing. Cytogenet Genome Res 2021; 161:93-102. [PMID: 33601374 DOI: 10.1159/000513228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2020] [Accepted: 08/18/2020] [Indexed: 11/19/2022] Open
Abstract
Constitutive heterochromatin is the most mysterious part of the eukaryotic genome. It forms vital chromosome regions such as the centromeric and the pericentromeric ones. The main component of heterochromatic regions are tandem repeats (TR), and their specific organization complicates assembly, annotation, and mapping of these regions. Unannotated and unmapped TR arrays are still present in database contigs. In this study, we used a set of TR in the genomes of the pig (Sus scrofa) and the Chinese hamster (Cricetulus griseus) identified with the help of bioinformatics techniques and determined the specificity of the designed probes. The signal of the 4 pig TR probes in spermatogenic cells was often ring-shaped, especially in primary spermatocytes. The rings were located in the regions relatively weakly stained with DAPI. The unique assembly of the centromeric region was traced using the hamster meiotic chromosomes. The probe specific to chromosome 5 was used. Two signals, arranged as rings, were seen at the pachytene stage, similar to those in the pig spermatogenic cells. In the spermatogenic cells of both pig and hamster, the rings appeared on the chromosomes with pericentromeric TR probes. Our observations support the loop model of the centromeric region, the size of the loops being about 50 kb.
Collapse
Affiliation(s)
- Nadezhda G Ivanova
- Laboratory of Non-coding DNA, Institute of Cytology RAS, St. Petersburg, Russian Federation,
| | | | - Olga Podgornaya
- Laboratory of Non-coding DNA, Institute of Cytology RAS, St. Petersburg, Russian Federation.,Department of Cytology and Histology, St. Petersburg State University, St. Petersburg, Russian Federation
| |
Collapse
|
20
|
The structure, function and evolution of a complete human chromosome 8. Nature 2021; 593:101-107. [PMID: 33828295 PMCID: PMC8099727 DOI: 10.1038/s41586-021-03420-7] [Citation(s) in RCA: 169] [Impact Index Per Article: 56.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2020] [Accepted: 03/04/2021] [Indexed: 02/07/2023]
Abstract
The complete assembly of each human chromosome is essential for understanding human biology and evolution1,2. Here we use complementary long-read sequencing technologies to complete the linear assembly of human chromosome 8. Our assembly resolves the sequence of five previously long-standing gaps, including a 2.08-Mb centromeric α-satellite array, a 644-kb copy number polymorphism in the β-defensin gene cluster that is important for disease risk, and an 863-kb variable number tandem repeat at chromosome 8q21.2 that can function as a neocentromere. We show that the centromeric α-satellite array is generally methylated except for a 73-kb hypomethylated region of diverse higher-order α-satellites enriched with CENP-A nucleosomes, consistent with the location of the kinetochore. In addition, we confirm the overall organization and methylation pattern of the centromere in a diploid human genome. Using a dual long-read sequencing approach, we complete high-quality draft assemblies of the orthologous centromere from chromosome 8 in chimpanzee, orangutan and macaque to reconstruct its evolutionary history. Comparative and phylogenetic analyses show that the higher-order α-satellite structure evolved in the great ape ancestor with a layered symmetry, in which more ancient higher-order repeats locate peripherally to monomeric α-satellites. We estimate that the mutation rate of centromeric satellite DNA is accelerated by more than 2.2-fold compared to the unique portions of the genome, and this acceleration extends into the flanking sequence.
Collapse
|
21
|
Ahmad SF, Singchat W, Jehangir M, Suntronpong A, Panthum T, Malaivijitnond S, Srikulnath K. Dark Matter of Primate Genomes: Satellite DNA Repeats and Their Evolutionary Dynamics. Cells 2020; 9:E2714. [PMID: 33352976 PMCID: PMC7767330 DOI: 10.3390/cells9122714] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Revised: 12/15/2020] [Accepted: 12/16/2020] [Indexed: 12/12/2022] Open
Abstract
A substantial portion of the primate genome is composed of non-coding regions, so-called "dark matter", which includes an abundance of tandemly repeated sequences called satellite DNA. Collectively known as the satellitome, this genomic component offers exciting evolutionary insights into aspects of primate genome biology that raise new questions and challenge existing paradigms. A complete human reference genome was recently reported with telomere-to-telomere human X chromosome assembly that resolved hundreds of dark regions, encompassing a 3.1 Mb centromeric satellite array that had not been identified previously. With the recent exponential increase in the availability of primate genomes, and the development of modern genomic and bioinformatics tools, extensive growth in our knowledge concerning the structure, function, and evolution of satellite elements is expected. The current state of knowledge on this topic is summarized, highlighting various types of primate-specific satellite repeats to compare their proportions across diverse lineages. Inter- and intraspecific variation of satellite repeats in the primate genome are reviewed. The functional significance of these sequences is discussed by describing how the transcriptional activity of satellite repeats can affect gene expression during different cellular processes. Sex-linked satellites are outlined, together with their respective genomic organization. Mechanisms are proposed whereby satellite repeats might have emerged as novel sequences during different evolutionary phases. Finally, the main challenges that hinder the detection of satellite DNA are outlined and an overview of the latest methodologies to address technological limitations is presented.
Collapse
Affiliation(s)
- Syed Farhan Ahmad
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Worapong Singchat
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Maryam Jehangir
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Department of Structural and Functional Biology, Institute of Bioscience at Botucatu, São Paulo State University (UNESP), Botucatu, São Paulo 18618-689, Brazil
| | - Aorarat Suntronpong
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Thitipong Panthum
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Suchinda Malaivijitnond
- National Primate Research Center of Thailand, Chulalongkorn University, Saraburi 18110, Thailand;
- Department of Biology, Faculty of Science, Chulalongkorn University, Bangkok 10330, Thailand
| | - Kornsorn Srikulnath
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
- National Primate Research Center of Thailand, Chulalongkorn University, Saraburi 18110, Thailand;
- Center of Excellence on Agricultural Biotechnology (AG-BIO/PERDO-CHE), Bangkok 10900, Thailand
- Omics Center for Agriculture, Bioresources, Food and Health, Kasetsart University (OmiKU), Bangkok 10900, Thailand
| |
Collapse
|
22
|
Mendoza MN, Schalnus SA, Thomson B, Bellone RR, Juras R, Raudsepp T. Novel Complex Unbalanced Dicentric X-Autosome Rearrangement in a Thoroughbred Mare with a Mild Effect on the Phenotype. Cytogenet Genome Res 2020; 160:597-609. [PMID: 33152736 DOI: 10.1159/000511236] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2020] [Accepted: 08/11/2020] [Indexed: 11/19/2022] Open
Abstract
Complex structural X chromosome abnormalities are rare in humans and animals, and not recurrent. Yet, each case provides a fascinating opportunity to evaluate X chromosome content and functional status in relation to the effect on the phenotype. Here, we report the first equine case of a complex unbalanced X-autosome rearrangement in a healthy but short in stature Thoroughbred mare. Studies of about 200 cells by chromosome banding and FISH revealed an abnormal 2n = 63,X,der(X;16) karyotype with a large dicentric derivative chromosome (der). The der was comprised of normal Xp material, a palindromic duplication of Xq12q21, and a translocation of chromosome 16 to the inverted Xq12q21 segment by the centromere, whereas the distal Xq22q29 was deleted from the der. Microsatellite genotyping determined a paternal origin of the der. While there was no option to experimentally investigate the status of X chromosome inactivation (XCI), the observed mild phenotype of this case suggested the following scenario to retain an almost normal genetic balance: active normal X, inactivated X-portion of the der, but without XCI spreading into the translocated chromosome 16. Cases like this present unique resources to acquire information about species-specific features of X regulation and the role of X-linked genes in development, health, and disease.
Collapse
Affiliation(s)
- Mayra N Mendoza
- Estación Experimental Agraria Chincha, Dirección de Recursos Genéticos y Biotecnología, Instituto Nacional de Innovación Agraria, Ica, Peru
| | - Sam A Schalnus
- Hagyard Equine Medical Institute, Lexington, Kentucky, USA
| | - Bitsy Thomson
- Hagyard Equine Medical Institute, Lexington, Kentucky, USA
| | - Rebecca R Bellone
- Department of Population Health and Reproduction, Veterinary Genetics Laboratory, School of Veterinary Medicine, University of California, Davis, California, USA
| | - Rytis Juras
- Molecular Cytogenetics Laboratory, College of Veterinary Medicine and Biomedical Sciences,Texas A&M University, College Station, Texas, USA
| | - Terje Raudsepp
- Molecular Cytogenetics Laboratory, College of Veterinary Medicine and Biomedical Sciences,Texas A&M University, College Station, Texas, USA,
| |
Collapse
|
23
|
Sena RS, Heringer P, Valeri MP, Pereira VS, Kuhn GCS, Svartman M. Identification and characterization of satellite DNAs in two-toed sloths of the genus Choloepus (Megalonychidae, Xenarthra). Sci Rep 2020; 10:19202. [PMID: 33154538 PMCID: PMC7644632 DOI: 10.1038/s41598-020-76199-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Accepted: 10/19/2020] [Indexed: 11/09/2022] Open
Abstract
Choloepus, the only extant genus of the Megalonychidae family, is composed of two living species of two-toed sloths: Choloepus didactylus and C. hoffmanni. In this work, we identified and characterized the main satellite DNAs (satDNAs) in the sequenced genomes of these two species. SATCHO1, the most abundant satDNA in both species, is composed of 117 bp tandem repeat sequences. The second most abundant satDNA, SATCHO2, is composed of ~ 2292 bp tandem repeats. Fluorescence in situ hybridization in C. hoffmanni revealed that both satDNAs are located in the centromeric regions of all chromosomes, except the X. In fact, these satDNAs present some centromeric characteristics in their sequences, such as dyad symmetries predicted to form secondary structures. PCR experiments indicated the presence of SATCHO1 sequences in two other Xenarthra species: the tree-toed sloth Bradypus variegatus and the anteater Myrmecophaga tridactyla. Nevertheless, SATCHO1 is present as large tandem arrays only in Choloepus species, thus likely representing a satDNA exclusively in this genus. Our results reveal interesting features of the satDNA landscape in Choloepus species with the potential to aid future phylogenetic studies in Xenarthra and mammalian genomes in general.
Collapse
Affiliation(s)
- Radarane Santos Sena
- Laboratório de Citogenômica Evolutiva, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil
| | - Pedro Heringer
- Laboratório de Citogenômica Evolutiva, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil
| | - Mirela Pelizaro Valeri
- Laboratório de Citogenômica Evolutiva, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil
| | | | - Gustavo C S Kuhn
- Laboratório de Citogenômica Evolutiva, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil
| | - Marta Svartman
- Laboratório de Citogenômica Evolutiva, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil.
| |
Collapse
|
24
|
Peichel CL, McCann SR, Ross JA, Naftaly AFS, Urton JR, Cech JN, Grimwood J, Schmutz J, Myers RM, Kingsley DM, White MA. Assembly of the threespine stickleback Y chromosome reveals convergent signatures of sex chromosome evolution. Genome Biol 2020; 21:177. [PMID: 32684159 PMCID: PMC7368989 DOI: 10.1186/s13059-020-02097-x] [Citation(s) in RCA: 63] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2020] [Accepted: 07/08/2020] [Indexed: 01/15/2023] Open
Abstract
BACKGROUND Heteromorphic sex chromosomes have evolved repeatedly across diverse species. Suppression of recombination between X and Y chromosomes leads to degeneration of the Y chromosome. The progression of degeneration is not well understood, as complete sequence assemblies of heteromorphic Y chromosomes have only been generated across a handful of taxa with highly degenerate sex chromosomes. Here, we describe the assembly of the threespine stickleback (Gasterosteus aculeatus) Y chromosome, which is less than 26 million years old and at an intermediate stage of degeneration. Our previous work identified that the non-recombining region between the X and the Y spans approximately 17.5 Mb on the X chromosome. RESULTS We combine long-read sequencing with a Hi-C-based proximity guided assembly to generate a 15.87 Mb assembly of the Y chromosome. Our assembly is concordant with cytogenetic maps and Sanger sequences of over 90 Y chromosome BAC clones. We find three evolutionary strata on the Y chromosome, consistent with the three inversions identified by our previous cytogenetic analyses. The threespine stickleback Y shows convergence with more degenerate sex chromosomes in the retention of haploinsufficient genes and the accumulation of genes with testis-biased expression, many of which are recent duplicates. However, we find no evidence for large amplicons identified in other sex chromosome systems. We also report an excellent candidate for the master sex-determination gene: a translocated copy of Amh (Amhy). CONCLUSIONS Together, our work shows that the evolutionary forces shaping sex chromosomes can cause relatively rapid changes in the overall genetic architecture of Y chromosomes.
Collapse
Affiliation(s)
- Catherine L. Peichel
- Divisions of Human Biology and Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA 98109 USA
- Institute of Ecology and Evolution, University of Bern, Baltzerstrasse 6, 3012 Bern, Switzerland
| | - Shaugnessy R. McCann
- Divisions of Human Biology and Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA 98109 USA
| | - Joseph A. Ross
- Divisions of Human Biology and Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA 98109 USA
- Graduate Program in Molecular and Cellular Biology, University of Washington, Seattle, WA 98195 USA
| | | | - James R. Urton
- Divisions of Human Biology and Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA 98109 USA
- Graduate Program in Molecular and Cellular Biology, University of Washington, Seattle, WA 98195 USA
| | - Jennifer N. Cech
- Divisions of Human Biology and Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA 98109 USA
- Graduate Program in Molecular and Cellular Biology, University of Washington, Seattle, WA 98195 USA
| | - Jane Grimwood
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806 USA
| | - Jeremy Schmutz
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806 USA
| | - Richard M. Myers
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806 USA
| | - David M. Kingsley
- Department of Developmental Biology and Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA 94305 USA
| | - Michael A. White
- Divisions of Human Biology and Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA 98109 USA
- Department of Genetics, University of Georgia, Athens, GA 30602 USA
| |
Collapse
|
25
|
Gamba R, Fachinetti D. From evolution to function: Two sides of the same CENP-B coin? Exp Cell Res 2020; 390:111959. [DOI: 10.1016/j.yexcr.2020.111959] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2020] [Revised: 03/07/2020] [Accepted: 03/12/2020] [Indexed: 10/24/2022]
|
26
|
Li B, Li Z, Lu C, Chang L, Zhao D, Shen G, Kusakabe T, Xia Q, Zhao P. Heat Shock Cognate 70 Functions as A Chaperone for the Stability of Kinetochore Protein CENP-N in Holocentric Insect Silkworms. Int J Mol Sci 2019; 20:ijms20235823. [PMID: 31756960 PMCID: PMC6929194 DOI: 10.3390/ijms20235823] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2019] [Revised: 11/18/2019] [Accepted: 11/18/2019] [Indexed: 01/09/2023] Open
Abstract
The centromere, in which kinetochore proteins are assembled, plays an important role in the accurate congression and segregation of chromosomes during cell mitosis. Although the function of the centromere and kinetochore is conserved from monocentric to holocentric, the DNA sequences of the centromere and components of the kinetochore are varied among different species. Given the lack of core centromere protein A (CENP-A) and CENP-C in the lepidopteran silkworm Bombyx mori, which possesses holocentric chromosomes, here we investigated the role of CENP-N, another important member of the centromere protein family essential for kinetochore assembly. For the first time, cellular localization and RNA interference against CENP-N have confirmed its kinetochore function in silkworms. To gain further insights into the regulation of CENP-N in the centromere, we analyzed the affinity-purified complex of CENP-N by mass spectrometry and identified 142 interacting proteins. Among these factors, we found that the chaperone protein heat shock cognate 70 (HSC70) is able to regulate the stability of CENP-N by prohibiting ubiquitin-proteasome pathway, indicating that HSC70 could control cell cycle-regulated degradation of CENP-N at centromeres. Altogether, the present work will provide a novel clue to understand the regulatory mechanism for the kinetochore activity of CENP-N during the cell cycle.
Collapse
Affiliation(s)
- Bingqian Li
- Biological Science Research Center, Southwest University, Chongqing 400715, China; (B.L.); (C.L.); (L.C.); (D.Z.); (G.S.); (Q.X.); (P.Z.)
- Chongqing Key Laboratory of Sericultural Science, Chongqing Engineering and Technology Research Center for Novel Silk Materials, Southwest University, Chongqing 400715, China
| | - Zhiqing Li
- Biological Science Research Center, Southwest University, Chongqing 400715, China; (B.L.); (C.L.); (L.C.); (D.Z.); (G.S.); (Q.X.); (P.Z.)
- Chongqing Key Laboratory of Sericultural Science, Chongqing Engineering and Technology Research Center for Novel Silk Materials, Southwest University, Chongqing 400715, China
- Correspondence:
| | - Chenchen Lu
- Biological Science Research Center, Southwest University, Chongqing 400715, China; (B.L.); (C.L.); (L.C.); (D.Z.); (G.S.); (Q.X.); (P.Z.)
- Chongqing Key Laboratory of Sericultural Science, Chongqing Engineering and Technology Research Center for Novel Silk Materials, Southwest University, Chongqing 400715, China
| | - Li Chang
- Biological Science Research Center, Southwest University, Chongqing 400715, China; (B.L.); (C.L.); (L.C.); (D.Z.); (G.S.); (Q.X.); (P.Z.)
- Chongqing Key Laboratory of Sericultural Science, Chongqing Engineering and Technology Research Center for Novel Silk Materials, Southwest University, Chongqing 400715, China
| | - Dongchao Zhao
- Biological Science Research Center, Southwest University, Chongqing 400715, China; (B.L.); (C.L.); (L.C.); (D.Z.); (G.S.); (Q.X.); (P.Z.)
- Chongqing Key Laboratory of Sericultural Science, Chongqing Engineering and Technology Research Center for Novel Silk Materials, Southwest University, Chongqing 400715, China
| | - Guanwang Shen
- Biological Science Research Center, Southwest University, Chongqing 400715, China; (B.L.); (C.L.); (L.C.); (D.Z.); (G.S.); (Q.X.); (P.Z.)
- Chongqing Key Laboratory of Sericultural Science, Chongqing Engineering and Technology Research Center for Novel Silk Materials, Southwest University, Chongqing 400715, China
| | - Takahiro Kusakabe
- Laboratory of Insect Genome Science, Kyushu University Graduate School of Bioresource and Bioenvironmental Sciences, Fukuoka 819-0395, Japan;
| | - Qingyou Xia
- Biological Science Research Center, Southwest University, Chongqing 400715, China; (B.L.); (C.L.); (L.C.); (D.Z.); (G.S.); (Q.X.); (P.Z.)
- Chongqing Key Laboratory of Sericultural Science, Chongqing Engineering and Technology Research Center for Novel Silk Materials, Southwest University, Chongqing 400715, China
| | - Ping Zhao
- Biological Science Research Center, Southwest University, Chongqing 400715, China; (B.L.); (C.L.); (L.C.); (D.Z.); (G.S.); (Q.X.); (P.Z.)
- Chongqing Key Laboratory of Sericultural Science, Chongqing Engineering and Technology Research Center for Novel Silk Materials, Southwest University, Chongqing 400715, China
| |
Collapse
|
27
|
Centromere Repeats: Hidden Gems of the Genome. Genes (Basel) 2019; 10:genes10030223. [PMID: 30884847 PMCID: PMC6471113 DOI: 10.3390/genes10030223] [Citation(s) in RCA: 88] [Impact Index Per Article: 17.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2019] [Revised: 03/07/2019] [Accepted: 03/11/2019] [Indexed: 01/08/2023] Open
Abstract
Satellite DNAs are now regarded as powerful and active contributors to genomic and chromosomal evolution. Paired with mobile transposable elements, these repetitive sequences provide a dynamic mechanism through which novel karyotypic modifications and chromosomal rearrangements may occur. In this review, we discuss the regulatory activity of satellite DNA and their neighboring transposable elements in a chromosomal context with a particular emphasis on the integral role of both in centromere function. In addition, we discuss the varied mechanisms by which centromeric repeats have endured evolutionary processes, producing a novel, species-specific centromeric landscape despite sharing a ubiquitously conserved function. Finally, we highlight the role these repetitive elements play in the establishment and functionality of de novo centromeres and chromosomal breakpoints that underpin karyotypic variation. By emphasizing these unique activities of satellite DNAs and transposable elements, we hope to disparage the conventional exemplification of repetitive DNA in the historically-associated context of ‘junk’.
Collapse
|
28
|
Podgornaya OI, Ostromyshenskii DI, Enukashvily NI. Who Needs This Junk, or Genomic Dark Matter. BIOCHEMISTRY (MOSCOW) 2018; 83:450-466. [PMID: 29626931 DOI: 10.1134/s0006297918040156] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Centromeres (CEN), pericentromeric regions (periCEN), and subtelomeric regions (subTel) comprise the areas of constitutive heterochromatin (HChr). Tandem repeats (TRs or satellite DNA) are the main components of HChr forming no less than 10% of the mouse and human genome. HChr is assembled within distinct structures in the interphase nuclei of many species - chromocenters. In this review, the main classes of HChr repeat sequences are considered in the order of their number increase in the sequencing reads of the mouse chromocenters (ChrmC). TRs comprise ~70% of ChrmC occupying the first place. Non-LTR (-long terminal repeat) retroposons (mainly LINE, long interspersed nuclear element) are the next (~11%), and endogenous retroviruses (ERV; LTR-containing) are in the third position (~9%). HChr is not enriched with ERV in comparison with the whole genome, but there are differences in distribution of certain elements: while MaLR-like elements (ERV3) are dominant in the whole genome, intracisternal A-particles and corresponding LTR (ERV2) are prevalent in HChr. Most of LINE in ChrmC is represented by the 2-kb fragment at the end of the 2nd open reading frame and its flanking regions. Almost all tandem repeats classified as CEN or periCEN are contained in ChrmC. Our previous classification revealed 60 new mouse TR families with 29 of them being absent in ChrmC, which indicates their location on chromosome arms. TR transcription is necessary for maintenance of heterochromatic status of the HChr genome part. A burst of TR transcription is especially important in embryogenesis and other cases of radical changes in the cell program, including carcinogenesis. The recently discovered mechanism of epigenetic regulation with noncoding sequences transcripts, long noncoding RNA, and its role in embryogenesis and pluripotency maintenance is discussed.
Collapse
Affiliation(s)
- O I Podgornaya
- Institute of Cytology, Russian Academy of Sciences, St. Petersburg, 194064, Russia.
| | | | | |
Collapse
|
29
|
Vlahovic I, Gluncic M, Rosandic M, Ugarkovic Ð, Paar V. Regular Higher Order Repeat Structures in Beetle Tribolium castaneum Genome. Genome Biol Evol 2018; 9:2668-2680. [PMID: 27492235 PMCID: PMC5737470 DOI: 10.1093/gbe/evw174] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/21/2016] [Indexed: 02/07/2023] Open
Abstract
Higher order repeats (HORs) containing tandems of primary and secondary repeat units (head-to-tail “tandem within tandem pattern”), referred to as regular HORs, are typical for primate alpha satellite DNAs and most pronounced in human genome. Regular HORs are known to be a result of recent evolutionary processes. In non-primate genomes mostly so called complex HORs have been found, without head to tail tandem of primary repeat units. In beetle Tribolium castaneum, considered as a model case for genome studies, large tandem repeats have been identified, but no HORs have been reported. Here, using our novel robust repeat finding algorithm Global Repeat Map, we discover two regular and six complex HORs in T. castaneum. In organizational pattern, the integrity and homogeneity of regular HORs in T. castaneum resemble human regular HORs (with T. castaneum monomers different from human alpha satellite monomers), involving a wider range of monomer lengths than in human HORs. Similar regular higher order repeat structures have previously not been found in insects. Some of these novel HORs in T. castaneum appear as most regular among known HORs in non-primate genomes, although with substantial riddling. This is intriguing, in particular from the point of view of role of non-coding repeats in modulation of gene expression.
Collapse
Affiliation(s)
- Ines Vlahovic
- Faculty of Science, University of Zagreb, Zagreb, Croatia
| | - Matko Gluncic
- Faculty of Science, University of Zagreb, Zagreb, Croatia
| | | | | | - Vladimir Paar
- Faculty of Science, University of Zagreb, Zagreb, Croatia.,Croatian Academy of Sciences and Arts, Zagreb, Croatia
| |
Collapse
|
30
|
Ostromyshenskii DI, Chernyaeva EN, Kuznetsova IS, Podgornaya OI. Mouse chromocenters DNA content: sequencing and in silico analysis. BMC Genomics 2018; 19:151. [PMID: 29458329 PMCID: PMC5819297 DOI: 10.1186/s12864-018-4534-z] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2017] [Accepted: 02/06/2018] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Chromocenters are defined as a punctate condensed blocks of chromatin in the interphase cell nuclei of certain cell types with unknown biological significance. In recent years a progress in revealing of chromocenters protein content has been made although the details of DNA content within constitutive heterochromatin still remain unclear. It is known that these regions are enriched in tandem repeats (TR) and transposable elements. Quick improvement of genome sequencing does not help to assemble the heterochromatic regions due to lack of appropriate bioinformatics techniques. RESULTS Chromocenters DNA have been isolated by a biochemical approach from mouse liver cells nuclei and sequenced on the Illumina MiSeq resulting in ChrmC dataset. Analysis of ChrmC dataset by the bioinformatics tools available revealed that the major component of chromocenter DNA are TRs: ~ 66% MaSat and ~ 4% MiSat. Other previously classified TR families constitute ~ 1% of ChrmC dataset. About 6% of chromocenters DNA are mostly unannotated sequences. In the contigs assembled with IDBA_UD there are many fragments of heterochromatic Y-chromosome, rDNA and other pseudo-genes and non-coding DNA. A protein coding sfi1 homolog gene fragment was also found in contigs. The Sfi1 homolog gene is located on the chromosome 11 in the reference genome very close to the Golden Pass Gap (a ~ 3 Mb empty region reserved to the pericentromeric region) and proves the purity of chromocenters isolation. The second major fraction are non-LTR retroposons (SINE and LINE) with overwhelming majority of LINE - ~ 11% of ChrmC. Most of the LINE fragments are from the ~ 2 kb region at the end of the 2nd ORF and its' flanking region. The precise LINEs' segment of ~ 2 kb is the necessary mouse constitutive heterohromatin component together with TR. The third most abundant fraction are ERVs. The ERV distribution in chromocenters differs from the whole genome: IAP (ERV2 class) is the most numerous in ChrmC while MaLR (ERV3 class) prevails in the reference genome. IAP and its LTR also prevail in TR containing contigs extracted from the WGS dataset. In silico prediction of IAP and LINE fragments in chromocenters was confirmed by direct fluorescent in situ hybridization (FISH). CONCLUSION Our data of chromocenters' DNA (ChrmC) sequencing demonstrate that IAP with LTR and a precise ~ 2 kb fragment of LINE represent a substantial fraction of mouse chromocenters (constitutive heteroсhromatin) along with TRs.
Collapse
Affiliation(s)
- Dmitrii I Ostromyshenskii
- Institute of Cytology RAS, St.-Petersburg, 194064, Russia.
- Far Eastern Federal University, Vladivostok, 690922, Russia.
| | | | - Inna S Kuznetsova
- School of Biomedical Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Olga I Podgornaya
- Institute of Cytology RAS, St.-Petersburg, 194064, Russia
- Far Eastern Federal University, Vladivostok, 690922, Russia
- St Petersburg State University, St Petersburg, 199034, Russia
| |
Collapse
|
31
|
Klein SJ, O'Neill RJ. Transposable elements: genome innovation, chromosome diversity, and centromere conflict. Chromosome Res 2018; 26:5-23. [PMID: 29332159 PMCID: PMC5857280 DOI: 10.1007/s10577-017-9569-5] [Citation(s) in RCA: 106] [Impact Index Per Article: 17.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2017] [Revised: 12/05/2017] [Accepted: 12/12/2017] [Indexed: 12/21/2022]
Abstract
Although it was nearly 70 years ago when transposable elements (TEs) were first discovered “jumping” from one genomic location to another, TEs are now recognized as contributors to genomic innovations as well as genome instability across a wide variety of species. In this review, we illustrate the ways in which active TEs, specifically retroelements, can create novel chromosome rearrangements and impact gene expression, leading to disease in some cases and species-specific diversity in others. We explore the ways in which eukaryotic genomes have evolved defense mechanisms to temper TE activity and the ways in which TEs continue to influence genome structure despite being rendered transpositionally inactive. Finally, we focus on the role of TEs in the establishment, maintenance, and stabilization of critical, yet rapidly evolving, chromosome features: eukaryotic centromeres. Across centromeres, specific types of TEs participate in genomic conflict, a balancing act wherein they are actively inserting into centromeric domains yet are harnessed for the recruitment of centromeric histones and potentially new centromere formation.
Collapse
Affiliation(s)
- Savannah J Klein
- Institute for Systems Genomics and Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, 06269, USA
| | - Rachel J O'Neill
- Institute for Systems Genomics and Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, 06269, USA.
| |
Collapse
|
32
|
Larsen PA, Harris RA, Liu Y, Murali SC, Campbell CR, Brown AD, Sullivan BA, Shelton J, Brown SJ, Raveendran M, Dudchenko O, Machol I, Durand NC, Shamim MS, Aiden EL, Muzny DM, Gibbs RA, Yoder AD, Rogers J, Worley KC. Hybrid de novo genome assembly and centromere characterization of the gray mouse lemur (Microcebus murinus). BMC Biol 2017; 15:110. [PMID: 29145861 PMCID: PMC5689209 DOI: 10.1186/s12915-017-0439-6] [Citation(s) in RCA: 50] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2017] [Accepted: 10/10/2017] [Indexed: 12/31/2022] Open
Abstract
BACKGROUND The de novo assembly of repeat-rich mammalian genomes using only high-throughput short read sequencing data typically results in highly fragmented genome assemblies that limit downstream applications. Here, we present an iterative approach to hybrid de novo genome assembly that incorporates datasets stemming from multiple genomic technologies and methods. We used this approach to improve the gray mouse lemur (Microcebus murinus) genome from early draft status to a near chromosome-scale assembly. METHODS We used a combination of advanced genomic technologies to iteratively resolve conflicts and super-scaffold the M. murinus genome. RESULTS We improved the M. murinus genome assembly to a scaffold N50 of 93.32 Mb. Whole genome alignments between our primary super-scaffolds and 23 human chromosomes revealed patterns that are congruent with historical comparative cytogenetic data, thus demonstrating the accuracy of our de novo scaffolding approach and allowing assignment of scaffolds to M. murinus chromosomes. Moreover, we utilized our independent datasets to discover and characterize sequences associated with centromeres across the mouse lemur genome. Quality assessment of the final assembly found 96% of mouse lemur canonical transcripts nearly complete, comparable to other published high-quality reference genome assemblies. CONCLUSIONS We describe a new assembly of the gray mouse lemur (Microcebus murinus) genome with chromosome-scale scaffolds produced using a hybrid bioinformatic and sequencing approach. The approach is cost effective and produces superior results based on metrics of contiguity and completeness. Our results show that emerging genomic technologies can be used in combination to characterize centromeres of non-model species and to produce accurate de novo chromosome-scale genome assemblies of complex mammalian genomes.
Collapse
Affiliation(s)
- Peter A. Larsen
- Department of Biology, Duke University, Durham, NC 27708 USA
| | - R. Alan Harris
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030 USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030 USA
| | - Yue Liu
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030 USA
| | - Shwetha C. Murali
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030 USA
- Present address: Department of Genome Sciences, University of Washington, Seattle, WA 98195 USA
| | | | - Adam D. Brown
- Department of Pharmacology and Cancer Biology, Duke University, Durham, NC 27710 USA
- Present address: Bristol Myers-Squibb, 420 W Round Grove Rd, Lewisville, TX 75067 USA
| | - Beth A. Sullivan
- Department of Molecular Genetics and Microbiology, Duke University, Durham, NC 27710 USA
| | - Jennifer Shelton
- Kansas State University Bioinformatics Center, Division of Biology, Kansas State University, Manhattan, KS 66506 USA
- Present address: New York Genome Center, 101 Avenue of the Americas, New York, NY 10013 USA
| | - Susan J. Brown
- Kansas State University Bioinformatics Center, Division of Biology, Kansas State University, Manhattan, KS 66506 USA
| | | | - Olga Dudchenko
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030 USA
- The Center for Theoretical Biological Physics, Rice University, Houston, TX 77005 USA
- Department of Computer Science, Rice University, Houston, TX 77005 USA
| | - Ido Machol
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030 USA
- The Center for Theoretical Biological Physics, Rice University, Houston, TX 77005 USA
- Department of Computer Science, Rice University, Houston, TX 77005 USA
| | - Neva C. Durand
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030 USA
- The Center for Theoretical Biological Physics, Rice University, Houston, TX 77005 USA
- Department of Computer Science, Rice University, Houston, TX 77005 USA
| | - Muhammad S. Shamim
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030 USA
- The Center for Theoretical Biological Physics, Rice University, Houston, TX 77005 USA
- Department of Computer Science, Rice University, Houston, TX 77005 USA
| | - Erez Lieberman Aiden
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030 USA
- The Center for Theoretical Biological Physics, Rice University, Houston, TX 77005 USA
- Department of Computer Science, Rice University, Houston, TX 77005 USA
| | - Donna M. Muzny
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030 USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030 USA
| | - Richard A. Gibbs
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030 USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030 USA
| | - Anne D. Yoder
- Department of Biology, Duke University, Durham, NC 27708 USA
| | - Jeffrey Rogers
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030 USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030 USA
| | - Kim C. Worley
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030 USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030 USA
| |
Collapse
|
33
|
Garrido-Ramos MA. Satellite DNA: An Evolving Topic. Genes (Basel) 2017; 8:genes8090230. [PMID: 28926993 PMCID: PMC5615363 DOI: 10.3390/genes8090230] [Citation(s) in RCA: 222] [Impact Index Per Article: 31.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2017] [Revised: 09/12/2017] [Accepted: 09/13/2017] [Indexed: 12/22/2022] Open
Abstract
Satellite DNA represents one of the most fascinating parts of the repetitive fraction of the eukaryotic genome. Since the discovery of highly repetitive tandem DNA in the 1960s, a lot of literature has extensively covered various topics related to the structure, organization, function, and evolution of such sequences. Today, with the advent of genomic tools, the study of satellite DNA has regained a great interest. Thus, Next-Generation Sequencing (NGS), together with high-throughput in silico analysis of the information contained in NGS reads, has revolutionized the analysis of the repetitive fraction of the eukaryotic genomes. The whole of the historical and current approaches to the topic gives us a broad view of the function and evolution of satellite DNA and its role in chromosomal evolution. Currently, we have extensive information on the molecular, chromosomal, biological, and population factors that affect the evolutionary fate of satellite DNA, knowledge that gives rise to a series of hypotheses that get on well with each other about the origin, spreading, and evolution of satellite DNA. In this paper, I review these hypotheses from a methodological, conceptual, and historical perspective and frame them in the context of chromosomal organization and evolution.
Collapse
Affiliation(s)
- Manuel A Garrido-Ramos
- Departamento de Genética, Facultad de Ciencias, Universidad de Granada, 18071 Granada, Spain.
| |
Collapse
|
34
|
de Sotero-Caio CG, Cabral-de-Mello DC, Calixto MDS, Valente GT, Martins C, Loreto V, de Souza MJ, Santos N. Centromeric enrichment of LINE-1 retrotransposons and its significance for the chromosome evolution of Phyllostomid bats. Chromosome Res 2017; 25:313-325. [PMID: 28916913 DOI: 10.1007/s10577-017-9565-9] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2017] [Revised: 08/24/2017] [Accepted: 08/28/2017] [Indexed: 10/18/2022]
Abstract
Despite their ubiquitous incidence, little is known about the chromosomal distribution of long interspersed elements (LINEs) in mammalian genomes. Phyllostomid bats, characterized by lineages with distinct trends of chromosomal evolution coupled with remarkable ecological and taxonomic diversity, represent good models to understand how these repetitive sequences contribute to the evolution of genome architecture and its link to lineage diversification. To test the hypothesis that LINE-1 sequences were important modifiers of bat genome architecture, we characterized the distribution of LINE-1-derived sequences on genomes of 13 phyllostomid species within a phylogenetic framework. We found massive accumulation of LINE-1 elements in the centromeres of most species: a rare phenomenon on mammalian genomes. We hypothesize that expansion of these elements has occurred early in the radiation of phyllostomids and recurred episodically. LINE-1 expansions on centromeric heterochromatin probably spurred chromosomal change before the radiation of phyllostomids into the extant 11 subfamilies and contributed to the high degree of karyotypic variation observed among different lineages. Understanding centromere architecture in a variety of taxa promises to explain how lineage-specific changes on centromere structure can contribute to karyotypic diversity while not disrupting functional constraints for proper cell division.
Collapse
Affiliation(s)
- Cibele Gomes de Sotero-Caio
- Departamento de Genética, Laboratório de Genética e Citogenética Animal e Humana, UFPE-Universidade Federal de Pernambuco, Av. da Engenharia s/n; Cidade Universitária, Recife, PE, CEP:50740-600, Brazil. .,Department of Biological Sciences, Texas Tech University, Lubbock, TX, USA.
| | - Diogo Cavalcanti Cabral-de-Mello
- Departamento de Biologia, Grupo de Estudos em Citogenômica e Evolução Animal, UNESP-Universidade Estadual Paulista, Instituto de Biociências, Rio Claro, SP, Brazil
| | - Merilane da Silva Calixto
- Departamento de Genética, Laboratório de Genética e Citogenética Animal e Humana, UFPE-Universidade Federal de Pernambuco, Av. da Engenharia s/n; Cidade Universitária, Recife, PE, CEP:50740-600, Brazil.,Centro de Saúde e Tecnologia, Unidade Acadêmica de Ciências Biológicas, UFCG-Universidade Federal de Campina Grande, Patos, PB, Brazil
| | - Guilherme Targino Valente
- Departamento de Bioprocessos e Biotecnologia da Faculdade de Ciências Agronômicas, UNESP-Universidade Estadual Paulista, Botucatu, SP, Brazil
| | - Cesar Martins
- Departamento de Morfologia, Laboratório Genômica Integrativa, UNESP-Universidade Estadual Paulista, Botucatu, SP, Brazil
| | - Vilma Loreto
- Departamento de Genética, Laboratório de Genética e Citogenética Animal e Humana, UFPE-Universidade Federal de Pernambuco, Av. da Engenharia s/n; Cidade Universitária, Recife, PE, CEP:50740-600, Brazil
| | - Maria José de Souza
- Departamento de Genética, Laboratório de Genética e Citogenética Animal e Humana, UFPE-Universidade Federal de Pernambuco, Av. da Engenharia s/n; Cidade Universitária, Recife, PE, CEP:50740-600, Brazil
| | - Neide Santos
- Departamento de Genética, Laboratório de Genética e Citogenética Animal e Humana, UFPE-Universidade Federal de Pernambuco, Av. da Engenharia s/n; Cidade Universitária, Recife, PE, CEP:50740-600, Brazil
| |
Collapse
|
35
|
Gent JI, Wang N, Dawe RK. Stable centromere positioning in diverse sequence contexts of complex and satellite centromeres of maize and wild relatives. Genome Biol 2017. [PMID: 28637491 PMCID: PMC5480163 DOI: 10.1186/s13059-017-1249-4] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Background Paradoxically, centromeres are known both for their characteristic repeat sequences (satellite DNA) and for being epigenetically defined. Maize (Zea mays mays) is an attractive model for studying centromere positioning because many of its large (~2 Mb) centromeres are not dominated by satellite DNA. These centromeres, which we call complex centromeres, allow for both assembly into reference genomes and for mapping short reads from ChIP-seq with antibodies to centromeric histone H3 (cenH3). Results We found frequent complex centromeres in maize and its wild relatives Z. mays parviglumis, Z. mays mexicana, and particularly Z. mays huehuetenangensis. Analysis of individual plants reveals minor variation in the positions of complex centromeres among siblings. However, such positional shifts are stochastic and not heritable, consistent with prior findings that centromere positioning is stable at the population level. Centromeres are also stable in multiple F1 hybrid contexts. Analysis of repeats in Z. mays and other species (Zea diploperennis, Zea luxurians, and Tripsacum dactyloides) reveals tenfold differences in abundance of the major satellite CentC, but similar high levels of sequence polymorphism in individual CentC copies. Deviation from the CentC consensus has little or no effect on binding of cenH3. Conclusions These data indicate that complex centromeres are neither a peculiarity of cultivation nor inbreeding in Z. mays. While extensive arrays of CentC may be the norm for other Zea and Tripsacum species, these data also reveal that a wide diversity of DNA sequences and multiple types of genetic elements in and near centromeres support centromere function and constrain centromere positions. Electronic supplementary material The online version of this article (doi:10.1186/s13059-017-1249-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jonathan I Gent
- Department of Plant Biology, University of Georgia, Athens, USA
| | - Na Wang
- Department of Plant Biology, University of Georgia, Athens, USA
| | - R Kelly Dawe
- Department of Plant Biology, University of Georgia, Athens, USA. .,Department of Genetics, University of Georgia, Athens, USA.
| |
Collapse
|
36
|
Vogel H, Jähnert M, Stadion M, Matzke D, Scherneck S, Schürmann A. A vast genomic deletion in the C56BL/6 genome affects different genes within the Ifi200 cluster on chromosome 1 and mediates obesity and insulin resistance. BMC Genomics 2017; 18:172. [PMID: 28201990 PMCID: PMC5312539 DOI: 10.1186/s12864-017-3552-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2016] [Accepted: 02/03/2017] [Indexed: 04/09/2023] Open
Abstract
Background Obesity, the excessive accumulation of body fat, is a highly heritable and genetically heterogeneous disorder. The complex, polygenic basis for the disease consisting of a network of different gene variants is still not completely known. Results In the current study we generated a BAC library of the obese-prone NZO strain to clarify the genomic alteration within the gene cluster Ifi200 on chr.1 including Ifi202b, an obesity gene that is in contrast to NZO not expressed in the lean B6 mouse. With the PacBio sequencing data of NZO BAC clones we identified a deletion spanning approximately 261.8 kb in the B6 reference genome. The deletion affects different members of the Ifi200 gene family which also includes the original first exon and 5′-regulatory parts of the Ifi202b gene and suggests to be the relevant cause of its expression deficiency in B6. In addition, the generation and characterization of congenic mice carrying the critical fragment on the B6 background demonstrate its crucial role for obesity and insulin resistance. Conclusions Our data reveal the reconstruction of a complex genomic region on mouse chr.1 resulting from deletions and duplications of Ifi200 genes and suggest to be relevant for the development of obesity. The results further demonstrate the complexity of the disease and highlight the importance for studying rare genetic variants as they can be causal for large effects.
Collapse
Affiliation(s)
- Heike Vogel
- Department of Experimental Diabetology, German Institute of Human Nutrition Potsdam-Rehbruecke, Arthur-Scheunert Allee 114-116, D-14558, Nuthetal, Germany.,German Center for Diabetes Research (DZD), Ingolstädter Landstr. 1, 85764, München-Neuherberg, Germany
| | - Markus Jähnert
- Department of Experimental Diabetology, German Institute of Human Nutrition Potsdam-Rehbruecke, Arthur-Scheunert Allee 114-116, D-14558, Nuthetal, Germany.,German Center for Diabetes Research (DZD), Ingolstädter Landstr. 1, 85764, München-Neuherberg, Germany
| | - Mandy Stadion
- Department of Experimental Diabetology, German Institute of Human Nutrition Potsdam-Rehbruecke, Arthur-Scheunert Allee 114-116, D-14558, Nuthetal, Germany.,German Center for Diabetes Research (DZD), Ingolstädter Landstr. 1, 85764, München-Neuherberg, Germany
| | - Daniela Matzke
- Department of Experimental Diabetology, German Institute of Human Nutrition Potsdam-Rehbruecke, Arthur-Scheunert Allee 114-116, D-14558, Nuthetal, Germany.,German Center for Diabetes Research (DZD), Ingolstädter Landstr. 1, 85764, München-Neuherberg, Germany
| | - Stephan Scherneck
- Institute of Pharmacology and Toxicology, University of Braunschweig, Mendelssohnstr. 1, 38106, Braunschweig, Germany
| | - Annette Schürmann
- Department of Experimental Diabetology, German Institute of Human Nutrition Potsdam-Rehbruecke, Arthur-Scheunert Allee 114-116, D-14558, Nuthetal, Germany. .,German Center for Diabetes Research (DZD), Ingolstädter Landstr. 1, 85764, München-Neuherberg, Germany.
| |
Collapse
|
37
|
Vozdova M, Kubickova S, Cernohorska H, Fröhlich J, Rubes J. Satellite DNA Sequences in Canidae and Their Chromosome Distribution in Dog and Red Fox. Cytogenet Genome Res 2017; 150:118-127. [PMID: 28122375 DOI: 10.1159/000455081] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/04/2016] [Indexed: 11/19/2022] Open
Abstract
Satellite DNA is a characteristic component of mammalian centromeric heterochromatin, and a comparative analysis of its evolutionary dynamics can be used for phylogenetic studies. We analysed satellite and satellite-like DNA sequences available in NCBI for 4 species of the family Canidae (red fox, Vulpes vulpes, VVU; domestic dog, Canis familiaris, CFA; arctic fox, Vulpes lagopus, VLA; raccoon dog, Nyctereutes procyonoides procyonoides, NPR) by comparative sequence analysis, which revealed 86-90% intraspecies and 76-79% interspecies similarity. Comparative fluorescence in situ hybridisation in the red fox and dog showed signals of the red fox satellite probe in canine and vulpine autosomal centromeres, on VVUY, B chromosomes, and in the distal parts of VVU9q and VVU10p which were shown to contain nucleolus organiser regions. The CFA satellite probe stained autosomal centromeres only in the dog. The CFA satellite-like DNA did not show any significant sequence similarity with the satellite DNA of any species analysed and was localised to the centromeres of 9 canine chromosome pairs. No significant heterochromatin block was detected on the B chromosomes of the red fox. Our results show extensive heterogeneity of satellite sequences among Canidae and prove close evolutionary relationships between the red and arctic fox.
Collapse
Affiliation(s)
- Miluse Vozdova
- Central European Institute of Technology - Veterinary Research Institute, Brno, Czech Republic
| | | | | | | | | |
Collapse
|
38
|
Giulotto E, Raimondi E, Sullivan KF. The Unique DNA Sequences Underlying Equine Centromeres. PROGRESS IN MOLECULAR AND SUBCELLULAR BIOLOGY 2017; 56:337-354. [PMID: 28840244 DOI: 10.1007/978-3-319-58592-5_14] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Centromeres are highly distinctive genetic loci whose function is specified largely by epigenetic mechanisms. Understanding the role of DNA sequences in centromere function has been a daunting task due to the highly repetitive nature of centromeres in animal chromosomes. The discovery of a centromere devoid of satellite DNA in the domestic horse consolidated observations on the epigenetic nature of centromere identity, showing that entirely natural chromosomes could function without satellite DNA cues. Horses belong to the genus Equus which exhibits a very high degree of evolutionary plasticity in centromere position and DNA sequence composition. Examination of horses has revealed that the position of the satellite-free centromere is variable among individuals. Analysis of centromere location and composition in other Equus species, including domestic donkey and zebras, confirms that the satellite-less configuration of centromeres is common in this group which has undergone particularly rapid karyotype evolution. These features have established the equids as a new mammalian system in which to investigate the molecular organization, dynamics and evolutionary behaviour of centromeres.
Collapse
Affiliation(s)
- Elena Giulotto
- Dipartimento di Biologia e Biotecnologie, Università di Pavia, Via Ferrata 1, 27100, Pavia, Italy.
| | - Elena Raimondi
- Dipartimento di Biologia e Biotecnologie, Università di Pavia, Via Ferrata 1, 27100, Pavia, Italy
| | - Kevin F Sullivan
- National University of Ireland Galway, University Road, Galway, Ireland
| |
Collapse
|
39
|
Dumont M, Fachinetti D. DNA Sequences in Centromere Formation and Function. PROGRESS IN MOLECULAR AND SUBCELLULAR BIOLOGY 2017; 56:305-336. [PMID: 28840243 DOI: 10.1007/978-3-319-58592-5_13] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Faithful chromosome segregation during cell division depends on the centromere, a complex DNA/protein structure that links chromosomes to spindle microtubules. This chromosomal domain has to be marked throughout cell division and its chromosomal localization preserved across cell generations. From fission yeast to human, centromeres are established on a series of repetitive DNA sequences and on specialized centromeric chromatin. This chromatin is enriched with the histone H3 variant, named CENP-A, that was demonstrated to be the epigenetic mark that maintains centromere identity and function indefinitely. Although centromere identity is thought to be exclusively epigenetic, the presence of specific DNA sequences in the majority of eukaryotes and of the centromeric protein CENP-B that binds to these sequences, suggests the existence of a genetic component as well. In this review, we will highlight the importance of centromeric sequences for centromere formation and function, and discuss the centromere DNA sequence/CENP-B paradox.
Collapse
Affiliation(s)
- M Dumont
- Institut Curie, PSL Research University, CNRS, UMR 144, 26 rue d'Ulm, 75005, Paris, France
| | - D Fachinetti
- Institut Curie, PSL Research University, CNRS, UMR 144, 26 rue d'Ulm, 75005, Paris, France.
| |
Collapse
|
40
|
Suntronpong A, Kugou K, Masumoto H, Srikulnath K, Ohshima K, Hirai H, Koga A. CENP-B box, a nucleotide motif involved in centromere formation, occurs in a New World monkey. Biol Lett 2016; 12:20150817. [PMID: 27029836 DOI: 10.1098/rsbl.2015.0817] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2015] [Accepted: 03/04/2016] [Indexed: 01/01/2023] Open
Abstract
Centromere protein B (CENP-B) is one of the major proteins involved in centromere formation, binding to centromeric repetitive DNA by recognizing a 17 bp motif called the CENP-B box. Hominids (humans and great apes) carry large numbers of CENP-B boxes in alpha satellite DNA (AS, the major centromeric repetitive DNA of simian primates). Only negative results have been reported regarding the presence of the CENP-B box in other primate taxa. Consequently, it is widely believed that the CENP-B box is confined, within primates, to the hominids. We report here that the common marmoset, a New World monkey, contains an abundance of CENP-B boxes in its AS. First, in a long contig sequence we constructed and analysed, we identified the motif in 17 of the 38 alpha satellite repeat units. We then sequenced terminal regions of additional clones and found the motif in many of them. Immunostaining of marmoset cells demonstrated that CENP-B binds to DNA in the centromeric regions of chromosomes. Therefore, functional CENP-B boxes are not confined to hominids. Our results indicate that the efficiency of identification of the CENP-B box may depend largely on the sequencing methods used, and that the CENP-B box in centromeric repetitive DNA may be more common than researchers previously thought.
Collapse
Affiliation(s)
- Aorarat Suntronpong
- Primate Research Institute, Kyoto University, Inuyama 484-8506, Japan Faculty of Science, Kasetsart University, Bangkok 10900, Thailand
| | - Kazuto Kugou
- Department of Frontier Research, Kazusa DNA Research Institute, Kisarazu 292-0818, Japan
| | - Hiroshi Masumoto
- Department of Frontier Research, Kazusa DNA Research Institute, Kisarazu 292-0818, Japan
| | | | - Kazuhiko Ohshima
- Graduate School of Bioscience, Nagahama Institute of Bio-Science and Technology, Nagahama 526-0829, Japan
| | - Hirohisa Hirai
- Primate Research Institute, Kyoto University, Inuyama 484-8506, Japan
| | - Akihiko Koga
- Primate Research Institute, Kyoto University, Inuyama 484-8506, Japan
| |
Collapse
|
41
|
Zhu Z, Gui S, Jin J, Yi R, Wu Z, Qian Q, Ding Y. The NnCenH3 protein and centromeric DNA sequence profiles of Nelumbo nucifera Gaertn. (sacred lotus) reveal the DNA structures and dynamics of centromeres in basal eudicots. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2016; 87:568-582. [PMID: 27227686 DOI: 10.1111/tpj.13219] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/02/2016] [Revised: 05/15/2016] [Accepted: 05/23/2016] [Indexed: 06/05/2023]
Abstract
Centromeres on eukaryotic chromosomes consist of large arrays of DNA repeats that undergo very rapid evolution. Nelumbo nucifera Gaertn. (sacred lotus) is a phylogenetic relict and an aquatic perennial basal eudicot. Studies concerning the centromeres of this basal eudicot species could provide ancient evolutionary perspectives. In this study, we characterized the centromeric marker protein NnCenH3 (sacred lotus centromere-specific histone H3 variant), and used a chromatin immunoprecipitation (ChIP)-based technique to recover the NnCenH3 nucleosome-associated sequences of sacred lotus. The properties of the centromere-binding protein and DNA sequences revealed notable divergence between sacred lotus and other flowering plants, including the following factors: (i) an NnCenH3 alternative splicing variant comprising only a partial centromere-targeting domain, (ii) active genes with low transcription levels in the NnCenH3 nucleosomal regions, and (iii) the prevalence of the Ty1/copia class of long terminal repeat (LTR) retrotransposons in the centromeres of sacred lotus chromosomes. In addition, the dynamic natures of the centromeric region showed that some of the centromeric repeat DNA sequences originated from telomeric repeats, and a pair of centromeres on the dicentric chromosome 1 was inactive in the metaphase cells of sacred lotus. Our characterization of the properties of centromeric DNA structure within the sacred lotus genome describes a centromeric profile in ancient basal eudicots and might provide evidence of the origins and evolution of centromeres. Furthermore, the identification of centromeric DNA sequences is of great significance for the assembly of the sacred lotus genome.
Collapse
Affiliation(s)
- Zhixuan Zhu
- Department of Genetics, State Key Laboratory of Hybrid Rice, College of Life Sciences, Wuhan University, Wuhan, 430072, China
| | - Songtao Gui
- Department of Genetics, State Key Laboratory of Hybrid Rice, College of Life Sciences, Wuhan University, Wuhan, 430072, China
| | - Jing Jin
- Department of Genetics, State Key Laboratory of Hybrid Rice, College of Life Sciences, Wuhan University, Wuhan, 430072, China
| | - Rong Yi
- Department of Genetics, State Key Laboratory of Hybrid Rice, College of Life Sciences, Wuhan University, Wuhan, 430072, China
| | - Zhihua Wu
- Department of Genetics, State Key Laboratory of Hybrid Rice, College of Life Sciences, Wuhan University, Wuhan, 430072, China
| | - Qian Qian
- Department of Genetics, State Key Laboratory of Hybrid Rice, College of Life Sciences, Wuhan University, Wuhan, 430072, China
| | - Yi Ding
- Department of Genetics, State Key Laboratory of Hybrid Rice, College of Life Sciences, Wuhan University, Wuhan, 430072, China.
| |
Collapse
|
42
|
Cech JN, Peichel CL. Centromere inactivation on a neo-Y fusion chromosome in threespine stickleback fish. Chromosome Res 2016; 24:437-450. [PMID: 27553478 DOI: 10.1007/s10577-016-9535-7] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2016] [Revised: 08/14/2016] [Accepted: 08/16/2016] [Indexed: 02/07/2023]
Abstract
Having one and only one centromere per chromosome is essential for proper chromosome segregation during both mitosis and meiosis. Chromosomes containing two centromeres are known as dicentric and often mis-segregate during cell division, resulting in aneuploidy or chromosome breakage. Dicentric chromosome can be stabilized by centromere inactivation, a process which reestablishes monocentric chromosomes. However, little is known about this process in naturally occurring dicentric chromosomes. Using a combination of fluorescence in situ hybridization (FISH) and immunofluorescence combined with FISH (IF-FISH) on metaphase chromosome spreads, we demonstrate that centromere inactivation has evolved on a neo-Y chromosome fusion in the Japan Sea threespine stickleback fish (Gasterosteus nipponicus). We found that the centromere derived from the ancestral Y chromosome has been inactivated. Our data further suggest that there have been genetic changes to this centromere in the two million years since the formation of the neo-Y chromosome, but it remains unclear whether these genetic changes are a cause or consequence of centromere inactivation.
Collapse
Affiliation(s)
- Jennifer N Cech
- Divisions of Basic Sciences and Human Biology, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave North, Mailstop C2-023, Seattle, WA, 98109, USA
- Graduate Program in Molecular and Cellular Biology, University of Washington, Seattle, WA, 98195, USA
| | - Catherine L Peichel
- Institute of Ecology and Evolution, University of Bern, Baltzerstrasse 6, 3012, Bern, Switzerland.
| |
Collapse
|
43
|
Kugou K, Hirai H, Masumoto H, Koga A. Formation of functional CENP-B boxes at diverse locations in repeat units of centromeric DNA in New World monkeys. Sci Rep 2016; 6:27833. [PMID: 27292628 PMCID: PMC4904201 DOI: 10.1038/srep27833] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2015] [Accepted: 05/25/2016] [Indexed: 12/17/2022] Open
Abstract
Centromere protein B, which is involved in centromere formation, binds to centromeric repetitive DNA by recognizing a nucleotide motif called the CENP-B box. Humans have large numbers of CENP-B boxes in the centromeric repetitive DNA of their autosomes and X chromosome. The current understanding is that these CENP-B boxes are located at identical positions in the repeat units of centromeric DNA. Great apes also have CENP-B boxes in locations that are identical to humans. The purpose of the present study was to examine the location of CENP-B box in New World monkeys. We recently identified CENP-B box in one species of New World monkeys (marmosets). In this study, we found functional CENP-B boxes in CENP-A-assembled repeat units of centromeric DNA in 2 additional New World monkeys (squirrel monkeys and tamarins) by immunostaining and ChIP-qPCR analyses. The locations of the 3 CENP-B boxes in the repeat units differed from one another. The repeat unit size of centromeric DNA of New World monkeys (340–350 bp) is approximately twice that of humans and great apes (171 bp). This might be, associated with higher-order repeat structures of centromeric DNA, a factor for the observed variation in the CENP-B box location in New World monkeys.
Collapse
Affiliation(s)
- Kazuto Kugou
- Department of Frontier Research, Kazusa DNA Research Institute, Kisarazu 292-0818, Japan
| | - Hirohisa Hirai
- Primate Research Institute, Kyoto University, Inuyama 484-8506, Japan
| | - Hiroshi Masumoto
- Department of Frontier Research, Kazusa DNA Research Institute, Kisarazu 292-0818, Japan
| | - Akihiko Koga
- Primate Research Institute, Kyoto University, Inuyama 484-8506, Japan
| |
Collapse
|
44
|
Cech JN, Peichel CL. Identification of the centromeric repeat in the threespine stickleback fish (Gasterosteus aculeatus). Chromosome Res 2015; 23:767-79. [PMID: 26424612 DOI: 10.1007/s10577-015-9495-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2015] [Revised: 09/11/2015] [Accepted: 09/17/2015] [Indexed: 01/09/2023]
Abstract
Centromere sequences exist as gaps in many genome assemblies due to their repetitive nature. Here we take an unbiased approach utilizing centromere protein A (CENP-A) chomatin immunoprecipitation followed by high-throughput sequencing to identify the centromeric repeat sequence in the threespine stickleback fish (Gasterosteus aculeatus). A 186-bp, AT-rich repeat was validated as centromeric using both fluorescence in situ hybridization (FISH) and immunofluorescence combined with FISH (IF-FISH) on interphase nuclei and metaphase spreads. This repeat hybridizes strongly to the centromere on all chromosomes, with the exception of weak hybridization to the Y chromosome. Together, our work provides the first validated sequence information for the threespine stickleback centromere.
Collapse
Affiliation(s)
- Jennifer N Cech
- Divisions of Human Biology and Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave North, Mailstop C2-023, Seattle, WA, 98109, USA.,Graduate Program in Molecular and Cellular Biology, University of Washington, Seattle, WA, 98195, USA
| | - Catherine L Peichel
- Divisions of Human Biology and Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave North, Mailstop C2-023, Seattle, WA, 98109, USA.
| |
Collapse
|
45
|
Tarrant-Elorza M, Rossetto CC, Pari GS. Maintenance and replication of the human cytomegalovirus genome during latency. Cell Host Microbe 2015; 16:43-54. [PMID: 25011107 DOI: 10.1016/j.chom.2014.06.006] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2014] [Revised: 03/26/2014] [Accepted: 05/01/2014] [Indexed: 11/17/2022]
Abstract
Human cytomegalovirus (HCMV) can establish latent infection in hematopoietic progenitor cells (HPCs) or CD14 (+) monocytes. While circularized viral genomes are observed during latency, how viral genomes persist or which viral factors contribute to genome maintenance and/or replication is unclear. Previously, we identified a HCMV cis-acting viral maintenance element (TR element) and showed that HCMV IE1 exon 4 mRNA is expressed in latently infected HPCs. We now show that a smaller IE1 protein species (IE1x4) is expressed in latently infected HPCs. IE1x4 protein expression is required for viral genome persistence and maintenance and replication of a TR element containing plasmid (pTR). Both IE1x4 and the cellular transcription factor Sp1 interact with the TR, and inhibition of Sp1 binding abrogates pTR amplification. Further, IE1x4 interacts with Topoisomerase IIβ (TOPOIIβ), whose activity is required for pTR amplification. These results identify a HCMV latency-specific factor that promotes viral chromosome maintenance and replication.
Collapse
Affiliation(s)
- Margaret Tarrant-Elorza
- University of Nevada School of Medicine, 1664 North Virginia Street/MS320, Reno, NV 89557, USA
| | - Cyprian C Rossetto
- University of Nevada School of Medicine, 1664 North Virginia Street/MS320, Reno, NV 89557, USA
| | - Gregory S Pari
- University of Nevada School of Medicine, 1664 North Virginia Street/MS320, Reno, NV 89557, USA.
| |
Collapse
|
46
|
Louzada S, Vieira-da-Silva A, Mendes-da-Silva A, Kubickova S, Rubes J, Adega F, Chaves R. A novel satellite DNA sequence in the Peromyscus genome (PMSat): Evolution via copy number fluctuation. Mol Phylogenet Evol 2015; 92:193-203. [PMID: 26103000 DOI: 10.1016/j.ympev.2015.06.008] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2014] [Revised: 06/11/2015] [Accepted: 06/12/2015] [Indexed: 12/16/2022]
Abstract
Satellite DNAs (satDNA) are tandemly arrayed repeated sequences largely present in eukaryotic genomes, which play important roles in genome evolution and function, and therefore, their analysis is vital. Here, we describe the isolation of a novel satellite DNA family (PMSat) from the rodent Peromyscus eremicus (Cricetidae, Rodentia), which is located in pericentromeric regions and exhibits a typical satellite DNA genome organization. Orthologous PMSat sequences were isolated and characterized from three species belonging to Cricetidae: Cricetus cricetus, Phodopus sungorus and Microtus arvalis. In these species, PMSat is highly conserved, with the absence of fixed species-specific mutations. Strikingly, different numbers of copies of this sequence were found among the species, suggesting evolution by copy number fluctuation. Repeat units of PMSat were also found in the Peromyscus maniculatus bairdii BioProject, but our results suggest that these repeat units are from genome regions outside the pericentromere. The remarkably high evolutionary sequence conservation along with the preservation of a few numbers of copies of this sequence in the analyzed genomes may suggest functional significance but a different sequence nature/organization. Our data highlight that repeats are difficult to analyze due to the limited tools available to dissect genomes and the fact that assemblies do not cover regions of constitutive heterochromatin.
Collapse
Affiliation(s)
- Sandra Louzada
- University of Trás-os-Montes and Alto Douro (UTAD), Department of Genetics and Biotechnology (DGB), Laboratory of Cytogenomics and Animal Genomics (CAG), Apdo 1013, 5001-801 Vila Real, Portugal
| | - Ana Vieira-da-Silva
- University of Trás-os-Montes and Alto Douro (UTAD), Department of Genetics and Biotechnology (DGB), Laboratory of Cytogenomics and Animal Genomics (CAG), Apdo 1013, 5001-801 Vila Real, Portugal; University of Lisboa, Faculty of Sciences, BioISI - Biosystems & Integrative Sciences Institute, Campo Grande, Lisboa, Portugal
| | - Ana Mendes-da-Silva
- University of Trás-os-Montes and Alto Douro (UTAD), Department of Genetics and Biotechnology (DGB), Laboratory of Cytogenomics and Animal Genomics (CAG), Apdo 1013, 5001-801 Vila Real, Portugal; University of Lisboa, Faculty of Sciences, BioISI - Biosystems & Integrative Sciences Institute, Campo Grande, Lisboa, Portugal
| | | | - Jiri Rubes
- Veterinary Research Institute, Brno, Czech Republic
| | - Filomena Adega
- University of Trás-os-Montes and Alto Douro (UTAD), Department of Genetics and Biotechnology (DGB), Laboratory of Cytogenomics and Animal Genomics (CAG), Apdo 1013, 5001-801 Vila Real, Portugal; University of Lisboa, Faculty of Sciences, BioISI - Biosystems & Integrative Sciences Institute, Campo Grande, Lisboa, Portugal
| | - Raquel Chaves
- University of Trás-os-Montes and Alto Douro (UTAD), Department of Genetics and Biotechnology (DGB), Laboratory of Cytogenomics and Animal Genomics (CAG), Apdo 1013, 5001-801 Vila Real, Portugal; University of Lisboa, Faculty of Sciences, BioISI - Biosystems & Integrative Sciences Institute, Campo Grande, Lisboa, Portugal.
| |
Collapse
|
47
|
Nergadze SG, Belloni E, Piras FM, Khoriauli L, Mazzagatti A, Vella F, Bensi M, Vitelli V, Giulotto E, Raimondi E. Discovery and comparative analysis of a novel satellite, EC137, in horses and other equids. Cytogenet Genome Res 2014; 144:114-23. [PMID: 25342230 DOI: 10.1159/000368138] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/12/2014] [Indexed: 11/19/2022] Open
Abstract
Centromeres are the sites of kinetochore assembly and spindle fiber attachment and consist of protein-DNA complexes in which the DNA component is typically characterized by the presence of extended arrays of tandem repeats called satellite DNA. Here, we describe the isolation and characterization of a 137-bp-long new satellite DNA sequence from the horse genome (EC137), which is also present, even if less abundant, in the domestic donkey, the Grevy's zebra and the Burchelli's zebra. We investigated the chromosomal distribution of the EC137 sequence in these 4 species. Moreover, we analyzed its architectural organization by high-resolution FISH. The position of this sequence with respect to the primary constriction and in relation to the 2 major horse satellite tandem repeats (37 cen and 2PI) on horse chromosomes suggests that the new centromeric equine satellite is an accessory DNA element, presumably contributing to the organization of pericentromeric chromatin. FISH on combed DNA fibers reveals that the EC137 satellite is organized in relatively short stretches (2-8 kb) which are strictly intermingled within 37 cen or 2PI arrays. This arrangement suggests that interchanges between satellite families are a frequent occurrence in the horse genome.
Collapse
Affiliation(s)
- Solomon G Nergadze
- Department of Biology and Biotechnology 'L. Spallanzani', University of Pavia, Pavia, Italy
| | | | | | | | | | | | | | | | | | | |
Collapse
|
48
|
Tessereau C, Lesecque Y, Monnet N, Buisson M, Barjhoux L, Léoné M, Feng B, Goldgar DE, Sinilnikova OM, Mousset S, Duret L, Mazoyer S. Estimation of the RNU2 macrosatellite mutation rate by BRCA1 mutation tracing. Nucleic Acids Res 2014; 42:9121-30. [PMID: 25034697 PMCID: PMC4132748 DOI: 10.1093/nar/gku639] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
Large tandem repeat sequences have been poorly investigated as severe technical limitations and their frequent absence from the genome reference hinder their analysis. Extensive allelotyping of this class of variation has not been possible until now and their mutational dynamics are still poorly known. In order to estimate the mutation rate of a macrosatellite, we analysed in detail the RNU2 locus, which displays at least 50 different alleles containing 5-82 copies of a 6.1 kb repeat unit. Mining data from the 1000 Genomes Project allowed us to precisely estimate copy numbers of the RNU2 repeat unit using read depth of coverage. This further revealed significantly different mean values in various recent modern human populations, favoring a scenario of fast evolution of this locus. Its proximity to a disease gene with numerous founder mutations, BRCA1, within the same linkage disequilibrium block, offered the unique opportunity to trace RNU2 arrays over a large timescale. Analysis of the transmission of RNU2 arrays associated with one ‘private’ mutation in an extended kindred and four founder mutations in multiple kindreds gave an estimation by maximum likelihood of 5 × 10−3 mutations per generation, which is close to that of microsatellites.
Collapse
Affiliation(s)
- Chloé Tessereau
- Genetics of Breast Cancer Team, Cancer Research Centre of Lyon, CNRS UMR5286, Inserm U1052, Université Lyon 1, Centre Léon Bérard, Lyon, France Genomic Vision, Bagneux, Paris, France
| | - Yann Lesecque
- Laboratoire de Biométrie et Biologie Evolutive, CNRS UMR5558, Université Lyon 1, France
| | - Nastasia Monnet
- Genetics of Breast Cancer Team, Cancer Research Centre of Lyon, CNRS UMR5286, Inserm U1052, Université Lyon 1, Centre Léon Bérard, Lyon, France
| | - Monique Buisson
- Genetics of Breast Cancer Team, Cancer Research Centre of Lyon, CNRS UMR5286, Inserm U1052, Université Lyon 1, Centre Léon Bérard, Lyon, France
| | - Laure Barjhoux
- Genetics of Breast Cancer Team, Cancer Research Centre of Lyon, CNRS UMR5286, Inserm U1052, Université Lyon 1, Centre Léon Bérard, Lyon, France
| | - Mélanie Léoné
- Unité Mixte de Génétique Constitutionnelle des Cancers Fréquents, Hospices Civils de Lyon/Centre Léon Bérard, Lyon, France
| | - Bingjian Feng
- Department of Dermatology and Huntsman Cancer Institute University of Utah School of Medicine, Salt Lake City, Utah, USA
| | - David E Goldgar
- Department of Dermatology and Huntsman Cancer Institute University of Utah School of Medicine, Salt Lake City, Utah, USA
| | - Olga M Sinilnikova
- Genetics of Breast Cancer Team, Cancer Research Centre of Lyon, CNRS UMR5286, Inserm U1052, Université Lyon 1, Centre Léon Bérard, Lyon, France Unité Mixte de Génétique Constitutionnelle des Cancers Fréquents, Hospices Civils de Lyon/Centre Léon Bérard, Lyon, France
| | - Sylvain Mousset
- Laboratoire de Biométrie et Biologie Evolutive, CNRS UMR5558, Université Lyon 1, France
| | - Laurent Duret
- Laboratoire de Biométrie et Biologie Evolutive, CNRS UMR5558, Université Lyon 1, France
| | - Sylvie Mazoyer
- Genetics of Breast Cancer Team, Cancer Research Centre of Lyon, CNRS UMR5286, Inserm U1052, Université Lyon 1, Centre Léon Bérard, Lyon, France
| |
Collapse
|
49
|
Altemose N, Miga KH, Maggioni M, Willard HF. Genomic characterization of large heterochromatic gaps in the human genome assembly. PLoS Comput Biol 2014; 10:e1003628. [PMID: 24831296 PMCID: PMC4022460 DOI: 10.1371/journal.pcbi.1003628] [Citation(s) in RCA: 81] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2013] [Accepted: 03/26/2014] [Indexed: 01/24/2023] Open
Abstract
The largest gaps in the human genome assembly correspond to multi-megabase heterochromatic regions composed primarily of two related families of tandem repeats, Human Satellites 2 and 3 (HSat2,3). The abundance of repetitive DNA in these regions challenges standard mapping and assembly algorithms, and as a result, the sequence composition and potential biological functions of these regions remain largely unexplored. Furthermore, existing genomic tools designed to predict consensus-based descriptions of repeat families cannot be readily applied to complex satellite repeats such as HSat2,3, which lack a consistent repeat unit reference sequence. Here we present an alignment-free method to characterize complex satellites using whole-genome shotgun read datasets. Utilizing this approach, we classify HSat2,3 sequences into fourteen subfamilies and predict their chromosomal distributions, resulting in a comprehensive satellite reference database to further enable genomic studies of heterochromatic regions. We also identify 1.3 Mb of non-repetitive sequence interspersed with HSat2,3 across 17 unmapped assembly scaffolds, including eight annotated gene predictions. Finally, we apply our satellite reference database to high-throughput sequence data from 396 males to estimate array size variation of the predominant HSat3 array on the Y chromosome, confirming that satellite array sizes can vary between individuals over an order of magnitude (7 to 98 Mb) and further demonstrating that array sizes are distributed differently within distinct Y haplogroups. In summary, we present a novel framework for generating initial reference databases for unassembled genomic regions enriched with complex satellite DNA, and we further demonstrate the utility of these reference databases for studying patterns of sequence variation within human populations. At least 5–10% of the human genome remains unassembled, unmapped, and poorly characterized. The reference assembly annotates these missing regions as multi-megabase heterochromatic gaps, found primarily near centromeres and on the short arms of the acrocentric chromosomes. This missing fraction of the genome consists predominantly of long arrays of near-identical tandem repeats called satellite DNA. Due to the repetitive nature of satellite DNA, sequence assembly algorithms cannot uniquely align overlapping sequence reads, and thus satellite-rich domains have been omitted from the reference assembly and from most genome-wide studies of variation and function. Existing methods for analyzing some satellite DNAs cannot be easily extended to a large portion of satellites whose repeat structures are complex and largely uncharacterized, such as Human Satellites 2 and 3 (HSat2,3). Here we characterize HSat2,3 using a novel approach that does not depend on having a well-defined repeat structure. By classifying genome-wide HSat2,3 sequences into subfamilies and localizing them to chromosomes, we have generated an initial HSat2,3 genomic reference, which serves as a critical foundation for future studies of variation and function in these regions. This approach should be generally applicable to other classes of satellite DNA, in both the human genome and other complex genomes.
Collapse
Affiliation(s)
- Nicolas Altemose
- Genome Biology Group, Duke Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina, United States of America
| | - Karen H. Miga
- Genome Biology Group, Duke Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina, United States of America
- * E-mail:
| | - Mauro Maggioni
- Department of Mathematics, Duke University, Durham, North Carolina, United States of America
| | - Huntington F. Willard
- Genome Biology Group, Duke Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina, United States of America
| |
Collapse
|
50
|
Zagga AD, Ahmed HOON, Ismail SM, Tadros AA. Molecular sex identification of dry human teeth specimens from Sokoto, Northwestern Nigeria. J Forensic Dent Sci 2014; 6:132-8. [PMID: 25125922 PMCID: PMC4130016 DOI: 10.4103/0975-1475.132544] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
BACKGROUND The advent of molecular techniques has revolutionized the ability of scientists to estimate the sex of individuals. Forensic odontology plays an important role in establishing the sex of victims with bodies mutilated beyond recognition due to major disaster. The genetic difference between males and females is defined by the presence or absence of the Y-chromosome. The use of alphoid-repeat primers in sex estimation was first applied on dried blood. Generally, the X, Y alphoid repeats blind test attest to the accuracy of genetic testing, and also point the potential for occasional error in morphometric sexing. AIM To estimate genetic sex of dry human teeth specimens from Sokoto, Northwestern Nigeria, using polymerase chain reaction (PCR). MATERIALS AND METHODS A single-blind study of DNA analysis for sex estimation of nine dry human teeth specimens from Sokoto, Northwestern Nigeria, through PCR, using alphoid repeats primers, was undertaken. RESULTS The genetic sex of each group of the teeth samples were accurately (100%) identified. For each group of teeth, PCR Sensitivity = 100%, Specificity = 0%, Predictive value of positive test = 100%, Predictive value of negative test = 0%, False positive rate = 0%, False negative rate = 0%, Efficiency of test = 100%. Fisher's exact probability test P = 1. Z-test: z- and P values were invalid. CONCLUSION This study has demonstrated the successful use of alphoid-repeat primers in genetic sex identification of human dry teeth samples from Sokoto, Northwestern Nigeria. This is the first known study estimating the sex of human dry teeth specimens by means of PCR in Nigeria. There is need for further studies in Nigeria to complement the findings of this study.
Collapse
Affiliation(s)
- AD Zagga
- Department of Anatomy, College of Health Sciences, Usmanu Danfodiyo University, Sokoto, Nigeria
| | - H. OON Ahmed
- Department of Paediatrics, College of Health Sciences, Usmanu Danfodiyo University, Sokoto, Nigeria
| | - SM Ismail
- Department of Medical Molecular Genetics, Division of Human Genetics and Genome Research, National Research Centre, Cairo, Egypt
| | - AA Tadros
- Department of Anatomy, College of Health Sciences, Usmanu Danfodiyo University, Sokoto, Nigeria
| |
Collapse
|