1
|
Filliaux S, Bertelsen C, Baughman H, Komives E, Lyubchenko Y. The Interaction of NF-κB Transcription Factor with Centromeric Chromatin. J Phys Chem B 2024; 128:5803-5813. [PMID: 38860885 DOI: 10.1021/acs.jpcb.3c08388] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/12/2024]
Abstract
Centromeric chromatin is a subset of chromatin structure and governs chromosome segregation. The centromere is composed of both CENP-A nucleosomes (CENP-Anuc) and H3 nucleosomes (H3nuc) and is enriched with alpha-satellite (α-sat) DNA repeats. These CENP-Anuc have a different structure than H3nuc, decreasing the base pairs (bp) of wrapped DNA from 147 bp for H3nuc to 121 bp for CENP-Anuc. All these factors can contribute to centromere function. We investigated the interaction of H3nuc and CENP-Anuc with NF-κB, a crucial transcription factor in regulating immune response and inflammation. We utilized atomic force microscopy (AFM) to characterize complexes of both types of nucleosomes with NF-κB. We found that NF-κB unravels H3nuc, removing more than 20 bp of DNA, and that NF-κB binds to the nucleosomal core. Similar results were obtained for the truncated variant of NF-κB comprised only of the Rel homology domain and missing the transcription activation domain (TAD), suggesting that RelATAD is not critical in unraveling H3nuc. By contrast, NF-κB did not bind to or unravel CENP-Anuc. These findings with different affinities for two types of nucleosomes to NF-κB may have implications for understanding the mechanisms of gene expression in bulk and centromere chromatin.
Collapse
Affiliation(s)
- Shaun Filliaux
- Department of Pharmaceutical Sciences, University of Nebraska Medical Center, Omaha, Nebraska 68198-6025, United States
| | - Chloe Bertelsen
- Department of Pharmaceutical Sciences, University of Nebraska Medical Center, Omaha, Nebraska 68198-6025, United States
| | - Hannah Baughman
- Department of Chemistry and Biochemistry, UC San Diego, La Jolla, California 92093-0378, United States
| | - Elizabeth Komives
- Department of Chemistry and Biochemistry, UC San Diego, La Jolla, California 92093-0378, United States
| | - Yuri Lyubchenko
- Department of Pharmaceutical Sciences, University of Nebraska Medical Center, Omaha, Nebraska 68198-6025, United States
| |
Collapse
|
2
|
Packiaraj J, Thakur J. DNA satellite and chromatin organization at mouse centromeres and pericentromeres. Genome Biol 2024; 25:52. [PMID: 38378611 PMCID: PMC10880262 DOI: 10.1186/s13059-024-03184-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Accepted: 02/12/2024] [Indexed: 02/22/2024] Open
Abstract
BACKGROUND Centromeres are essential for faithful chromosome segregation during mitosis and meiosis. However, the organization of satellite DNA and chromatin at mouse centromeres and pericentromeres is poorly understood due to the challenges of assembling repetitive genomic regions. RESULTS Using recently available PacBio long-read sequencing data from the C57BL/6 strain, we find that contrary to the previous reports of their homogeneous nature, both centromeric minor satellites and pericentromeric major satellites exhibit a high degree of variation in sequence and organization within and between arrays. While most arrays are continuous, a significant fraction is interspersed with non-satellite sequences, including transposable elements. Using chromatin immunoprecipitation sequencing (ChIP-seq), we find that the occupancy of CENP-A and H3K9me3 chromatin at centromeric and pericentric regions, respectively, is associated with increased sequence enrichment and homogeneity at these regions. The transposable elements at centromeric regions are not part of functional centromeres as they lack significant CENP-A enrichment. Furthermore, both CENP-A and H3K9me3 nucleosomes occupy minor and major satellites spanning centromeric-pericentric junctions and a low yet significant amount of CENP-A spreads locally at centromere junctions on both pericentric and telocentric sides. Finally, while H3K9me3 nucleosomes display a well-phased organization on major satellite arrays, CENP-A nucleosomes on minor satellite arrays are poorly phased. Interestingly, the homogeneous class of major satellites also phase CENP-A and H3K27me3 nucleosomes, indicating that the nucleosome phasing is an inherent property of homogeneous major satellites. CONCLUSIONS Our findings reveal that mouse centromeres and pericentromeres display a high diversity in satellite sequence, organization, and chromatin structure.
Collapse
Affiliation(s)
- Jenika Packiaraj
- Department of Biology, Emory University, 1510 Clifton Rd, Atlanta, GA, 30322, USA
| | - Jitendra Thakur
- Department of Biology, Emory University, 1510 Clifton Rd, Atlanta, GA, 30322, USA.
| |
Collapse
|
3
|
Filliaux S, Bertelsen C, Baughman H, Komives E, Lyubchenko YL. The Interaction of NF-κB Transcription Factor with Centromeric Chromatin. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.13.580208. [PMID: 38405937 PMCID: PMC10888803 DOI: 10.1101/2024.02.13.580208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/27/2024]
Abstract
Centromeric chromatin is a subset of chromatin structure and governs chromosome segregation. The centromere is composed of both CENP-A nucleosomes (CENP-A nuc ) and H3 nucleosomes (H3 nuc ) and is enriched with alpha-satellite (α-sat) DNA repeats. These CENP-A nuc have a different structure than H3 nuc , decreasing the base pairs (bp) of wrapped DNA from 147 bp for H3 nuc to 121 bp for CENP-A nuc . All these factors can contribute to centromere function. We investigated the interaction of H3 nuc and CENP-A nuc with NF-κB, a crucial transcription factor in regulating immune response and inflammation. We utilized Atomic Force Microscopy (AFM) to characterize complexes of both types of nucleosomes with NF-κB. We found that NF-κB unravels H3 nuc , removing more than 20 bp of DNA, and that NF-κB binds to the nucleosomal core. Similar results were obtained for the truncated variant of NF-κB comprised only of the Rel Homology domain and missing the transcription activation domain (TAD), suggesting the RelA TAD is not critical in unraveling H3 nuc . By contrast, NF-κB did not bind to or unravel CENP- A nuc . These findings with different affinities for two types of nucleosomes to NF-κB may have implications for understanding the mechanisms of gene expression in bulk and centromere chromatin.
Collapse
|
4
|
Glunčić M, Vlahović I, Rosandić M, Paar V. Tandem NBPF 3mer HORs (Olduvai triplets) in Neanderthal and two novel HOR tandem arrays in human chromosome 1 T2T-CHM13 assembly. Sci Rep 2023; 13:14420. [PMID: 37660151 PMCID: PMC10475015 DOI: 10.1038/s41598-023-41517-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Accepted: 08/28/2023] [Indexed: 09/04/2023] Open
Abstract
It is known that the ~ 1.6 kb Neuroblastoma BreakPoint Family (NBPF) repeats are human specific and contributing to cognitive capabilities, with increasing frequency in higher order repeat 3mer HORs (Olduvai triplets). From chimpanzee to modern human there is a discontinuous jump from 0 to ~ 50 tandemly organized 3mer HORs. Here we investigate the structure of NBPF 3mer HORs in the Neanderthal genome assembly of Pääbo et al., comparing it to the results obtained for human hg38.p14 chromosome 1. Our findings reveal corresponding NBPF 3mer HOR arrays in Neanderthals with slightly different monomer structures and numbers of HOR copies compared to humans. Additionally, we compute the NBPF 3mer HOR pattern for the complete telomere-to-telomere human genome assembly (T2T-CHM13) by Miga et al., identifying two novel tandem arrays of NBPF 3mer HOR repeats with 5 and 9 NBPF 3mer HOR copies. We hypothesize that these arrays correspond to novel NBPF genes (here referred to as NBPFA1 and NBPFA2). Further improving the quality of the Neanderthal genome using T2T-CHM13 as a reference would be of great interest in determining the presence of such distant novel NBPF genes in the Neanderthal genome and enhancing our understanding of human evolution.
Collapse
Affiliation(s)
- Matko Glunčić
- Faculty of Science, University of Zagreb, 10000, Zagreb, Croatia.
| | | | - Marija Rosandić
- University Hospital Centre Zagreb (Ret.), 10000, Zagreb, Croatia
- Croatian Academy of Sciences and Arts, 10000, Zagreb, Croatia
| | - Vladimir Paar
- Faculty of Science, University of Zagreb, 10000, Zagreb, Croatia
- Croatian Academy of Sciences and Arts, 10000, Zagreb, Croatia
| |
Collapse
|
5
|
Despot-Slade E, Širca S, Mravinac B, Castagnone-Sereno P, Plohl M, Meštrović N. Satellitome analyses in nematodes illuminate complex species history and show conserved features in satellite DNAs. BMC Biol 2022; 20:259. [PMCID: PMC9673304 DOI: 10.1186/s12915-022-01460-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Accepted: 11/07/2022] [Indexed: 11/19/2022] Open
Abstract
Abstract
Background
Satellite DNAs (satDNAs) are tandemly repeated non-coding DNA sequences that belong to the most abundant and the fastest evolving parts of the eukaryotic genome. A satellitome represents the collection of different satDNAs in a genome. Due to extreme diversity and methodological difficulties to characterize and compare satDNA collection in complex genomes, knowledge on their putative functional constraints and capacity to participate in genome evolution remains rather elusive. SatDNA transcripts have been detected in many species, however comparative studies of satDNA transcriptome between species are extremely rare.
Results
We conducted a genome-wide survey and comparative analyses of satellitomes among different closely related Meloidogyne spp. nematodes. The evolutionary trends of satDNAs suggest that each round of proposed polyploidization in the evolutionary history is concomitant with the addition of a new set of satDNAs in the satellitome of any particular Meloidogyne species. Successive incorporation of new sets of satDNAs in the genome along the process of polyploidization supports multiple hybridization events as the main factor responsible for the formation of these species. Through comparative analyses of 83 distinct satDNAs, we found a CENP-B box-like sequence motif conserved among 11 divergent satDNAs (similarity ranges from 36 to 74%). We also found satDNAs that harbor a splice leader (SL) sequence which, in spite of overall divergence, shows conservation across species in two putative functional regions, the 25-nt SL exon and the Sm binding site. Intra- and interspecific comparative expression analyses of the complete satDNA set in the analyzed Meloidogyne species revealed transcription profiles including a subset of 14 actively transcribed satDNAs. Among those, 9 show active transcription in every species where they are found in the genome and throughout developmental stages.
Conclusions
Our results demonstrate the feasibility and power of comparative analysis of the non-coding repetitive genome for elucidation of the origin of species with a complex history. Although satDNAs generally evolve extremely quickly, the comparative analyses of 83 satDNAs detected in the analyzed Meloidogyne species revealed conserved sequence features in some satDNAs suggesting sequence evolution under selective pressure. SatDNAs that are actively transcribed in related genomes and throughout nematode development support the view that their expression is not stochastic.
Collapse
|
6
|
Altemose N, Glennis A, Bzikadze AV, Sidhwani P, Langley SA, Caldas GV, Hoyt SJ, Uralsky L, Ryabov FD, Shew CJ, Sauria MEG, Borchers M, Gershman A, Mikheenko A, Shepelev VA, Dvorkina T, Kunyavskaya O, Vollger MR, Rhie A, McCartney AM, Asri M, Lorig-Roach R, Shafin K, Aganezov S, Olson D, de Lima LG, Potapova T, Hartley GA, Haukness M, Kerpedjiev P, Gusev F, Tigyi K, Brooks S, Young A, Nurk S, Koren S, Salama SR, Paten B, Rogaev EI, Streets A, Karpen GH, Dernburg AF, Sullivan BA, Straight AF, Wheeler TJ, Gerton JL, Eichler EE, Phillippy AM, Timp W, Dennis MY, O'Neill RJ, Zook JM, Schatz MC, Pevzner PA, Diekhans M, Langley CH, Alexandrov IA, Miga KH. Complete genomic and epigenetic maps of human centromeres. Science 2022; 376:eabl4178. [PMID: 35357911 PMCID: PMC9233505 DOI: 10.1126/science.abl4178] [Citation(s) in RCA: 203] [Impact Index Per Article: 101.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Existing human genome assemblies have almost entirely excluded repetitive sequences within and near centromeres, limiting our understanding of their organization, evolution, and functions, which include facilitating proper chromosome segregation. Now, a complete, telomere-to-telomere human genome assembly (T2T-CHM13) has enabled us to comprehensively characterize pericentromeric and centromeric repeats, which constitute 6.2% of the genome (189.9 megabases). Detailed maps of these regions revealed multimegabase structural rearrangements, including in active centromeric repeat arrays. Analysis of centromere-associated sequences uncovered a strong relationship between the position of the centromere and the evolution of the surrounding DNA through layered repeat expansions. Furthermore, comparisons of chromosome X centromeres across a diverse panel of individuals illuminated high degrees of structural, epigenetic, and sequence variation in these complex and rapidly evolving regions.
Collapse
Affiliation(s)
- Nicolas Altemose
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
| | - A. Glennis
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Andrey V. Bzikadze
- Graduate Program in Bioinformatics and Systems Biology, University of California San Diego, La Jolla, CA, USA
| | - Pragya Sidhwani
- Department of Biochemistry, Stanford University, Stanford, CA, USA
| | - Sasha A. Langley
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
| | - Gina V. Caldas
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
| | - Savannah J. Hoyt
- Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
| | - Lev Uralsky
- Sirius University of Science and Technology, Sochi, Russia
- Vavilov Institute of General Genetics, Moscow, Russia
| | | | - Colin J. Shew
- Genome Center, MIND Institute, and Department of Biochemistry and Molecular Medicine, School of Medicine, University of California, Davis, Davis, CA, USA
| | | | | | - Ariel Gershman
- Department of Molecular Biology and Genetics, Johns Hopkins University, Baltimore, MD, USA
| | - Alla Mikheenko
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University, Saint Petersburg, Russia
| | | | - Tatiana Dvorkina
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University, Saint Petersburg, Russia
| | - Olga Kunyavskaya
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University, Saint Petersburg, Russia
| | - Mitchell R. Vollger
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Arang Rhie
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Ann M. McCartney
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Mobin Asri
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Ryan Lorig-Roach
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Kishwar Shafin
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Sergey Aganezov
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Daniel Olson
- Department of Computer Science, University of Montana, Missoula, MT. USA
| | | | - Tamara Potapova
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Gabrielle A. Hartley
- Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
| | - Marina Haukness
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA
| | | | - Fedor Gusev
- Vavilov Institute of General Genetics, Moscow, Russia
| | - Kristof Tigyi
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Shelise Brooks
- NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Alice Young
- NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Sergey Nurk
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Sergey Koren
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Sofie R. Salama
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Benedict Paten
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA
- Department of Biomolecular Engineering, University of California Santa Cruz, CA, USA
| | - Evgeny I. Rogaev
- Sirius University of Science and Technology, Sochi, Russia
- Vavilov Institute of General Genetics, Moscow, Russia
- Department of Psychiatry, University of Massachusetts Medical School, Worcester, MA, USA
- Faculty of Biology, Lomonosov Moscow State University, Moscow, Russia
| | - Aaron Streets
- Department of Bioengineering, University of California, Berkeley, Berkeley, CA, USA
- Chan Zuckerberg Biohub, San Francisco, CA, USA
| | - Gary H. Karpen
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
- BioEngineering and BioMedical Sciences Department, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Abby F. Dernburg
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
- Institute for Quantitative Biosciences (QB3), University of California, Berkeley, Berkeley, CA, USA
| | - Beth A. Sullivan
- Department of Molecular Genetics and Microbiology, Duke University School of Medicine, Durham, NC, USA
| | | | - Travis J. Wheeler
- Department of Computer Science, University of Montana, Missoula, MT. USA
| | - Jennifer L. Gerton
- Stowers Institute for Medical Research, Kansas City, MO, USA
- University of Kansas Medical School, Department of Biochemistry and Molecular Biology and Cancer Center, University of Kansas, Kansas City, KS, USA
| | - Evan E. Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Adam M. Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Winston Timp
- Department of Molecular Biology and Genetics, Johns Hopkins University, Baltimore, MD, USA
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Megan Y. Dennis
- Genome Center, MIND Institute, and Department of Biochemistry and Molecular Medicine, School of Medicine, University of California, Davis, Davis, CA, USA
| | - Rachel J. O'Neill
- Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
| | - Justin M. Zook
- Biosystems and Biomaterials Division, National Institute of Standards and Technology, Gaithersburg, MD, USA
| | - Michael C. Schatz
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Pavel A. Pevzner
- Department of Computer Science and Engineering, University of California at San Diego, San Diego, CA, USA
| | - Mark Diekhans
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Charles H. Langley
- Department of Evolution and Ecology, University of California Davis, Davis, CA, USA
| | - Ivan A. Alexandrov
- Vavilov Institute of General Genetics, Moscow, Russia
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University, Saint Petersburg, Russia
- Research Center of Biotechnology of the Russian Academy of Sciences, Moscow, Russia
| | - Karen H. Miga
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA
- Department of Biomolecular Engineering, University of California Santa Cruz, CA, USA
| |
Collapse
|
7
|
Abstract
We are entering a new era in genomics where entire centromeric regions are accurately represented in human reference assemblies. Access to these high-resolution maps will enable new surveys of sequence and epigenetic variation in the population and offer new insight into satellite array genomics and centromere function. Here, we focus on the sequence organization and evolution of alpha satellites, which are credited as the genetic and genomic definition of human centromeres due to their interaction with inner kinetochore proteins and their importance in the development of human artificial chromosome assays. We provide an overview of alpha satellite repeat structure and array organization in the context of these high-quality reference data sets; discuss the emergence of variation-based surveys; and provide perspective on the role of this new source of genetic and epigenetic variation in the context of chromosome biology, genome instability, and human disease.
Collapse
Affiliation(s)
- Karen H Miga
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, California 95064, USA; .,Department of Biomolecular Engineering, University of California, Santa Cruz, California 95064, USA
| | - Ivan A Alexandrov
- Department of Genomics and Human Genetics, Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow 119991, Russia; .,Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University, Saint Petersburg 199004, Russia.,Research Center of Biotechnology of the Russian Academy of Sciences, Moscow 119071, Russia
| |
Collapse
|
8
|
Ishikura S, Yoshida K, Hashimoto S, Nakabayashi K, Tsunoda T, Shirasawa S. CENP-B promotes the centromeric localization of ZFAT to control transcription of noncoding RNA. J Biol Chem 2021; 297:101213. [PMID: 34547289 PMCID: PMC8496178 DOI: 10.1016/j.jbc.2021.101213] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2021] [Revised: 09/13/2021] [Accepted: 09/16/2021] [Indexed: 11/28/2022] Open
Abstract
The centromere is a chromosomal locus that is essential for the accurate segregation of chromosomes during cell division. Transcription of noncoding RNA (ncRNA) at the centromere plays a crucial role in centromere function. The zinc-finger transcriptional regulator ZFAT binds to a specific 8-bp DNA sequence at the centromere, named the ZFAT box, to control ncRNA transcription. However, the precise molecular mechanisms by which ZFAT localizes to the centromere remain elusive. Here we show that the centromeric protein CENP-B is required for the centromeric localization of ZFAT to regulate ncRNA transcription. The ectopic expression of CENP-B induces the accumulation of both endogenous and ectopically expressed ZFAT protein at the centromere in human cells, suggesting that the centromeric localization of ZFAT requires the presence of CENP-B. Coimmunoprecipitation analysis reveals that ZFAT interacts with the acidic domain of CENP-B, and depletion of endogenous CENP-B reduces the centromeric levels of ZFAT protein, further supporting that CENP-B is required for the centromeric localization of ZFAT. In addition, knockdown of CENP-B significantly decreased the expression levels of ncRNA at the centromere where ZFAT regulates the transcription, suggesting that CENP-B is involved in the ZFAT-regulated centromeric ncRNA transcription. Thus, we concluded that CENP-B contributes to the establishment of the centromeric localization of ZFAT to regulate ncRNA transcription.
Collapse
Affiliation(s)
- Shuhei Ishikura
- Department of Cell Biology, Faculty of Medicine, Fukuoka University, Fukuoka, Japan; Center for Advanced Molecular Medicine, Fukuoka University, Fukuoka, Japan
| | - Kazumasa Yoshida
- Department of Cell Biology, Faculty of Medicine, Fukuoka University, Fukuoka, Japan; Center for Advanced Molecular Medicine, Fukuoka University, Fukuoka, Japan
| | - Sayuri Hashimoto
- Department of Cell Biology, Faculty of Medicine, Fukuoka University, Fukuoka, Japan
| | - Kazuhiko Nakabayashi
- Department of Maternal-Fetal Biology, National Research Institute for Child Health and Development, Tokyo, Japan
| | - Toshiyuki Tsunoda
- Department of Cell Biology, Faculty of Medicine, Fukuoka University, Fukuoka, Japan; Center for Advanced Molecular Medicine, Fukuoka University, Fukuoka, Japan
| | - Senji Shirasawa
- Department of Cell Biology, Faculty of Medicine, Fukuoka University, Fukuoka, Japan; Center for Advanced Molecular Medicine, Fukuoka University, Fukuoka, Japan.
| |
Collapse
|
9
|
Thakur J, Packiaraj J, Henikoff S. Sequence, Chromatin and Evolution of Satellite DNA. Int J Mol Sci 2021; 22:ijms22094309. [PMID: 33919233 PMCID: PMC8122249 DOI: 10.3390/ijms22094309] [Citation(s) in RCA: 92] [Impact Index Per Article: 30.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Revised: 04/16/2021] [Accepted: 04/17/2021] [Indexed: 12/15/2022] Open
Abstract
Satellite DNA consists of abundant tandem repeats that play important roles in cellular processes, including chromosome segregation, genome organization and chromosome end protection. Most satellite DNA repeat units are either of nucleosomal length or 5–10 bp long and occupy centromeric, pericentromeric or telomeric regions. Due to high repetitiveness, satellite DNA sequences have largely been absent from genome assemblies. Although few conserved satellite-specific sequence motifs have been identified, DNA curvature, dyad symmetries and inverted repeats are features of various satellite DNAs in several organisms. Satellite DNA sequences are either embedded in highly compact gene-poor heterochromatin or specialized chromatin that is distinct from euchromatin. Nevertheless, some satellite DNAs are transcribed into non-coding RNAs that may play important roles in satellite DNA function. Intriguingly, satellite DNAs are among the most rapidly evolving genomic elements, such that a large fraction is species-specific in most organisms. Here we describe the different classes of satellite DNA sequences, their satellite-specific chromatin features, and how these features may contribute to satellite DNA biology and evolution. We also discuss how the evolution of functional satellite DNA classes may contribute to speciation in plants and animals.
Collapse
Affiliation(s)
- Jitendra Thakur
- Department of Biology, Emory University, Atlanta, GA 30322, USA;
- Correspondence:
| | - Jenika Packiaraj
- Department of Biology, Emory University, Atlanta, GA 30322, USA;
| | - Steven Henikoff
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA;
- Fred Hutchinson Cancer Research Center, Howard Hughes Medical Institute, Seattle, WA 98109, USA
| |
Collapse
|
10
|
Ishikura S, Nakabayashi K, Nagai M, Tsunoda T, Shirasawa S. ZFAT binds to centromeres to control noncoding RNA transcription through the KAT2B-H4K8ac-BRD4 axis. Nucleic Acids Res 2020; 48:10848-10866. [PMID: 32997115 PMCID: PMC7641738 DOI: 10.1093/nar/gkaa815] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Revised: 09/09/2020] [Accepted: 09/17/2020] [Indexed: 02/07/2023] Open
Abstract
Centromeres are genomic regions essential for faithful chromosome segregation. Transcription of noncoding RNA (ncRNA) at centromeres is important for their formation and functions. Here, we report the molecular mechanism by which the transcriptional regulator ZFAT controls the centromeric ncRNA transcription in human and mouse cells. Chromatin immunoprecipitation with high-throughput sequencing analysis shows that ZFAT binds to centromere regions at every chromosome. We find a specific 8-bp DNA sequence for the ZFAT-binding motif that is highly conserved and widely distributed at whole centromere regions of every chromosome. Overexpression of ZFAT increases the centromeric ncRNA levels at specific chromosomes, whereas its silencing reduces them, indicating crucial roles of ZFAT in centromeric transcription. Overexpression of ZFAT increases the centromeric levels of both the histone acetyltransferase KAT2B and the acetylation at the lysine 8 in histone H4 (H4K8ac). siRNA-mediated knockdown of KAT2B inhibits the overexpressed ZFAT-induced increase in centromeric H4K8ac levels, suggesting that ZFAT recruits KAT2B to centromeres to induce H4K8ac. Furthermore, overexpressed ZFAT recruits the bromodomain-containing protein BRD4 to centromeres through KAT2B-mediated H4K8ac, leading to RNA polymerase II-dependent ncRNA transcription. Thus, ZFAT binds to centromeres to control ncRNA transcription through the KAT2B-H4K8ac-BRD4 axis.
Collapse
Affiliation(s)
- Shuhei Ishikura
- Department of Cell Biology, Faculty of Medicine, Fukuoka University, Fukuoka 814-0180, Japan.,Central Research Institute for Advanced Molecular Medicine, Fukuoka University, Fukuoka 814-0180, Japan
| | - Kazuhiko Nakabayashi
- Department of Maternal-Fetal Biology, National Research Institute for Child Health and Development, Tokyo 157-8535, Japan
| | - Masayoshi Nagai
- Central Research Institute for Advanced Molecular Medicine, Fukuoka University, Fukuoka 814-0180, Japan
| | - Toshiyuki Tsunoda
- Department of Cell Biology, Faculty of Medicine, Fukuoka University, Fukuoka 814-0180, Japan.,Central Research Institute for Advanced Molecular Medicine, Fukuoka University, Fukuoka 814-0180, Japan
| | - Senji Shirasawa
- Department of Cell Biology, Faculty of Medicine, Fukuoka University, Fukuoka 814-0180, Japan.,Central Research Institute for Advanced Molecular Medicine, Fukuoka University, Fukuoka 814-0180, Japan
| |
Collapse
|
11
|
Gamba R, Fachinetti D. From evolution to function: Two sides of the same CENP-B coin? Exp Cell Res 2020; 390:111959. [DOI: 10.1016/j.yexcr.2020.111959] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2020] [Revised: 03/07/2020] [Accepted: 03/12/2020] [Indexed: 10/24/2022]
|
12
|
Centromere Repeats: Hidden Gems of the Genome. Genes (Basel) 2019; 10:genes10030223. [PMID: 30884847 PMCID: PMC6471113 DOI: 10.3390/genes10030223] [Citation(s) in RCA: 88] [Impact Index Per Article: 17.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2019] [Revised: 03/07/2019] [Accepted: 03/11/2019] [Indexed: 01/08/2023] Open
Abstract
Satellite DNAs are now regarded as powerful and active contributors to genomic and chromosomal evolution. Paired with mobile transposable elements, these repetitive sequences provide a dynamic mechanism through which novel karyotypic modifications and chromosomal rearrangements may occur. In this review, we discuss the regulatory activity of satellite DNA and their neighboring transposable elements in a chromosomal context with a particular emphasis on the integral role of both in centromere function. In addition, we discuss the varied mechanisms by which centromeric repeats have endured evolutionary processes, producing a novel, species-specific centromeric landscape despite sharing a ubiquitously conserved function. Finally, we highlight the role these repetitive elements play in the establishment and functionality of de novo centromeres and chromosomal breakpoints that underpin karyotypic variation. By emphasizing these unique activities of satellite DNAs and transposable elements, we hope to disparage the conventional exemplification of repetitive DNA in the historically-associated context of ‘junk’.
Collapse
|
13
|
Kyriacou E, Heun P. High-resolution mapping of centromeric protein association using APEX-chromatin fibers. Epigenetics Chromatin 2018; 11:68. [PMID: 30445992 PMCID: PMC6238281 DOI: 10.1186/s13072-018-0237-6] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2018] [Accepted: 10/26/2018] [Indexed: 08/30/2023] Open
Abstract
BACKGROUND The centromere is a specialized chromosomal locus that forms the basis for the assembly of a multi-protein complex, the kinetochore and ensures faithful chromosome segregation during every cell division. The repetitive nature of the underlying centromeric sequence represents a major obstacle for high-resolution mapping of protein binding using methods that rely on annotated genomes. Here, we present a novel microscopy-based approach called "APEX-chromatin fibers" for localizing protein binding over the repetitive centromeric sequences at kilobase resolution. RESULTS By fusing centromere factors of interest to ascorbate peroxidase, we were able to label their binding profiles on extended chromatin fibers with biotin marks. We applied APEX-chromatin fibers to at least one member of each CCAN complex, most of which show a localization pattern different from CENP-A but within the CENP-A delineated centromeric domain. Interestingly, we describe here a novel characteristic of CENP-I and CENP-B that display extended localization beyond the CENP-A boundaries. CONCLUSIONS Our approach was successfully applied for mapping protein association over centromeric chromatin, revealing previously undescribed localization patterns. In this study, we focused on centromeric factors, but we believe that this approach could be useful for mapping protein binding patterns in other repetitive regions.
Collapse
Affiliation(s)
- Eftychia Kyriacou
- Max-Planck Institute of Immunobiology and Epigenetics, Freiburg im Breisgau, Germany.,Faculty of Biology, Albert-Ludwigs Universität Freiburg, Freiburg im Breisgau, Germany.,Wellcome Centre for Cell Biology, University of Edinburgh, Edinburgh, UK
| | - Patrick Heun
- Wellcome Centre for Cell Biology, University of Edinburgh, Edinburgh, UK.
| |
Collapse
|
14
|
McNulty SM, Sullivan BA. Alpha satellite DNA biology: finding function in the recesses of the genome. Chromosome Res 2018; 26:115-138. [PMID: 29974361 DOI: 10.1007/s10577-018-9582-3] [Citation(s) in RCA: 81] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2018] [Accepted: 06/14/2018] [Indexed: 02/05/2023]
Abstract
Repetitive DNA, formerly referred to by the misnomer "junk DNA," comprises a majority of the human genome. One class of this DNA, alpha satellite, comprises up to 10% of the genome. Alpha satellite is enriched at all human centromere regions and is competent for de novo centromere assembly. Because of the highly repetitive nature of alpha satellite, it has been difficult to achieve genome assemblies at centromeres using traditional next-generation sequencing approaches, and thus, centromeres represent gaps in the current human genome assembly. Moreover, alpha satellite DNA is transcribed into repetitive noncoding RNA and contributes to a large portion of the transcriptome. Recent efforts to characterize these transcripts and their function have uncovered pivotal roles for satellite RNA in genome stability, including silencing "selfish" DNA elements and recruiting centromere and kinetochore proteins. This review will describe the genomic and epigenetic features of alpha satellite DNA, discuss recent findings of noncoding transcripts produced from distinct alpha satellite arrays, and address current progress in the functional understanding of this oft-neglected repetitive sequence. We will discuss unique challenges of studying human satellite DNAs and RNAs and point toward new technologies that will continue to advance our understanding of this largely untapped portion of the genome.
Collapse
Affiliation(s)
- Shannon M McNulty
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC, 27710, USA
| | - Beth A Sullivan
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC, 27710, USA. .,Division of Human Genetics, Duke University Medical Center, Durham, NC, 27710, USA.
| |
Collapse
|
15
|
Cacheux L, Ponger L, Gerbault-Seureau M, Loll F, Gey D, Richard FA, Escudé C. The Targeted Sequencing of Alpha Satellite DNA in Cercopithecus pogonias Provides New Insight Into the Diversity and Dynamics of Centromeric Repeats in Old World Monkeys. Genome Biol Evol 2018; 10:1837-1851. [PMID: 29860303 PMCID: PMC6061836 DOI: 10.1093/gbe/evy109] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/29/2018] [Indexed: 02/06/2023] Open
Abstract
Alpha satellite is the major repeated DNA element of primate centromeres. Specific evolutionary mechanisms have led to a great diversity of sequence families with peculiar genomic organization and distribution, which have till now been studied mostly in great apes. Using high throughput sequencing of alpha satellite monomers obtained by enzymatic digestion followed by computational and cytogenetic analysis, we compare here the diversity and genomic distribution of alpha satellite DNA in two related Old World monkey species, Cercopithecus pogonias and Cercopithecus solatus, which are known to have diverged about 7 Ma. Two main families of monomers, called C1 and C2, are found in both species. A detailed analysis of our data sets revealed the existence of numerous subfamilies within the centromeric C1 family. Although the most abundant subfamily is conserved between both species, our fluorescence in situ hybridization (FISH) experiments clearly show that some subfamilies are specific for each species and that their distribution is restricted to a subset of chromosomes, thereby pointing to the existence of recurrent amplification/homogenization events. The pericentromeric C2 family is very abundant on the short arm of all acrocentric chromosomes in both species, pointing to specific mechanisms that lead to this distribution. Results obtained using two different restriction enzymes are fully consistent with a predominant monomeric organization of alpha satellite DNA that coexists with higher order organization patterns in the C. pogonias genome. Our study suggests a high dynamics of alpha satellite DNA in Cercopithecini, with recurrent apparition of new sequence variants and interchromosomal sequence transfer.
Collapse
Affiliation(s)
- Lauriane Cacheux
- Département Adaptations du Vivant, Structure et Instabilité des Génomes, INSERM U1154, CNRS UMR7196, Sorbonne Universités, Muséum National d’Histoire Naturelle, Paris, France
- Département Origines et Evolution, Institut de Systématique, Evolution, Biodiversité, UMR 7205 MNHN, CNRS, UPMC, EPHE, Sorbonne Universités, Muséum National d’Histoire Naturelle, Paris, France
| | - Loïc Ponger
- Département Adaptations du Vivant, Structure et Instabilité des Génomes, INSERM U1154, CNRS UMR7196, Sorbonne Universités, Muséum National d’Histoire Naturelle, Paris, France
| | - Michèle Gerbault-Seureau
- Département Origines et Evolution, Institut de Systématique, Evolution, Biodiversité, UMR 7205 MNHN, CNRS, UPMC, EPHE, Sorbonne Universités, Muséum National d’Histoire Naturelle, Paris, France
| | - François Loll
- Département Adaptations du Vivant, Structure et Instabilité des Génomes, INSERM U1154, CNRS UMR7196, Sorbonne Universités, Muséum National d’Histoire Naturelle, Paris, France
| | - Delphine Gey
- Service de Systématique Moléculaire, UMS 2700 CNRS, Sorbonne Universités, Muséum National d’Histoire Naturelle, Paris, France
| | - Florence Anne Richard
- Département Origines et Evolution, Institut de Systématique, Evolution, Biodiversité, UMR 7205 MNHN, CNRS, UPMC, EPHE, Sorbonne Universités, Muséum National d’Histoire Naturelle, Paris, France
- Université Versailles St-Quentin, Montigny-le-Bretonneux, France
| | - Christophe Escudé
- Département Adaptations du Vivant, Structure et Instabilité des Génomes, INSERM U1154, CNRS UMR7196, Sorbonne Universités, Muséum National d’Histoire Naturelle, Paris, France
| |
Collapse
|
16
|
Vlahovic I, Gluncic M, Rosandic M, Ugarkovic Ð, Paar V. Regular Higher Order Repeat Structures in Beetle Tribolium castaneum Genome. Genome Biol Evol 2018; 9:2668-2680. [PMID: 27492235 PMCID: PMC5737470 DOI: 10.1093/gbe/evw174] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/21/2016] [Indexed: 02/07/2023] Open
Abstract
Higher order repeats (HORs) containing tandems of primary and secondary repeat units (head-to-tail “tandem within tandem pattern”), referred to as regular HORs, are typical for primate alpha satellite DNAs and most pronounced in human genome. Regular HORs are known to be a result of recent evolutionary processes. In non-primate genomes mostly so called complex HORs have been found, without head to tail tandem of primary repeat units. In beetle Tribolium castaneum, considered as a model case for genome studies, large tandem repeats have been identified, but no HORs have been reported. Here, using our novel robust repeat finding algorithm Global Repeat Map, we discover two regular and six complex HORs in T. castaneum. In organizational pattern, the integrity and homogeneity of regular HORs in T. castaneum resemble human regular HORs (with T. castaneum monomers different from human alpha satellite monomers), involving a wider range of monomer lengths than in human HORs. Similar regular higher order repeat structures have previously not been found in insects. Some of these novel HORs in T. castaneum appear as most regular among known HORs in non-primate genomes, although with substantial riddling. This is intriguing, in particular from the point of view of role of non-coding repeats in modulation of gene expression.
Collapse
Affiliation(s)
- Ines Vlahovic
- Faculty of Science, University of Zagreb, Zagreb, Croatia
| | - Matko Gluncic
- Faculty of Science, University of Zagreb, Zagreb, Croatia
| | | | | | - Vladimir Paar
- Faculty of Science, University of Zagreb, Zagreb, Croatia.,Croatian Academy of Sciences and Arts, Zagreb, Croatia
| |
Collapse
|
17
|
Contreras-Galindo R, Fischer S, Saha AK, Lundy JD, Cervantes PW, Mourad M, Wang C, Qian B, Dai M, Meng F, Chinnaiyan A, Omenn GS, Kaplan MH, Markovitz DM. Rapid molecular assays to study human centromere genomics. Genome Res 2017; 27:2040-2049. [PMID: 29141960 PMCID: PMC5741061 DOI: 10.1101/gr.219709.116] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2016] [Accepted: 10/27/2017] [Indexed: 01/16/2023]
Abstract
The centromere is the structural unit responsible for the faithful segregation of chromosomes. Although regulation of centromeric function by epigenetic factors has been well-studied, the contributions of the underlying DNA sequences have been much less well defined, and existing methodologies for studying centromere genomics in biology are laborious. We have identified specific markers in the centromere of 23 of the 24 human chromosomes that allow for rapid PCR assays capable of capturing the genomic landscape of human centromeres at a given time. Use of this genetic strategy can also delineate which specific centromere arrays in each chromosome drive the recruitment of epigenetic modulators. We further show that, surprisingly, loss and rearrangement of DNA in centromere 21 is associated with trisomy 21. This new approach can thus be used to rapidly take a snapshot of the genetics and epigenetics of each specific human centromere in nondisjunction disorders and other biological settings.
Collapse
Affiliation(s)
| | - Sabrina Fischer
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA.,Laboratory of Molecular Virology, Centro de Investigaciones Nucleares, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay 11400
| | - Anjan K Saha
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA.,Medical Scientist Training Program, University of Michigan, Ann Arbor, Michigan 48109, USA.,Program in Cancer Biology, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - John D Lundy
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Patrick W Cervantes
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Mohamad Mourad
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Claire Wang
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Brian Qian
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Manhong Dai
- Molecular and Behavioral Neuroscience Institute, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Fan Meng
- Molecular and Behavioral Neuroscience Institute, University of Michigan, Ann Arbor, Michigan 48109, USA.,Department of Psychiatry, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Arul Chinnaiyan
- Michigan Center for Translational Pathology and Comprehensive Cancer Center, University of Michigan Medical School, Ann Arbor, Michigan 48109, USA.,Howard Hughes Medical Institute, Chevy Chase, Maryland 20815, USA
| | - Gilbert S Omenn
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA.,Department of Human Genetics.,Departments of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Mark H Kaplan
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - David M Markovitz
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA.,Program in Cancer Biology, University of Michigan, Ann Arbor, Michigan 48109, USA.,Program in Immunology, University of Michigan, Ann Arbor, Michigan 48109, USA.,Program in Cellular and Molecular Biology, University of Michigan, Ann Arbor, Michigan 48109, USA
| |
Collapse
|
18
|
Dumont M, Fachinetti D. DNA Sequences in Centromere Formation and Function. PROGRESS IN MOLECULAR AND SUBCELLULAR BIOLOGY 2017; 56:305-336. [PMID: 28840243 DOI: 10.1007/978-3-319-58592-5_13] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Faithful chromosome segregation during cell division depends on the centromere, a complex DNA/protein structure that links chromosomes to spindle microtubules. This chromosomal domain has to be marked throughout cell division and its chromosomal localization preserved across cell generations. From fission yeast to human, centromeres are established on a series of repetitive DNA sequences and on specialized centromeric chromatin. This chromatin is enriched with the histone H3 variant, named CENP-A, that was demonstrated to be the epigenetic mark that maintains centromere identity and function indefinitely. Although centromere identity is thought to be exclusively epigenetic, the presence of specific DNA sequences in the majority of eukaryotes and of the centromeric protein CENP-B that binds to these sequences, suggests the existence of a genetic component as well. In this review, we will highlight the importance of centromeric sequences for centromere formation and function, and discuss the centromere DNA sequence/CENP-B paradox.
Collapse
Affiliation(s)
- M Dumont
- Institut Curie, PSL Research University, CNRS, UMR 144, 26 rue d'Ulm, 75005, Paris, France
| | - D Fachinetti
- Institut Curie, PSL Research University, CNRS, UMR 144, 26 rue d'Ulm, 75005, Paris, France.
| |
Collapse
|
19
|
Cacheux L, Ponger L, Gerbault-Seureau M, Richard FA, Escudé C. Diversity and distribution of alpha satellite DNA in the genome of an Old World monkey: Cercopithecus solatus. BMC Genomics 2016; 17:916. [PMID: 27842493 PMCID: PMC5109768 DOI: 10.1186/s12864-016-3246-5] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2016] [Accepted: 11/02/2016] [Indexed: 11/10/2022] Open
Abstract
Background Alpha satellite is the major repeated DNA element of primate centromeres. Evolution of these tandemly repeated sequences has led to the existence of numerous families of monomers exhibiting specific organizational patterns. The limited amount of information available in non-human primates is a restriction to the understanding of the evolutionary dynamics of alpha satellite DNA. Results We carried out the targeted high-throughput sequencing of alpha satellite monomers and dimers from the Cercopithecus solatus genome, an Old World monkey from the Cercopithecini tribe. Computational approaches were used to infer the existence of sequence families and to study how these families are organized with respect to each other. While previous studies had suggested that alpha satellites in Old World monkeys were poorly diversified, our analysis provides evidence for the existence of at least four distinct families of sequences within the studied species and of higher order organizational patterns. Fluorescence in situ hybridization using oligonucleotide probes that are able to target each family in a specific way showed that the different families had distinct distributions on chromosomes and were not homogeneously distributed between chromosomes. Conclusions Our new approach provides an unprecedented and comprehensive view of the diversity and organization of alpha satellites in a species outside the hominoid group. We consider these data with respect to previously known alpha satellite families and to potential mechanisms for satellite DNA evolution. Applying this approach to other species will open new perspectives regarding the integration of satellite DNA into comparative genomic and cytogenetic studies. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-3246-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Lauriane Cacheux
- Département Régulations, Développement et Diversité Moléculaire, Structure et Instabilité des Génomes, INSERM U1154, CNRS UMR7196, Sorbonne Universités, Muséum national d'Histoire naturelle, Paris, France.,Département Systématique et Evolution, Institut de Systématique, Evolution, Biodiversité, UMR 7205 MNHN, CNRS, UPMC, EPHE, Sorbonne Universités, Muséum national d'Histoire naturelle, Paris, France
| | - Loïc Ponger
- Département Régulations, Développement et Diversité Moléculaire, Structure et Instabilité des Génomes, INSERM U1154, CNRS UMR7196, Sorbonne Universités, Muséum national d'Histoire naturelle, Paris, France
| | - Michèle Gerbault-Seureau
- Département Systématique et Evolution, Institut de Systématique, Evolution, Biodiversité, UMR 7205 MNHN, CNRS, UPMC, EPHE, Sorbonne Universités, Muséum national d'Histoire naturelle, Paris, France
| | - Florence Anne Richard
- Département Systématique et Evolution, Institut de Systématique, Evolution, Biodiversité, UMR 7205 MNHN, CNRS, UPMC, EPHE, Sorbonne Universités, Muséum national d'Histoire naturelle, Paris, France.,Université Versailles St-Quentin, Montigny-le-Bretonneux, France
| | - Christophe Escudé
- Département Régulations, Développement et Diversité Moléculaire, Structure et Instabilité des Génomes, INSERM U1154, CNRS UMR7196, Sorbonne Universités, Muséum national d'Histoire naturelle, Paris, France.
| |
Collapse
|
20
|
Clusters of alpha satellite on human chromosome 21 are dispersed far onto the short arm and lack ancient layers. Chromosome Res 2016; 24:421-36. [PMID: 27430641 DOI: 10.1007/s10577-016-9530-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2016] [Accepted: 06/03/2016] [Indexed: 10/21/2022]
Abstract
Human alpha satellite (AS) sequence domains that currently function as centromeres are typically flanked by layers of evolutionarily older AS that presumably represent the remnants of earlier primate centromeres. Studies on several human chromosomes reveal that these older AS arrays are arranged in an age gradient, with the oldest arrays farthest from the functional centromere and arrays progressively closer to the centromere being progressively younger. The organization of AS on human chromosome 21 (HC21) has not been well-characterized. We have used newly available HC21 sequence data and an HC21p YAC map to determine the size, organization, and location of the AS arrays, and compared them to AS arrays found on other chromosomes. We find that the majority of the HC21 AS sequences are present on the p-arm of the chromosome and are organized into at least five distinct isolated clusters which are distributed over a larger distance from the functional centromere than that typically seen for AS on other chromosomes. Using both phylogenetic and L1 element age estimations, we found that all of the HC21 AS clusters outside the functional centromere are of a similar relatively recent evolutionary origin. HC21 contains none of the ancient AS layers associated with early primate evolution which is present on other chromosomes, possibly due to the fact that the p-arm of HC21 and the other acrocentric chromosomes underwent substantial reorganization about 20 million years ago.
Collapse
|
21
|
Kugou K, Hirai H, Masumoto H, Koga A. Formation of functional CENP-B boxes at diverse locations in repeat units of centromeric DNA in New World monkeys. Sci Rep 2016; 6:27833. [PMID: 27292628 PMCID: PMC4904201 DOI: 10.1038/srep27833] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2015] [Accepted: 05/25/2016] [Indexed: 12/17/2022] Open
Abstract
Centromere protein B, which is involved in centromere formation, binds to centromeric repetitive DNA by recognizing a nucleotide motif called the CENP-B box. Humans have large numbers of CENP-B boxes in the centromeric repetitive DNA of their autosomes and X chromosome. The current understanding is that these CENP-B boxes are located at identical positions in the repeat units of centromeric DNA. Great apes also have CENP-B boxes in locations that are identical to humans. The purpose of the present study was to examine the location of CENP-B box in New World monkeys. We recently identified CENP-B box in one species of New World monkeys (marmosets). In this study, we found functional CENP-B boxes in CENP-A-assembled repeat units of centromeric DNA in 2 additional New World monkeys (squirrel monkeys and tamarins) by immunostaining and ChIP-qPCR analyses. The locations of the 3 CENP-B boxes in the repeat units differed from one another. The repeat unit size of centromeric DNA of New World monkeys (340–350 bp) is approximately twice that of humans and great apes (171 bp). This might be, associated with higher-order repeat structures of centromeric DNA, a factor for the observed variation in the CENP-B box location in New World monkeys.
Collapse
Affiliation(s)
- Kazuto Kugou
- Department of Frontier Research, Kazusa DNA Research Institute, Kisarazu 292-0818, Japan
| | - Hirohisa Hirai
- Primate Research Institute, Kyoto University, Inuyama 484-8506, Japan
| | - Hiroshi Masumoto
- Department of Frontier Research, Kazusa DNA Research Institute, Kisarazu 292-0818, Japan
| | - Akihiko Koga
- Primate Research Institute, Kyoto University, Inuyama 484-8506, Japan
| |
Collapse
|
22
|
AlQuraishi M, Tang S, Xia X. An affinity-structure database of helix-turn-helix: DNA complexes with a universal coordinate system. BMC Bioinformatics 2015; 16:390. [PMID: 26586237 PMCID: PMC4653904 DOI: 10.1186/s12859-015-0819-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2015] [Accepted: 11/11/2015] [Indexed: 11/28/2022] Open
Abstract
Background Molecular interactions between proteins and DNA molecules underlie many cellular processes, including transcriptional regulation, chromosome replication, and nucleosome positioning. Computational analyses of protein-DNA interactions rely on experimental data characterizing known protein-DNA interactions structurally and biochemically. While many databases exist that contain either structural or biochemical data, few integrate these two data sources in a unified fashion. Such integration is becoming increasingly critical with the rapid growth of structural and biochemical data, and the emergence of algorithms that rely on the synthesis of multiple data types to derive computational models of molecular interactions. Description We have developed an integrated affinity-structure database in which the experimental and quantitative DNA binding affinities of helix-turn-helix proteins are mapped onto the crystal structures of the corresponding protein-DNA complexes. This database provides access to: (i) protein-DNA structures, (ii) quantitative summaries of protein-DNA binding affinities using position weight matrices, and (iii) raw experimental data of protein-DNA binding instances. Critically, this database establishes a correspondence between experimental structural data and quantitative binding affinity data at the single basepair level. Furthermore, we present a novel alignment algorithm that structurally aligns the protein-DNA complexes in the database and creates a unified residue-level coordinate system for comparing the physico-chemical environments at the interface between complexes. Using this unified coordinate system, we compute the statistics of atomic interactions at the protein-DNA interface of helix-turn-helix proteins. We provide an interactive website for visualization, querying, and analyzing this database, and a downloadable version to facilitate programmatic analysis. Conclusions This database will facilitate the analysis of protein-DNA interactions and the development of programmatic computational methods that capitalize on integration of structural and biochemical datasets. The database can be accessed at http://ProteinDNA.hms.harvard.edu.
Collapse
Affiliation(s)
- Mohammed AlQuraishi
- Department of Systems Biology, Harvard Medical School, Boston, MA, 02115, USA. .,HMS Laboratory of Systems Pharmacology, Harvard Medical School, 200 Longwood Avenue, Boston, MA, 02115, USA.
| | - Shengdong Tang
- Department of Systems Biology, Harvard Medical School, Boston, MA, 02115, USA.,HMS Laboratory of Systems Pharmacology, Harvard Medical School, 200 Longwood Avenue, Boston, MA, 02115, USA
| | - Xide Xia
- Department of Systems Biology, Harvard Medical School, Boston, MA, 02115, USA.,HMS Laboratory of Systems Pharmacology, Harvard Medical School, 200 Longwood Avenue, Boston, MA, 02115, USA
| |
Collapse
|
23
|
Catacchio CR, Ragone R, Chiatante G, Ventura M. Organization and evolution of Gorilla centromeric DNA from old strategies to new approaches. Sci Rep 2015; 5:14189. [PMID: 26387916 PMCID: PMC4585704 DOI: 10.1038/srep14189] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2015] [Accepted: 08/18/2015] [Indexed: 11/09/2022] Open
Abstract
The centromere/kinetochore interaction is responsible for the pairing and segregation of replicated chromosomes in eukaryotes. Centromere DNA is portrayed as scarcely conserved, repetitive in nature, quickly evolving and protein-binding competent. Among primates, the major class of centromeric DNA is the pancentromeric α-satellite, made of arrays of 171 bp monomers, repeated in a head-to-tail pattern. α-satellite sequences can either form tandem heterogeneous monomeric arrays or assemble in higher-order repeats (HORs). Gorilla centromere DNA has barely been characterized, and data are mainly based on hybridizations of human alphoid sequences. We isolated and finely characterized gorilla α-satellite sequences and revealed relevant structure and chromosomal distribution similarities with other great apes as well as gorilla-specific features, such as the uniquely octameric structure of the suprachromosomal family-2 (SF2). We demonstrated for the first time the orthologous localization of alphoid suprachromosomal families-1 and −2 (SF1 and SF2) between human and gorilla in contrast to chimpanzee centromeres. Finally, the discovery of a new 189 bp monomer type in gorilla centromeres unravels clues to the role of the centromere protein B, paving the way to solve the significance of the centromere DNA’s essential repetitive nature in association with its function and the peculiar evolution of the α-satellite sequence.
Collapse
Affiliation(s)
- C R Catacchio
- University of Bari Aldo Moro, Department of Biology, Via Orabona 4, Bari, 70125, Italy
| | - R Ragone
- University of Bari Aldo Moro, Department of Biology, Via Orabona 4, Bari, 70125, Italy
| | - G Chiatante
- University of Bari Aldo Moro, Department of Biology, Via Orabona 4, Bari, 70125, Italy
| | - M Ventura
- University of Bari Aldo Moro, Department of Biology, Via Orabona 4, Bari, 70125, Italy
| |
Collapse
|
24
|
Shepelev VA, Uralsky LI, Alexandrov AA, Yurov YB, Rogaev EI, Alexandrov IA. Annotation of suprachromosomal families reveals uncommon types of alpha satellite organization in pericentromeric regions of hg38 human genome assembly. GENOMICS DATA 2015; 5:139-146. [PMID: 26167452 PMCID: PMC4496801 DOI: 10.1016/j.gdata.2015.05.035] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
- V A Shepelev
- Institute of Molecular Genetics, Russian Academy of Sciences, Kurchatov sq. 2, Moscow 123182, Russia ; Department of Genomics and Human Genetics, Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow 119991, Russia ; Center for Brain Neurobiology and Neurogenetics, Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, Novosibirsk 630090, Russia
| | - L I Uralsky
- Institute of Molecular Genetics, Russian Academy of Sciences, Kurchatov sq. 2, Moscow 123182, Russia ; Center for Brain Neurobiology and Neurogenetics, Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, Novosibirsk 630090, Russia
| | - A A Alexandrov
- Institute of Molecular Genetics, Russian Academy of Sciences, Kurchatov sq. 2, Moscow 123182, Russia
| | - Y B Yurov
- Research Center of Mental Health, Russian Academy of Medical Sciences, Zagorodnoe sh. 2, Moscow 113152, Russia
| | - E I Rogaev
- Department of Genomics and Human Genetics, Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow 119991, Russia ; Center for Brain Neurobiology and Neurogenetics, Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, Novosibirsk 630090, Russia ; Department of Psychiatry, Brudnick Neuropsychiatric Research Institute, University of Massachusetts Medical School, Worcester, MA 01604, USA ; Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow 119234, Russia
| | - I A Alexandrov
- Research Center of Mental Health, Russian Academy of Medical Sciences, Zagorodnoe sh. 2, Moscow 113152, Russia
| |
Collapse
|
25
|
Sujiwattanarat P, Thapana W, Srikulnath K, Hirai Y, Hirai H, Koga A. Higher-order repeat structure in alpha satellite DNA occurs in New World monkeys and is not confined to hominoids. Sci Rep 2015; 5:10315. [PMID: 25974220 PMCID: PMC4431391 DOI: 10.1038/srep10315] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2014] [Accepted: 03/25/2015] [Indexed: 11/17/2022] Open
Abstract
Centromeres usually contain large amounts of tandem repeat DNA. Alpha satellite DNA (AS) is the most abundant tandem repeat DNA found in the centromeres of simian primates. The AS of humans contains sequences organized into higher-order repeat (HOR) structures, which are tandem arrays of larger repeat units consisting of multiple basic repeat units. HOR-carrying AS also occurs in other hominoids, but results reported to date for phylogenetically more remote taxa have been negative. Here we show direct evidence for clear HOR structures in AS of the owl monkey and common marmoset. These monkeys are New World monkey species that are located phylogenetically outside of hominoids. It is currently postulated that the presence of HOR structures in AS is unique to hominoids. Our results suggest that this view must be modified. A plausible explanation is that generation of HOR structures is a general event that occurs occasionally or frequently in primate centromeres, and that, in humans, HOR-carrying AS became predominant in the central region of the centromere. It is often difficult to assemble sequence reads of tandem repeat DNAs into accurate contig sequences; our careful sequencing strategy allowed us to overcome this problem.
Collapse
Affiliation(s)
- Penporn Sujiwattanarat
- 1] Primate Research Institute, Kyoto University, Inuyama City 484-8506, Japan [2] Faculty of Science, Kasetsart University, Bangkok 10900, Thailand
| | - Watcharaporn Thapana
- 1] Primate Research Institute, Kyoto University, Inuyama City 484-8506, Japan [2] Faculty of Science, Kasetsart University, Bangkok 10900, Thailand
| | | | - Yuriko Hirai
- Primate Research Institute, Kyoto University, Inuyama City 484-8506, Japan
| | - Hirohisa Hirai
- Primate Research Institute, Kyoto University, Inuyama City 484-8506, Japan
| | - Akihiko Koga
- Primate Research Institute, Kyoto University, Inuyama City 484-8506, Japan
| |
Collapse
|
26
|
Kononenko AV, Bansal R, Lee NCO, Grimes BR, Masumoto H, Earnshaw WC, Larionov V, Kouprina N. A portable BRCA1-HAC (human artificial chromosome) module for analysis of BRCA1 tumor suppressor function. Nucleic Acids Res 2014; 42:gku870. [PMID: 25260588 PMCID: PMC4245969 DOI: 10.1093/nar/gku870] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
BRCA1 is involved in many disparate cellular functions, including DNA damage repair, cell-cycle checkpoint activation, gene transcriptional regulation, DNA replication, centrosome function and others. The majority of evidence strongly favors the maintenance of genomic integrity as a principal tumor suppressor activity of BRCA1. At the same time some functional aspects of BRCA1 are not fully understood. Here, a HAC (human artificial chromosome) module with a regulated centromere was constructed for delivery and expression of the 90 kb genomic copy of the BRCA1 gene into BRCA1-deficient human cells. A battery of functional tests was carried out to demonstrate functionality of the exogenous BRCA1. In separate experiments, we investigated the role of BRCA1 in maintenance of heterochromatin integrity within a human functional kinetochore. We demonstrated that BRCA1 deficiency results in a specific activation of transcription of higher-order alpha-satellite repeats (HORs) assembled into heterochromatin domains flanking the kinetochore. At the same time no detectable elevation of transcription was observed within HORs assembled into centrochromatin domains. Thus, we demonstrated a link between BRCA1 deficiency and kinetochore dysfunction and extended previous observations that BRCA1 is required to silence transcription in heterochromatin in specific genomic loci. This supports the hypothesis that epigenetic alterations of the kinetochore initiated in the absence of BRCA1 may contribute to cellular transformation.
Collapse
Affiliation(s)
- Artem V Kononenko
- Developmental Therapeutics Branch, National Cancer Institute, Bethesda, MD 20892, USA
| | - Ruchi Bansal
- Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indiana University Melvin and Bren Simon Cancer Center, Indianapolis, IN 46202, USA
| | - Nicholas C O Lee
- Developmental Therapeutics Branch, National Cancer Institute, Bethesda, MD 20892, USA
| | - Brenda R Grimes
- Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indiana University Melvin and Bren Simon Cancer Center, Indianapolis, IN 46202, USA
| | - Hiroshi Masumoto
- Laboratory of Cell Engineering, Department of Frontier Research, Kazusa DNA, Research Institute, 2-6-7 Kazusa-Kamatari, Kisarazu, Chiba 292-0818, Japan
| | - William C Earnshaw
- Wellcome Trust Centre for Cell Biology, University of Edinburgh, Edinburgh EH9 3JR, Scotland
| | - Vladimir Larionov
- Developmental Therapeutics Branch, National Cancer Institute, Bethesda, MD 20892, USA
| | - Natalay Kouprina
- Developmental Therapeutics Branch, National Cancer Institute, Bethesda, MD 20892, USA
| |
Collapse
|
27
|
Takeiri A, Motoyama S, Matsuzaki K, Harada A, Taketo J, Katoh C, Tanaka K, Mishima M. New DNA probes to detect aneugenicity in rat bone marrow micronucleated cells by a pan-centromeric FISH analysis. MUTATION RESEARCH-GENETIC TOXICOLOGY AND ENVIRONMENTAL MUTAGENESIS 2013; 755:73-80. [DOI: 10.1016/j.mrgentox.2013.05.011] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/06/2013] [Revised: 04/27/2013] [Accepted: 05/14/2013] [Indexed: 11/24/2022]
|
28
|
AlQuraishi M, McAdams HH. Three enhancements to the inference of statistical protein-DNA potentials. Proteins 2012; 81:426-42. [PMID: 23042633 DOI: 10.1002/prot.24201] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2012] [Revised: 09/17/2012] [Accepted: 10/02/2012] [Indexed: 12/28/2022]
Abstract
The energetics of protein-DNA interactions are often modeled using so-called statistical potentials, that is, energy models derived from the atomic structures of protein-DNA complexes. Many statistical protein-DNA potentials based on differing theoretical assumptions have been investigated, but little attention has been paid to the types of data and the parameter estimation process used in deriving the statistical potentials. We describe three enhancements to statistical potential inference that significantly improve the accuracy of predicted protein-DNA interactions: (i) incorporation of binding energy data of protein-DNA complexes, in conjunction with their X-ray crystal structures, (ii) use of spatially-aware parameter fitting, and (iii) use of ensemble-based parameter fitting. We apply these enhancements to three widely-used statistical potentials and use the resulting enhanced potentials in a structure-based prediction of the DNA binding sites of proteins. These enhancements are directly applicable to all statistical potentials used in protein-DNA modeling, and we show that they can improve the accuracy of predicted DNA binding sites by up to 21%.
Collapse
Affiliation(s)
- Mohammed AlQuraishi
- Department of Developmental Biology, Stanford University School of Medicine, Stanford, California 94305, USA
| | | |
Collapse
|
29
|
Rosandić M, Glunčić M, Paar V. Start/stop codon like trinucleotides extensions in primate alpha satellites. J Theor Biol 2012; 317:301-9. [PMID: 23026763 DOI: 10.1016/j.jtbi.2012.09.022] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2012] [Revised: 09/07/2012] [Accepted: 09/19/2012] [Indexed: 11/28/2022]
Abstract
The centromeres remain "the final frontier" in unexplored segments of genome landscape in primate genomes, characterized by 2-5 Mb arrays of evolutionary rapidly evolving alpha satellite (AS) higher order repeats (HORs). Alpha satellites as specific noncoding sequences may be also significant in light of regulatory role of noncoding sequences. Using the Global Repeat Map (GRM) algorithm we identify in NCBI assemblies of chromosome 5 the species-specific alpha satellite HORs: 13mer in human, 5mer in chimpanzee, 14mer in orangutan and 3mers in macaque. The suprachromosomal family (SF) classification of alpha satellite HORs and surrounding monomeric alpha satellites is performed and specific segmental structure was found for major alpha satellite arrays in chromosome 5 of primates. In the framework of our novel concept of start/stop Codon Like Trinucleotides (CLTs) as a "new DNA language in noncoding sequences", we find characteristics and differences of these species in CLT extensions, in particular the extensions of stop-TGA CLT. We hypothesize that these are regulators in noncoding sequences, acting at a distance, and that they can amplify or weaken the activity of start/stop codons in coding sequences in protein genesis, increasing the richness of regulatory phenomena.
Collapse
Affiliation(s)
- Marija Rosandić
- Faculty of Science, University of Zagreb, 10000 Zagreb, Croatia.
| | | | | |
Collapse
|
30
|
Glunčić M, Paar V. Direct mapping of symbolic DNA sequence into frequency domain in global repeat map algorithm. Nucleic Acids Res 2012; 41:e17. [PMID: 22977183 PMCID: PMC3592446 DOI: 10.1093/nar/gks721] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
The main feature of global repeat map (GRM) algorithm (www.hazu.hr/grm/software/win/grm2012.exe) is its ability to identify a broad variety of repeats of unbounded length that can be arbitrarily distant in sequences as large as human chromosomes. The efficacy is due to the use of complete set of a K-string ensemble which enables a new method of direct mapping of symbolic DNA sequence into frequency domain, with straightforward identification of repeats as peaks in GRM diagram. In this way, we obtain very fast, efficient and highly automatized repeat finding tool. The method is robust to substitutions and insertions/deletions, as well as to various complexities of the sequence pattern. We present several case studies of GRM use, in order to illustrate its capabilities: identification of α-satellite tandem repeats and higher order repeats (HORs), identification of Alu dispersed repeats and of Alu tandems, identification of Period 3 pattern in exons, implementation of ‘magnifying glass’ effect, identification of complex HOR pattern, identification of inter-tandem transitional dispersed repeat sequences and identification of long segmental duplications. GRM algorithm is convenient for use, in particular, in cases of large repeat units, of highly mutated and/or complex repeats, and of global repeat maps for large genomic sequences (chromosomes and genomes).
Collapse
Affiliation(s)
- Matko Glunčić
- Faculty of Science, University of Zagreb, Bijenička 32 and Croatian Academy of Sciences and Arts, Zrinski trg 11, 10000 Zagreb, Croatia.
| | | |
Collapse
|
31
|
Slee RB, Steiner CM, Herbert BS, Vance GH, Hickey RJ, Schwarz T, Christan S, Radovich M, Schneider BP, Schindelhauer D, Grimes BR. Cancer-associated alteration of pericentromeric heterochromatin may contribute to chromosome instability. Oncogene 2011; 31:3244-53. [PMID: 22081068 DOI: 10.1038/onc.2011.502] [Citation(s) in RCA: 82] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Many tumors exhibit elevated chromosome mis-segregation termed chromosome instability (CIN), which is likely to be a potent driver of tumor progression and drug resistance. Causes of CIN are poorly understood but probably include prior genome tetraploidization, centrosome amplification and mitotic checkpoint defects. This study identifies epigenetic alteration of the centromere as a potential contributor to the CIN phenotype. The centromere controls chromosome segregation and consists of higher-order repeat (HOR) alpha-satellite DNA packaged into two chromatin domains: the kinetochore, harboring the centromere-specific H3 variant centromere protein A (CENP-A), and the pericentromeric heterochromatin, considered important for cohesion. Perturbation of centromeric chromatin in model systems causes CIN. As cancer cells exhibit widespread chromatin changes, we hypothesized that pericentromeric chromatin structure could also be affected, contributing to CIN. Cytological and chromatin immunoprecipitation and PCR (ChIP-PCR)-based analyses of HT1080 cancer cells showed that only one of the two HORs on chromosomes 5 and 7 incorporate CENP-A, an organization conserved in all normal and cancer-derived cells examined. Contrastingly, the heterochromatin marker H3K9me3 (trimethylation of H3 lysine 9) mapped to all four HORs and ChIP-PCR showed an altered pattern of H3K9me3 in cancer cell lines and breast tumors, consistent with a reduction on the kinetochore-forming HORs. The JMJD2B demethylase is overexpressed in breast tumors with a CIN phenotype, and overexpression of exogenous JMJD2B in cultured breast epithelial cells caused loss of centromere-associated H3K9me3 and increased CIN. These findings suggest that impaired maintenance of pericentromeric heterochromatin may contribute to CIN in cancer and be a novel therapeutic target.
Collapse
Affiliation(s)
- R B Slee
- Department of Medical and Molecular Genetics, Indiana University School of Medicine (IUSM), Indianapolis, IN 46202, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
32
|
Paar V, Gluncic M, Rosandic M, Basar I, Vlahovic I. Intragene Higher Order Repeats in Neuroblastoma BreakPoint Family Genes Distinguish Humans from Chimpanzees. Mol Biol Evol 2011; 28:1877-92. [DOI: 10.1093/molbev/msr009] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
|
33
|
Paar V, Glunčić M, Basar I, Rosandić M, Paar P, Cvitković M. Large Tandem, Higher Order Repeats and Regularly Dispersed Repeat Units Contribute Substantially to Divergence Between Human and Chimpanzee Y Chromosomes. J Mol Evol 2010; 72:34-55. [DOI: 10.1007/s00239-010-9401-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2010] [Accepted: 10/25/2010] [Indexed: 10/18/2022]
|
34
|
Paar V, Pavin N, Basar I, Rosandić M, Gluncić M, Paar N. Hierarchical structure of cascade of primary and secondary periodicities in Fourier power spectrum of alphoid higher order repeats. BMC Bioinformatics 2008; 9:466. [PMID: 18980673 PMCID: PMC2661002 DOI: 10.1186/1471-2105-9-466] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2008] [Accepted: 11/03/2008] [Indexed: 11/28/2022] Open
Abstract
Background Identification of approximate tandem repeats is an important task of broad significance and still remains a challenging problem of computational genomics. Often there is no single best approach to periodicity detection and a combination of different methods may improve the prediction accuracy. Discrete Fourier transform (DFT) has been extensively used to study primary periodicities in DNA sequences. Here we investigate the application of DFT method to identify and study alphoid higher order repeats. Results We used method based on DFT with mapping of symbolic into numerical sequence to identify and study alphoid higher order repeats (HOR). For HORs the power spectrum shows equidistant frequency pattern, with characteristic two-level hierarchical organization as signature of HOR. Our case study was the 16 mer HOR tandem in AC017075.8 from human chromosome 7. Very long array of equidistant peaks at multiple frequencies (more than a thousand higher harmonics) is based on fundamental frequency of 16 mer HOR. Pronounced subset of equidistant peaks is based on multiples of the fundamental HOR frequency (multiplication factor n for nmer) and higher harmonics. In general, nmer HOR-pattern contains equidistant secondary periodicity peaks, having a pronounced subset of equidistant primary periodicity peaks. This hierarchical pattern as signature for HOR detection is robust with respect to monomer insertions and deletions, random sequence insertions etc. For a monomeric alphoid sequence only primary periodicity peaks are present. The 1/fβ – noise and periodicity three pattern are missing from power spectra in alphoid regions, in accordance with expectations. Conclusion DFT provides a robust detection method for higher order periodicity. Easily recognizable HOR power spectrum is characterized by hierarchical two-level equidistant pattern: higher harmonics of the fundamental HOR-frequency (secondary periodicity) and a subset of pronounced peaks corresponding to constituent monomers (primary periodicity). The number of lower frequency peaks (secondary periodicity) below the frequency of the first primary periodicity peak reveals the size of nmer HOR, i.e., the number n of monomers contained in consensus HOR.
Collapse
Affiliation(s)
- Vladimir Paar
- Faculty of Science, University of Zagreb, Bijenicka 32, Zagreb, Croatia.
| | | | | | | | | | | |
Collapse
|
35
|
Rosandić M, Glunčić M, Paar V, Basar I. The role of alphoid higher order repeats (HORs) in the centromere folding. J Theor Biol 2008; 254:555-60. [DOI: 10.1016/j.jtbi.2008.06.012] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2007] [Revised: 05/13/2008] [Accepted: 06/06/2008] [Indexed: 10/21/2022]
|
36
|
Paar V, Basar I, Rosandić M, Glunčić M. Consensus higher order repeats and frequency of string distributions in human genome. Curr Genomics 2007; 8:93-111. [PMID: 18660848 PMCID: PMC2435359 DOI: 10.2174/138920207780368169] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2007] [Revised: 01/26/2007] [Accepted: 01/30/2007] [Indexed: 02/01/2023] Open
Abstract
Key string algorithm (KSA) could be viewed as robust computational generalization of restriction enzyme method. KSA enables robust and effective identification and structural analyzes of any given genomic sequences, like in the case of NCBI assembly for human genome. We have developed a method, using total frequency distribution of all r-bp key strings in dependence on the fragment length l, to determine the exact size of all repeats within the given genomic sequence, both of monomeric and HOR type. Subsequently, for particular fragment lengths equal to each of these repeat sizes we compute the partial frequency distribution of r-bp key strings; the key string with highest frequency is a dominant key string, optimal for segmentation of a given genomic sequence into repeat units. We illustrate how a wide class of 3-bp key strings leads to a key-string-dependent periodic cell which enables a simple identification and consensus length determinations of HORs, or any other highly convergent repeat of monomeric or HOR type, both tandem or dispersed. We illustrated KSA application for HORs in human genome and determined consensus HORs in the Build 35.1 assembly. In the next step we compute suprachromosomal family classification and CENP-B box / pJalpha distributions for HORs. In the case of less convergent repeats, like for example monomeric alpha satellite (20-40% divergence), we searched for optimal compact key string using frequency method and developed a concept of composite key string (GAAAC--CTTTG) or flexible relaxation (28 bp key string) which provides both monomeric alpha satellites as well as alpha monomer segmentation of internal HOR structure. This method is convenient also for study of R-strand (direct) / S-strand (reverse complement) alpha monomer alternations. Using KSA we identified 16 alternating regions of R-strand and S-strand monomers in one contig in choromosome 7. Use of CENP-B box and/or pJalpha motif as key string is suitable both for identification of HORs and monomeric pattern as well as for studies of CENP-B box / pJalpha distribution. As an example of application of KSA to sequences outside of HOR regions we present our finding of a tandem with highly convergent 3434-bp Long monomer in chromosome 5 (divergence less then 0.3%).
Collapse
Affiliation(s)
- Vladimir Paar
- Faculty of Science, University of Zagreb, Bijenička 32, 10000 Zagreb, Croatia
| | - Ivan Basar
- Faculty of Science, University of Zagreb, Bijenička 32, 10000 Zagreb, Croatia
| | - Marija Rosandić
- Department of Internal Medicine,
University Hospital Rebro, Kišpatićeva 12, 10000 Zagreb, Croatia
| | - Matko Glunčić
- Faculty of Science, University of Zagreb, Bijenička 32, 10000 Zagreb, Croatia
| |
Collapse
|