1
|
Kara Öztürk SD, Tek AL. Novel centromeric repetitive DNA elements reveal karyotype dynamics in polyploid sainfoin (Onobrychis viciifolia). Chromosoma 2024:10.1007/s00412-024-00824-y. [PMID: 39269484 DOI: 10.1007/s00412-024-00824-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2024] [Revised: 08/22/2024] [Accepted: 08/29/2024] [Indexed: 09/15/2024]
Abstract
Polyploidy is a common feature in eukaryotes with one of paramount consequences leading to better environmental adaptation. Heterochromatin is often located at telomeres and centromeres and contains repetitive DNA sequences. Sainfoin (Onobrychis viciifolia) is an important perennial forage legume for sustainable agriculture. However, there are only a few studies on the sainfoin genome and chromosomes. In this study, novel tandem repetitive DNA sequences of the sainfoin genome (OnVi180, OnVi169, OnVi176 and OnVidimer) were characterized using bioinformatics, molecular and cytogenetic approaches. The OnVi180 and OnVi169 elements colocalized within functional centromeres. The OnVi176 and OnVidimer elements were localized in centromeric, subtelomeric and interstitial regions. We constructed a sainfoin karyotype that distinguishes the seven basic chromosome groups. Our study provides the first detailed description of heterochromatin and chromosome structure of sainfoin and proposes an origin of heterozygous ancestral genomes, possibly from the same ancestral diploid species, not necessarily from different species, or for chromosome rearrangements after polyploidy. Overall, we discuss our novel and complementary findings in a polyploid crop with unique and complex chromosomal features.
Collapse
Affiliation(s)
- Sevim D Kara Öztürk
- Department of Agricultural Genetic Engineering, Ayhan Şahenk Faculty of Agricultural Sciences and Technologies, Niğde Ömer Halisdemir University, Niğde, Turkey
| | - Ahmet L Tek
- Department of Agricultural Genetic Engineering, Ayhan Şahenk Faculty of Agricultural Sciences and Technologies, Niğde Ömer Halisdemir University, Niğde, Turkey.
| |
Collapse
|
2
|
Gao S, Jia Y, Guo H, Xu T, Wang B, Bush SJ, Wan S, Zhang Y, Yang X, Ye K. The centromere landscapes of four karyotypically diverse Papaver species provide insights into chromosome evolution and speciation. CELL GENOMICS 2024; 4:100626. [PMID: 39084227 PMCID: PMC11406182 DOI: 10.1016/j.xgen.2024.100626] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 04/16/2024] [Accepted: 07/09/2024] [Indexed: 08/02/2024]
Abstract
Understanding the roles played by centromeres in chromosome evolution and speciation is complicated by the fact that centromeres comprise large arrays of tandemly repeated satellite DNA, which hinders high-quality assembly. Here, we used long-read sequencing to generate nearly complete genome assemblies for four karyotypically diverse Papaver species, P. setigerum (2n = 44), P. somniferum (2n = 22), P. rhoeas (2n = 14), and P. bracteatum (2n = 14), collectively representing 45 gapless centromeres. We identified four centromere satellite (cenSat) families and experimentally validated two representatives. For the two allopolyploid genomes (P. somniferum and P. setigerum), we characterized the subgenomic distribution of each satellite and identified a "homogenizing" phase of centromere evolution in the aftermath of hybridization. An interspecies comparison of the peri-centromeric regions further revealed extensive centromere-mediated chromosome rearrangements. Taking these results together, we propose a model for studying cenSat competition after hybridization and shed further light on the complex role of the centromere in speciation.
Collapse
Affiliation(s)
- Shenghan Gao
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China; School of Computer Science and Technology, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China; MOE Key Lab for Intelligent Networks & Networks Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China
| | - Yanyan Jia
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China
| | - Hongtao Guo
- School of Computer Science and Technology, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China
| | - Tun Xu
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China; MOE Key Lab for Intelligent Networks & Networks Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China
| | - Bo Wang
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China
| | - Stephen J Bush
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China
| | - Shijie Wan
- School of Computer Science and Technology, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China
| | - Yimeng Zhang
- School of Computer Science and Technology, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China
| | - Xiaofei Yang
- School of Computer Science and Technology, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China; MOE Key Lab for Intelligent Networks & Networks Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China.
| | - Kai Ye
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China; MOE Key Lab for Intelligent Networks & Networks Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China; Center for Mathematical Medical, The First Affiliated Hospital, Xi'an Jiaotong University, Xi'an, Shaanxi 710061, China; Genome Institute, The First Affiliated Hospital, Xi'an Jiaotong University, Xi'an, Shaanxi 710061, China; School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China; Faculty of Science, Leiden University, Leiden 2311EZ, the Netherlands.
| |
Collapse
|
3
|
Arora UP, Dumont BL. Molecular evolution of the mammalian kinetochore complex. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.27.600994. [PMID: 38979348 PMCID: PMC11230421 DOI: 10.1101/2024.06.27.600994] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/10/2024]
Abstract
Mammalian centromeres are satellite-rich chromatin domains that serve as sites for kinetochore complex assembly. Centromeres are highly variable in sequence and satellite organization across species, but the processes that govern the co-evolutionary dynamics between rapidly evolving centromeres and their associated kinetochore proteins remain poorly understood. Here, we pursue a course of phylogenetic analyses to investigate the molecular evolution of the complete kinetochore complex across primate and rodent species with divergent centromere repeat sequences and features. We show that many protein components of the core centromere associated network (CCAN) harbor signals of adaptive evolution, consistent with their intimate association with centromere satellite DNA and roles in the stability and recruitment of additional kinetochore proteins. Surprisingly, CCAN and outer kinetochore proteins exhibit comparable rates of adaptive divergence, suggesting that changes in centromere DNA can ripple across the kinetochore to drive adaptive protein evolution within distant domains of the complex. Our work further identifies kinetochore proteins subject to lineage-specific adaptive evolution, including rapidly evolving proteins in species with centromere satellites characterized by higher-order repeat structure and lacking CENP-B boxes. Thus, features of centromeric chromatin beyond the linear DNA sequence may drive selection on kinetochore proteins. Overall, our work spotlights adaptively evolving proteins with diverse centromere-associated functions, including centromere chromatin structure, kinetochore protein assembly, kinetochore-microtubule association, cohesion maintenance, and DNA damage response pathways. These adaptively evolving kinetochore protein candidates present compelling opportunities for future functional investigations exploring how their concerted changes with centromere DNA ensure the maintenance of genome stability.
Collapse
Affiliation(s)
- Uma P Arora
- The Jackson Laboratory, 600 Main Street, Bar Harbor ME 04609
- Tufts University, Graduate School of Biomedical Sciences, 136 Harrison Ave, Boston MA 02111
| | - Beth L Dumont
- The Jackson Laboratory, 600 Main Street, Bar Harbor ME 04609
- Tufts University, Graduate School of Biomedical Sciences, 136 Harrison Ave, Boston MA 02111
- Graduate School of Biomedical Science and Engineering, The University of Maine, Orono, Maine, 04469
| |
Collapse
|
4
|
Flynn JM, Yamashita YM. The implications of satellite DNA instability on cellular function and evolution. Semin Cell Dev Biol 2024; 156:152-159. [PMID: 37852904 DOI: 10.1016/j.semcdb.2023.10.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 09/21/2023] [Accepted: 10/11/2023] [Indexed: 10/20/2023]
Abstract
Abundant tandemly repeated satellite DNA is present in most eukaryotic genomes. Previous limitations including a pervasive view that it was uninteresting junk DNA, combined with challenges in studying it, are starting to dissolve - and recent studies have found important functions for satellite DNAs. The observed rapid evolution and implied instability of satellite DNA now has important significance for their functions and maintenance within the genome. In this review, we discuss the processes that lead to satellite DNA copy number instability, and the importance of mechanisms to manage the potential negative effects of instability. Satellite DNA is vulnerable to challenges during replication and repair, since it forms difficult-to-process secondary structures and its homology within tandem arrays can result in various types of recombination. Satellite DNA instability may be managed by DNA or chromatin-binding proteins ensuring proper nuclear localization and repair, or by proteins that process aberrant structures that satellite DNAs tend to form. We also discuss the pattern of satellite DNA mutations from recent mutation accumulation (MA) studies that have tracked changes in satellite DNA for up to 1000 generations with minimal selection. Finally, we highlight examples of satellite evolution from studies that have characterized satellites across millions of years of Drosophila fruit fly evolution, and discuss possible ways that selection might act on the satellite DNA composition.
Collapse
Affiliation(s)
- Jullien M Flynn
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA; Howard Hughes Medical Institute, Cambridge, MA, USA.
| | - Yukiko M Yamashita
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA; Howard Hughes Medical Institute, Cambridge, MA, USA; Massachusetts Institute of Technology, Cambridge, MA, USA.
| |
Collapse
|
5
|
Graham E, Esashi F. DNA strand breaks at centromeres: Friend or foe? Semin Cell Dev Biol 2024; 156:141-151. [PMID: 37872040 DOI: 10.1016/j.semcdb.2023.10.004] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2023] [Revised: 09/22/2023] [Accepted: 10/11/2023] [Indexed: 10/25/2023]
Abstract
Centromeres are large structural regions in the genomic DNA, which are essential for accurately transmitting a complete set of chromosomes to daughter cells during cell division. In humans, centromeres consist of highly repetitive α-satellite DNA sequences and unique epigenetic components, forming large proteinaceous structures required for chromosome segregation. Despite their biological importance, there is a growing body of evidence for centromere breakage across the cell cycle, including periods of quiescence. In this review, we provide an up-to-date examination of the distinct centromere environments at different stages of the cell cycle, highlighting their plausible contribution to centromere breakage. Additionally, we explore the implications of these breaks on centromere function, both in terms of negative consequences and potential positive effects.
Collapse
Affiliation(s)
- Emily Graham
- Sir William Dunn School of Pathology, University of Oxford, Oxford, UK
| | - Fumiko Esashi
- Sir William Dunn School of Pathology, University of Oxford, Oxford, UK.
| |
Collapse
|
6
|
Di Tommaso E, Giunta S. Dynamic interplay between human alpha-satellite DNA structure and centromere functions. Semin Cell Dev Biol 2024; 156:130-140. [PMID: 37926668 DOI: 10.1016/j.semcdb.2023.10.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Revised: 10/04/2023] [Accepted: 10/10/2023] [Indexed: 11/07/2023]
Abstract
Maintenance of genome stability relies on functional centromeres for correct chromosome segregation and faithful inheritance of the genetic information. The human centromere is the primary constriction within mitotic chromosomes made up of repetitive alpha-satellite DNA hierarchically organized in megabase-long arrays of near-identical higher order repeats (HORs). Centromeres are epigenetically specified by the presence of the centromere-specific histone H3 variant, CENP-A, which enables the assembly of the kinetochore for microtubule attachment. Notably, centromeric DNA is faithfully inherited as intact haplotypes from the parents to the offspring without intervening recombination, yet, outside of meiosis, centromeres are akin to common fragile sites (CFSs), manifesting crossing-overs and ongoing sequence instability. Consequences of DNA changes within the centromere are just starting to emerge, with unclear effects on intra- and inter-generational inheritance driven by centromere's essential role in kinetochore assembly. Here, we review evidence of meiotic selection operating to mitigate centromere drive, as well as recent reports on centromere damage, recombination and repair during the mitotic cell division. We propose an antagonistic pleiotropy interpretation to reconcile centromere DNA instability as both driver of aneuploidy that underlies degenerative diseases, while also potentially necessary for the maintenance of homogenized HORs for centromere function. We attempt to provide a framework for this conceptual leap taking into consideration the structural interface of centromere-kinetochore interaction and present case scenarios for its malfunctioning. Finally, we offer an integrated working model to connect DNA instability, chromatin, and structural changes with functional consequences on chromosome integrity.
Collapse
Affiliation(s)
- Elena Di Tommaso
- Laboratory of Genome Evolution, Department of Biology & Biotechnology Charles Darwin, Sapienza University of Rome, Rome 00185, Italy
| | - Simona Giunta
- Laboratory of Genome Evolution, Department of Biology & Biotechnology Charles Darwin, Sapienza University of Rome, Rome 00185, Italy.
| |
Collapse
|
7
|
Marcon F, Giunta S, Bignami M. Emerging roles of DNA repair factors in the stability of centromeres. Semin Cell Dev Biol 2024; 156:121-129. [PMID: 37852903 DOI: 10.1016/j.semcdb.2023.10.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 10/07/2023] [Accepted: 10/10/2023] [Indexed: 10/20/2023]
Abstract
Satellite DNA sequences are an integral part of centromeres, regions critical for faithful segregation of chromosomes during cell division. Because of their complex repetitive structure, satellite DNA may act as a barrier to DNA replication and other DNA based transactions ultimately resulting in chromosome breakage. Over the past two decades, several DNA repair proteins have been shown to bind and function at centromeres. While the importance of these repair factors is highlighted by various structural and numerical chromosome aberrations resulting from their inactivation, their roles in helping to maintain genome stability by solving the intrinsic difficulties of satellite DNA replication or promoting their repair are just starting to emerge. In this review, we summarize the current knowledge on the role of DNA repair and DNA damage response proteins in maintaining the structure and function of centromeres in different contexts. We also report the recent connection between the roles of specific DNA repair factors at these genomic loci with age-related increase of chromosomal instability under physiological and pathological conditions.
Collapse
Affiliation(s)
- Francesca Marcon
- Department of Environment and Health, Istituto Superiore di Sanità, Viale Regina Elena 299, 00161 Rome, Italy.
| | - Simona Giunta
- Laboratory of Genome Evolution, Department of Biology and Biotechnologies Charles Darwin, Sapienza University of Rome, 00185, Italy
| | - Margherita Bignami
- Department of Environment and Health, Istituto Superiore di Sanità, Viale Regina Elena 299, 00161 Rome, Italy
| |
Collapse
|
8
|
Pi X, Zhang Q, Wang X, Jiang F. Retinoblastoma and polydactyly in a child with 46, XY, 15pstk+ karyotype-A case report and literature review. Mol Genet Genomic Med 2024; 12:e2414. [PMID: 38465842 PMCID: PMC10926652 DOI: 10.1002/mgg3.2414] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Revised: 01/08/2024] [Accepted: 02/21/2024] [Indexed: 03/12/2024] Open
Abstract
BACKGROUND Retinoblastoma (Rb) is the most common intraocular malignancy in childhood, originating from primitive retinal stem cells or cone precursor cells. It can be triggered by mutations of the RB1 gene or amplification of the MYCN gene. Rb may rarely present with polydactyly. METHODS We conducted karyotype analysis, copy number variation sequencing, and whole-genome sequencing on the infant proband and his family. The clinical course and laboratory results of the proband's infant were documented and collected. We also reviewed the relevant literature. RESULTS A 68-day-old boy presented with preaxial polydactyly and corneal edema. His intraocular pressure (IOP) was 40/19 mmHg, and color Doppler imaging revealed vitreous solid mass-occupying lesions with calcification in the right eye. Ocular CT showed flaky high-density and calcification in the right eye. This was classified as an International Retinoblastoma Staging System group E retinoblastoma with an indication for enucleation. Enucleation and orbital implantation were performed on the child's right eye. Karyotype analysis revealed an abnormal 46, XY, 15pstk+ karyotype, and the mother exhibited diploidy of the short arm of chromosome 15. The Alx-4 development factor, 13q deletion syndrome, and the PAPA2 gene have been reported as potential mechanisms for Rb combined with polydactyly. CONCLUSION We report the case of a baby boy with Rb and polydactyly exhibiting a 46, XY, 15pstk+ Karyotype. We discuss potential genetic factors related to both Rb and polydactyly. Furthermore, there is a need for further exploration into the impact of chromosomal polymorphisms in Rb with polydactyly.
Collapse
Affiliation(s)
- Xiaohuan Pi
- Department of OphthalmologyThe Sixth Hospital of Wuhan, Affiliated Hospital of Jianghan UniversityWuhanChina
| | - Qiming Zhang
- Department of OphthalmologyThe Sixth Hospital of Wuhan, Affiliated Hospital of Jianghan UniversityWuhanChina
| | - Xinghua Wang
- Department of Ophthalmology, Union Hospital, Tongji Medical CollegeHuazhong University of Science and TechnologyWuhanChina
| | - Fagang Jiang
- Department of Ophthalmology, Union Hospital, Tongji Medical CollegeHuazhong University of Science and TechnologyWuhanChina
| |
Collapse
|
9
|
Pinto LM, Pailas A, Bondarchenko M, Sharma AB, Neumann K, Rizzo AJ, Jeanty C, Nicot N, Racca C, Graham MK, Naughton C, Liu Y, Chen CL, Meakin PJ, Gilbert N, Britton S, Meeker AK, Heaphy CM, Larminat F, Van Dyck E. DAXX promotes centromeric stability independently of ATRX by preventing the accumulation of R-loop-induced DNA double-stranded breaks. Nucleic Acids Res 2024; 52:1136-1155. [PMID: 38038252 PMCID: PMC10853780 DOI: 10.1093/nar/gkad1141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Revised: 11/08/2023] [Accepted: 11/13/2023] [Indexed: 12/02/2023] Open
Abstract
Maintaining chromatin integrity at the repetitive non-coding DNA sequences underlying centromeres is crucial to prevent replicative stress, DNA breaks and genomic instability. The concerted action of transcriptional repressors, chromatin remodelling complexes and epigenetic factors controls transcription and chromatin structure in these regions. The histone chaperone complex ATRX/DAXX is involved in the establishment and maintenance of centromeric chromatin through the deposition of the histone variant H3.3. ATRX and DAXX have also evolved mutually-independent functions in transcription and chromatin dynamics. Here, using paediatric glioma and pancreatic neuroendocrine tumor cell lines, we identify a novel ATRX-independent function for DAXX in promoting genome stability by preventing transcription-associated R-loop accumulation and DNA double-strand break formation at centromeres. This function of DAXX required its interaction with histone H3.3 but was independent of H3.3 deposition and did not reflect a role in the repression of centromeric transcription. DAXX depletion mobilized BRCA1 at centromeres, in line with BRCA1 role in counteracting centromeric R-loop accumulation. Our results provide novel insights into the mechanisms protecting the human genome from chromosomal instability, as well as potential perspectives in the treatment of cancers with DAXX alterations.
Collapse
Affiliation(s)
- Lia M Pinto
- DNA Repair and Chemoresistance Group, Department of Cancer Research, Luxembourg Institute of Health (LIH), L-1210 Luxembourg, Luxembourg
- Faculty of Science, Technology and Communication, University of Luxembourg, L-4365 Esch-sur-Alzette, Luxembourg
- Discovery & Translational Science Department, Leeds Institute of Cardiovascular and Metabolic Medicine, University of Leeds, Leeds LS2 9JT, UK
| | - Alexandros Pailas
- DNA Repair and Chemoresistance Group, Department of Cancer Research, Luxembourg Institute of Health (LIH), L-1210 Luxembourg, Luxembourg
- Faculty of Science, Technology and Communication, University of Luxembourg, L-4365 Esch-sur-Alzette, Luxembourg
| | - Max Bondarchenko
- DNA Repair and Chemoresistance Group, Department of Cancer Research, Luxembourg Institute of Health (LIH), L-1210 Luxembourg, Luxembourg
- Faculty of Science, Technology and Communication, University of Luxembourg, L-4365 Esch-sur-Alzette, Luxembourg
| | - Abhishek Bharadwaj Sharma
- DNA Repair and Chemoresistance Group, Department of Cancer Research, Luxembourg Institute of Health (LIH), L-1210 Luxembourg, Luxembourg
| | - Katrin Neumann
- DNA Repair and Chemoresistance Group, Department of Cancer Research, Luxembourg Institute of Health (LIH), L-1210 Luxembourg, Luxembourg
| | - Anthony J Rizzo
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, MD 21231, USA
| | - Céline Jeanty
- DNA Repair and Chemoresistance Group, Department of Cancer Research, Luxembourg Institute of Health (LIH), L-1210 Luxembourg, Luxembourg
| | - Nathalie Nicot
- Translational Medicine Operations Hub, Luxembourg Institute of Health (LIH), Luxembourg, Luxembourg
| | - Carine Racca
- Institut de Pharmacologie et Biologie Structurale (IPBS), Université de Toulouse, CNRS, Université Toulouse III - Paul Sabatier (UT3), 31077 Toulouse Cedex 4, France
| | - Mindy K Graham
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, MD 21231, USA
| | - Catherine Naughton
- Medical Research Council Human Genetics Unit, Institute of Genetics and Molecular Medicine, The University of Edinburgh, Edinburgh EH4 1QY, UK
| | - Yaqun Liu
- Institut Curie, PSL Research University, CNRS UMR3244, Dynamics of Genetic Information, Sorbonne Université, 75248 Paris Cedex 05, France
| | - Chun-Long Chen
- Institut Curie, PSL Research University, CNRS UMR3244, Dynamics of Genetic Information, Sorbonne Université, 75248 Paris Cedex 05, France
| | - Paul J Meakin
- Discovery & Translational Science Department, Leeds Institute of Cardiovascular and Metabolic Medicine, University of Leeds, Leeds LS2 9JT, UK
| | - Nick Gilbert
- Medical Research Council Human Genetics Unit, Institute of Genetics and Molecular Medicine, The University of Edinburgh, Edinburgh EH4 1QY, UK
| | - Sébastien Britton
- Institut de Pharmacologie et Biologie Structurale (IPBS), Université de Toulouse, CNRS, Université Toulouse III - Paul Sabatier (UT3), 31077 Toulouse Cedex 4, France
| | - Alan K Meeker
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, MD 21231, USA
| | - Christopher M Heaphy
- Department of Medicine, Boston University School of Medicine, Boston, MA 02118, USA
| | - Florence Larminat
- Institut de Pharmacologie et Biologie Structurale (IPBS), Université de Toulouse, CNRS, Université Toulouse III - Paul Sabatier (UT3), 31077 Toulouse Cedex 4, France
| | - Eric Van Dyck
- DNA Repair and Chemoresistance Group, Department of Cancer Research, Luxembourg Institute of Health (LIH), L-1210 Luxembourg, Luxembourg
| |
Collapse
|
10
|
Ramirez P, Sun W, Kazempour Dehkordi S, Zare H, Fongang B, Bieniek KF, Frost B. Nanopore-based DNA long-read sequencing analysis of the aged human brain. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.01.578450. [PMID: 38370753 PMCID: PMC10871260 DOI: 10.1101/2024.02.01.578450] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]
Abstract
Aging disrupts cellular processes such as DNA repair and epigenetic control, leading to a gradual buildup of genomic alterations that can have detrimental effects in post-mitotic cells. Genomic alterations in regions of the genome that are rich in repetitive sequences, often termed "dark loci," are difficult to resolve using traditional sequencing approaches. New long-read technologies offer promising avenues for exploration of previously inaccessible regions of the genome. Using nanopore-based long-read whole-genome sequencing of DNA extracted from aged 18 human brains, we identify previously unreported structural variants and methylation patterns within repetitive DNA, focusing on transposable elements ("jumping genes") as crucial sources of variation, particularly in dark loci. Our analyses reveal potential somatic insertion variants and provides DNA methylation frequencies for many retrotransposon families. We further demonstrate the utility of this technology for the study of these challenging genomic regions in brains affected by Alzheimer's disease and identify significant differences in DNA methylation in pathologically normal brains versus those affected by Alzheimer's disease. Highlighting the power of this approach, we discover specific polymorphic retrotransposons with altered DNA methylation patterns. These retrotransposon loci have the potential to contribute to pathology, warranting further investigation in Alzheimer's disease research. Taken together, our study provides the first long-read DNA sequencing-based analysis of retrotransposon sequences, structural variants, and DNA methylation in the aging brain affected with Alzheimer's disease neuropathology.
Collapse
Affiliation(s)
- Paulino Ramirez
- Barshop Institute for Longevity and Aging Studies, University of Texas Health San Antonio, San Antonio, Texas
- Glenn Biggs Institute for Alzheimer’s and Neurodegenerative Diseases, University of Texas Health San Antonio, San Antonio, Texas
- Department of Cell Systems and Anatomy, University of Texas Health San Antonio, San Antonio, Texas
| | - Wenyan Sun
- Barshop Institute for Longevity and Aging Studies, University of Texas Health San Antonio, San Antonio, Texas
- Glenn Biggs Institute for Alzheimer’s and Neurodegenerative Diseases, University of Texas Health San Antonio, San Antonio, Texas
- Department of Cell Systems and Anatomy, University of Texas Health San Antonio, San Antonio, Texas
- School of Pharmacy, University of Missouri-Kansas City, Kansas City, Missouri
| | - Shiva Kazempour Dehkordi
- Glenn Biggs Institute for Alzheimer’s and Neurodegenerative Diseases, University of Texas Health San Antonio, San Antonio, Texas
- Department of Cell Systems and Anatomy, University of Texas Health San Antonio, San Antonio, Texas
| | - Habil Zare
- Glenn Biggs Institute for Alzheimer’s and Neurodegenerative Diseases, University of Texas Health San Antonio, San Antonio, Texas
- Department of Cell Systems and Anatomy, University of Texas Health San Antonio, San Antonio, Texas
| | - Bernard Fongang
- Glenn Biggs Institute for Alzheimer’s and Neurodegenerative Diseases, University of Texas Health San Antonio, San Antonio, Texas
- Department of Biochemistry & Structural Biology, University of Texas Health San Antonio, San Antonio, Texas
| | - Kevin F. Bieniek
- Glenn Biggs Institute for Alzheimer’s and Neurodegenerative Diseases, University of Texas Health San Antonio, San Antonio, Texas
- Department of Pathology, University of Texas Health San Antonio, San Antonio, Texas
| | - Bess Frost
- Barshop Institute for Longevity and Aging Studies, University of Texas Health San Antonio, San Antonio, Texas
- Glenn Biggs Institute for Alzheimer’s and Neurodegenerative Diseases, University of Texas Health San Antonio, San Antonio, Texas
- Department of Cell Systems and Anatomy, University of Texas Health San Antonio, San Antonio, Texas
| |
Collapse
|
11
|
Volpe E, Corda L, Tommaso ED, Pelliccia F, Ottalevi R, Licastro D, Guarracino A, Capulli M, Formenti G, Tassone E, Giunta S. The complete diploid reference genome of RPE-1 identifies human phased epigenetic landscapes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.01.565049. [PMID: 38168337 PMCID: PMC10760208 DOI: 10.1101/2023.11.01.565049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2024]
Abstract
Comparative analysis of recent human genome assemblies highlights profound sequence divergence that peaks within polymorphic loci such as centromeres. This raises the question about the adequacy of relying on human reference genomes to accurately analyze sequencing data derived from experimental cell lines. Here, we generated the complete diploid genome assembly for the human retinal epithelial cells (RPE-1), a widely used non-cancer laboratory cell line with a stable karyotype, to use as matched reference for multi-omics sequencing data analysis. Our RPE1v1.0 assembly presents completely phased haplotypes and chromosome-level scaffolds that span centromeres with ultra-high base accuracy (>QV60). We mapped the haplotype-specific genomic variation specific to this cell line including t(Xq;10q), a stable 73.18 Mb duplication of chromosome 10 translocated onto the microdeleted chromosome X telomere t(Xq;10q). Polymorphisms between haplotypes of the same genome reveals genetic and epigenetic variation for all chromosomes, especially at centromeres. The RPE-1 assembly as matched reference genome improves mapping quality of multi-omics reads originating from RPE-1 cells with drastic reduction in alignments mismatches compared to using the most complete human reference to date (CHM13). Leveraging the accuracy achieved using a matched reference, we were able to identify the kinetochore sites at base pair resolution and show unprecedented variation between haplotypes. This work showcases the use of matched reference genomes for multiomics analyses and serves as the foundation for a call to comprehensively assemble experimentally relevant cell lines for widespread application.
Collapse
Affiliation(s)
- Emilia Volpe
- Giunta Laboratory of Genome Evolution, Department of Biology and Biotechnologies Charles Darwin, University of Rome “Sapienza”, Piazzale Aldo Moro 5, 00185 Rome, Italy
| | - Luca Corda
- Giunta Laboratory of Genome Evolution, Department of Biology and Biotechnologies Charles Darwin, University of Rome “Sapienza”, Piazzale Aldo Moro 5, 00185 Rome, Italy
| | - Elena Di Tommaso
- Giunta Laboratory of Genome Evolution, Department of Biology and Biotechnologies Charles Darwin, University of Rome “Sapienza”, Piazzale Aldo Moro 5, 00185 Rome, Italy
| | - Franca Pelliccia
- Giunta Laboratory of Genome Evolution, Department of Biology and Biotechnologies Charles Darwin, University of Rome “Sapienza”, Piazzale Aldo Moro 5, 00185 Rome, Italy
| | - Riccardo Ottalevi
- Department of Bioinformatic, Dante Genomics Corp Inc., 667 Madison Avenue, New York, NY 10065 USA and S.s.17, 67100, L’Aquila, Italy
| | | | - Andrea Guarracino
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN 38163, USA
| | - Mattia Capulli
- Department of Biotechnological and Applied Clinical Sciences, University of L’Aquila, L’Aquila, Italy
| | - Giulio Formenti
- The Rockefeller University, 1230 York Avenue, 10065 New York, USA
| | - Evelyne Tassone
- Giunta Laboratory of Genome Evolution, Department of Biology and Biotechnologies Charles Darwin, University of Rome “Sapienza”, Piazzale Aldo Moro 5, 00185 Rome, Italy
| | - Simona Giunta
- Giunta Laboratory of Genome Evolution, Department of Biology and Biotechnologies Charles Darwin, University of Rome “Sapienza”, Piazzale Aldo Moro 5, 00185 Rome, Italy
| |
Collapse
|
12
|
Xu R, Pan Z, Nakagawa T. Gross Chromosomal Rearrangement at Centromeres. Biomolecules 2023; 14:28. [PMID: 38254628 PMCID: PMC10813616 DOI: 10.3390/biom14010028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 12/19/2023] [Accepted: 12/22/2023] [Indexed: 01/24/2024] Open
Abstract
Centromeres play essential roles in the faithful segregation of chromosomes. CENP-A, the centromere-specific histone H3 variant, and heterochromatin characterized by di- or tri-methylation of histone H3 9th lysine (H3K9) are the hallmarks of centromere chromatin. Contrary to the epigenetic marks, DNA sequences underlying the centromere region of chromosomes are not well conserved through evolution. However, centromeres consist of repetitive sequences in many eukaryotes, including animals, plants, and a subset of fungi, including fission yeast. Advances in long-read sequencing techniques have uncovered the complete sequence of human centromeres containing more than thousands of alpha satellite repeats and other types of repetitive sequences. Not only tandem but also inverted repeats are present at a centromere. DNA recombination between centromere repeats can result in gross chromosomal rearrangement (GCR), such as translocation and isochromosome formation. CENP-A chromatin and heterochromatin suppress the centromeric GCR. The key player of homologous recombination, Rad51, safeguards centromere integrity through conservative noncrossover recombination between centromere repeats. In contrast to Rad51-dependent recombination, Rad52-mediated single-strand annealing (SSA) and microhomology-mediated end-joining (MMEJ) lead to centromeric GCR. This review summarizes recent findings on the role of centromere and recombination proteins in maintaining centromere integrity and discusses how GCR occurs at centromeres.
Collapse
Affiliation(s)
- Ran Xu
- Department of Biological Sciences, Graduate School of Science, Osaka University, 1-1 Machikaneyama, Toyonaka 560-0043, Osaka, Japan
- Forefront Research Center, Graduate School of Science, Osaka University, 1-1 Machikaneyama, Toyonaka 560-0043, Osaka, Japan
| | - Ziyi Pan
- Department of Biological Sciences, Graduate School of Science, Osaka University, 1-1 Machikaneyama, Toyonaka 560-0043, Osaka, Japan
- Forefront Research Center, Graduate School of Science, Osaka University, 1-1 Machikaneyama, Toyonaka 560-0043, Osaka, Japan
| | - Takuro Nakagawa
- Department of Biological Sciences, Graduate School of Science, Osaka University, 1-1 Machikaneyama, Toyonaka 560-0043, Osaka, Japan
- Forefront Research Center, Graduate School of Science, Osaka University, 1-1 Machikaneyama, Toyonaka 560-0043, Osaka, Japan
| |
Collapse
|
13
|
Bzikadze AV, Pevzner PA. UniAligner: a parameter-free framework for fast sequence alignment. Nat Methods 2023; 20:1346-1354. [PMID: 37580559 DOI: 10.1038/s41592-023-01970-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Accepted: 07/05/2023] [Indexed: 08/16/2023]
Abstract
Even though the recent advances in 'complete genomics' revealed the previously inaccessible genomic regions, analysis of variations in centromeres and other extra-long tandem repeats (ETRs) faces an algorithmic challenge since there are currently no tools for accurate sequence comparison of ETRs. Counterintuitively, the classical alignment approaches, such as the Smith-Waterman algorithm, fail to construct biologically adequate alignments of ETRs. We present UniAligner-the parameter-free sequence alignment algorithm with sequence-dependent alignment scoring that automatically changes for any pair of compared sequences. UniAligner prioritizes matches of rare substrings that are more likely to be relevant to the evolutionary relationship between two sequences. We apply UniAligner to estimate the mutation rates in human centromeres, and quantify the extremely high rate of large duplications and deletions in centromeres. This high rate suggests that centromeres may represent some of the most rapidly evolving regions of the human genome with respect to their structural organization.
Collapse
Affiliation(s)
- Andrey V Bzikadze
- Graduate Program in Bioinformatics and Systems Biology, University of California, San Diego, La Jolla, CA, USA
| | - Pavel A Pevzner
- Department of Computer Science and Engineering, University of California, San Diego, La Jolla, CA, USA.
| |
Collapse
|
14
|
Nassar R, Thompson L, Fouquerel E. Molecular mechanisms protecting centromeres from self-sabotage and implications for cancer therapy. NAR Cancer 2023; 5:zcad019. [PMID: 37180029 PMCID: PMC10167631 DOI: 10.1093/narcan/zcad019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Revised: 03/27/2023] [Accepted: 04/20/2023] [Indexed: 05/15/2023] Open
Abstract
Centromeres play a crucial role in DNA segregation by mediating the cohesion and separation of sister chromatids during cell division. Centromere dysfunction, breakage or compromised centromeric integrity can generate aneuploidies and chromosomal instability, which are cellular features associated with cancer initiation and progression. Maintaining centromere integrity is thus essential for genome stability. However, the centromere itself is prone to DNA breaks, likely due to its intrinsically fragile nature. Centromeres are complex genomic loci that are composed of highly repetitive DNA sequences and secondary structures and require the recruitment and homeostasis of a centromere-associated protein network. The molecular mechanisms engaged to preserve centromere inherent structure and respond to centromeric damage are not fully understood and remain a subject of ongoing research. In this article, we provide a review of the currently known factors that contribute to centromeric dysfunction and the molecular mechanisms that mitigate the impact of centromere damage on genome stability. Finally, we discuss the potential therapeutic strategies that could arise from a deeper understanding of the mechanisms preserving centromere integrity.
Collapse
Affiliation(s)
- Rim Nassar
- UPMC Hillman Cancer Center, Department of Pharmacology and Chemical Biology, University of Pittsburgh Cancer Institute, Pittsburgh, PA 15232, USA
| | - Lily Thompson
- UPMC Hillman Cancer Center, Department of Pharmacology and Chemical Biology, University of Pittsburgh Cancer Institute, Pittsburgh, PA 15232, USA
- Department of Biochemistry and Molecular Biology, Thomas Jefferson University, Philadelphia, PA 19107, USA
| | - Elise Fouquerel
- UPMC Hillman Cancer Center, Department of Pharmacology and Chemical Biology, University of Pittsburgh Cancer Institute, Pittsburgh, PA 15232, USA
| |
Collapse
|
15
|
Flynn JM, Hu KB, Clark AG. Three recent sex chromosome-to-autosome fusions in a Drosophila virilis strain with high satellite DNA content. Genetics 2023; 224:iyad062. [PMID: 37052958 PMCID: PMC10213488 DOI: 10.1093/genetics/iyad062] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 12/02/2022] [Accepted: 04/07/2023] [Indexed: 04/14/2023] Open
Abstract
The karyotype, or number and arrangement of chromosomes, has varying levels of stability across both evolution and disease. Karyotype changes often originate from DNA breaks near the centromeres of chromosomes, which generally contain long arrays of tandem repeats or satellite DNA. Drosophila virilis possesses among the highest relative satellite abundances of studied species, with almost half its genome composed of three related 7 bp satellites. We discovered a strain of D. virilis that we infer recently underwent three independent chromosome fusion events involving the X and Y chromosomes, in addition to one subsequent fission event. Here, we isolate and characterize the four different karyotypes we discovered in this strain which we believe demonstrates remarkable genome instability. We discovered that one of the substrains with an X-autosome fusion has an X-to-Y chromosome nondisjunction rate 20 × higher than the D. virilis reference strain (21% vs 1%). Finally, we found an overall higher rate of DNA breakage in the substrain with higher satellite DNA compared to a genetically similar substrain with less satellite DNA. This suggests that satellite DNA abundance may play a role in the risk of genome instability. Overall, we introduce a novel system consisting of a single strain with four different karyotypes, which we believe will be useful for future studies of genome instability, centromere function, and sex chromosome evolution.
Collapse
Affiliation(s)
- Jullien M Flynn
- Department of Molecular Biology and Genetics, Cornell University, Biotechnology Building Room 227, Ithaca, NY 14853, USA
| | - Kevin B Hu
- Department of Molecular Biology and Genetics, Cornell University, Biotechnology Building Room 227, Ithaca, NY 14853, USA
| | - Andrew G Clark
- Department of Molecular Biology and Genetics, Cornell University, Biotechnology Building Room 227, Ithaca, NY 14853, USA
| |
Collapse
|
16
|
Mongia P, Toyofuku N, Pan Z, Xu R, Kinoshita Y, Oki K, Takahashi H, Ogura Y, Hayashi T, Nakagawa T. Fission yeast Srr1 and Skb1 promote isochromosome formation at the centromere. Commun Biol 2023; 6:551. [PMID: 37237082 DOI: 10.1038/s42003-023-04925-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Accepted: 05/09/2023] [Indexed: 05/28/2023] Open
Abstract
Rad51 maintains genome integrity, whereas Rad52 causes non-canonical homologous recombination leading to gross chromosomal rearrangements (GCRs). Here we find that fission yeast Srr1/Ber1 and Skb1/PRMT5 promote GCRs at centromeres. Genetic and physical analyses show that srr1 and skb1 mutations reduce isochromosome formation mediated by centromere inverted repeats. srr1 increases DNA damage sensitivity in rad51 cells but does not abolish checkpoint response, suggesting that Srr1 promotes Rad51-independent DNA repair. srr1 and rad52 additively, while skb1 and rad52 epistatically reduce GCRs. Unlike srr1 or rad52, skb1 does not increase damage sensitivity. Skb1 regulates cell morphology and cell cycle with Slf1 and Pom1, respectively, but neither Slf1 nor Pom1 causes GCRs. Mutating conserved residues in the arginine methyltransferase domain of Skb1 greatly reduces GCRs. These results suggest that, through arginine methylation, Skb1 forms aberrant DNA structures leading to Rad52-dependent GCRs. This study has uncovered roles for Srr1 and Skb1 in GCRs at centromeres.
Collapse
Affiliation(s)
- Piyusha Mongia
- Department of Biological Sciences, Graduate School of Science, Osaka University, 1-1 Machikaneyama, Toyonaka, Osaka, 560-0043, Japan
- Forefront Research Center, Graduate School of Science, Osaka University, 1-1 Machikaneyama, Toyonaka, Osaka, 560-0043, Japan
| | - Naoko Toyofuku
- Department of Biological Sciences, Graduate School of Science, Osaka University, 1-1 Machikaneyama, Toyonaka, Osaka, 560-0043, Japan
| | - Ziyi Pan
- Department of Biological Sciences, Graduate School of Science, Osaka University, 1-1 Machikaneyama, Toyonaka, Osaka, 560-0043, Japan
- Forefront Research Center, Graduate School of Science, Osaka University, 1-1 Machikaneyama, Toyonaka, Osaka, 560-0043, Japan
| | - Ran Xu
- Department of Biological Sciences, Graduate School of Science, Osaka University, 1-1 Machikaneyama, Toyonaka, Osaka, 560-0043, Japan
- Forefront Research Center, Graduate School of Science, Osaka University, 1-1 Machikaneyama, Toyonaka, Osaka, 560-0043, Japan
| | - Yakumo Kinoshita
- Department of Biological Sciences, Graduate School of Science, Osaka University, 1-1 Machikaneyama, Toyonaka, Osaka, 560-0043, Japan
- Forefront Research Center, Graduate School of Science, Osaka University, 1-1 Machikaneyama, Toyonaka, Osaka, 560-0043, Japan
| | - Keitaro Oki
- Department of Biological Sciences, Graduate School of Science, Osaka University, 1-1 Machikaneyama, Toyonaka, Osaka, 560-0043, Japan
| | - Hiroki Takahashi
- Medical Mycology Research Center, Chiba University, Chiba, 260-8673, Japan
| | - Yoshitoshi Ogura
- Division of Microbiology, Department of Infectious Medicine, Kurume University School of Medicine, Kurume, Fukuoka, 830-0011, Japan
| | - Tetsuya Hayashi
- Department of Bacteriology, Faculty of Medical Sciences, Kyushu University, Fukuoka, 812-8582, Japan
| | - Takuro Nakagawa
- Department of Biological Sciences, Graduate School of Science, Osaka University, 1-1 Machikaneyama, Toyonaka, Osaka, 560-0043, Japan.
- Forefront Research Center, Graduate School of Science, Osaka University, 1-1 Machikaneyama, Toyonaka, Osaka, 560-0043, Japan.
| |
Collapse
|
17
|
Carnie CJ, Armstrong L, Sebesta M, Ariza A, Wang X, Graham E, Zhu K, Ahel D. ERCC6L2 mitigates replication stress and promotes centromere stability. Cell Rep 2023; 42:112329. [PMID: 37014751 DOI: 10.1016/j.celrep.2023.112329] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Revised: 01/26/2023] [Accepted: 03/20/2023] [Indexed: 04/05/2023] Open
Abstract
Structurally complex genomic regions, such as centromeres, are inherently difficult to duplicate. The mechanism behind centromere inheritance is not well understood, and one of the key questions relates to the reassembly of centromeric chromatin following DNA replication. Here, we define ERCC6L2 as a key regulator of this process. ERCC6L2 accumulates at centromeres and promotes deposition of core centromeric factors. Interestingly, ERCC6L2-/- cells show unrestrained replication of centromeric DNA, likely caused by the erosion of centromeric chromatin. Beyond centromeres, ERCC6L2 facilitates replication at genomic repeats and non-canonical DNA structures. Notably, ERCC6L2 interacts with the DNA-clamp PCNA through an atypical peptide, presented here in a co-crystal structure. Finally, ERCC6L2 also restricts DNA end resection, acting independently of the 53BP1-REV7-Shieldin complex. We propose a mechanistic model, which reconciles seemingly distinct functions of ERCC6L2 in DNA repair and DNA replication. These findings provide a molecular context for studies linking ERCC6L2 to human disease.
Collapse
Affiliation(s)
| | - Lucy Armstrong
- Sir William Dunn School of Pathology, University of Oxford, Oxford OX1 3RE, UK
| | - Marek Sebesta
- Sir William Dunn School of Pathology, University of Oxford, Oxford OX1 3RE, UK
| | - Antonio Ariza
- Sir William Dunn School of Pathology, University of Oxford, Oxford OX1 3RE, UK
| | - Xiaomeng Wang
- Sir William Dunn School of Pathology, University of Oxford, Oxford OX1 3RE, UK
| | - Emily Graham
- Sir William Dunn School of Pathology, University of Oxford, Oxford OX1 3RE, UK
| | - Kang Zhu
- Sir William Dunn School of Pathology, University of Oxford, Oxford OX1 3RE, UK
| | - Dragana Ahel
- Sir William Dunn School of Pathology, University of Oxford, Oxford OX1 3RE, UK.
| |
Collapse
|
18
|
Součková K, Jasík M, Sovadinová I, Sember A, Sychrová E, Konieczna A, Bystrý V, Dyková I, Blažek R, Lukšíková K, Pavlica T, Jankásek M, Altmanová M, Žák J, Zbončáková A, Reichard M, Slabý O. From fish to cells: Establishment of continuous cell lines from embryos of annual killifish Nothobranchius furzeri and N. kadleci. AQUATIC TOXICOLOGY (AMSTERDAM, NETHERLANDS) 2023; 259:106517. [PMID: 37087860 DOI: 10.1016/j.aquatox.2023.106517] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Revised: 03/24/2023] [Accepted: 03/30/2023] [Indexed: 05/03/2023]
Abstract
There is a growing need of alternative experimental models that avoid or minimize the use of animals due to ethical, economical, and scientific reasons. Surprisingly, the stable embryonic cell lines representing Nothobranchius spp., emerging vertebrate models in aging research, regenerative medicine, ecotoxicology, or genomics, have been not derived so far. This paper reports establishment and deep characterization of ten continuous cell lines from annual killifish embryos of N. furzeri and N. kadleci. The established cell lines exhibited mostly fibroblast- and epithelial-like morphology and steady growth rates with cell doubling time ranging from 27 to 40 h. All cell lines retained very similar characteristics even after continuous subcultivation (more than 100 passages) and extended storage in liquid nitrogen (∼3 years). The cytogenetic analysis of the cell lines revealed a diploid chromosome number mostly equal to 38 elements (i.e., the native chromosome count for both killifish species), with minor but diverse line/passage-specific karyotype changes compared to the patterns observed in non-cultured N. furzeri and N. kadleci somatic cells. Based on transcriptional analysis of marker genes, the cell lines displayed features of an undifferentiated state without signs of senescence even in advanced passages. We confirmed that the cell lines are transfectable and can form viable 3-D spheroids. The applicability of the cell lines for (eco)toxicological surveys was confirmed by assessing the effect of cytotoxic and growth inhibitory agents. Properties of established Nothobranchius embryonic cell lines open new possibilities for the application of this model in various fields of life sciences including molecular mechanisms of aging, karyotype (in)stability or differences in lifespan.
Collapse
Affiliation(s)
- Kamila Součková
- Ondřej Slabý Group, Molecular Medicine, Central European Institute of Technology, Masaryk University, Kamenice 5, Brno 625 00, Czech Republic.
| | - Matej Jasík
- Ondřej Slabý Group, Molecular Medicine, Central European Institute of Technology, Masaryk University, Kamenice 5, Brno 625 00, Czech Republic
| | - Iva Sovadinová
- RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno 602 00, Czech Republic
| | - Alexandr Sember
- Institute of Animal Physiology and Genetics, Czech Academy of Sciences, Liběchov 277 21, Czech Republic
| | - Eliška Sychrová
- RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno 602 00, Czech Republic
| | - Anna Konieczna
- Ondřej Slabý Group, Molecular Medicine, Central European Institute of Technology, Masaryk University, Kamenice 5, Brno 625 00, Czech Republic
| | - Vojtěch Bystrý
- Ondřej Slabý Group, Molecular Medicine, Central European Institute of Technology, Masaryk University, Kamenice 5, Brno 625 00, Czech Republic
| | - Iva Dyková
- Department of Botany and Zoology, Faculty of Science, Masaryk University, Brno 611 37, Czech Republic
| | - Radim Blažek
- Department of Botany and Zoology, Faculty of Science, Masaryk University, Brno 611 37, Czech Republic; Institute of Vertebrate Biology, Czech Academy of Sciences, Brno 603 00, Czech Republic
| | - Karolína Lukšíková
- Institute of Animal Physiology and Genetics, Czech Academy of Sciences, Liběchov 277 21, Czech Republic; Department of Genetics and Microbiology, Faculty of Science, Charles University, Prague 128 44, Czech Republic
| | - Tomáš Pavlica
- Institute of Animal Physiology and Genetics, Czech Academy of Sciences, Liběchov 277 21, Czech Republic; Department of Zoology, Faculty of Science, Charles University, Prague 128 44, Czech Republic
| | - Marek Jankásek
- Institute of Animal Physiology and Genetics, Czech Academy of Sciences, Liběchov 277 21, Czech Republic; Department of Zoology, Faculty of Science, Charles University, Prague 128 44, Czech Republic
| | - Marie Altmanová
- Institute of Animal Physiology and Genetics, Czech Academy of Sciences, Liběchov 277 21, Czech Republic; Department of Ecology, Faculty of Science, Charles University, Prague 128 44, Czech Republic
| | - Jakub Žák
- Department of Botany and Zoology, Faculty of Science, Masaryk University, Brno 611 37, Czech Republic; Institute of Vertebrate Biology, Czech Academy of Sciences, Brno 603 00, Czech Republic
| | - Adriana Zbončáková
- Ondřej Slabý Group, Molecular Medicine, Central European Institute of Technology, Masaryk University, Kamenice 5, Brno 625 00, Czech Republic
| | - Martin Reichard
- Department of Botany and Zoology, Faculty of Science, Masaryk University, Brno 611 37, Czech Republic; Institute of Vertebrate Biology, Czech Academy of Sciences, Brno 603 00, Czech Republic; Department of Ecology and Vertebrate Zoology, University of Łódź, Łódź 90-237, Poland
| | - Ondřej Slabý
- Ondřej Slabý Group, Molecular Medicine, Central European Institute of Technology, Masaryk University, Kamenice 5, Brno 625 00, Czech Republic; Department of Biology, Faculty of Medicine, Masaryk University, Brno 625 00, Czech Republic
| |
Collapse
|
19
|
Lopes M, Louzada S, Ferreira D, Veríssimo G, Eleutério D, Gama-Carvalho M, Chaves R. Human Satellite 1A analysis provides evidence of pericentromeric transcription. BMC Biol 2023; 21:28. [PMID: 36755311 PMCID: PMC9909926 DOI: 10.1186/s12915-023-01521-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Accepted: 01/19/2023] [Indexed: 02/10/2023] Open
Abstract
BACKGROUND Pericentromeric regions of human chromosomes are composed of tandem-repeated and highly organized sequences named satellite DNAs. Human classical satellite DNAs are classified into three families named HSat1, HSat2, and HSat3, which have historically posed a challenge for the assembly of the human reference genome where they are misrepresented due to their repetitive nature. Although being known for a long time as the most AT-rich fraction of the human genome, classical satellite HSat1A has been disregarded in genomic and transcriptional studies, falling behind other human satellites in terms of functional knowledge. Here, we aim to characterize and provide an understanding on the biological relevance of HSat1A. RESULTS The path followed herein trails with HSat1A isolation and cloning, followed by in silico analysis. Monomer copy number and expression data was obtained in a wide variety of human cell lines, with greatly varying profiles in tumoral/non-tumoral samples. HSat1A was mapped in human chromosomes and applied in in situ transcriptional assays. Additionally, it was possible to observe the nuclear organization of HSat1A transcripts and further characterize them by 3' RACE-Seq. Size-varying polyadenylated HSat1A transcripts were detected, which possibly accounts for the intricate regulation of alternative polyadenylation. CONCLUSION As far as we know, this work pioneers HSat1A transcription studies. With the emergence of new human genome assemblies, acrocentric pericentromeres are becoming relevant characters in disease and other biological contexts. HSat1A sequences and associated noncoding RNAs will most certainly prove significant in the future of HSat research.
Collapse
Affiliation(s)
- Mariana Lopes
- grid.12341.350000000121821287CytoGenomics Lab, Department of Genetics and Biotechnology (DGB), University of Trás-Os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal ,grid.9983.b0000 0001 2181 4263BioISI – Biosystems & Integrative Sciences Institute, Faculty of Sciences, University of Lisboa, 1749-016 Lisbon, Portugal
| | - Sandra Louzada
- grid.12341.350000000121821287CytoGenomics Lab, Department of Genetics and Biotechnology (DGB), University of Trás-Os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal ,grid.9983.b0000 0001 2181 4263BioISI – Biosystems & Integrative Sciences Institute, Faculty of Sciences, University of Lisboa, 1749-016 Lisbon, Portugal
| | - Daniela Ferreira
- grid.12341.350000000121821287CytoGenomics Lab, Department of Genetics and Biotechnology (DGB), University of Trás-Os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal ,grid.9983.b0000 0001 2181 4263BioISI – Biosystems & Integrative Sciences Institute, Faculty of Sciences, University of Lisboa, 1749-016 Lisbon, Portugal
| | - Gabriela Veríssimo
- grid.12341.350000000121821287CytoGenomics Lab, Department of Genetics and Biotechnology (DGB), University of Trás-Os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal ,grid.9983.b0000 0001 2181 4263BioISI – Biosystems & Integrative Sciences Institute, Faculty of Sciences, University of Lisboa, 1749-016 Lisbon, Portugal
| | - Daniel Eleutério
- grid.9983.b0000 0001 2181 4263BioISI – Biosystems & Integrative Sciences Institute, Faculty of Sciences, University of Lisboa, 1749-016 Lisbon, Portugal
| | - Margarida Gama-Carvalho
- grid.9983.b0000 0001 2181 4263BioISI – Biosystems & Integrative Sciences Institute, Faculty of Sciences, University of Lisboa, 1749-016 Lisbon, Portugal
| | - Raquel Chaves
- CytoGenomics Lab, Department of Genetics and Biotechnology (DGB), University of Trás-Os-Montes and Alto Douro (UTAD), 5000-801, Vila Real, Portugal. .,BioISI - Biosystems & Integrative Sciences Institute, Faculty of Sciences, University of Lisboa, 1749-016, Lisbon, Portugal.
| |
Collapse
|
20
|
Shukla S, Lazarchuk P, Pavlova MN, Sidorova JM. Genome-wide survey of D/E repeats in human proteins uncovers their instability and aids in identifying their role in the chromatin regulator ATAD2. iScience 2022; 25:105464. [PMCID: PMC9672403 DOI: 10.1016/j.isci.2022.105464] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Revised: 08/03/2022] [Accepted: 10/26/2022] [Indexed: 11/15/2022] Open
Abstract
D/E repeats are stretches of aspartic and/or glutamic acid residues found in over 150 human proteins. We examined genomic stability of D/E repeats and functional characteristics of D/E repeat-containing proteins vis-à-vis the proteins with poly-Q or poly-A repeats, which are known to undergo pathologic expansions. Mining of tumor sequencing data revealed that D/E repeat-coding regions are similar to those coding poly-Qs and poly-As in increased incidence of trinucleotide insertions/deletions but differ in types and incidence of substitutions. D/E repeat-containing proteins preferentially function in chromatin metabolism and are the more likely to be nuclear and interact with core histones, the longer their repeats are. One of the longest D/E repeats of unknown function is in ATAD2, a bromodomain family ATPase frequently overexpressed in tumors. We demonstrate that D/E repeat deletion in ATAD2 suppresses its binding to nascent and mature chromatin and to the constitutive pericentromeric heterochromatin, where ATAD2 represses satellite transcription. Many human proteins contain runs of aspartic/glutamic acid residues (D/E repeats) D/E repeats show increased incidence of in-frame insertions/deletions in tumors Nuclear and histone-interacting proteins often have long D/E repeats D/E repeat of the oncogene ATAD2 controls its binding to pericentric chromatin
Collapse
Affiliation(s)
- Shalabh Shukla
- Department of Laboratory Medicine and Pathology, University of Washington, 1959 NE Pacific St., Box 357705, Seattle, WA 98195, USA
| | - Pavlo Lazarchuk
- Department of Laboratory Medicine and Pathology, University of Washington, 1959 NE Pacific St., Box 357705, Seattle, WA 98195, USA
| | - Maria N. Pavlova
- Department of Laboratory Medicine and Pathology, University of Washington, 1959 NE Pacific St., Box 357705, Seattle, WA 98195, USA
| | - Julia M. Sidorova
- Department of Laboratory Medicine and Pathology, University of Washington, 1959 NE Pacific St., Box 357705, Seattle, WA 98195, USA
- Corresponding author
| |
Collapse
|
21
|
Mirceta M, Shum N, Schmidt MHM, Pearson CE. Fragile sites, chromosomal lesions, tandem repeats, and disease. Front Genet 2022; 13:985975. [PMID: 36468036 PMCID: PMC9714581 DOI: 10.3389/fgene.2022.985975] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Accepted: 09/02/2022] [Indexed: 09/16/2023] Open
Abstract
Expanded tandem repeat DNAs are associated with various unusual chromosomal lesions, despiralizations, multi-branched inter-chromosomal associations, and fragile sites. Fragile sites cytogenetically manifest as localized gaps or discontinuities in chromosome structure and are an important genetic, biological, and health-related phenomena. Common fragile sites (∼230), present in most individuals, are induced by aphidicolin and can be associated with cancer; of the 27 molecularly-mapped common sites, none are associated with a particular DNA sequence motif. Rare fragile sites ( ≳ 40 known), ≤ 5% of the population (may be as few as a single individual), can be associated with neurodevelopmental disease. All 10 molecularly-mapped folate-sensitive fragile sites, the largest category of rare fragile sites, are caused by gene-specific CGG/CCG tandem repeat expansions that are aberrantly CpG methylated and include FRAXA, FRAXE, FRAXF, FRA2A, FRA7A, FRA10A, FRA11A, FRA11B, FRA12A, and FRA16A. The minisatellite-associated rare fragile sites, FRA10B, FRA16B, can be induced by AT-rich DNA-ligands or nucleotide analogs. Despiralized lesions and multi-branched inter-chromosomal associations at the heterochromatic satellite repeats of chromosomes 1, 9, 16 are inducible by de-methylating agents like 5-azadeoxycytidine and can spontaneously arise in patients with ICF syndrome (Immunodeficiency Centromeric instability and Facial anomalies) with mutations in genes regulating DNA methylation. ICF individuals have hypomethylated satellites I-III, alpha-satellites, and subtelomeric repeats. Ribosomal repeats and subtelomeric D4Z4 megasatellites/macrosatellites, are associated with chromosome location, fragility, and disease. Telomere repeats can also assume fragile sites. Dietary deficiencies of folate or vitamin B12, or drug insults are associated with megaloblastic and/or pernicious anemia, that display chromosomes with fragile sites. The recent discovery of many new tandem repeat expansion loci, with varied repeat motifs, where motif lengths can range from mono-nucleotides to megabase units, could be the molecular cause of new fragile sites, or other chromosomal lesions. This review focuses on repeat-associated fragility, covering their induction, cytogenetics, epigenetics, cell type specificity, genetic instability (repeat instability, micronuclei, deletions/rearrangements, and sister chromatid exchange), unusual heritability, disease association, and penetrance. Understanding tandem repeat-associated chromosomal fragile sites provides insight to chromosome structure, genome packaging, genetic instability, and disease.
Collapse
Affiliation(s)
- Mila Mirceta
- Program of Genetics and Genome Biology, The Hospital for Sick Children, The Peter Gilgan Centre for Research and Learning, Toronto, ON, Canada
- Program of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Natalie Shum
- Program of Genetics and Genome Biology, The Hospital for Sick Children, The Peter Gilgan Centre for Research and Learning, Toronto, ON, Canada
- Program of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Monika H. M. Schmidt
- Program of Genetics and Genome Biology, The Hospital for Sick Children, The Peter Gilgan Centre for Research and Learning, Toronto, ON, Canada
- Program of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Christopher E. Pearson
- Program of Genetics and Genome Biology, The Hospital for Sick Children, The Peter Gilgan Centre for Research and Learning, Toronto, ON, Canada
- Program of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
22
|
The Chromatin Remodeler HELLS: A New Regulator in DNA Repair, Genome Maintenance, and Cancer. Int J Mol Sci 2022; 23:ijms23169313. [PMID: 36012581 PMCID: PMC9409174 DOI: 10.3390/ijms23169313] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2022] [Revised: 08/14/2022] [Accepted: 08/16/2022] [Indexed: 02/06/2023] Open
Abstract
Robust, tightly regulated DNA repair is critical to maintaining genome stability and preventing cancer. Eukaryotic DNA is packaged into chromatin, which has a profound, yet incompletely understood, regulatory influence on DNA repair and genome stability. The chromatin remodeler HELLS (helicase, lymphoid specific) has emerged as an important epigenetic regulator of DNA repair, genome stability, and multiple cancer-associated pathways. HELLS belongs to a subfamily of the conserved SNF2 ATP-dependent chromatin-remodeling complexes, which use energy from ATP hydrolysis to alter nucleosome structure and packaging of chromatin during the processes of DNA replication, transcription, and repair. The mouse homologue, LSH (lymphoid-specific helicase), plays an important role in the maintenance of heterochromatin and genome-wide DNA methylation, and is crucial in embryonic development, gametogenesis, and maturation of the immune system. Human HELLS is abundantly expressed in highly proliferating cells of the lymphoid tissue, skin, germ cells, and embryonic stem cells. Mutations in HELLS cause the human immunodeficiency syndrome ICF (Immunodeficiency, Centromeric instability, Facial anomalies). HELLS has been implicated in many types of cancer, including retinoblastoma, colorectal cancer, hepatocellular carcinoma, and glioblastoma. Here, we review and summarize accumulating evidence highlighting important roles for HELLS in DNA repair, genome maintenance, and key pathways relevant to cancer development, progression, and treatment.
Collapse
|
23
|
Repetitive Sequence Transcription in Breast Cancer. Cells 2022; 11:cells11162522. [PMID: 36010599 PMCID: PMC9406339 DOI: 10.3390/cells11162522] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Revised: 08/05/2022] [Accepted: 08/12/2022] [Indexed: 11/17/2022] Open
Abstract
Repetitive sequences represent about half of the human genome. They are actively transcribed and play a role during development and in epigenetic regulation. The altered activity of repetitive sequences can lead to genomic instability and they can contribute to the establishment or the progression of degenerative diseases and cancer transformation. In this work, we analyzed the expression profiles of DNA repetitive sequences in the breast cancer specimens of the HMUCC cohort. Satellite expression is generally upregulated in breast cancers, with specific families upregulated per histotype: in HER2-enriched cancers, they are the human satellite II (HSATII), in luminal A and B, they are part of the ALR family and in triple-negative, they are part of SAR and GSAT families, together with a perturbation in the transcription from endogenous retroviruses and their LTR sequences. We report that the background expression of repetitive sequences in healthy tissues of cancer patients differs from the tissues of non-cancerous controls. To conclude, peculiar patterns of expression of repetitive sequences are reported in each specimen, especially in the case of transcripts arising from satellite repeats.
Collapse
|
24
|
Haig D. Paradox lost: Concerted evolution and centromeric instability: Centromeres are hospitable habitats for repeats that evolve adaptations for proliferation within the nucleus sometimes at organismal cost.: Centromeres are hospitable habitats for repeats that evolve adaptations for proliferation within the nucleus sometimes at organismal cost. Bioessays 2022; 44:e2200023. [PMID: 35748194 DOI: 10.1002/bies.202200023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Revised: 06/07/2022] [Accepted: 06/09/2022] [Indexed: 11/11/2022]
Abstract
Homologous centromeres compete for segregation to the secondary oocyte nucleus at female meiosis I. Centromeric repeats also compete with each other to populate centromeres in mitotic cells of the germline and have become adapted to use the recombinational machinery present at centromeres to promote their own propagation. Repeats are not needed at centromeres, rather centromeres appear to be hospitable habitats for the colonization and proliferation of repeats. This is probably an indirect consequence of two distinctive features of centromeric DNA. Centromeres are subject to breakage by the mechanical forces exerted by microtubules and meiotic crossing-over is suppressed. Centromeric proteins acting in trans are under selection to mitigate the costs of centromeric repeats acting in cis. Collateral costs of mitotic competition at centromeres may help to explain the high rates of aneuploidy observed in early human embryos.
Collapse
Affiliation(s)
- David Haig
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, USA
| |
Collapse
|
25
|
Kunyavskaya O, Dvorkina T, Bzikadze AV, Alexandrov I, Pevzner PA. Automated annotation of human centromeres with HORmon. Genome Res 2022; 32:1137-1151. [PMID: 35545449 DOI: 10.1101/gr.276362.121] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Accepted: 05/06/2022] [Indexed: 11/24/2022]
Abstract
Recent advances in long-read sequencing opened a possibility to address the long-standing questions about the architecture and evolution of human centromeres. They also emphasized the need for centromere annotation (partitioning human centromeres into monomers and higher-order repeats (HORs)). Even though there was a half-century-long series of semi-manual studies of centromere architecture, a rigorous centromere annotation algorithm is still lacking. Moreover, an automated centromere annotation is a prerequisite for studies of genetic diseases associated with centromeres, and evolutionary studies of centromeres across multiple species. Although the monomer decomposition (transforming a centromere into a monocentromere written in the monomer alphabet) and the HOR decomposition (representing a monocentromere in the alphabet of HORs) are currently viewed as two separate problems, we demonstrate that they should be integrated into a single framework in such a way that HOR (monomer) inference affects monomer (HOR) inference. We thus developed the HORmon algorithm that integrates the monomer/HOR inference and automatically generates the human monomers/HORs that are largely consistent with the previous semi-manual inference.
Collapse
Affiliation(s)
- Olga Kunyavskaya
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University
| | - Tatiana Dvorkina
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University
| | | | - Ivan Alexandrov
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University
| | | |
Collapse
|
26
|
Saayman X, Esashi F. Breaking the paradigm: early insights from mammalian DNA breakomes. FEBS J 2022; 289:2409-2428. [PMID: 33792193 PMCID: PMC9451923 DOI: 10.1111/febs.15849] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Revised: 03/04/2021] [Accepted: 03/29/2021] [Indexed: 12/13/2022]
Abstract
DNA double-strand breaks (DSBs) can result from both exogenous and endogenous sources and are potentially toxic lesions to the human genome. If improperly repaired, DSBs can threaten genome integrity and contribute to premature ageing, neurodegenerative disorders and carcinogenesis. Through decades of work on genome stability, it has become evident that certain regions of the genome are inherently more prone to breakage than others, known as genome instability hotspots. Recent advancements in sequencing-based technologies now enable the profiling of genome-wide distributions of DSBs, also known as breakomes, to systematically map these instability hotspots. Here, we review the application of these technologies and their implications for our current understanding of the genomic regions most likely to drive genome instability. These breakomes ultimately highlight both new and established breakage hotspots including actively transcribed regions, loop boundaries and early-replicating regions of the genome. Further, these breakomes challenge the paradigm that DNA breakage primarily occurs in hard-to-replicate regions. With these advancements, we begin to gain insights into the biological mechanisms both invoking and protecting against genome instability.
Collapse
Affiliation(s)
- Xanita Saayman
- Sir William Dunn School of Pathology, University of Oxford, UK
| | - Fumiko Esashi
- Sir William Dunn School of Pathology, University of Oxford, UK
| |
Collapse
|
27
|
Altemose N, Glennis A, Bzikadze AV, Sidhwani P, Langley SA, Caldas GV, Hoyt SJ, Uralsky L, Ryabov FD, Shew CJ, Sauria MEG, Borchers M, Gershman A, Mikheenko A, Shepelev VA, Dvorkina T, Kunyavskaya O, Vollger MR, Rhie A, McCartney AM, Asri M, Lorig-Roach R, Shafin K, Aganezov S, Olson D, de Lima LG, Potapova T, Hartley GA, Haukness M, Kerpedjiev P, Gusev F, Tigyi K, Brooks S, Young A, Nurk S, Koren S, Salama SR, Paten B, Rogaev EI, Streets A, Karpen GH, Dernburg AF, Sullivan BA, Straight AF, Wheeler TJ, Gerton JL, Eichler EE, Phillippy AM, Timp W, Dennis MY, O'Neill RJ, Zook JM, Schatz MC, Pevzner PA, Diekhans M, Langley CH, Alexandrov IA, Miga KH. Complete genomic and epigenetic maps of human centromeres. Science 2022; 376:eabl4178. [PMID: 35357911 PMCID: PMC9233505 DOI: 10.1126/science.abl4178] [Citation(s) in RCA: 203] [Impact Index Per Article: 101.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Existing human genome assemblies have almost entirely excluded repetitive sequences within and near centromeres, limiting our understanding of their organization, evolution, and functions, which include facilitating proper chromosome segregation. Now, a complete, telomere-to-telomere human genome assembly (T2T-CHM13) has enabled us to comprehensively characterize pericentromeric and centromeric repeats, which constitute 6.2% of the genome (189.9 megabases). Detailed maps of these regions revealed multimegabase structural rearrangements, including in active centromeric repeat arrays. Analysis of centromere-associated sequences uncovered a strong relationship between the position of the centromere and the evolution of the surrounding DNA through layered repeat expansions. Furthermore, comparisons of chromosome X centromeres across a diverse panel of individuals illuminated high degrees of structural, epigenetic, and sequence variation in these complex and rapidly evolving regions.
Collapse
Affiliation(s)
- Nicolas Altemose
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
| | - A. Glennis
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Andrey V. Bzikadze
- Graduate Program in Bioinformatics and Systems Biology, University of California San Diego, La Jolla, CA, USA
| | - Pragya Sidhwani
- Department of Biochemistry, Stanford University, Stanford, CA, USA
| | - Sasha A. Langley
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
| | - Gina V. Caldas
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
| | - Savannah J. Hoyt
- Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
| | - Lev Uralsky
- Sirius University of Science and Technology, Sochi, Russia
- Vavilov Institute of General Genetics, Moscow, Russia
| | | | - Colin J. Shew
- Genome Center, MIND Institute, and Department of Biochemistry and Molecular Medicine, School of Medicine, University of California, Davis, Davis, CA, USA
| | | | | | - Ariel Gershman
- Department of Molecular Biology and Genetics, Johns Hopkins University, Baltimore, MD, USA
| | - Alla Mikheenko
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University, Saint Petersburg, Russia
| | | | - Tatiana Dvorkina
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University, Saint Petersburg, Russia
| | - Olga Kunyavskaya
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University, Saint Petersburg, Russia
| | - Mitchell R. Vollger
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Arang Rhie
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Ann M. McCartney
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Mobin Asri
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Ryan Lorig-Roach
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Kishwar Shafin
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Sergey Aganezov
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Daniel Olson
- Department of Computer Science, University of Montana, Missoula, MT. USA
| | | | - Tamara Potapova
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Gabrielle A. Hartley
- Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
| | - Marina Haukness
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA
| | | | - Fedor Gusev
- Vavilov Institute of General Genetics, Moscow, Russia
| | - Kristof Tigyi
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Shelise Brooks
- NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Alice Young
- NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Sergey Nurk
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Sergey Koren
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Sofie R. Salama
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Benedict Paten
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA
- Department of Biomolecular Engineering, University of California Santa Cruz, CA, USA
| | - Evgeny I. Rogaev
- Sirius University of Science and Technology, Sochi, Russia
- Vavilov Institute of General Genetics, Moscow, Russia
- Department of Psychiatry, University of Massachusetts Medical School, Worcester, MA, USA
- Faculty of Biology, Lomonosov Moscow State University, Moscow, Russia
| | - Aaron Streets
- Department of Bioengineering, University of California, Berkeley, Berkeley, CA, USA
- Chan Zuckerberg Biohub, San Francisco, CA, USA
| | - Gary H. Karpen
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
- BioEngineering and BioMedical Sciences Department, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Abby F. Dernburg
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
- Institute for Quantitative Biosciences (QB3), University of California, Berkeley, Berkeley, CA, USA
| | - Beth A. Sullivan
- Department of Molecular Genetics and Microbiology, Duke University School of Medicine, Durham, NC, USA
| | | | - Travis J. Wheeler
- Department of Computer Science, University of Montana, Missoula, MT. USA
| | - Jennifer L. Gerton
- Stowers Institute for Medical Research, Kansas City, MO, USA
- University of Kansas Medical School, Department of Biochemistry and Molecular Biology and Cancer Center, University of Kansas, Kansas City, KS, USA
| | - Evan E. Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Adam M. Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Winston Timp
- Department of Molecular Biology and Genetics, Johns Hopkins University, Baltimore, MD, USA
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Megan Y. Dennis
- Genome Center, MIND Institute, and Department of Biochemistry and Molecular Medicine, School of Medicine, University of California, Davis, Davis, CA, USA
| | - Rachel J. O'Neill
- Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
| | - Justin M. Zook
- Biosystems and Biomaterials Division, National Institute of Standards and Technology, Gaithersburg, MD, USA
| | - Michael C. Schatz
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Pavel A. Pevzner
- Department of Computer Science and Engineering, University of California at San Diego, San Diego, CA, USA
| | - Mark Diekhans
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Charles H. Langley
- Department of Evolution and Ecology, University of California Davis, Davis, CA, USA
| | - Ivan A. Alexandrov
- Vavilov Institute of General Genetics, Moscow, Russia
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University, Saint Petersburg, Russia
- Research Center of Biotechnology of the Russian Academy of Sciences, Moscow, Russia
| | - Karen H. Miga
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA
- Department of Biomolecular Engineering, University of California Santa Cruz, CA, USA
| |
Collapse
|
28
|
Chardon F, Japaridze A, Witt H, Velikovsky L, Chakraborty C, Wilhelm T, Dumont M, Yang W, Kikuti C, Gangnard S, Mace AS, Wuite G, Dekker C, Fachinetti D. CENP-B-mediated DNA loops regulate activity and stability of human centromeres. Mol Cell 2022; 82:1751-1767.e8. [PMID: 35320753 DOI: 10.1016/j.molcel.2022.02.032] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Revised: 02/22/2022] [Accepted: 02/23/2022] [Indexed: 12/25/2022]
Abstract
Chromosome inheritance depends on centromeres, epigenetically specified regions of chromosomes. While conventional human centromeres are known to be built of long tandem DNA repeats, much of their architecture remains unknown. Using single-molecule techniques such as AFM, nanopores, and optical tweezers, we find that human centromeric DNA exhibits complex DNA folds such as local hairpins. Upon binding to a specific sequence within centromeric regions, the DNA-binding protein CENP-B compacts centromeres by forming pronounced DNA loops between the repeats, which favor inter-chromosomal centromere compaction and clustering. This DNA-loop-mediated organization of centromeric chromatin participates in maintaining centromere position and integrity upon microtubule pulling during mitosis. Our findings emphasize the importance of DNA topology in centromeric regulation and stability.
Collapse
Affiliation(s)
- Florian Chardon
- Institut Curie, PSL Research University, CNRS, UMR 144, 26 rue d'Ulm, 75005 Paris, France
| | - Aleksandre Japaridze
- Department of Bionanoscience, Kavli Institute of Nanoscience Delft, Delft University of Technology, Van der Maasweg 9, 2629 HZ Delft, the Netherlands
| | - Hannes Witt
- Department of Physics and Astronomy, LaserLaB Amsterdam, Vrije Universiteit Amsterdam, De Boelelaan 1081, 1081 HV Amsterdam, the Netherlands
| | - Leonid Velikovsky
- Institut Curie, PSL Research University, CNRS, UMR 144, 26 rue d'Ulm, 75005 Paris, France
| | - Camellia Chakraborty
- Institut Curie, PSL Research University, CNRS, UMR 144, 26 rue d'Ulm, 75005 Paris, France
| | - Therese Wilhelm
- Institut Curie, PSL Research University, CNRS, UMR 144, 26 rue d'Ulm, 75005 Paris, France
| | - Marie Dumont
- Institut Curie, PSL Research University, CNRS, UMR 144, 26 rue d'Ulm, 75005 Paris, France
| | - Wayne Yang
- Department of Bionanoscience, Kavli Institute of Nanoscience Delft, Delft University of Technology, Van der Maasweg 9, 2629 HZ Delft, the Netherlands
| | - Carlos Kikuti
- Institut Curie, PSL Research University, CNRS, UMR 144, 26 rue d'Ulm, 75005 Paris, France
| | - Stephane Gangnard
- Institut Curie, PSL Research University, CNRS, UMR 144, 26 rue d'Ulm, 75005 Paris, France
| | - Anne-Sophie Mace
- Institut Curie, PSL Research University, CNRS, UMR 144, 26 rue d'Ulm, 75005 Paris, France
| | - Gijs Wuite
- Department of Physics and Astronomy, LaserLaB Amsterdam, Vrije Universiteit Amsterdam, De Boelelaan 1081, 1081 HV Amsterdam, the Netherlands
| | - Cees Dekker
- Department of Bionanoscience, Kavli Institute of Nanoscience Delft, Delft University of Technology, Van der Maasweg 9, 2629 HZ Delft, the Netherlands
| | - Daniele Fachinetti
- Institut Curie, PSL Research University, CNRS, UMR 144, 26 rue d'Ulm, 75005 Paris, France.
| |
Collapse
|
29
|
The Psychoemotional Stress-Induced Changes in the Abundance of SatIII (1q12) and Telomere Repeats, but Not Ribosomal DNA, in Human Leukocytes. Genes (Basel) 2022; 13:genes13020343. [PMID: 35205387 PMCID: PMC8872136 DOI: 10.3390/genes13020343] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2022] [Revised: 02/01/2022] [Accepted: 02/11/2022] [Indexed: 02/04/2023] Open
Abstract
INTRODUCTION. As shown earlier, copy number variations (CNV) in the human satellite III (1q12) fragment (f-SatIII) and the telomere repeat (TR) reflects the cell’s response to oxidative stress. The contents of f-SatIII and TR in schizophrenic (SZ) patients were found to be lower than in healthy controls (HC) in previous studies. The major question of this study was: ‘What are the f-SatIII and TR CNV dynamic changes in human leukocytes, depending on psychoemotional stress?’ MATERIALS AND METHODS. We chose a model of psychoemotional stress experienced by second-year medical students during their exams. Blood samples were taken in stressful conditions (exams) and in a control non-stressful period. Biotinylated probes were used for f-SatIII, rDNA, and TR quantitation in leukocyte DNA by non-radioactive quantitative hybridization in SZ patients (n = 97), HC (n = 97), and medical students (n = 17, n = 42). A flow cytometry analysis was used for the oxidative stress marker (NOX4, 8-oxodG, and γH2AX) detection in the lymphocytes of the three groups. RESULTS. Oxidative stress markers increased significantly in the students’ lymphocytes during psychoemotional stress. The TR and f-SatIII, but not the rDNA, contents significantly changed in the DNA isolated from human blood leukocytes. After a restoration period (post-examinational vacations), the f-SatIII content decreased, and the TR content increased. Changes in the blood cells of students during examinational stress were similar to those in SZ patients during an exacerbation of the disease. CONCLUSIONS. Psychoemotional stress in students during exams triggers a universal mechanism of oxidative stress. The oxidative stress causes significant changes in the f-SatIII and TR contents, while the ribosomal repeat content remains stable. A hypothesis is proposed to explain the quantitative polymorphisms of f-SatIII and TR contents under transient (e.g., students’ exams) or chronic (in SZ patients) stress. The changes in the f-SatIII and TR copy numbers are non-specific events, irrespective of the source of stress. Thus, our findings suggest that the psychoemotional stress, common in SZ patients and healthy students during exams, but not in a schizophrenia-specific event, was responsible for the changes in the repeat contents that we observed earlier in SZ patients.
Collapse
|
30
|
Pan W, Ruan J. En Route to Completion: What Is An Ideal Reference Genome? GENOMICS, PROTEOMICS & BIOINFORMATICS 2022; 20:1-3. [PMID: 34509700 PMCID: PMC9510861 DOI: 10.1016/j.gpb.2021.09.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Accepted: 09/02/2021] [Indexed: 11/12/2022]
Affiliation(s)
- Weihua Pan
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China.
| | - Jue Ruan
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China.
| |
Collapse
|
31
|
Vojvoda Zeljko T, Ugarković Đ, Pezer Ž. Differential enrichment of H3K9me3 at annotated satellite DNA repeats in human cell lines and during fetal development in mouse. Epigenetics Chromatin 2021; 14:47. [PMID: 34663449 PMCID: PMC8524813 DOI: 10.1186/s13072-021-00423-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2021] [Accepted: 10/05/2021] [Indexed: 01/24/2023] Open
Abstract
BACKGROUND Trimethylation of histone H3 on lysine 9 (H3K9me3) at satellite DNA sequences has been primarily studied at (peri)centromeric regions, where its level shows differences associated with various processes such as development and malignant transformation. However, the dynamics of H3K9me3 at distal satellite DNA repeats has not been thoroughly investigated. RESULTS We exploit the sets of publicly available data derived from chromatin immunoprecipitation combined with massively parallel DNA sequencing (ChIP-Seq), produced by the The Encyclopedia of DNA Elements (ENCODE) project, to analyze H3K9me3 at assembled satellite DNA repeats in genomes of human cell lines and during mouse fetal development. We show that annotated satellite elements are generally enriched for H3K9me3, but its level in cancer cell lines is on average lower than in normal cell lines. We find 407 satellite DNA instances with differential H3K9me3 enrichment between cancer and normal cells including a large 115-kb cluster of GSATII elements on chromosome 12. Differentially enriched regions are not limited to satellite DNA instances, but instead encompass a wider region of flanking sequences. We found no correlation between the levels of H3K9me3 and noncoding RNA at corresponding satellite DNA loci. The analysis of data derived from multiple tissues identified 864 instances of satellite DNA sequences in the mouse reference genome that are differentially enriched between fetal developmental stages. CONCLUSIONS Our study reveals significant differences in H3K9me3 level at a subset of satellite repeats between biological states and as such contributes to understanding of the role of satellite DNA repeats in epigenetic regulation during development and carcinogenesis.
Collapse
Affiliation(s)
| | | | - Željka Pezer
- Ruđer Bošković Institute, Bijenička 54, 10000, Zagreb, Croatia.
| |
Collapse
|
32
|
Suzuki Y, Morishita S. The time is ripe to investigate human centromeres by long-read sequencing†. DNA Res 2021; 28:6381569. [PMID: 34609504 PMCID: PMC8502840 DOI: 10.1093/dnares/dsab021] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Accepted: 09/28/2021] [Indexed: 01/05/2023] Open
Abstract
The complete sequencing of human centromeres, which are filled with highly repetitive elements, has long been challenging. In human centromeres, α-satellite monomers of about 171 bp in length are the basic repeating units, but α-satellite monomers constitute the higher-order repeat (HOR) units, and thousands of copies of highly homologous HOR units form large arrays, which have hampered sequence assembly of human centromeres. Because most HOR unit occurrences are covered by long reads of about 10 kb, the recent availability of much longer reads is expected to enable observation of individual HOR occurrences in terms of their single-nucleotide or structural variants. The time has come to examine the complete sequence of human centromeres.
Collapse
Affiliation(s)
- Yuta Suzuki
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba 277-8568, Japan
| | - Shinichi Morishita
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba 277-8568, Japan
| |
Collapse
|
33
|
The Role of Human Satellite III (1q12) Copy Number Variation in the Adaptive Response during Aging, Stress, and Pathology: A Pendulum Model. Genes (Basel) 2021; 12:genes12101524. [PMID: 34680920 PMCID: PMC8535310 DOI: 10.3390/genes12101524] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2021] [Revised: 09/25/2021] [Accepted: 09/26/2021] [Indexed: 12/18/2022] Open
Abstract
The pericentric satellite III (SatIII or Sat3) and II tandem repeats recently appeared to be transcribed under stress conditions, and the transcripts were shown to play an essential role in the universal stress response. In this paper, we review the role of human-specific SatIII copy number variation (CNV) in normal stress response, aging and pathology, with a focus on 1q12 loci. We postulate a close link between transcription of SatII/III repeats and their CNV. The accrued body of data suggests a hypothetical universal mechanism, which provides for SatIII copy gain during the stress response, alongside with another, more hypothetical reverse mechanism that might reduce the mean SatIII copy number, likely via the selection of cells with excessively large 1q12 loci. Both mechanisms, working alternatively like swings of the pendulum, may ensure the balance of SatIII copy numbers and optimum stress resistance. This model is verified on the most recent data on SatIII CNV in pathology and therapy, aging, senescence and response to genotoxic stress in vitro.
Collapse
|
34
|
Flynn JM, Brown EJ, Clark AG. Copy number evolution in simple and complex tandem repeats across the C57BL/6 and C57BL/10 inbred mouse lines. G3 GENES|GENOMES|GENETICS 2021; 11:6287064. [PMID: 34849804 PMCID: PMC8496272 DOI: 10.1093/g3journal/jkab184] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/27/2021] [Accepted: 05/25/2021] [Indexed: 12/01/2022]
Abstract
Simple sequence tandem repeats are among the most rapidly evolving compartments of the genome. Some repeat expansions are associated with mammalian disease or meiotic segregation distortion, yet the rates of copy number change across generations are not well known. Here, we use 14 distinct sublineages of the C57BL/6 and C57BL/10 inbred mouse strains, which have been evolving independently over about 300 generations, to estimate the rates of copy number changes in genome-wide tandem repeats. Rates of change varied across repeats and across lines. Notably, CAG, whose expansions in coding regions are associated with many neurological and genetic disorders, was highly stable in copy number, likely indicating stabilizing selection. Rates of change were positively correlated with copy number, but the direction and magnitude of changes varied across lines. Some mouse lines experienced consistent losses or gains across most simple repeats, but this did not correlate with copy number changes in complex repeats. Rates of copy number change were similar between simple repeats and the more abundant complex repeats after normalization by copy number. Finally, the Y-specific centromeric repeat had a fourfold higher rate of change than the homologous centromeric repeat on other chromosomes. Structural differences in satellite complexity, or restriction to the Y chromosome and elevated mutation rates of the male germline, may explain the higher rate of change. Overall, our work underscores the mutational fluidity of long tandem arrays of repeats, and the correlations and constraints between genome-wide tandem repeats, which suggest that turnover is not a completely neutral process.
Collapse
Affiliation(s)
- Jullien M Flynn
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| | - Emily J Brown
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| | - Andrew G Clark
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| |
Collapse
|
35
|
Dvorkina T, Kunyavskaya O, Bzikadze AV, Alexandrov I, Pevzner PA. CentromereArchitect: inference and analysis of the architecture of centromeres. Bioinformatics 2021; 37:i196-i204. [PMID: 34252949 PMCID: PMC8336445 DOI: 10.1093/bioinformatics/btab265] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Motivation Recent advances in long-read sequencing technologies led to rapid progress in centromere assembly in the last year and, for the first time, opened a possibility to address the long-standing questions about the architecture and evolution of human centromeres. However, since these advances have not been yet accompanied by the development of the centromere-specific bioinformatics algorithms, even the fundamental questions (e.g. centromere annotation by deriving the complete set of human monomers and high-order repeats), let alone more complex questions (e.g. explaining how monomers and high-order repeats evolved) about human centromeres remain open. Moreover, even though there was a four-decade-long series of studies aimed at cataloging all human monomers and high-order repeats, the rigorous algorithmic definitions of these concepts are still lacking. Thus, the development of a centromere annotation tool is a prerequisite for follow-up personalized biomedical studies of centromeres across the human population and evolutionary studies of centromeres across various species. Results We describe the CentromereArchitect, the first tool for the centromere annotation in a newly sequenced genome, apply it to the recently generated complete assembly of a human genome by the Telomere-to-Telomere consortium, generate the complete set of human monomers and high-order repeats for ‘live’ centromeres, and reveal a vast set of hybrid monomers that may represent the focal points of centromere evolution. Availability and implementation CentromereArchitect is publicly available on https://github.com/ablab/stringdecomposer/tree/ismb2021 Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Tatiana Dvorkina
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University, Saint Petersburg 199034, Russia
| | - Olga Kunyavskaya
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University, Saint Petersburg 199034, Russia
| | - Andrey V Bzikadze
- Graduate Program in Bioinformatics and Systems Biology, University of California, San Diego, CA 92093, USA
| | - Ivan Alexandrov
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University, Saint Petersburg 199034, Russia
| | - Pavel A Pevzner
- Department of Computer Science and Engineering, University of California, San Diego, CA 92093, USA
| |
Collapse
|
36
|
Pappalardo XG, Barra V. Losing DNA methylation at repetitive elements and breaking bad. Epigenetics Chromatin 2021; 14:25. [PMID: 34082816 PMCID: PMC8173753 DOI: 10.1186/s13072-021-00400-z] [Citation(s) in RCA: 57] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2021] [Accepted: 05/21/2021] [Indexed: 02/08/2023] Open
Abstract
Background DNA methylation is an epigenetic chromatin mark that allows heterochromatin formation and gene silencing. It has a fundamental role in preserving genome stability (including chromosome stability) by controlling both gene expression and chromatin structure. Therefore, the onset of an incorrect pattern of DNA methylation is potentially dangerous for the cells. This is particularly important with respect to repetitive elements, which constitute the third of the human genome. Main body Repetitive sequences are involved in several cell processes, however, due to their intrinsic nature, they can be a source of genome instability. Thus, most repetitive elements are usually methylated to maintain a heterochromatic, repressed state. Notably, there is increasing evidence showing that repetitive elements (satellites, long interspersed nuclear elements (LINEs), Alus) are frequently hypomethylated in various of human pathologies, from cancer to psychiatric disorders. Repetitive sequences’ hypomethylation correlates with chromatin relaxation and unscheduled transcription. If these alterations are directly involved in human diseases aetiology and how, is still under investigation. Conclusions Hypomethylation of different families of repetitive sequences is recurrent in many different human diseases, suggesting that the methylation status of these elements can be involved in preservation of human health. This provides a promising point of view towards the research of therapeutic strategies focused on specifically tuning DNA methylation of DNA repeats.
Collapse
Affiliation(s)
- Xena Giada Pappalardo
- Department of Biomedical and Biotechnological Sciences (BIOMETEC), University of Catania, 95125, Catania, Italy.,National Council of Research, Institute for Biomedical Research and Innovation (IRIB), Unit of Catania, 95125, Catania, Italy
| | - Viviana Barra
- Department of Biological, Chemical and Pharmaceutical Sciences and Technologies (STEBICEF), University of Palermo, 90128, Palermo, Italy.
| |
Collapse
|
37
|
Ishimoto R, Tsuzuki Y, Matsumura T, Kurashige S, Enokitani K, Narimatsu K, Higa M, Sugimoto N, Yoshida K, Fujita M. SLX4-XPF mediates DNA damage responses to replication stress induced by DNA-protein interactions. J Cell Biol 2021; 220:211628. [PMID: 33347546 PMCID: PMC7754685 DOI: 10.1083/jcb.202003148] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2020] [Revised: 10/05/2020] [Accepted: 11/13/2020] [Indexed: 12/20/2022] Open
Abstract
The DNA damage response (DDR) has a critical role in the maintenance of genomic integrity during chromosome replication. However, responses to replication stress evoked by tight DNA–protein complexes have not been fully elucidated. Here, we used bacterial LacI protein binding to lacO arrays to make site-specific replication fork barriers on the human chromosome. These barriers induced the accumulation of single-stranded DNA (ssDNA) and various DDR proteins at the lacO site. SLX4–XPF functioned as an upstream factor for the accumulation of DDR proteins, and consequently, ATR and FANCD2 were interdependently recruited. Moreover, LacI binding in S phase caused underreplication and abnormal mitotic segregation of the lacO arrays. Finally, we show that the SLX4–ATR axis represses the anaphase abnormality induced by LacI binding. Our results outline a long-term process by which human cells manage nucleoprotein obstacles ahead of the replication fork to prevent chromosomal instability.
Collapse
Affiliation(s)
- Riko Ishimoto
- Department of Cellular Biochemistry, Graduate School of Pharmaceutical Sciences, Kyushu University, Fukuoka, Japan
| | - Yota Tsuzuki
- Department of Cellular Biochemistry, Graduate School of Pharmaceutical Sciences, Kyushu University, Fukuoka, Japan
| | - Tomoki Matsumura
- Department of Cellular Biochemistry, Graduate School of Pharmaceutical Sciences, Kyushu University, Fukuoka, Japan
| | - Seiichiro Kurashige
- Department of Cellular Biochemistry, Graduate School of Pharmaceutical Sciences, Kyushu University, Fukuoka, Japan
| | - Kouki Enokitani
- Department of Cellular Biochemistry, Graduate School of Pharmaceutical Sciences, Kyushu University, Fukuoka, Japan
| | - Koki Narimatsu
- Department of Cellular Biochemistry, Graduate School of Pharmaceutical Sciences, Kyushu University, Fukuoka, Japan
| | - Mitsunori Higa
- Department of Cellular Biochemistry, Graduate School of Pharmaceutical Sciences, Kyushu University, Fukuoka, Japan
| | - Nozomi Sugimoto
- Department of Cellular Biochemistry, Graduate School of Pharmaceutical Sciences, Kyushu University, Fukuoka, Japan
| | - Kazumasa Yoshida
- Department of Cellular Biochemistry, Graduate School of Pharmaceutical Sciences, Kyushu University, Fukuoka, Japan
| | - Masatoshi Fujita
- Department of Cellular Biochemistry, Graduate School of Pharmaceutical Sciences, Kyushu University, Fukuoka, Japan
| |
Collapse
|
38
|
Replication initiation: Implications in genome integrity. DNA Repair (Amst) 2021; 103:103131. [PMID: 33992866 DOI: 10.1016/j.dnarep.2021.103131] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2021] [Revised: 05/07/2021] [Accepted: 05/07/2021] [Indexed: 02/01/2023]
Abstract
In every cell cycle, billions of nucleotides need to be duplicated within hours, with extraordinary precision and accuracy. The molecular mechanism by which cells regulate the replication event is very complicated, and the entire process begins way before the onset of S phase. During the G1 phase of the cell cycle, cells prepare by assembling essential replication factors to establish the pre-replicative complex at origins, sites that dictate where replication would initiate during S phase. During S phase, the replication process is tightly coupled with the DNA repair system to ensure the fidelity of replication. Defects in replication and any error must be recognized by DNA damage response and checkpoint signaling pathways in order to halt the cell cycle before cells are allowed to divide. The coordination of these processes throughout the cell cycle is therefore critical to achieve genomic integrity and prevent diseases. In this review, we focus on the current understanding of how the replication initiation events are regulated to achieve genome stability.
Collapse
|
39
|
Lopes M, Louzada S, Gama-Carvalho M, Chaves R. Genomic Tackling of Human Satellite DNA: Breaking Barriers through Time. Int J Mol Sci 2021; 22:4707. [PMID: 33946766 PMCID: PMC8125562 DOI: 10.3390/ijms22094707] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Revised: 04/24/2021] [Accepted: 04/27/2021] [Indexed: 12/12/2022] Open
Abstract
(Peri)centromeric repetitive sequences and, more specifically, satellite DNA (satDNA) sequences, constitute a major human genomic component. SatDNA sequences can vary on a large number of features, including nucleotide composition, complexity, and abundance. Several satDNA families have been identified and characterized in the human genome through time, albeit at different speeds. Human satDNA families present a high degree of sub-variability, leading to the definition of various subfamilies with different organization and clustered localization. Evolution of satDNA analysis has enabled the progressive characterization of satDNA features. Despite recent advances in the sequencing of centromeric arrays, comprehensive genomic studies to assess their variability are still required to provide accurate and proportional representation of satDNA (peri)centromeric/acrocentric short arm sequences. Approaches combining multiple techniques have been successfully applied and seem to be the path to follow for generating integrated knowledge in the promising field of human satDNA biology.
Collapse
Affiliation(s)
- Mariana Lopes
- Laboratory of Cytogenomics and Animal Genomics (CAG), Department of Genetics and Biotechnology (DGB), University of Trás-os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal; (M.L.); (S.L.)
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisbon, 1749-016 Lisbon, Portugal;
| | - Sandra Louzada
- Laboratory of Cytogenomics and Animal Genomics (CAG), Department of Genetics and Biotechnology (DGB), University of Trás-os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal; (M.L.); (S.L.)
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisbon, 1749-016 Lisbon, Portugal;
| | - Margarida Gama-Carvalho
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisbon, 1749-016 Lisbon, Portugal;
| | - Raquel Chaves
- Laboratory of Cytogenomics and Animal Genomics (CAG), Department of Genetics and Biotechnology (DGB), University of Trás-os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal; (M.L.); (S.L.)
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisbon, 1749-016 Lisbon, Portugal;
| |
Collapse
|
40
|
Giunta S, Hervé S, White RR, Wilhelm T, Dumont M, Scelfo A, Gamba R, Wong CK, Rancati G, Smogorzewska A, Funabiki H, Fachinetti D. CENP-A chromatin prevents replication stress at centromeres to avoid structural aneuploidy. Proc Natl Acad Sci U S A 2021; 118:e2015634118. [PMID: 33653953 PMCID: PMC7958389 DOI: 10.1073/pnas.2015634118] [Citation(s) in RCA: 39] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Chromosome segregation relies on centromeres, yet their repetitive DNA is often prone to aberrant rearrangements under pathological conditions. Factors that maintain centromere integrity to prevent centromere-associated chromosome translocations are unknown. Here, we demonstrate the importance of the centromere-specific histone H3 variant CENP-A in safeguarding DNA replication of alpha-satellite repeats to prevent structural aneuploidy. Rapid removal of CENP-A in S phase, but not other cell-cycle stages, caused accumulation of R loops with increased centromeric transcripts, and interfered with replication fork progression. Replication without CENP-A causes recombination at alpha-satellites in an R loop-dependent manner, unfinished replication, and anaphase bridges. In turn, chromosome breakage and translocations arise specifically at centromeric regions. Our findings provide insights into how specialized centromeric chromatin maintains the integrity of transcribed noncoding repetitive DNA during S phase.
Collapse
Affiliation(s)
- Simona Giunta
- Laboratory of Chromosome and Cell Biology, The Rockefeller University, New York, NY 10065;
| | - Solène Hervé
- Institut Curie, PSL Research University, CNRS, UMR 144, F-75005 Paris, France
| | - Ryan R White
- Laboratory of Genome Maintenance, The Rockefeller University, New York, NY 10065
| | - Therese Wilhelm
- Institut Curie, PSL Research University, CNRS, UMR 144, F-75005 Paris, France
| | - Marie Dumont
- Institut Curie, PSL Research University, CNRS, UMR 144, F-75005 Paris, France
| | - Andrea Scelfo
- Institut Curie, PSL Research University, CNRS, UMR 144, F-75005 Paris, France
| | - Riccardo Gamba
- Institut Curie, PSL Research University, CNRS, UMR 144, F-75005 Paris, France
| | - Cheng Kit Wong
- Institute of Medical Biology, Agency for Science, Technology and Research (A*STAR), 138648 Singapore, Singapore
| | - Giulia Rancati
- Institute of Medical Biology, Agency for Science, Technology and Research (A*STAR), 138648 Singapore, Singapore
| | - Agata Smogorzewska
- Laboratory of Genome Maintenance, The Rockefeller University, New York, NY 10065
| | - Hironori Funabiki
- Laboratory of Chromosome and Cell Biology, The Rockefeller University, New York, NY 10065;
| | - Daniele Fachinetti
- Institut Curie, PSL Research University, CNRS, UMR 144, F-75005 Paris, France;
| |
Collapse
|
41
|
Dvorkina T, Bzikadze AV, Pevzner PA. The string decomposition problem and its applications to centromere analysis and assembly. Bioinformatics 2021; 36:i93-i101. [PMID: 32657390 PMCID: PMC7428072 DOI: 10.1093/bioinformatics/btaa454] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Motivation Recent attempts to assemble extra-long tandem repeats (such as centromeres) faced the challenge of translating long error-prone reads from the nucleotide alphabet into the alphabet of repeat units. Human centromeres represent a particularly complex type of high-order repeats (HORs) formed by chromosome-specific monomers. Given a set of all human monomers, translating a read from a centromere into the monomer alphabet is modeled as the String Decomposition Problem. The accurate translation of reads into the monomer alphabet turns the notoriously difficult problem of assembling centromeres from reads (in the nucleotide alphabet) into a more tractable problem of assembling centromeres from translated reads. Results We describe a StringDecomposer (SD) algorithm for solving this problem, benchmark it on the set of long error-prone Oxford Nanopore reads generated by the Telomere-to-Telomere consortium and identify a novel (rare) monomer that extends the set of known X-chromosome specific monomers. Our identification of a novel monomer emphasizes the importance of identification of all (even rare) monomers for future centromere assembly efforts and evolutionary studies. To further analyze novel monomers, we applied SD to the set of recently generated long accurate Pacific Biosciences HiFi reads. This analysis revealed that the set of known human monomers and HORs remains incomplete. SD opens a possibility to generate a complete set of human monomers and HORs for using in the ongoing efforts to generate the complete assembly of the human genome. Availability and implementation StringDecomposer is publicly available on https://github.com/ablab/stringdecomposer. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Tatiana Dvorkina
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University, Saint Petersburg 199034, Russia
| | - Andrey V Bzikadze
- Graduate Program in Bioinformatics and Systems Biology, University of California, San Diego, CA 92093, USA
| | - Pavel A Pevzner
- Department of Computer Science and Engineering, University of California, San Diego, CA 92093, USA
| |
Collapse
|
42
|
Suzuki Y, Myers EW, Morishita S. Rapid and ongoing evolution of repetitive sequence structures in human centromeres. SCIENCE ADVANCES 2020; 6:6/50/eabd9230. [PMID: 33310858 PMCID: PMC7732198 DOI: 10.1126/sciadv.abd9230] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/20/2020] [Accepted: 10/30/2020] [Indexed: 06/12/2023]
Abstract
Our understanding of centromere sequence variation across human populations is limited by its extremely long nested repeat structures called higher-order repeats that are challenging to sequence. Here, we analyzed chromosomes 11, 17, and X using long-read sequencing data for 36 individuals from diverse populations including a Han Chinese trio and 21 Japanese. We revealed substantial structural diversity with many previously unidentified variant higher-order repeats specific to individuals characterizing rapid, haplotype-specific evolution of human centromeric arrays, while frequent single-nucleotide variants are largely conserved. We found a characteristic pattern shared among prevalent variants in human and chimpanzee. Our findings pave the way for studying sequence evolution in human and primate centromeres.
Collapse
Affiliation(s)
- Yuta Suzuki
- The University of Tokyo, Graduate School of Frontier Sciences, Department of Computational Biology and Medical Sciences, Kashiwa, Chiba 277-8568, Japan.
| | - Eugene W Myers
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
| | - Shinichi Morishita
- The University of Tokyo, Graduate School of Frontier Sciences, Department of Computational Biology and Medical Sciences, Kashiwa, Chiba 277-8568, Japan.
| |
Collapse
|
43
|
Bzikadze AV, Pevzner PA. Automated assembly of centromeres from ultra-long error-prone reads. Nat Biotechnol 2020; 38:1309-1316. [PMID: 32665660 PMCID: PMC10718184 DOI: 10.1038/s41587-020-0582-4] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2019] [Accepted: 05/29/2020] [Indexed: 12/12/2022]
Abstract
Centromeric variation has been linked to cancer and infertility, but centromere sequences contain multiple tandem repeats and can only be assembled manually from long error-prone reads. Here we describe the centroFlye algorithm for centromere assembly using long error-prone reads, and apply it to assemble human centromeres on chromosomes 6 and X. Our analyses reveal putative breakpoints in the manual reconstruction of the human X centromere, demonstrate that human X chromosome is partitioned into repeat subfamilies and provide initial insights into centromere evolution. We anticipate that centroFlye could be applied to automatically close remaining multimegabase gaps in the reference human genome.
Collapse
Affiliation(s)
- Andrey V Bzikadze
- Graduate Program in Bioinformatics and Systems Biology, University of California San Diego, La Jolla, CA, USA
| | - Pavel A Pevzner
- Department of Computer Science and Engineering, University of California San Diego, La Jolla, CA, USA.
| |
Collapse
|
44
|
Balzano E, Pelliccia F, Giunta S. Genome (in)stability at tandem repeats. Semin Cell Dev Biol 2020; 113:97-112. [PMID: 33109442 DOI: 10.1016/j.semcdb.2020.10.003] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Revised: 09/26/2020] [Accepted: 10/10/2020] [Indexed: 12/12/2022]
Abstract
Repeat sequences account for over half of the human genome and represent a significant source of variation that underlies physiological and pathological states. Yet, their study has been hindered due to limitations in short-reads sequencing technology and difficulties in assembly. A important category of repetitive DNA in the human genome is comprised of tandem repeats (TRs), where repetitive units are arranged in a head-to-tail pattern. Compared to other regions of the genome, TRs carry between 10 and 10,000 fold higher mutation rate. There are several mutagenic mechanisms that can give rise to this propensity toward instability, but their precise contribution remains speculative. Given the high degree of homology between these sequences and their arrangement in tandem, once damaged, TRs have an intrinsic propensity to undergo aberrant recombination with non-allelic exchange and generate harmful rearrangements that may undermine the stability of the entire genome. The dynamic mutagenesis at TRs has been found to underlie individual polymorphism associated with neurodegenerative and neuromuscular disorders, as well as complex genetic diseases like cancer and diabetes. Here, we review our current understanding of the surveillance and repair mechanisms operating within these regions, and we describe how alterations in these protective processes can readily trigger mutational signatures found at TRs, ultimately resulting in the pathological correlation between TRs instability and human diseases. Finally, we provide a viewpoint to counter the detrimental effects that TRs pose in light of their selection and conservation, as important drivers of human evolution.
Collapse
Affiliation(s)
- Elisa Balzano
- Dipartimento di Biologia e Biotecnologie "Charles Darwin", Sapienza Università di Roma, 00185 Roma, Italy
| | - Franca Pelliccia
- Dipartimento di Biologia e Biotecnologie "Charles Darwin", Sapienza Università di Roma, 00185 Roma, Italy
| | - Simona Giunta
- The Rockefeller University, 1230 York Avenue, New York, NY 10065, USA; Dipartimento di Biologia e Biotecnologie "Charles Darwin", Sapienza Università di Roma, 00185 Roma, Italy.
| |
Collapse
|
45
|
Ershova ES, Malinovskaya EM, Golimbet VE, Lezheiko TV, Zakharova NV, Shmarina GV, Veiko RV, Umriukhin PE, Kostyuk GP, Kutsev SI, Izhevskaya VL, Veiko NN, Kostyuk SV. Copy number variations of satellite III (1q12) and ribosomal repeats in health and schizophrenia. Schizophr Res 2020; 223:199-212. [PMID: 32773342 DOI: 10.1016/j.schres.2020.07.022] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/20/2020] [Revised: 06/16/2020] [Accepted: 07/26/2020] [Indexed: 12/30/2022]
Abstract
OBJECTIVE Earlier we studied the copy number variations (CNVs) of ribosomal repeat (rDNA) and the satellite III fragment (1q12) (f-SatIII) in the cells of schizophrenia patients (SZ) and healthy controls (HC). In the present study we pursued two main objectives: (1) to confirm the increased rDNA and decreased f-SatIII content in the genomes of enlarged SZ and HC samples and (2) to compare the rDNA and f-SatIII content in the same DNA samples of SZ and HC individuals. METHODS We determined the rDNA CN and f-SatIII content in the genomes of leukocytes of 1770 subjects [HC (N = 814) and SZ (N = 956)]. Non-radioactive quantitative hybridization method (NQH) was applied for analysis of the various combinations of the two repeats sizes in SZ and HC groups. RESULTS f-SatIII in human leukocytes (N = 1556) varies between 5.7 and 44.7 pg/ng DNA. RDNA CN varies between 200 and 896 (N = 1770). SZ group significantly differ from the HC group by lower f-SatIII content and by rDNA abundance. The f-SatIII and rDNA CN are not randomly combined in the genome. Higher rDNA CN values are associated with higher f-SatIII index values in SZ and HC. The f-SatIII variation interval in SZ group increases significantly in the subgroup with the high rDNA CN index values (>300 copies). CONCLUSION Schizophrenia patients' genomes contain low number of f-SatIII copies corresponding with a large ribosomal repeats CN. A scheme is proposed to explain the low f-SatIII content in SZ group against the background of high rDNA CN.
Collapse
Affiliation(s)
- E S Ershova
- Research Centre for Medical Genetics, Department of Molecular Biology, Moscow, Russia; I.M. Sechenov First Moscow State Medical University (Sechenov University), Moscow, Russia
| | - E M Malinovskaya
- Research Centre for Medical Genetics, Department of Molecular Biology, Moscow, Russia
| | - V E Golimbet
- Mental Health Research Center, Department of Clinical Genetics, Moscow, Russia
| | - T V Lezheiko
- Mental Health Research Center, Department of Clinical Genetics, Moscow, Russia
| | - N V Zakharova
- N. A. Alexeev Clinical Psychiatric Hospital №1, Moscow Healthcare Department, Moscow, Russia
| | - G V Shmarina
- Research Centre for Medical Genetics, Department of Molecular Biology, Moscow, Russia; I.M. Sechenov First Moscow State Medical University (Sechenov University), Moscow, Russia
| | - R V Veiko
- Research Centre for Medical Genetics, Department of Molecular Biology, Moscow, Russia
| | - P E Umriukhin
- Research Centre for Medical Genetics, Department of Molecular Biology, Moscow, Russia; I.M. Sechenov First Moscow State Medical University (Sechenov University), Moscow, Russia; P.K. Anokhin Institute of Normal Physiology, Moscow, Russia.
| | - G P Kostyuk
- N. A. Alexeev Clinical Psychiatric Hospital №1, Moscow Healthcare Department, Moscow, Russia
| | - S I Kutsev
- Research Centre for Medical Genetics, Department of Molecular Biology, Moscow, Russia
| | - V L Izhevskaya
- Research Centre for Medical Genetics, Department of Molecular Biology, Moscow, Russia
| | - N N Veiko
- Research Centre for Medical Genetics, Department of Molecular Biology, Moscow, Russia
| | - S V Kostyuk
- Research Centre for Medical Genetics, Department of Molecular Biology, Moscow, Russia; I.M. Sechenov First Moscow State Medical University (Sechenov University), Moscow, Russia
| |
Collapse
|
46
|
Balzano E, Giunta S. Centromeres under Pressure: Evolutionary Innovation in Conflict with Conserved Function. Genes (Basel) 2020; 11:E912. [PMID: 32784998 PMCID: PMC7463522 DOI: 10.3390/genes11080912] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2020] [Revised: 08/04/2020] [Accepted: 08/04/2020] [Indexed: 12/22/2022] Open
Abstract
Centromeres are essential genetic elements that enable spindle microtubule attachment for chromosome segregation during mitosis and meiosis. While this function is preserved across species, centromeres display an array of dynamic features, including: (1) rapidly evolving DNA; (2) wide evolutionary diversity in size, shape and organization; (3) evidence of mutational processes to generate homogenized repetitive arrays that characterize centromeres in several species; (4) tolerance to changes in position, as in the case of neocentromeres; and (5) intrinsic fragility derived by sequence composition and secondary DNA structures. Centromere drive underlies rapid centromere DNA evolution due to the "selfish" pursuit to bias meiotic transmission and promote the propagation of stronger centromeres. Yet, the origins of other dynamic features of centromeres remain unclear. Here, we review our current understanding of centromere evolution and plasticity. We also detail the mutagenic processes proposed to shape the divergent genetic nature of centromeres. Changes to centromeres are not simply evolutionary relics, but ongoing shifts that on one side promote centromere flexibility, but on the other can undermine centromere integrity and function with potential pathological implications such as genome instability.
Collapse
Affiliation(s)
- Elisa Balzano
- Dipartimento di Biologia e Biotecnologie “Charles Darwin”, Sapienza Università di Roma, 00185 Roma, Italy;
| | - Simona Giunta
- Laboratory of Chromosome and Cell Biology, The Rockefeller University, 1230 York Avenue, New York, NY 10065, USA
| |
Collapse
|
47
|
Leo L, Marchetti M, Giunta S, Fanti L. Epigenetics as an Evolutionary Tool for Centromere Flexibility. Genes (Basel) 2020; 11:genes11070809. [PMID: 32708654 PMCID: PMC7397245 DOI: 10.3390/genes11070809] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2020] [Revised: 07/11/2020] [Accepted: 07/13/2020] [Indexed: 12/31/2022] Open
Abstract
Centromeres are the complex structures responsible for the proper segregation of chromosomes during cell division. Structural or functional alterations of the centromere cause aneuploidies and other chromosomal aberrations that can induce cell death with consequences on health and survival of the organism as a whole. Because of their essential function in the cell, centromeres have evolved high flexibility and mechanisms of tolerance to preserve their function following stress, whether it is originating from within or outside the cell. Here, we review the main epigenetic mechanisms of centromeres’ adaptability to preserve their functional stability, with particular reference to neocentromeres and holocentromeres. The centromere position can shift in response to altered chromosome structures, but how and why neocentromeres appear in a given chromosome region are still open questions. Models of neocentromere formation developed during the last few years will be hereby discussed. Moreover, we will discuss the evolutionary significance of diffuse centromeres (holocentromeres) in organisms such as nematodes. Despite the differences in DNA sequences, protein composition and centromere size, all of these diverse centromere structures promote efficient chromosome segregation, balancing genome stability and adaptability, and ensuring faithful genome inheritance at each cellular generation.
Collapse
Affiliation(s)
- Laura Leo
- Istituto Pasteur Italia, Dipartimento di Biologia e Biotecnologie “Charles Darwin”, “Sapienza” University of Rome, 00185 Rome, Italy; (L.L.); (M.M.); (S.G.)
| | - Marcella Marchetti
- Istituto Pasteur Italia, Dipartimento di Biologia e Biotecnologie “Charles Darwin”, “Sapienza” University of Rome, 00185 Rome, Italy; (L.L.); (M.M.); (S.G.)
| | - Simona Giunta
- Istituto Pasteur Italia, Dipartimento di Biologia e Biotecnologie “Charles Darwin”, “Sapienza” University of Rome, 00185 Rome, Italy; (L.L.); (M.M.); (S.G.)
- Laboratory of Chromosome and Cell Biology, The Rockefeller University, New York, NY 10065, USA
| | - Laura Fanti
- Istituto Pasteur Italia, Dipartimento di Biologia e Biotecnologie “Charles Darwin”, “Sapienza” University of Rome, 00185 Rome, Italy; (L.L.); (M.M.); (S.G.)
- Correspondence:
| |
Collapse
|
48
|
Mikheenko A, Bzikadze AV, Gurevich A, Miga KH, Pevzner PA. TandemTools: mapping long reads and assessing/improving assembly quality in extra-long tandem repeats. Bioinformatics 2020; 36:i75-i83. [PMID: 32657355 PMCID: PMC7355294 DOI: 10.1093/bioinformatics/btaa440] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
MOTIVATION Extra-long tandem repeats (ETRs) are widespread in eukaryotic genomes and play an important role in fundamental cellular processes, such as chromosome segregation. Although emerging long-read technologies have enabled ETR assemblies, the accuracy of such assemblies is difficult to evaluate since there are no tools for their quality assessment. Moreover, since the mapping of error-prone reads to ETRs remains an open problem, it is not clear how to polish draft ETR assemblies. RESULTS To address these problems, we developed the TandemTools software that includes the TandemMapper tool for mapping reads to ETRs and the TandemQUAST tool for polishing ETR assemblies and their quality assessment. We demonstrate that TandemTools not only reveals errors in ETR assemblies but also improves the recently generated assemblies of human centromeres. AVAILABILITY AND IMPLEMENTATION https://github.com/ablab/TandemTools. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Alla Mikheenko
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University, Saint Petersburg 199034, Russia
| | - Andrey V Bzikadze
- Graduate Program in Bioinformatics and Systems Biology, University of California, San Diego, CA 92093, USA
| | - Alexey Gurevich
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University, Saint Petersburg 199034, Russia
| | - Karen H Miga
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Pavel A Pevzner
- Department of Computer Science and Engineering, University of California, San Diego, CA 92093, USA
| |
Collapse
|
49
|
Konkova MS, Ershova ES, Savinova EA, Malinovskaya EM, Shmarina GV, Martynov AV, Veiko RV, Zakharova NV, Umriukhin P, Kostyuk GP, Izhevskaya VL, Kutsev SI, Veiko NN, Kostyuk SV. 1Q12 Loci Movement in the Interphase Nucleus Under the Action of ROS Is an Important Component of the Mechanism That Determines Copy Number Variation of Satellite III (1q12) in Health and Schizophrenia. Front Cell Dev Biol 2020; 8:386. [PMID: 32714923 PMCID: PMC7346584 DOI: 10.3389/fcell.2020.00386] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2020] [Accepted: 04/29/2020] [Indexed: 12/30/2022] Open
Abstract
Introduction: Genome repeat cluster sizes can affect the chromatin spatial configuration and function. Low-dose ionizing radiation (IR) induces an adaptive response (AR) in human cells. AR includes the change in chromatin spatial configuration that is necessary to change the expression profile of the genome in response to stress. The 1q12 heterochromatin loci movement from the periphery to the center of the nucleus is a marker of the chromatin configuration change. We hypothesized that a large 1q12 domain could affect chromatin movement, thereby inhibiting the AR. Materials and Methods: 2D fluorescent in situ hybridization (FISH) method was used for the satellite III fragment from the 1q12 region (f-SatIII) localization analysis in the interphase nuclei of healthy control (HC) lymphocytes, schizophrenia (SZ) patients, and in cultured mesenchymal stem cells (MSCs). The localization of the nucleolus was analyzed by the nucleolus Ag staining. The non-radioactive quantitative hybridization (NQH) technique was used for the f-SatIII fragment content in DNA analysis. Satellite III fragments transcription was analyzed by reverse transcriptase quantitative PCR (RT-qPCR). Results: Low-dose IR induces the small-area 1q12 domains movement from the periphery to the central regions of the nucleus in HC lymphocytes and MSCs. Simultaneously, nucleolus moves from the nucleus center toward the nuclear envelope. The nucleolus in that period increases. The distance between the 1q12 domain and the nucleolus in irradiated cells is significantly reduced. The large-area 1q12 domains do not move in response to stress. During prolonged cultivation, the irradiated cells with a large f-SatIII amount die, and the population is enriched with the cells with low f-SatIII content. IR induces satellite III transcription in HC lymphocytes. Intact SZ patients' lymphocytes have the same signs of nuclei activation as irradiated HC cells. Conclusion: When a cell population responds to stress, cells are selected according to the size of the 1q12 domain (the f-SatIII content). The low content of the f-SatIII repeat in SZ patients may be a consequence of the chronic oxidative stress and of a large copies number of the ribosomal repeats.
Collapse
Affiliation(s)
- Marina Sergeevna Konkova
- Federal State Budgetary Scientific Institution, Research Centre for Medical Genetics, Moscow, Russia
| | | | | | | | | | | | - Roman Vladimirovich Veiko
- Federal State Budgetary Scientific Institution, Research Centre for Medical Genetics, Moscow, Russia
| | | | - Pavel Umriukhin
- Federal State Budgetary Scientific Institution, Research Centre for Medical Genetics, Moscow, Russia
- I.M. Sechenov First Moscow State Medical University of the Ministry of Health of the Russian Federation, Moscow, Russia
- P.K. Anokhin Institute of Normal Physiology, Moscow, Russia
| | | | | | - Sergey Ivanovich Kutsev
- Federal State Budgetary Scientific Institution, Research Centre for Medical Genetics, Moscow, Russia
| | - Natalia Nikolaevna Veiko
- Federal State Budgetary Scientific Institution, Research Centre for Medical Genetics, Moscow, Russia
| | | |
Collapse
|
50
|
Miga KH. Centromere studies in the era of 'telomere-to-telomere' genomics. Exp Cell Res 2020; 394:112127. [PMID: 32504677 DOI: 10.1016/j.yexcr.2020.112127] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2020] [Revised: 05/23/2020] [Accepted: 05/30/2020] [Indexed: 12/17/2022]
Abstract
We are entering into an exciting era of genomics where truly complete, high-quality assemblies of human chromosomes are available end-to-end, or from 'telomere-to-telomere' (T2T). This technological advance offers a new opportunity to include endogenous human centromeric regions in high-resolution, sequence-based studies. These emerging reference maps are expected to reveal a new functional landscape in the human genome, where centromere proteins, transcriptional regulation, and spatial organization can be examined with base-level resolution across different stages of development and disease. Such studies will depend on innovative assembly methods of extremely long tandem repeats (ETRs), or satellite DNAs, paired with the development of new, orthogonal validation methods to ensure accuracy and completeness. This review reflects the progress in centromere genomics, credited by recent advancements in long-read sequencing and assembly methods. In doing so, I will discuss the challenges that remain and the promise for a new period of scientific discovery for satellite DNA biology and centromere function.
Collapse
Affiliation(s)
- Karen H Miga
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, CA, 95064, USA.
| |
Collapse
|