1
|
Linear interaction between replication and transcription shapes DNA break dynamics at recurrent DNA break Clusters. Nat Commun 2024; 15:3594. [PMID: 38678011 PMCID: PMC11055891 DOI: 10.1038/s41467-024-47934-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Accepted: 04/12/2024] [Indexed: 04/29/2024] Open
Abstract
Recurrent DNA break clusters (RDCs) are replication-transcription collision hotspots; many are unique to neural progenitor cells. Through high-resolution replication sequencing and a capture-ligation assay in mouse neural progenitor cells experiencing replication stress, we unravel the replication features dictating RDC location and orientation. Most RDCs occur at the replication forks traversing timing transition regions (TTRs), where sparse replication origins connect unidirectional forks. Leftward-moving forks generate telomere-connected DNA double-strand breaks (DSBs), while rightward-moving forks lead to centromere-connected DSBs. Strand-specific mapping for DNA-bound RNA reveals co-transcriptional dual-strand DNA:RNA hybrids present at a higher density in RDC than in other actively transcribed long genes. In addition, mapping RNA polymerase activity uncovers that head-to-head interactions between replication and transcription machinery result in 60% DSB contribution to the head-on compared to 40% for co-directional. Taken together we reveal TTR as a fragile class and show how the linear interaction between transcription and replication impacts genome stability.
Collapse
|
2
|
Beyond A and B Compartments: how major nuclear locales define nuclear genome organization and function. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.23.590809. [PMID: 38712201 PMCID: PMC11071382 DOI: 10.1101/2024.04.23.590809] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2024]
Abstract
Models of nuclear genome organization often propose a binary division into active versus inactive compartments, yet they overlook nuclear bodies. Here we integrated analysis of sequencing and image-based data to compare genome organization in four human cell types relative to three different nuclear locales: the nuclear lamina, nuclear speckles, and nucleoli. Whereas gene expression correlates mostly with nuclear speckle proximity, DNA replication timing correlates with proximity to multiple nuclear locales. Speckle attachment regions emerge as DNA replication initiation zones whose replication timing and gene composition vary with their attachment frequency. Most facultative LADs retain a partially repressed state as iLADs, despite their positioning in the nuclear interior. Knock out of two lamina proteins, Lamin A and LBR, causes a shift of H3K9me3-enriched LADs from lamina to nucleolus, and a reciprocal relocation of H3K27me3-enriched partially repressed iLADs from nucleolus to lamina. Thus, these partially repressed iLADs appear to compete with LADs for nuclear lamina attachment with consequences for replication timing. The nuclear organization in adherent cells is polarized with nuclear bodies and genomic regions segregating both radially and relative to the equatorial plane. Together, our results underscore the importance of considering genome organization relative to nuclear locales for a more complete understanding of the spatial and functional organization of the human genome.
Collapse
|
3
|
The genetic regulatory architecture and epigenomic basis for age-related changes in rattlesnake venom. Proc Natl Acad Sci U S A 2024; 121:e2313440121. [PMID: 38578985 PMCID: PMC11032440 DOI: 10.1073/pnas.2313440121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Accepted: 03/13/2024] [Indexed: 04/07/2024] Open
Abstract
Developmental phenotypic changes can evolve under selection imposed by age- and size-related ecological differences. Many of these changes occur through programmed alterations to gene expression patterns, but the molecular mechanisms and gene-regulatory networks underlying these adaptive changes remain poorly understood. Many venomous snakes, including the eastern diamondback rattlesnake (Crotalus adamanteus), undergo correlated changes in diet and venom expression as snakes grow larger with age, providing models for identifying mechanisms of timed expression changes that underlie adaptive life history traits. By combining a highly contiguous, chromosome-level genome assembly with measures of expression, chromatin accessibility, and histone modifications, we identified cis-regulatory elements and trans-regulatory factors controlling venom ontogeny in the venom glands of C. adamanteus. Ontogenetic expression changes were significantly correlated with epigenomic changes within genes, immediately adjacent to genes (e.g., promoters), and more distant from genes (e.g., enhancers). We identified 37 candidate transcription factors (TFs), with the vast majority being up-regulated in adults. The ontogenetic change is largely driven by an increase in the expression of TFs associated with growth signaling, transcriptional activation, and circadian rhythm/biological timing systems in adults with corresponding epigenomic changes near the differentially expressed venom genes. However, both expression activation and repression contributed to the composition of both adult and juvenile venoms, demonstrating the complexity and potential evolvability of gene regulation for this trait. Overall, given that age-based trait variation is common across the tree of life, we provide a framework for understanding gene-regulatory-network-driven life-history evolution more broadly.
Collapse
|
4
|
Linear Interaction Between Replication and Transcription Shapes DNA Break Dynamics at Recurrent DNA Break Clusters. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.08.22.554340. [PMID: 37662334 PMCID: PMC10473677 DOI: 10.1101/2023.08.22.554340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/05/2023]
Abstract
Recurrent DNA break clusters (RDCs) are replication-transcription collision hotspots; many are unique to neural progenitor cells. Through high-resolution replication sequencing and a capture-ligation assay in mouse neural progenitor cells experiencing replication stress, we unraveled the replication features dictating RDC location and orientation. Most RDCs occur at the replication forks traversing timing transition regions (TTRs), where sparse replication origins connect unidirectional forks. Leftward-moving forks generate telomere-connected DNA double-strand breaks (DSBs), while rightward-moving forks lead to centromere-connected DSBs. Strand-specific mapping for DNA-bound RNA revealed co-transcriptional dual-strand DNA:RNA hybrids present at a higher density in RDC than in other actively transcribed long genes. In addition, mapping RNA polymerase activity revealed that head-to-head interactions between replication and transcription machinery resulted in 60% DSB contribution to the head-on compared to 40% for co-directional. Our findings revealed TTR as a novel fragile class and highlighted how the linear interaction between transcription and replication impacts genome stability.
Collapse
|
5
|
Emergence of replication timing during early mammalian development. Nature 2024; 625:401-409. [PMID: 38123678 PMCID: PMC10781638 DOI: 10.1038/s41586-023-06872-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2022] [Accepted: 11/16/2023] [Indexed: 12/23/2023]
Abstract
DNA replication enables genetic inheritance across the kingdoms of life. Replication occurs with a defined temporal order known as the replication timing (RT) programme, leading to organization of the genome into early- or late-replicating regions. RT is cell-type specific, is tightly linked to the three-dimensional nuclear organization of the genome1,2 and is considered an epigenetic fingerprint3. In spite of its importance in maintaining the epigenome4, the developmental regulation of RT in mammals in vivo has not been explored. Here, using single-cell Repli-seq5, we generated genome-wide RT maps of mouse embryos from the zygote to the blastocyst stage. Our data show that RT is initially not well defined but becomes defined progressively from the 4-cell stage, coinciding with strengthening of the A and B compartments. We show that transcription contributes to the precision of the RT programme and that the difference in RT between the A and B compartments depends on RNA polymerase II at zygotic genome activation. Our data indicate that the establishment of nuclear organization precedes the acquisition of defined RT features and primes the partitioning of the genome into early- and late-replicating domains. Our work sheds light on the establishment of the epigenome at the beginning of mammalian development and reveals the organizing principles of genome organization.
Collapse
|
6
|
RIF1 regulates early replication timing in murine B cells. Nat Commun 2023; 14:8049. [PMID: 38081811 PMCID: PMC10713614 DOI: 10.1038/s41467-023-43778-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Accepted: 11/20/2023] [Indexed: 12/18/2023] Open
Abstract
The mammalian DNA replication timing (RT) program is crucial for the proper functioning and integrity of the genome. The best-known mechanism for controlling RT is the suppression of late origins of replication in heterochromatin by RIF1. Here, we report that in antigen-activated, hypermutating murine B lymphocytes, RIF1 binds predominantly to early-replicating active chromatin and promotes early replication, but plays a minor role in regulating replication origin activity, gene expression and genome organization in B cells. Furthermore, we find that RIF1 functions in a complementary and non-epistatic manner with minichromosome maintenance (MCM) proteins to establish early RT signatures genome-wide and, specifically, to ensure the early replication of highly transcribed genes. These findings reveal additional layers of regulation within the B cell RT program, driven by the coordinated activity of RIF1 and MCM proteins.
Collapse
|
7
|
Brd2 is dispensable for genome compartmentalization and replication timing. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.17.567572. [PMID: 38249518 PMCID: PMC10798648 DOI: 10.1101/2023.11.17.567572] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/23/2024]
Abstract
Replication Timing (RT) refers to the temporal order in which the genome is replicated during S phase. Early replicating regions correlate with the transcriptionally active, accessible euchromatin (A) compartment, while late replicating regions correlate with the heterochromatin (B) compartment and repressive histone marks. Previously, widespread A/B genome compartmentalization changes were reported following Brd2 depletion. Since RT and A/B compartmentalization are two of the most highly correlated chromosome properties, we evaluated the effects of Brd2 depletion on RT. We performed E/L Repli-Seq following Brd2 depletion in the previously described Brd2 conditional degron cell line and found no significant alterations in RT after Brd2 KD. This finding prompted us to re-analyze the Micro-C data from the previous publication. We report that we were unable to detect any compartmentalization changes in Brd2 depleted cells compared to DMSO control using the same data. Taken together, our findings demonstrate that Brd2 depletion alone does not affect A/B compartmentalization or RT in mouse embryonic stem cells.
Collapse
|
8
|
Nucleolus and centromere TSA-Seq reveals variable localization of heterochromatin in different cell types. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.29.564613. [PMID: 37961445 PMCID: PMC10634939 DOI: 10.1101/2023.10.29.564613] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
Genome differential positioning within interphase nuclei remains poorly explored. We extended and validated TSA-seq to map genomic regions near nucleoli and pericentric heterochromatin in four human cell lines. Our study confirmed that smaller chromosomes localize closer to nucleoli but further deconvolved this by revealing a preference for chromosome arms below 36-46 Mbp in length. We identified two lamina associated domain subsets through their differential nuclear lamina versus nucleolar positioning in different cell lines which showed distinctive patterns of DNA replication timing and gene expression across all cell lines. Unexpectedly, active, nuclear speckle-associated genomic regions were found near typically repressive nuclear compartments, which is attributable to the close proximity of nuclear speckles and nucleoli in some cell types, and association of centromeres with nuclear speckles in hESCs. Our study points to a more complex and variable nuclear genome organization than suggested by current models, as revealed by our TSA-seq methodology.
Collapse
|
9
|
SHIELD: a platform for high-throughput screening of barrier-type DNA elements in human cells. Nat Commun 2023; 14:5616. [PMID: 37699958 PMCID: PMC10497619 DOI: 10.1038/s41467-023-41468-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 09/04/2023] [Indexed: 09/14/2023] Open
Abstract
Chromatin boundary elements contribute to the partitioning of mammalian genomes into topological domains to regulate gene expression. Certain boundary elements are adopted as DNA insulators for safe and stable transgene expression in mammalian cells. These elements, however, are ill-defined and less characterized in the non-coding genome, partially due to the lack of a platform to readily evaluate boundary-associated activities of putative DNA sequences. Here we report SHIELD (Site-specific Heterochromatin Insertion of Elements at Lamina-associated Domains), a platform tailored for the high-throughput screening of barrier-type DNA elements in human cells. SHIELD takes advantage of the high specificity of serine integrase at heterochromatin, and exploits the natural heterochromatin spreading inside lamina-associated domains (LADs) for the discovery of potent barrier elements. We adopt SHIELD to evaluate the barrier activity of 1000 DNA elements in a high-throughput manner and identify 8 candidates with barrier activities comparable to the core region of cHS4 element in human HCT116 cells. We anticipate SHIELD could facilitate the discovery of novel barrier DNA elements from the non-coding genome in human cells.
Collapse
|
10
|
The location and development of Replicon Cluster Domains in early replicating DNA. Wellcome Open Res 2023; 8:158. [PMID: 37766844 PMCID: PMC10521077 DOI: 10.12688/wellcomeopenres.18742.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/18/2023] [Indexed: 09/29/2023] Open
Abstract
Background: It has been known for many years that in metazoan cells, replication origins are organised into clusters where origins within each cluster fire near-synchronously. Despite clusters being a fundamental organising principle of metazoan DNA replication, the genomic location of origin clusters has not been documented. Methods: We synchronised human U2OS by thymidine block and release followed by L-mimosine block and release to create a population of cells progressing into S phase with a high degree of synchrony. At different times after release into S phase, cells were pulsed with EdU; the EdU-labelled DNA was then pulled down, sequenced and mapped onto the human genome. Results: The early replicating DNA showed features at a range of scales. Wavelet analysis showed that the major feature of the early replicating DNA was at a size of 500 kb, consistent with clusters of replication origins. Over the first two hours of S phase, these Replicon Cluster Domains broadened in width, consistent with their being enlarged by the progression of replication forks at their outer boundaries. The total replication signal associated with each Replicon Cluster Domain varied considerably, and this variation was reproducible and conserved over time. We provide evidence that this variability in replication signal was at least in part caused by Replicon Cluster Domains being activated at different times in different cells in the population. We also provide evidence that adjacent clusters had a statistical preference for being activated in sequence across a group, consistent with the 'domino' model of replication focus activation order observed by microscopy. Conclusions: We show that early replicating DNA is organised into Replicon Cluster Domains that behave as expected of replicon clusters observed by DNA fibre analysis. The coordinated activation of different Replicon Cluster Domains can generate the replication timing programme by which the genome is duplicated.
Collapse
|
11
|
Spatial and temporal organization of the genome: Current state and future aims of the 4D nucleome project. Mol Cell 2023; 83:2624-2640. [PMID: 37419111 PMCID: PMC10528254 DOI: 10.1016/j.molcel.2023.06.018] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Revised: 06/10/2023] [Accepted: 06/12/2023] [Indexed: 07/09/2023]
Abstract
The four-dimensional nucleome (4DN) consortium studies the architecture of the genome and the nucleus in space and time. We summarize progress by the consortium and highlight the development of technologies for (1) mapping genome folding and identifying roles of nuclear components and bodies, proteins, and RNA, (2) characterizing nuclear organization with time or single-cell resolution, and (3) imaging of nuclear organization. With these tools, the consortium has provided over 2,000 public datasets. Integrative computational models based on these data are starting to reveal connections between genome structure and function. We then present a forward-looking perspective and outline current aims to (1) delineate dynamics of nuclear architecture at different timescales, from minutes to weeks as cells differentiate, in populations and in single cells, (2) characterize cis-determinants and trans-modulators of genome organization, (3) test functional consequences of changes in cis- and trans-regulators, and (4) develop predictive models of genome structure and function.
Collapse
|
12
|
Transposase expression, element abundance, element size, and DNA repair determine the mobility and heritability of PIF/ Pong/ Harbinger transposable elements. Front Cell Dev Biol 2023; 11:1184046. [PMID: 37363729 PMCID: PMC10288884 DOI: 10.3389/fcell.2023.1184046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Accepted: 05/22/2023] [Indexed: 06/28/2023] Open
Abstract
Introduction: Class II DNA transposable elements account for significant portions of eukaryotic genomes and contribute to genome evolution through their mobilization. To escape inactivating mutations and persist in the host genome over evolutionary time, these elements must be mobilized enough to result in additional copies. These elements utilize a "cut and paste" transposition mechanism that does not intrinsically include replication. However, elements such as the rice derived mPing element have been observed to increase in copy number over time. Methods: We used yeast transposition assays to test several parameters that could affect the excision and insertion of mPing and its related elements. This included development of novel strategies for measuring element insertion and sequencing insertion sites. Results: Increased transposase protein expression increased the mobilization frequency of a small (430 bp) element, while overexpression inhibition was observed for a larger (7,126 bp) element. Smaller element size increased both the frequency of excision and insertion of these elements. The effect of yeast ploidy on element excision, insertion, and copy number provided evidence that homology dependent repair allows for replicative transposition. These elements were found to preferentially insert into yeast rDNA repeat sequences. Discussion: Identifying the parameters that influence transposition of these elements will facilitate their use for gene discovery and genome editing. These insights in to the behavior of these elements also provide important clues into how class II transposable elements have shaped eukaryotic genomes.
Collapse
|
13
|
Replication licensing during S phase: breaking the law to prevent breaking DNA. Nat Struct Mol Biol 2023; 30:406-408. [PMID: 37041325 DOI: 10.1038/s41594-023-00962-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/13/2023]
|
14
|
Replication timing and transcriptional control: beyond cause and effect - part IV. Curr Opin Genet Dev 2023; 79:102031. [PMID: 36905782 PMCID: PMC10035587 DOI: 10.1016/j.gde.2023.102031] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 02/07/2023] [Accepted: 02/11/2023] [Indexed: 03/11/2023]
Abstract
Decades of work on the spatiotemporal organization of mammalian DNA replication timing (RT) continues to unveil novel correlations with aspects of transcription and chromatin organization but, until recently, mechanisms regulating RT and the biological significance of the RT program had been indistinct. We now know that the RT program is both influenced by and necessary to maintain chromatin structure, forming an epigenetic positive feedback loop. Moreover, the discovery of specific cis-acting elements regulating mammalian RT at both the domain and the whole-chromosome level has revealed multiple cell-type-specific and developmentally regulated mechanisms of RT control. We review recent evidence for diverse mechanisms employed by different cell types to regulate their RT programs and the biological significance of RT regulation during development.
Collapse
|
15
|
Abstract
Some viruses restructure host chromatin, influencing gene expression, with implications for disease outcome. Whether this occurs for SARS-CoV-2, the virus causing COVID-19, is largely unknown. Here we characterized the 3D genome and epigenome of human cells after SARS-CoV-2 infection, finding widespread host chromatin restructuring that features widespread compartment A weakening, A-B mixing, reduced intra-TAD contacts and decreased H3K27ac euchromatin modification levels. Such changes were not found following common-cold-virus HCoV-OC43 infection. Intriguingly, the cohesin complex was notably depleted from intra-TAD regions, indicating that SARS-CoV-2 disrupts cohesin loop extrusion. These altered 3D genome/epigenome structures correlated with transcriptional suppression of interferon response genes by the virus, while increased H3K4me3 was found in the promoters of pro-inflammatory genes highly induced during severe COVID-19. These findings show that SARS-CoV-2 acutely rewires host chromatin, facilitating future studies of the long-term epigenomic impacts of its infection.
Collapse
|
16
|
Epigenetic control of chromosome-associated lncRNA genes essential for replication and stability. Nat Commun 2022; 13:6301. [PMID: 36273230 PMCID: PMC9588035 DOI: 10.1038/s41467-022-34099-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Accepted: 10/13/2022] [Indexed: 01/18/2023] Open
Abstract
ASARs are long noncoding RNA genes that control replication timing of entire human chromosomes in cis. The three known ASAR genes are located on human chromosomes 6 and 15, and are essential for chromosome integrity. To identify ASARs on all human chromosomes we utilize a set of distinctive ASAR characteristics that allow for the identification of hundreds of autosomal loci with epigenetically controlled, allele-restricted behavior in expression and replication timing of coding and noncoding genes, and is distinct from genomic imprinting. Disruption of noncoding RNA genes at five of five tested loci result in chromosome-wide delayed replication and chromosomal instability, validating their ASAR activity. In addition to the three known essential cis-acting chromosomal loci, origins, centromeres, and telomeres, we propose that all mammalian chromosomes also contain "Inactivation/Stability Centers" that display allele-restricted epigenetic regulation of protein coding and noncoding ASAR genes that are essential for replication and stability of each chromosome.
Collapse
|
17
|
Dynamic chromosomal interactions and control of heterochromatin positioning by Ki-67. EMBO Rep 2022; 23:e55782. [PMID: 36245428 PMCID: PMC9724667 DOI: 10.15252/embr.202255782] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Revised: 09/26/2022] [Accepted: 09/29/2022] [Indexed: 11/05/2022] Open
Abstract
Ki-67 is a chromatin-associated protein with a dynamic distribution pattern throughout the cell cycle and is thought to be involved in chromatin organization. The lack of genomic interaction maps has hampered a detailed understanding of its roles, particularly during interphase. By pA-DamID mapping in human cell lines, we find that Ki-67 associates with large genomic domains that overlap mostly with late-replicating regions. Early in interphase, when Ki-67 is present in pre-nucleolar bodies, it interacts with these domains on all chromosomes. However, later in interphase, when Ki-67 is confined to nucleoli, it shows a striking shift toward small chromosomes. Nucleolar perturbations indicate that these cell cycle dynamics correspond to nucleolar maturation during interphase, and suggest that nucleolar sequestration of Ki-67 limits its interactions with larger chromosomes. Furthermore, we demonstrate that Ki-67 does not detectably control chromatin-chromatin interactions during interphase, but it competes with the nuclear lamina for interaction with late-replicating DNA, and it controls replication timing of (peri)centromeric regions. Together, these results reveal a highly dynamic choreography of genome interactions and roles for Ki-67 in heterochromatin organization.
Collapse
|
18
|
Abstract
DNA replication occurs through an intricately regulated series of molecular events and is fundamental for genome stability1,2. At present, it is unknown how the locations of replication origins are determined in the human genome. Here we dissect the role of topologically associating domains (TADs)3-6, subTADs7 and loops8 in the positioning of replication initiation zones (IZs). We stratify TADs and subTADs by the presence of corner-dots indicative of loops and the orientation of CTCF motifs. We find that high-efficiency, early replicating IZs localize to boundaries between adjacent corner-dot TADs anchored by high-density arrays of divergently and convergently oriented CTCF motifs. By contrast, low-efficiency IZs localize to weaker dotless boundaries. Following ablation of cohesin-mediated loop extrusion during G1, high-efficiency IZs become diffuse and delocalized at boundaries with complex CTCF motif orientations. Moreover, G1 knockdown of the cohesin unloading factor WAPL results in gained long-range loops and narrowed localization of IZs at the same boundaries. Finally, targeted deletion or insertion of specific boundaries causes local replication timing shifts consistent with IZ loss or gain, respectively. Our data support a model in which cohesin-mediated loop extrusion and stalling at a subset of genetically encoded TAD and subTAD boundaries is an essential determinant of the locations of replication origins in human S phase.
Collapse
|
19
|
Author Correction: Expanded encyclopaedias of DNA elements in the human and mouse genomes. Nature 2022; 605:E3. [PMID: 35474001 PMCID: PMC9095460 DOI: 10.1038/s41586-021-04226-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
20
|
Abstract
Replication timing (RT) is the temporal order in which genomic DNA is replicated during S phase. Early and late replication correlate with transcriptionally active and inactive chromatin compartments, but mechanistic links between large-scale chromosome structure, transcription, and replication are still enigmatic. A proper RT program is necessary to maintain the global epigenome that defines cell identity, suggesting that RT is critical for epigenome integrity by facilitating the assembly of different types of chromatin at different times during S phase. RT is regulated during development and has been found to be altered in disease. Thus, RT can identify stable epigenetic differences distinguishing cell types, and can be used to help stratify patient outcomes and identify markers of disease. Most methods to profile RT require thousands of S-phase cells. In cases where cells are rare (e.g., early-stage embryos or rare primary cell types) or consist of a heterogeneous mixture of cell states (e.g., differentiation intermediates), or when the interest is in determining the degree of stable epigenetic heterogeneity within a population of cells, single-cell measurements of RT are necessary. We have previously developed single cell Repli-seq, a method to measure replication timing in single cells using DNA copy number quantification. To date, however, single-cell Repli-seq suffers from relatively low throughput and high costs. Here, we describe an improved single-cell Repli-seq protocol that uses degenerate oligonucleotide-primed PCR (DOP-PCR) for uniform whole-genome amplification and uniquely barcoded primers that permit early pooling of single-cell samples into a single library preparation. We also provide a bioinformatics platform for analysis of the data. The improved throughput and decreased costs of this method relative to previously published single-cell Repli-seq protocols should make it considerably more accessible to a broad range of investigators. © 2022 Wiley Periodicals LLC. Basic Protocol 1: Whole Genome Amplification (WGA) of single cells and sequence library construction. Basic Protocol 2: Deriving and displaying single-cell replication timing data from whole genome sequencing.
Collapse
|
21
|
High-throughput single-cell epigenomic profiling by targeted insertion of promoters (TIP-seq). J Cell Biol 2021; 220:e202103078. [PMID: 34783858 PMCID: PMC8600797 DOI: 10.1083/jcb.202103078] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2021] [Revised: 09/15/2021] [Accepted: 10/27/2021] [Indexed: 12/16/2022] Open
Abstract
Chromatin profiling in single cells has been extremely challenging and almost exclusively limited to histone proteins. In cases where single-cell methods have shown promise, many require highly specialized equipment or cell type-specific protocols and are relatively low throughput. Here, we combine the advantages of tagmentation, linear amplification, and combinatorial indexing to produce a high-throughput single-cell DNA binding site mapping method that is simple, inexpensive, and capable of multiplexing several independent samples per experiment. Targeted insertion of promoters sequencing (TIP-seq) uses Tn5 fused to proteinA to insert a T7 RNA polymerase promoter adjacent to a chromatin protein of interest. Linear amplification of flanking DNA with T7 polymerase before sequencing library preparation provides ∼10-fold higher unique reads per single cell compared with other methods. We applied TIP-seq to map histone modifications, RNA polymerase II (RNAPII), and transcription factor CTCF binding sites in single human and mouse cells.
Collapse
|
22
|
Abstract
Immediately following the discovery of the structure of DNA and the semi-conservative replication of the parental DNA sequence into two new DNA strands, it became apparent that DNA replication is organized in a temporal and spatial fashion during the S phase of the cell cycle, correlated with the large-scale organization of chromatin in the nucleus. After many decades of limited progress, technological advances in genomics, genome engineering, and imaging have finally positioned the field to tackle mechanisms underpinning the temporal and spatial regulation of DNA replication and the causal relationships between DNA replication and other features of large-scale chromosome structure and function. In this review, we discuss these major recent discoveries as well as expectations for the coming decade.
Collapse
|
23
|
Genome-wide mapping of human DNA replication by optical replication mapping supports a stochastic model of eukaryotic replication. Mol Cell 2021; 81:2975-2988.e6. [PMID: 34157308 DOI: 10.1016/j.molcel.2021.05.024] [Citation(s) in RCA: 45] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Revised: 03/08/2021] [Accepted: 05/20/2021] [Indexed: 12/27/2022]
Abstract
The heterogeneous nature of eukaryotic replication kinetics and the low efficiency of individual initiation sites make mapping the location and timing of replication initiation in human cells difficult. To address this challenge, we have developed optical replication mapping (ORM), a high-throughput single-molecule approach, and used it to map early-initiation events in human cells. The single-molecule nature of our data and a total of >2,500-fold coverage of the human genome on 27 million fibers averaging ∼300 kb in length allow us to identify initiation sites and their firing probability with high confidence. We find that the distribution of human replication initiation is consistent with inefficient, stochastic activation of heterogeneously distributed potential initiation complexes enriched in accessible chromatin. These observations are consistent with stochastic models of initiation-timing regulation and suggest that stochastic regulation of replication kinetics is a fundamental feature of eukaryotic replication, conserved from yeast to humans.
Collapse
|
24
|
Nuclear organisation and replication timing are coupled through RIF1-PP1 interaction. Nat Commun 2021; 12:2910. [PMID: 34006872 PMCID: PMC8131703 DOI: 10.1038/s41467-021-22899-2] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Accepted: 03/30/2021] [Indexed: 02/07/2023] Open
Abstract
Three-dimensional genome organisation and replication timing are known to be correlated, however, it remains unknown whether nuclear architecture overall plays an instructive role in the replication-timing programme and, if so, how. Here we demonstrate that RIF1 is a molecular hub that co-regulates both processes. Both nuclear organisation and replication timing depend upon the interaction between RIF1 and PP1. However, whereas nuclear architecture requires the full complement of RIF1 and its interaction with PP1, replication timing is not sensitive to RIF1 dosage. The role of RIF1 in replication timing also extends beyond its interaction with PP1. Availing of this separation-of-function approach, we have therefore identified in RIF1 dual function the molecular bases of the co-dependency of the replication-timing programme and nuclear architecture.
Collapse
|
25
|
Replication timing maintains the global epigenetic state in human cells. Science 2021; 372:371-378. [PMID: 33888635 PMCID: PMC8173839 DOI: 10.1126/science.aba5545] [Citation(s) in RCA: 71] [Impact Index Per Article: 23.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2019] [Revised: 11/01/2020] [Accepted: 03/19/2021] [Indexed: 12/12/2022]
Abstract
The temporal order of DNA replication [replication timing (RT)] is correlated with chromatin modifications and three-dimensional genome architecture; however, causal links have not been established, largely because of an inability to manipulate the global RT program. We show that loss of RIF1 causes near-complete elimination of the RT program by increasing heterogeneity between individual cells. RT changes are coupled with widespread alterations in chromatin modifications and genome compartmentalization. Conditional depletion of RIF1 causes replication-dependent disruption of histone modifications and alterations in genome architecture. These effects were magnified with successive cycles of altered RT. These results support models in which the timing of chromatin replication and thus assembly plays a key role in maintaining the global epigenetic state.
Collapse
|
26
|
STATISTICAL COMPARISONS OF CHROMOSOMAL SHAPE POPULATIONS. PROCEEDINGS. IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING 2021; 2021:788-791. [PMID: 35165532 PMCID: PMC8840943 DOI: 10.1109/isbi48211.2021.9433812] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
This paper develops statistical tools for testing differences in shapes of chromosomes resulting from certain gene knockouts (KO), specifically RIF1 gene KO (RKO) and the cohesin subunit RAD21 gene KO (CKO). It utilizes a two-sample test for comparing shapes of KO chromosomes with wild type (WT) at two levels: (1) Coarse shape analysis, where one compares shapes of full or large parts of chromosomes, and (2) Fine shape analysis, where chromosomes are first segmented into (TAD-based) pieces and then the corresponding pieces are compared across populations. The shape comparisons - coarse and fine - are based on an elastic shape metric for comparing shapes of 3D curves. The experiments show that the KO populations, RKO and CKO, have statistically significant differences from WT at both coarse and fine levels. Furthermore, this framework highlights local regions where these differences are most prominent.
Collapse
|
27
|
Abstract
Guided by the extensive knowledge of CRISPR-Cas9 molecular mechanisms, protein engineering can be an effective method in improving CRISPR-Cas9 toward desired traits different from those of their natural forms. Here, we describe a directed protein evolution method that enables selection of catalytically enhanced CRISPR-Cas9 variants (CECas9) by targeting a shortened protospacer within a toxic gene. We demonstrate the effectiveness of this method with a previously characterized Type II-C Cas9 from Acidothermus cellulolyticus (AceCas9) and show by enzyme kinetics an up to fourfold improvement of the in vitro catalytic efficiency by AceCECas9. We further evolved the more widely used Streptococcus pyogenes Cas9 (SpyCas9) and demonstrated a noticeable improvement in the SpyCECas9-facilitated homology directed repair-based gene insertion in human colon cancer cells.
Collapse
|
28
|
The Tiger Rattlesnake genome reveals a complex genotype underlying a simple venom phenotype. Proc Natl Acad Sci U S A 2021; 118:e2014634118. [PMID: 33468678 PMCID: PMC7848695 DOI: 10.1073/pnas.2014634118] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Variation in gene regulation is ubiquitous, yet identifying the mechanisms producing such variation, especially for complex traits, is challenging. Snake venoms provide a model system for studying the phenotypic impacts of regulatory variation in complex traits because of their genetic tractability. Here, we sequence the genome of the Tiger Rattlesnake, which possesses the simplest and most toxic venom of any rattlesnake species, to determine whether the simple venom phenotype is the result of a simple genotype through gene loss or a complex genotype mediated through regulatory mechanisms. We generate the most contiguous snake-genome assembly to date and use this genome to show that gene loss, chromatin accessibility, and methylation levels all contribute to the production of the simplest, most toxic rattlesnake venom. We provide the most complete characterization of the venom gene-regulatory network to date and identify key mechanisms mediating phenotypic variation across a polygenic regulatory network.
Collapse
|
29
|
Abstract
We report SPIN, an integrative computational method to reveal genome-wide intranuclear chromosome positioning and nuclear compartmentalization relative to multiple nuclear structures, which are pivotal for modulating genome function. As a proof-of-principle, we use SPIN to integrate nuclear compartment mapping (TSA-seq and DamID) and chromatin interaction data (Hi-C) from K562 cells to identify 10 spatial compartmentalization states genome-wide relative to nuclear speckles, lamina, and putative associations with nucleoli. These SPIN states show novel patterns of genome spatial organization and their relation to other 3D genome features and genome function (transcription and replication timing). SPIN provides critical insights into nuclear spatial and functional compartmentalization.
Collapse
|
30
|
Cohesin depleted cells rebuild functional nuclear compartments after endomitosis. Nat Commun 2020; 11:6146. [PMID: 33262376 PMCID: PMC7708632 DOI: 10.1038/s41467-020-19876-6] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2019] [Accepted: 10/28/2020] [Indexed: 01/05/2023] Open
Abstract
Cohesin plays an essential role in chromatin loop extrusion, but its impact on a compartmentalized nuclear architecture, linked to nuclear functions, is less well understood. Using live-cell and super-resolved 3D microscopy, here we find that cohesin depletion in a human colon cancer derived cell line results in endomitosis and a single multilobulated nucleus with chromosome territories pervaded by interchromatin channels. Chromosome territories contain chromatin domain clusters with a zonal organization of repressed chromatin domains in the interior and transcriptionally competent domains located at the periphery. These clusters form microscopically defined, active and inactive compartments, which likely correspond to A/B compartments, which are detected with ensemble Hi-C. Splicing speckles are observed nearby within the lining channel system. We further observe that the multilobulated nuclei, despite continuous absence of cohesin, pass through S-phase with typical spatio-temporal patterns of replication domains. Evidence for structural changes of these domains compared to controls suggests that cohesin is required for their full integrity.
Collapse
|
31
|
Abstract
ENCODE comprises thousands of functional genomics datasets, and the encyclopedia covers hundreds of cell types, providing a universal annotation for genome interpretation. However, for particular applications, it may be advantageous to use a customized annotation. Here, we develop such a custom annotation by leveraging advanced assays, such as eCLIP, Hi-C, and whole-genome STARR-seq on a number of data-rich ENCODE cell types. A key aspect of this annotation is comprehensive and experimentally derived networks of both transcription factors and RNA-binding proteins (TFs and RBPs). Cancer, a disease of system-wide dysregulation, is an ideal application for such a network-based annotation. Specifically, for cancer-associated cell types, we put regulators into hierarchies and measure their network change (rewiring) during oncogenesis. We also extensively survey TF-RBP crosstalk, highlighting how SUB1, a previously uncharacterized RBP, drives aberrant tumor expression and amplifies the effect of MYC, a well-known oncogenic TF. Furthermore, we show how our annotation allows us to place oncogenic transformations in the context of a broad cell space; here, many normal-to-tumor transitions move towards a stem-like state, while oncogene knockdowns show an opposing trend. Finally, we organize the resource into a coherent workflow to prioritize key elements and variants, in addition to regulators. We showcase the application of this prioritization to somatic burdening, cancer differential expression and GWAS. Targeted validations of the prioritized regulators, elements and variants using siRNA knockdowns, CRISPR-based editing, and luciferase assays demonstrate the value of the ENCODE resource.
Collapse
|
32
|
Abstract
The human and mouse genomes contain instructions that specify RNAs and proteins and govern the timing, magnitude, and cellular context of their production. To better delineate these elements, phase III of the Encyclopedia of DNA Elements (ENCODE) Project has expanded analysis of the cell and tissue repertoires of RNA transcription, chromatin structure and modification, DNA methylation, chromatin looping, and occupancy by transcription factors and RNA-binding proteins. Here we summarize these efforts, which have produced 5,992 new experimental datasets, including systematic determinations across mouse fetal development. All data are available through the ENCODE data portal (https://www.encodeproject.org), including phase II ENCODE1 and Roadmap Epigenomics2 data. We have developed a registry of 926,535 human and 339,815 mouse candidate cis-regulatory elements, covering 7.9 and 3.4% of their respective genomes, by integrating selected datatypes associated with gene regulation, and constructed a web-based server (SCREEN; http://screen.encodeproject.org) to provide flexible, user-defined access to this resource. Collectively, the ENCODE data and registry provide an expansive resource for the scientific community to build a better understanding of the organization and function of the human and mouse genomes.
Collapse
|
33
|
High-resolution Repli-Seq defines the temporal choreography of initiation, elongation and termination of replication in mammalian cells. Genome Biol 2020; 21:76. [PMID: 32209126 PMCID: PMC7092589 DOI: 10.1186/s13059-020-01983-8] [Citation(s) in RCA: 59] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2020] [Accepted: 03/04/2020] [Indexed: 12/16/2022] Open
Abstract
BACKGROUND DNA replication in mammalian cells occurs in a defined temporal order during S phase, known as the replication timing (RT) programme. Replication timing is developmentally regulated and correlated with chromatin conformation and local transcriptional potential. Here, we present RT profiles of unprecedented temporal resolution in two human embryonic stem cell lines, human colon carcinoma line HCT116, and mouse embryonic stem cells and their neural progenitor derivatives. RESULTS Fine temporal windows revealed a remarkable degree of cell-to-cell conservation in RT, particularly at the very beginning and ends of S phase, and identified 5 temporal patterns of replication in all cell types, consistent with varying degrees of initiation efficiency. Zones of replication initiation (IZs) were detected throughout S phase and interacted in 3D space preferentially with other IZs of similar firing time. Temporal transition regions were resolved into segments of uni-directional replication punctuated at specific sites by small, inefficient IZs. Sites of convergent replication were divided into sites of termination or large constant timing regions consisting of many synchronous IZs in tandem. Developmental transitions in RT occured mainly by activating or inactivating individual IZs or occasionally by altering IZ firing time, demonstrating that IZs, rather than individual origins, are the units of developmental regulation. Finally, haplotype phasing revealed numerous regions of allele-specific and allele-independent asynchronous replication. Allele-independent asynchronous replication was correlated with the presence of previously mapped common fragile sites. CONCLUSIONS Altogether, these data provide a detailed temporal choreography of DNA replication in mammalian cells.
Collapse
|
34
|
Local rewiring of genome-nuclear lamina interactions by transcription. EMBO J 2020; 39:e103159. [PMID: 32080885 PMCID: PMC7073462 DOI: 10.15252/embj.2019103159] [Citation(s) in RCA: 45] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2019] [Revised: 01/23/2020] [Accepted: 01/28/2020] [Indexed: 12/20/2022] Open
Abstract
Transcriptionally inactive genes are often positioned at the nuclear lamina (NL), as part of large lamina‐associated domains (LADs). Activation of such genes is often accompanied by repositioning toward the nuclear interior. How this process works and how it impacts flanking chromosomal regions are poorly understood. We addressed these questions by systematic activation or inactivation of individual genes, followed by detailed genome‐wide analysis of NL interactions, replication timing, and transcription patterns. Gene activation inside LADs typically causes NL detachment of the entire transcription unit, but rarely more than 50–100 kb of flanking DNA, even when multiple neighboring genes are activated. The degree of detachment depends on the expression level and the length of the activated gene. Loss of NL interactions coincides with a switch from late to early replication timing, but the latter can involve longer stretches of DNA. Inactivation of active genes can lead to increased NL contacts. These extensive datasets are a resource for the analysis of LAD rewiring by transcription and reveal a remarkable flexibility of interphase chromosomes.
Collapse
|
35
|
Replication timing networks reveal a link between transcription regulatory circuits and replication timing control. Genome Res 2019; 29:1415-1428. [PMID: 31434679 PMCID: PMC6724675 DOI: 10.1101/gr.247049.118] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2018] [Accepted: 08/05/2019] [Indexed: 12/11/2022]
Abstract
DNA replication occurs in a defined temporal order known as the replication timing (RT) program and is regulated during development, coordinated with 3D genome organization and transcriptional activity. However, transcription and RT are not sufficiently coordinated to predict each other, suggesting an indirect relationship. Here, we exploit genome-wide RT profiles from 15 human cell types and intermediate differentiation stages derived from human embryonic stem cells to construct different types of RT regulatory networks. First, we constructed networks based on the coordinated RT changes during cell fate commitment to create highly complex RT networks composed of thousands of interactions that form specific functional subnetwork communities. We also constructed directional regulatory networks based on the order of RT changes within cell lineages, and identified master regulators of differentiation pathways. Finally, we explored relationships between RT networks and transcriptional regulatory networks (TRNs) by combining them into more complex circuitries of composite and bipartite networks. Results identified novel trans interactions linking transcription factors that are core to the regulatory circuitry of each cell type to RT changes occurring in those cell types. These core transcription factors were found to bind cooperatively to sites in the affected replication domains, providing provocative evidence that they constitute biologically significant directional interactions. Our findings suggest a regulatory link between the establishment of cell-type-specific TRNs and RT control during lineage specification.
Collapse
|
36
|
RT States: systematic annotation of the human genome using cell type-specific replication timing programs. Bioinformatics 2019; 35:2167-2176. [PMID: 30475980 PMCID: PMC6681175 DOI: 10.1093/bioinformatics/bty957] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2018] [Revised: 11/05/2018] [Accepted: 11/21/2018] [Indexed: 12/20/2022] Open
Abstract
MOTIVATION The replication timing (RT) program has been linked to many key biological processes including cell fate commitment, 3D chromatin organization and transcription regulation. Significant technology progress now allows to characterize the RT program in the entire human genome in a high-throughput and high-resolution fashion. These experiments suggest that RT changes dynamically during development in coordination with gene activity. Since RT is such a fundamental biological process, we believe that an effective quantitative profile of the local RT program from a diverse set of cell types in various developmental stages and lineages can provide crucial biological insights for a genomic locus. RESULTS In this study, we explored recurrent and spatially coherent combinatorial profiles from 42 RT programs collected from multiple lineages at diverse differentiation states. We found that a Hidden Markov Model with 15 hidden states provide a good model to describe these genome-wide RT profiling data. Each of the hidden state represents a unique combination of RT profiles across different cell types which we refer to as 'RT states'. To understand the biological properties of these RT states, we inspected their relationship with chromatin states, gene expression, functional annotation and 3D chromosomal organization. We found that the newly defined RT states possess interesting genome-wide functional properties that add complementary information to the existing annotation of the human genome. AVAILABILITY AND IMPLEMENTATION R scripts for inferring HMM models and Perl scripts for further analysis are available https://github.com/PouletAxel/script_HMM_Replication_timing. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
37
|
Rapid Irreversible Transcriptional Reprogramming in Human Stem Cells Accompanied by Discordance between Replication Timing and Chromatin Compartment. Stem Cell Reports 2019; 13:193-206. [PMID: 31231024 PMCID: PMC6627004 DOI: 10.1016/j.stemcr.2019.05.021] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2018] [Revised: 05/20/2019] [Accepted: 05/20/2019] [Indexed: 02/02/2023] Open
Abstract
The temporal order of DNA replication is regulated during development and is highly correlated with gene expression, histone modifications and 3D genome architecture. We tracked changes in replication timing, gene expression, and chromatin conformation capture (Hi-C) A/B compartments over the first two cell cycles during differentiation of human embryonic stem cells to definitive endoderm. Remarkably, transcriptional programs were irreversibly reprogrammed within the first cell cycle and were largely but not universally coordinated with replication timing changes. Moreover, changes in A/B compartment and several histone modifications that normally correlate strongly with replication timing showed weak correlation during the early cell cycles of differentiation but showed increased alignment in later differentiation stages and in terminally differentiated cell lines. Thus, epigenetic cell fate transitions during early differentiation can occur despite dynamic and discordant changes in otherwise highly correlated genomic properties.
Collapse
|
38
|
Identifying cis Elements for Spatiotemporal Control of Mammalian DNA Replication. Cell 2019; 176:816-830.e18. [PMID: 30595451 PMCID: PMC6546437 DOI: 10.1016/j.cell.2018.11.036] [Citation(s) in RCA: 106] [Impact Index Per Article: 21.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2018] [Revised: 10/01/2018] [Accepted: 11/21/2018] [Indexed: 01/09/2023]
Abstract
The temporal order of DNA replication (replication timing [RT]) is highly coupled with genome architecture, but cis-elements regulating either remain elusive. We created a series of CRISPR-mediated deletions and inversions of a pluripotency-associated topologically associating domain (TAD) in mouse ESCs. CTCF-associated domain boundaries were dispensable for RT. CTCF protein depletion weakened most TAD boundaries but had no effect on RT or A/B compartmentalization genome-wide. By contrast, deletion of three intra-TAD CTCF-independent 3D contact sites caused a domain-wide early-to-late RT shift, an A-to-B compartment switch, weakening of TAD architecture, and loss of transcription. The dispensability of TAD boundaries and the necessity of these "early replication control elements" (ERCEs) was validated by deletions and inversions at additional domains. Our results demonstrate that discrete cis-regulatory elements orchestrate domain-wide RT, A/B compartmentalization, TAD architecture, and transcription, revealing fundamental principles linking genome structure and function.
Collapse
|
39
|
Integrative detection and analysis of structural variation in cancer genomes. Nat Genet 2018; 50:1388-1398. [PMID: 30202056 PMCID: PMC6301019 DOI: 10.1038/s41588-018-0195-8] [Citation(s) in RCA: 205] [Impact Index Per Article: 34.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2017] [Accepted: 07/16/2018] [Indexed: 01/19/2023]
Abstract
Structural variants (SVs) can contribute to oncogenesis through a variety of mechanisms. Despite their importance, the identification of SVs in cancer genomes remains challenging. Here, we present a framework that integrates optical mapping, high-throughput chromosome conformation capture (Hi-C), and whole-genome sequencing to systematically detect SVs in a variety of normal or cancer samples and cell lines. We identify the unique strengths of each method and demonstrate that only integrative approaches can comprehensively identify SVs in the genome. By combining Hi-C and optical mapping, we resolve complex SVs and phase multiple SV events to a single haplotype. Furthermore, we observe widespread structural variation events affecting the functions of noncoding sequences, including the deletion of distal regulatory sequences, alteration of DNA replication timing, and the creation of novel three-dimensional chromatin structural domains. Our results indicate that noncoding SVs may be underappreciated mutational drivers in cancer genomes.
Collapse
|
40
|
Continuous-Trait Probabilistic Model for Comparing Multi-species Functional Genomic Data. Cell Syst 2018; 7:208-218.e11. [PMID: 29936186 PMCID: PMC6107375 DOI: 10.1016/j.cels.2018.05.022] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2018] [Revised: 05/17/2018] [Accepted: 05/29/2018] [Indexed: 01/22/2023]
Abstract
A large amount of multi-species functional genomic data from high-throughput assays are becoming available to help understand the molecular mechanisms for phenotypic diversity across species. However, continuous-trait probabilistic models, which are key to such comparative analysis, remain under-explored. Here we develop a new model, called phylogenetic hidden Markov Gaussian processes (Phylo-HMGP), to simultaneously infer heterogeneous evolutionary states of functional genomic features in a genome-wide manner. Both simulation studies and real data application demonstrate the effectiveness of Phylo-HMGP. Importantly, we applied Phylo-HMGP to analyze a new cross-species DNA replication timing (RT) dataset from the same cell type in five primate species (human, chimpanzee, orangutan, gibbon, and green monkey). We demonstrate that our Phylo-HMGP model enables discovery of genomic regions with distinct evolutionary patterns of RT. Our method provides a generic framework for comparative analysis of multi-species continuous functional genomic signals to help reveal regions with conserved or lineage-specific regulatory roles.
Collapse
|
41
|
Cellular senescence induces replication stress with almost no affect on DNA replication timing. Cell Cycle 2018; 17:1667-1681. [PMID: 29963964 DOI: 10.1080/15384101.2018.1491235] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
Organismal aging entails a gradual decline of normal physiological functions and a major contributor to this decline is withdrawal of the cell cycle, known as senescence. Senescence can result from telomere diminution leading to a finite number of population doublings, known as replicative senescence (RS), or from oncogene overexpression, as a protective mechanism against cancer. Senescence is associated with large-scale chromatin re-organization and changes in gene expression. Replication stress is a complex phenomenon, defined as the slowing or stalling of replication fork progression and/or DNA synthesis, which has serious implications for genome stability, and consequently in human diseases. Aberrant replication fork structures activate the replication stress response leading to the activation of dormant origins, which is thought to be a safeguard mechanism to complete DNA replication on time. However, the relationship between replicative stress and the changes in the spatiotemporal program of DNA replication in senescence progression remains unclear. Here, we studied the DNA replication program during senescence progression in proliferative and pre-senescent cells from donors of various ages by single DNA fiber combing of replicated DNA, origin mapping by sequencing short nascent strands and genome-wide profiling of replication timing (TRT). We demonstrate that, progression into RS leads to reduced replication fork rates and activation of dormant origins, which are the hallmarks of replication stress. However, with the exception of a delay in RT of the CREB5 gene in all pre-senescent cells, RT was globally unaffected by replication stress during entry into either oncogene-induced or RS. Consequently, we conclude that RT alterations associated with physiological and accelerated aging, do not result from senescence progression. Our results clarify the interplay between senescence, aging and replication programs and demonstrate that RT is largely resistant to replication stress.
Collapse
|
42
|
Allele-specific control of replication timing and genome organization during development. Genome Res 2018; 28:800-811. [PMID: 29735606 PMCID: PMC5991511 DOI: 10.1101/gr.232561.117] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2017] [Accepted: 04/26/2018] [Indexed: 12/14/2022]
Abstract
DNA replication occurs in a defined temporal order known as the replication-timing (RT) program. RT is regulated during development in discrete chromosomal units, coordinated with transcriptional activity and 3D genome organization. Here, we derived distinct cell types from F1 hybrid musculus × castaneus mouse crosses and exploited the high single-nucleotide polymorphism (SNP) density to characterize allelic differences in RT (Repli-seq), genome organization (Hi-C and promoter-capture Hi-C), gene expression (total nuclear RNA-seq), and chromatin accessibility (ATAC-seq). We also present HARP, a new computational tool for sorting SNPs in phased genomes to efficiently measure allele-specific genome-wide data. Analysis of six different hybrid mESC clones with different genomes (C57BL/6, 129/sv, and CAST/Ei), parental configurations, and gender revealed significant RT asynchrony between alleles across ∼12% of the autosomal genome linked to subspecies genomes but not to parental origin, growth conditions, or gender. RT asynchrony in mESCs strongly correlated with changes in Hi-C compartments between alleles but not as strongly with SNP density, gene expression, imprinting, or chromatin accessibility. We then tracked mESC RT asynchronous regions during development by analyzing differentiated cell types, including extraembryonic endoderm stem (XEN) cells, four male and female primary mouse embryonic fibroblasts (MEFs), and neural precursor cells (NPCs) differentiated in vitro from mESCs with opposite parental configurations. We found that RT asynchrony and allelic discordance in Hi-C compartments seen in mESCs were largely lost in all differentiated cell types, accompanied by novel sites of allelic asynchrony at a considerably smaller proportion of the genome, suggesting that genome organization of homologs converges to similar folding patterns during cell fate commitment.
Collapse
|
43
|
Bacterial artificial chromosomes establish replication timing and sub-nuclear compartment de novo as extra-chromosomal vectors. Nucleic Acids Res 2018; 46:1810-1820. [PMID: 29294101 PMCID: PMC5829748 DOI: 10.1093/nar/gkx1265] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2017] [Revised: 11/27/2017] [Accepted: 12/06/2017] [Indexed: 12/11/2022] Open
Abstract
The role of DNA sequence in determining replication timing (RT) and chromatin higher order organization remains elusive. To address this question, we have developed an extra-chromosomal replication system (E-BACs) consisting of ∼200 kb human bacterial artificial chromosomes (BACs) modified with Epstein-Barr virus (EBV) stable segregation elements. E-BACs were stably maintained as autonomous mini-chromosomes in EBNA1-expressing HeLa or human induced pluripotent stem cells (hiPSCs) and established distinct RT patterns. An E-BAC harboring an early replicating chromosomal region replicated early during S phase, while E-BACs derived from RT transition regions (TTRs) and late replicating regions replicated in mid to late S phase. Analysis of E-BAC interactions with cellular chromatin (4C-seq) revealed that the early replicating E-BAC interacted broadly throughout the genome and preferentially with the early replicating compartment of the nucleus. In contrast, mid- to late-replicating E-BACs interacted with more specific late replicating chromosomal segments, some of which were shared between different E-BACs. Together, we describe a versatile system in which to study the structure and function of chromosomal segments that are stably maintained separately from the influence of cellular chromosome context.
Collapse
|
44
|
Single-cell replication profiling to measure stochastic variation in mammalian replication timing. Nat Commun 2018; 9:427. [PMID: 29382831 PMCID: PMC5789892 DOI: 10.1038/s41467-017-02800-w] [Citation(s) in RCA: 51] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2017] [Accepted: 12/27/2017] [Indexed: 01/17/2023] Open
Abstract
Mammalian DNA replication is regulated via multi-replicon segments that replicate in a defined temporal order during S-phase. Further, early/late replication of RDs corresponds to active/inactive chromatin interaction compartments. Although replication origins are selected stochastically, variation in replication timing is poorly understood. Here we devise a strategy to measure variation in replication timing using DNA copy number in single mouse embryonic stem cells. We find that borders between replicated and unreplicated DNA are highly conserved between cells, demarcating active and inactive compartments of the nucleus. Fifty percent of replication events deviated from their average replication time by ± 15% of S phase. This degree of variation is similar between cells, between homologs within cells and between all domains genomewide, regardless of their replication timing. These results demonstrate that stochastic variation in replication timing is independent of elements that dictate timing or extrinsic environmental variation.
Collapse
|
45
|
Replication Domains: Genome Compartmentalization into Functional Replication Units. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2018; 1042:229-257. [DOI: 10.1007/978-981-10-6955-0_11] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
46
|
Abstract
Complete duplication of large metazoan chromosomes requires thousands of potential initiation sites, only a small fraction of which are selected in each cell cycle. Assembly of the replication machinery is highly conserved and tightly regulated during the cell cycle, but the sites of initiation are highly flexible, and their temporal order of firing is regulated at the level of large-scale multi-replicon domains. Importantly, the number of replication forks must be quickly adjusted in response to replication stress to prevent genome instability. Here we argue that large genomes are divided into domains for exactly this reason. Once established, domain structure abrogates the need for precise initiation sites and creates a scaffold for the evolution of other chromosome functions.
Collapse
|
47
|
Stability of patient-specific features of altered DNA replication timing in xenografts of primary human acute lymphoblastic leukemia. Exp Hematol 2017; 51:71-82.e3. [PMID: 28433605 DOI: 10.1016/j.exphem.2017.04.004] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2017] [Revised: 03/25/2017] [Accepted: 04/08/2017] [Indexed: 01/10/2023]
Abstract
Genome-wide DNA replication timing (RT) profiles reflect the global three-dimensional chromosome architecture of cells. They also provide a comprehensive and unique megabase-scale picture of cellular epigenetic state. Thus, normal differentiation involves reproducible changes in RT, and transformation generally perturbs these, although the potential effects of altered RT on the properties of transformed cells remain largely unknown. A major challenge to interrogating these issues in human acute lymphoid leukemia (ALL) is the low proliferative activity of most of the cells, which may be further reduced in cryopreserved samples and difficult to overcome in vitro. In contrast, the ability of many human ALL cell populations to expand when transplanted into highly immunodeficient mice is well documented. To examine the stability of DNA RT profiles of serially passaged xenografts of primary human B- and T-ALL cells, we first devised a method that circumvents the need for bromodeoxyuridine incorporation to distinguish early versus late S-phase cells. Using this and more standard protocols, we found consistently strong retention in xenografts of the original patient-specific RT features. Moreover, in a case in which genomic analyses indicated changing subclonal dynamics in serial passages, the RT profiles tracked concordantly. These results indicate that DNA RT is a relatively stable feature of human ALLs propagated in immunodeficient mice. In addition, they suggest the power of this approach for future interrogation of the origin and consequences of altered DNA RT in ALL.
Collapse
|
48
|
Spatio-temporal re-organization of replication foci accompanies replication domain consolidation during human pluripotent stem cell lineage specification. Cell Cycle 2016; 15:2464-75. [PMID: 27433885 DOI: 10.1080/15384101.2016.1203492] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022] Open
Abstract
Lineage specification of both mouse and human pluripotent stem cells (PSCs) is accompanied by spatial consolidation of chromosome domains and temporal consolidation of their replication timing. Replication timing and chromatin organization are both established during G1 phase at the timing decision point (TDP). Here, we have developed live cell imaging tools to track spatio-temporal replication domain consolidation during differentiation. First, we demonstrate that the fluorescence ubiquitination cell cycle indicator (Fucci) system is incapable of demarcating G1/S or G2/M cell cycle transitions. Instead, we employ a combination of fluorescent PCNA to monitor S phase progression, cytokinesis to demarcate mitosis, and fluorescent nucleotides to label early and late replication foci and track their 3D organization into sub-nuclear chromatin compartments throughout all cell cycle transitions. We find that, as human PSCs differentiate, the length of S phase devoted to replication of spatially clustered replication foci increases, coincident with global compartmentalization of domains into temporally clustered blocks of chromatin. Importantly, re-localization and anchorage of domains was completed prior to the onset of S phase, even in the context of an abbreviated PSC G1 phase. This approach can also be employed to investigate cell fate transitions in single PSCs, which could be seen to differentiate preferentially from G1 phase. Together, our results establish real-time, live-cell imaging methods for tracking cell cycle transitions during human PSC differentiation that can be applied to study chromosome domain consolidation and other aspects of lineage specification.
Collapse
|
49
|
Replication timing and transcriptional control: beyond cause and effect-part III. Curr Opin Cell Biol 2016; 40:168-178. [PMID: 27115331 DOI: 10.1016/j.ceb.2016.03.022] [Citation(s) in RCA: 72] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2015] [Revised: 03/24/2016] [Accepted: 03/29/2016] [Indexed: 11/17/2022]
Abstract
DNA replication is essential for faithful transmission of genetic information and is intimately tied to chromosome structure and function. Genome duplication occurs in a defined temporal order known as the replication-timing (RT) program, which is regulated during the cell cycle and development in discrete units referred to as replication domains (RDs). RDs correspond to topologically-associating domains (TADs) and are spatio-temporally compartmentalized in the nucleus. While improvements in experimental tools have begun to reveal glimpses of causality, they have also unveiled complex context-dependent relationships that challenge long recognized correlations of RT to chromatin organization and gene regulation. In particular, RDs/TADs that switch RT during development march to the beat of a different drummer.
Collapse
|
50
|
Influence of ATM-Mediated DNA Damage Response on Genomic Variation in Human Induced Pluripotent Stem Cells. Stem Cells Dev 2016; 25:740-7. [PMID: 26935587 DOI: 10.1089/scd.2015.0393] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Genome instability is a potential limitation to the research and therapeutic application of induced pluripotent stem cells (iPSCs). Observed genomic variations reflect the combined activities of DNA damage, cellular DNA damage response (DDR), and selection pressure in culture. To understand the contribution of DDR on the distribution of copy number variations (CNVs) in iPSCs, we mapped CNVs of iPSCs with mutations in the central DDR gene ATM onto genome organization landscapes defined by genome-wide replication timing profiles. We show that following reprogramming the early and late replicating genome is differentially affected by CNVs in ATM-deficient iPSCs relative to wild-type iPSCs. Specifically, the early replicating regions had increased CNV losses during retroviral (RV) reprogramming. This differential CNV distribution was not present after later passage or after episomal reprogramming. Comparison of different reprogramming methods in the setting of defective DDR reveals unique vulnerability of early replicating open chromatin to RV vectors.
Collapse
|