1
|
Saettone A, Ponce M, Nabeel-Shah S, Fillingham J. RACS: rapid analysis of ChIP-Seq data for contig based genomes. BMC Bioinformatics 2019; 20:533. [PMID: 31664892 PMCID: PMC6819487 DOI: 10.1186/s12859-019-3100-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2019] [Accepted: 09/13/2019] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Chromatin immunoprecipitation coupled to next generation sequencing (ChIP-Seq) is a widely-used molecular method to investigate the function of chromatin-related proteins by identifying their associated DNA sequences on a genomic scale. ChIP-Seq generates large quantities of data that is difficult to process and analyze, particularly for organisms with a contig-based sequenced genomes that typically have minimal annotation on their associated set of genes other than their associated coordinates primarily predicted by gene finding programs. Poorly annotated genome sequence makes comprehensive analysis of ChIP-Seq data difficult and as such standardized analysis pipelines are lacking. RESULTS We present a one-stop computational pipeline, "Rapid Analysis of ChIP-Seq data" (RACS), that utilizes traditional High-Performance Computing (HPC) techniques in association with open source tools for processing and analyzing raw ChIP-Seq data. RACS is an open source computational pipeline available from any of the following repositories https://bitbucket.org/mjponce/RACS or https://gitrepos.scinet.utoronto.ca/public/?a=summary&p=RACS . RACS is particularly useful for ChIP-Seq in organisms with contig-based genomes that have poor gene annotation to aid protein function discovery.To test the performance and efficiency of RACS, we analyzed ChIP-Seq data previously published in a model organism Tetrahymena thermophila which has a contig-based genome. We assessed the generality of RACS by analyzing a previously published data set generated using the model organism Oxytricha trifallax, whose genome sequence is also contig-based with poor annotation. CONCLUSIONS The RACS computational pipeline presented in this report is an efficient and reliable tool to analyze genome-wide raw ChIP-Seq data generated in model organisms with poorly annotated contig-based genome sequence. Because RACS segregates the found read accumulations between genic and intergenic regions, it is particularly efficient for rapid downstream analyses of proteins involved in gene expression.
Collapse
Affiliation(s)
- Alejandro Saettone
- Department of Chemistry and Biology, Ryerson University, 350 Victoria St, Toronto, M5B 2K3 Canada
| | - Marcelo Ponce
- SciNet High Performance Computing Consortium, University of Toronto, 661 University Ave, Toronto, M5G 1M1 Canada
| | - Syed Nabeel-Shah
- Department of Molecular Genetics, University of Toronto, 1 King’s College Cir, Toronto, M5S 1A8 Canada
| | - Jeffrey Fillingham
- Department of Chemistry and Biology, Ryerson University, 350 Victoria St, Toronto, M5B 2K3 Canada
| |
Collapse
|
2
|
Abstract
The machinery required for the replication of eukaryotic chromosomal DNA is made up of proteins whose function, structure and main interaction partners are evolutionarily conserved. Several new cases have been reported recently, however, in which non-coding RNAs play additional and specialised roles in the initiation of eukaryotic DNA replication in different classes of organisms. These non-coding RNAs include Y RNAs in vertebrate somatic cells, 26T RNA in somatic macronuclei of the ciliate Tetrahymena, and G-rich RNA in the Epstein-Barr DNA tumour virus and its human host cells. Here, I will give an overview of the experimental evidence in favour of roles for these non-coding RNAs in the regulation of eukaryotic DNA replication, and compare and contrast their biosynthesis and mechanisms of action.
Collapse
|
3
|
Donti TR, Datta S, Sandoval PY, Kapler GM. Differential targeting of Tetrahymena ORC to ribosomal DNA and non-rDNA replication origins. EMBO J 2009; 28:223-33. [PMID: 19153611 DOI: 10.1038/emboj.2008.282] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2008] [Accepted: 12/02/2008] [Indexed: 11/09/2022] Open
Abstract
The Tetrahymena thermophila origin recognition complex (ORC) contains an integral RNA subunit, 26T RNA, which confers specificity to the amplified ribosomal DNA (rDNA) origin by base pairing with an essential cis-acting replication determinant--the type I element. Using a plasmid maintenance assay, we identified a 6.7 kb non-rDNA fragment containing two closely associated replicators, ARS1-A (0.8 kb) and ARS1-B (1.2 kb). Both replicators lack type I elements and hence complementarity to 26T RNA, suggesting that ORC is recruited to these sites by an RNA-independent mechanism. Consistent with this prediction, although ORC associated exclusively with origin sequences in the 21 kb rDNA minichromosome, the interaction between ORC and the non-rDNA ARS1 chromosome changed across the cell cycle. In G(2) phase, ORC bound to all tested sequences in a 60 kb interval spanning ARS1-A/B. Remarkably, ORC and Mcm6 associated with just the ARS1-A replicator in G(1) phase when pre-replicative complexes assemble. We propose that ORC is stochastically deposited onto newly replicated non-rDNA chromosomes and subsequently targeted to preferred initiation sites prior to the next S phase.
Collapse
Affiliation(s)
- Taraka R Donti
- Department of Molecular and Cellular Medicine, Texas A&M Health Science Center, College Station, TX, USA
| | | | | | | |
Collapse
|
4
|
Yakisich JS, Kapler GM. Deletion of the Tetrahymena thermophila rDNA replication fork barrier region disrupts macronuclear rDNA excision and creates a fragile site in the micronuclear genome. Nucleic Acids Res 2006; 34:620-34. [PMID: 16449202 PMCID: PMC1356531 DOI: 10.1093/nar/gkj466] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
During macronuclear development the Tetrahymena thermophila ribosomal RNA gene is excised from micronuclear chromosome 1 by site-specific cleavage at chromosome breakage sequence (Cbs) elements, rearranged into a ‘palindromic’ 21 kb minichromosome and extensively amplified. Gene amplification initiates from origins in the 5′ non-transcribed spacer, and forks moving toward the center of the palindrome arrest at a developmentally regulated replication fork barrier (RFB). The RFB is inactive during vegetative cell divisions, suggesting a role in the formation or amplification of macronuclear rDNA. Using micronuclear (germline) transformation, we show that the RFB region facilitates Cbs-mediated excision. Deletion of the RFB inhibits chromosome breakage in a sub-population of developing macronuclei and promotes alternative processing by a Cbs-independent mechanism. Remarkably, the RFB region prevents spontaneous breakage of chromosome 1 in the diploid micronucleus. Strains heterozygous for ΔRFB and wild-type rDNA lose the ΔRFB allele and distal left arm of chromosome 1 during vegetative propagation. The wild-type chromosome is subsequently fragmented near the rDNA locus, and both homologs are progressively eroded, suggesting that broken micronuclear chromosomes are not ‘healed’ by telomerase. Deletion of this 363 bp segment effectively creates a fragile site in the micronuclear genome, providing the first evidence for a non-telomere cis-acting determinant that functions to maintain the structural integrity of a mitotic eukaryotic chromosome.
Collapse
Affiliation(s)
| | - G. M. Kapler
- To whom correspondence should be addressed. Tel: +1 979 847 8690; Fax: +1 979 847 9481;
| |
Collapse
|
5
|
Morrison TL, Yakisich JS, Cassidy-Hanley D, Kapler GM. TIF1 Represses rDNA replication initiation, but promotes normal S phase progression and chromosome transmission in Tetrahymena. Mol Biol Cell 2005; 16:2624-35. [PMID: 15772155 PMCID: PMC1142411 DOI: 10.1091/mbc.e05-02-0107] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2005] [Accepted: 03/07/2005] [Indexed: 02/03/2023] Open
Abstract
The non-ORC protein, TIF1, recognizes sequences in the Tetrahymena thermophila ribosomal DNA (rDNA) minichromosome that are required for origin activation. We show here that TIF1 represses rDNA origin firing, but is required for proper macronuclear S phase progression and division. TIF1 mutants exhibit an elongated macronuclear S phase and diminished rate of DNA replication. Despite this, replication of the rDNA minichromosome initiates precociously. Because rDNA copy number is unaffected in the polyploid macronucleus, mechanisms that prevent reinitiation appear intact. Although mutants exit macronuclear S with a wild-type DNA content, division of the amitotic macronucleus is both delayed and abnormal. Nuclear defects are also observed in the diploid mitotic micronucleus, as TIF1 mutants lose a significant fraction of their micronuclear DNA. Hence, TIF1 is required for the propagation and subsequent transmission of germline chromosomes. The broad phenotypes associated with a TIF1-deficiency suggest that this origin binding protein is required globally for the proper execution and/or monitoring of key chromosomal events during S phase and possibly at later stages of the cell cycle. We propose that micro- and macronuclear defects result from exiting the respective nuclear S phases with physically compromised chromosomes.
Collapse
Affiliation(s)
- Tara L Morrison
- Department of Medical Biochemistry and Genetics, Texas A&M University System Health Science Center, College Station, TX 77843-1114, USA
| | | | | | | |
Collapse
|
6
|
Abstract
Developmentally regulated gene amplification serves to increase the number of templates for transcription, yielding greatly increased protein and/or RNA product for gene(s) at the amplified loci. It is observed with genes that are very actively transcribed and during narrow windows of developmental time where copious amounts of those particular gene products are required. Amplification results from repeated firing of origins at a few genomic loci, while the rest of the genome either does not replicate, or replicates to a lesser extent. As such, amplification is a striking exception to the once-and-only-once rule of DNA replication and may be informative as to that mechanism. Drosophila amplifies eggshell (chorion) genes in the follicle cells of the ovary to allow for rapid eggshell synthesis. Sciara amplifies multiple genes in larval salivary gland cells that encode proteins secreted in the saliva for the pupal case. Finally, Tetrahymena amplifies its rRNA genes several thousand-fold in the creation of the transcriptionally active macronucleus. Due to the ease of molecular and genetic analysis with these systems, the study of origin regulation has advanced rapidly. Comparisons reveal an evolutionarily conserved trans-regulatory apparatus and a similar organization of sequence-specific cis-regulatory replicator and origin elements. The studies indicate a regulatory role for chromatin structure and transcriptionally active genes near the origins.
Collapse
Affiliation(s)
- John Tower
- Molecular and Computational Biology Program, Department of Biological Sciences, University of Southern California, Los Angeles, California 90089-1340, USA.
| |
Collapse
|
7
|
Wickstead B, Ersfeld K, Gull K. The small chromosomes of Trypanosoma brucei involved in antigenic variation are constructed around repetitive palindromes. Genome Res 2004; 14:1014-24. [PMID: 15173109 PMCID: PMC419779 DOI: 10.1101/gr.2227704] [Citation(s) in RCA: 81] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2003] [Accepted: 02/12/2004] [Indexed: 01/09/2023]
Abstract
Most eukaryotic genomes contain large regions of satellite DNA. These arrays are often associated with essential chromosomal functions, but remain largely absent from genome projects because of difficulties in cloning and sequence assembly. The numerous small chromosomes of the parasite Trypanosoma brucei fall into this category, yet are critical to understanding the genome because of their role in antigenic variation. Their relatively small size, however, makes them particularly amenable to physical mapping. We have produced fine-resolution maps of 17 complete minichromosomes and partial maps of two larger intermediate-sized chromosomes. This revealed a canonical structure shared by both chromosomal classes based around a large central core of 177-bp repeats. Around the core are variable-length genic regions, the lengths of which define chromosomal class. We show the core region to be a repetitive palindrome with a single inversion point common to all the chromosomes of both classes, suggesting a mechanism of genesis for these chromosomes. Moreover, palindromy appears to be a feature of (peri)centromeres in other species that can be easily overlooked. We propose that sequence inversion is one of the higher-order sequence motifs that confer chromosomal stability.
Collapse
Affiliation(s)
- Bill Wickstead
- Sir William Dunn School of Pathology, University of Oxford, Oxford, OX1 3RE, United Kingdom
| | | | | |
Collapse
|
8
|
Mohammad M, York RD, Hommel J, Kapler GM. Characterization of a novel origin recognition complex-like complex: implications for DNA recognition, cell cycle control, and locus-specific gene amplification. Mol Cell Biol 2003; 23:5005-17. [PMID: 12832485 PMCID: PMC162205 DOI: 10.1128/mcb.23.14.5005-5017.2003] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2003] [Revised: 03/14/2003] [Accepted: 04/17/2003] [Indexed: 11/20/2022] Open
Abstract
The origin recognition complex (ORC) plays a central role in eukaryotic DNA replication. Here we describe a unique ORC-like complex in Tetrahymena thermophila, TIF4, which bound in an ATP-dependent manner to sequences required for cell cycle-controlled replication and gene amplification (ribosomal DNA [rDNA] type I elements). TIF4's mode of DNA recognition was distinct from that of other characterized ORCs, as it bound exclusively to single-stranded DNA. In contrast to yeast ORCs, TIF4 DNA binding activity was cell cycle regulated and peaked during S phase, coincident with the redistribution of the Orc2-related subunit, p69, from the cytoplasm to the macronucleus. Origin-binding activity and nuclear p69 immunoreactivity were further regulated during development, where they distinguished replicating from nonreplicating nuclei. Both activities were lost from germ line micronuclei following the programmed arrest of micronuclear replication. Replicating macronuclei stained with Orc2 antibodies throughout development in wild-type cells but failed to do so in the amplification-defective rmm11 mutant. Collectively, these findings indicate that the regulation of TIF4 is intimately tied to the cell cycle and developmentally programmed replication cycles. They further implicate TIF4 in rDNA gene amplification. As type I elements interact with other sequence-specific single-strand breaks (in vitro and in vivo), the dynamic interplay of Orc-like (TIF4) and non-ORC-like proteins with this replication determinant may provide a novel mechanism for regulation.
Collapse
Affiliation(s)
- Mohammad Mohammad
- Department of Medical Biochemistry and Genetics, Texas A&M Health Science Center, College Station, Texas 77843-1114, USA
| | | | | | | |
Collapse
|
9
|
Abstract
Over the past decade, researchers have manipulated the unique biology of Tetrahymena thermophila to generate a premier experimental organism for functional genomic analysis. A diverse array of DNA transformation methods have spearheaded in vivo strategies for discovering and dissecting universal eukaryotic processes, such as telomere addition and chromatin remodeling. Compartmentalization of this protist's genome into two functionally distinct nuclei - the silent 'germline' micronucleus and the transcriptionally active macronucleus - provides a powerful means for controlling the expression of transgenes. Heterokaryons that silently harbor homozygous recessive mutations (including lethal ones) in the germline have been exploited. The coupling of forward and reverse genetic approaches with genomics-based methods for gene discovery presents a bright future for research in this rising model eukaryote.
Collapse
Affiliation(s)
- Aaron P Turkewitz
- Department of Molecular Genetics and Cell Biology, The University of Chicago, 920 E. 58th Street, Chicago, IL 60637, USA.
| | | | | |
Collapse
|
10
|
Saha S, Nicholson A, Kapler GM. Cloning and biochemical analysis of the tetrahymena origin binding protein TIF1: competitive DNA binding in vitro and in vivo to critical rDNA replication determinants. J Biol Chem 2001; 276:45417-26. [PMID: 11577092 DOI: 10.1074/jbc.m106162200] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Cis-acting type I elements regulate the initiation of DNA replication, replication fork movement, and transcription of the Tetrahymena thermophila rDNA minichromosome and are required for cell cycle-controlled replication and developmentally programmed gene amplification. Previous studies identified three in vitro single-stranded type I element binding activities that were proposed to play distinct roles in replication control. Here we describe the cloning of one of these genes, TIF1, and we provide evidence for its association with type I elements in vivo. Furthermore, we show that TIF1 interacts (in vitro and in vivo) with pause site elements (PSE), which co-localize with replication initiation and fork arrest sites, and are shown to be essential. The in vivo accessibility of PSE and type I elements to potassium permanganate suggests that origin regions are frequently unwound in native chromatin. TIF1 contains sequence similarity to the Solanum tuberosum single strand-specific transcription factor, p24, and a related Arabidopsis protein. Antisense inhibition studies suggest that TIF1 competes with other proteins for PSE and type I element binding. TIF1 displays a marked strand bias in vivo, discriminating between origin- and promoter-proximal type I elements. We propose that this bias selectively modulates the binding of a different subset of proteins to the respective regulatory elements.
Collapse
MESH Headings
- Amino Acid Sequence
- Animals
- Binding, Competitive
- Chromatin/chemistry
- Chromatin/metabolism
- Cloning, Molecular
- DNA/metabolism
- DNA, Complementary/metabolism
- DNA, Ribosomal/metabolism
- DNA-Binding Proteins/chemistry
- DNA-Binding Proteins/genetics
- Electrophoresis, Gel, Two-Dimensional
- Gene Deletion
- Mice
- Mice, Knockout
- Models, Genetic
- Molecular Sequence Data
- Oligonucleotides, Antisense/pharmacology
- Plasmids/metabolism
- Potassium Permanganate/pharmacology
- Promoter Regions, Genetic
- Protein Structure, Tertiary
- Protozoan Proteins
- Replication Origin
- Ribosomes/metabolism
- S100 Proteins/chemistry
- Sequence Homology, Amino Acid
- Tetrahymena/genetics
- Tetrahymena/metabolism
- Transcription, Genetic
- Ultraviolet Rays
Collapse
Affiliation(s)
- S Saha
- Department of Medical Biochemistry and Genetics, Texas A & M Health Science Center, College Station, Texas 77843-1114, USA
| | | | | |
Collapse
|
11
|
Brassinga AK, Marczynski GT. Replication intermediate analysis confirms that chromosomal replication origin initiates from an unusual intergenic region in Caulobacter crescentus. Nucleic Acids Res 2001; 29:4441-51. [PMID: 11691932 PMCID: PMC60194 DOI: 10.1093/nar/29.21.4441] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The alpha-proteobacterium Caulobacter crescentus possesses a developmental cell cycle that restricts chromosome replication to a stalked cell type. The proposed C.crescentus chromosome replication origin (Cori) lies between hemE and RP001, an unusual intergenic region not previously associated with bacterial replication origins, although a similar genomic arrangement is also present at the putative replication origin in the related bacterium Rickettsia prowazekii. The cloned Cori supports autonomous plasmid replication selectively in the stalked cell type implying that replication of the entire chromosome also initiates between hemE and RP001. To confirm this location, we applied the 2-D (N/N) agarose gel electrophoresis technique to resolve and identify chromosome replication intermediates throughout a 30 kb region spanning Cori. Replication initiation in Cori was uniquely characterized by an 'origin bubble and Y-arc' pattern and this observation was supported by simple replication fork 'Y-arc' patterns that characterized the regions flanking Cori. These replication forks originated bi-directionally from within Cori as determined by the fork direction assay. Therefore, chromosomal replication initiates from the unusual hemE/RP001 intergenic region that we propose represents a new class of replication origins.
Collapse
Affiliation(s)
- A K Brassinga
- Department of Microbiology and Immunology, Lyman-Duff Building, Room 506, McGill University, 3775 University Street, Montreal, Quebec H3A 2B4, Canada
| | | |
Collapse
|
12
|
Abstract
The mechanism for initiation of eukaryotic DNA replication is highly conserved: the proteins required to initiate replication, the sequence of events leading to initiation, and the regulation of initiation are remarkably similar throughout the eukaryotic kingdom. Nevertheless, there is a liberal attitude when it comes to selecting initiation sites. Differences appear to exist in the composition of replication origins and in the way proteins recognize these origins. In fact, some multicellular eukaryotes (the metazoans) can change the number and locations of initiation sites during animal development, revealing that selection of initiation sites depends on epigenetic as well as genetic parameters. Here we have attempted to summarize our understanding of this process, to identify the similarities and differences between single cell and multicellular eukaryotes, and to examine the extent to which origin recognition proteins and replication origins have been conserved among eukaryotes. Published 2000 Wiley-Liss, Inc.
Collapse
Affiliation(s)
- J A Bogan
- National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland 20894, USA.
| | | | | |
Collapse
|
13
|
Mohammad M, Saha S, Kapler GM. Three different proteins recognize a multifunctional determinant that controls replication initiation, fork arrest and transcription in Tetrahymena. Nucleic Acids Res 2000; 28:843-51. [PMID: 10637338 PMCID: PMC102555 DOI: 10.1093/nar/28.3.843] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Type I elements regulate the initiation of DNA replication, elongation of replication forks and transcription of the Tetrahymena thermophila rDNA minichromosome. Previous studies identified a 24 kDa protein, ssA-TIBF, which binds the A-rich strand of type I elements. Here we describe two additional type I element binding activities (native mol. wt approximately 65 and approximately 250 kDa) that interact with DNA via previously unidentified 32 and 110 kDa polypeptides. The 65 kDa activity was purified to homogeneity and consists of a homodimer of a 32 kDa polypeptide. In contrast to the other type I element binding factors, the 65 kDa activity partitions preferentially to the nuclear fraction during isolation. Levels of the 65 kDa activity increase dramatically in starved cells, raising the possibility that it might negatively regulate replication or transcription. By comparison, the other two binding activities were elevated slightly during macronuclear development, when the rDNA was undergoing DNA replication. Previous studies indicate that the initiation of rDNA replication is regulated by long range interactions between dispersed type I elements. Competitive DNA binding or cooperative protein-protein interactions between the factors described here may play a regulatory role in replication or expression of the rDNA minichromosome.
Collapse
Affiliation(s)
- M Mohammad
- Department of Medical Biochemistry and Genetics, Texas A&M Health Science Center, College Station, TX 77843-1114, USA
| | | | | |
Collapse
|