1
|
Syeda AH, Dimude JU, Skovgaard O, Rudolph CJ. Too Much of a Good Thing: How Ectopic DNA Replication Affects Bacterial Replication Dynamics. Front Microbiol 2020; 11:534. [PMID: 32351461 PMCID: PMC7174701 DOI: 10.3389/fmicb.2020.00534] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2020] [Accepted: 03/12/2020] [Indexed: 12/15/2022] Open
Abstract
Each cell division requires the complete and accurate duplication of the entire genome. In bacteria, the duplication process of the often-circular chromosomes is initiated at a single origin per chromosome, resulting in two replication forks that traverse the chromosome in opposite directions. DNA synthesis is completed once the two forks fuse in a region diametrically opposite the origin. In some bacteria, such as Escherichia coli, the region where forks fuse forms a specialized termination area. Polar replication fork pause sites flanking this area can pause the progression of replication forks, thereby allowing forks to enter but not to leave. Transcription of all required genes has to take place simultaneously with genome duplication. As both of these genome trafficking processes share the same template, conflicts are unavoidable. In this review, we focus on recent attempts to add additional origins into various ectopic chromosomal locations of the E. coli chromosome. As ectopic origins disturb the native replichore arrangements, the problems resulting from such perturbations can give important insights into how genome trafficking processes are coordinated and the problems that arise if this coordination is disturbed. The data from these studies highlight that head-on replication–transcription conflicts are indeed highly problematic and multiple repair pathways are required to restart replication forks arrested at obstacles. In addition, the existing data also demonstrate that the replication fork trap in E. coli imposes significant constraints to genome duplication if ectopic origins are active. We describe the current models of how replication fork fusion events can cause serious problems for genome duplication, as well as models of how such problems might be alleviated both by a number of repair pathways as well as the replication fork trap system. Considering the problems associated both with head-on replication-transcription conflicts as well as head-on replication fork fusion events might provide clues of how these genome trafficking issues have contributed to shape the distinct architecture of bacterial chromosomes.
Collapse
Affiliation(s)
- Aisha H Syeda
- Department of Biology, University of York, York, United Kingdom
| | - Juachi U Dimude
- Division of Biosciences, College of Health and Life Sciences, Brunel University London, Uxbridge, United Kingdom
| | - Ole Skovgaard
- Department of Science and Environment, Roskilde University, Roskilde, Denmark
| | - Christian J Rudolph
- Division of Biosciences, College of Health and Life Sciences, Brunel University London, Uxbridge, United Kingdom
| |
Collapse
|
2
|
Jin L, Gao H, Cao X, Han S, Xu L, Ma Z, Shang Y, Ma XX. Significance and roles of synonymous codon usage in the evolutionary process of Proteus. J Basic Microbiol 2020; 60:424-434. [PMID: 32162710 DOI: 10.1002/jobm.201900647] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2019] [Revised: 02/02/2020] [Accepted: 02/15/2020] [Indexed: 12/21/2022]
Abstract
Proteus spp. bacteria frequently serve as opportunistic pathogens that can infect many animals and show positive survival and existence in various natural environments. The evolutionary pattern of Proteus spp. is an unknown topic, which benefits understanding the different evolutionary dynamics for excellent bacterial adaptation to various environments. Here, the eight whole genomes of different Proteus species were analyzed for the interplay between nucleotide usage and synonymous codon usage. Although the orthologous average nucleotide identity and average nucleotide identity display the genetic diversity of these Proteus species at the genome level, the principal component analysis further shows that these species sustain the specific genetic niche at the aspect of synonymous codon usage patterns. Interestingly, although these Proteus species have A/T rich genes with underrepresented G (guanine) or C (cytosine) at the third codon positions and overrepresented A or T at these positions, some synonymous codons with A or T end are obviously suppressed in usage. The overall codon usage pattern reflected by the effective number of codons (ENC) has a significantly positive correlation with GC3 content (GC content at the third codon position), and ENC has a significantly negative correlation with the adaptation index for these species. These results suggest that the mutation pressure caused by nucleotide composition constraint serves as a dominant evolutionary dynamic driving evolutionary trend of Proteus spp., along with other selections related to natural selection, replication and fine-tune translation, and so on. Taken together, the analyses help to understand the evolutionary interplay between nucleotide and codon usage at the gene level of Proteus.
Collapse
Affiliation(s)
- Li Jin
- Biomedical Research Center, Northwest Minzu University, Lanzhou, China.,State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, Gansu, China
| | - Han Gao
- Department of College of Pharmacy, East China University of Science and Technology, Shanghai, China
| | - Xiaoan Cao
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, Gansu, China
| | - Shengyi Han
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, Gansu, China.,College of Veterinary Medicine, Gansu Agricultural University, Lanzhou, Gansu, China
| | - Long Xu
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, Gansu, China.,College of Veterinary Medicine, Gansu Agricultural University, Lanzhou, Gansu, China
| | - Zhongren Ma
- Biomedical Research Center, Northwest Minzu University, Lanzhou, China
| | - Youjun Shang
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, Gansu, China
| | - Xiao-Xia Ma
- Biomedical Research Center, Northwest Minzu University, Lanzhou, China
| |
Collapse
|
3
|
Silva E, Ideker T. Transcriptional responses to DNA damage. DNA Repair (Amst) 2019; 79:40-49. [PMID: 31102970 PMCID: PMC6570417 DOI: 10.1016/j.dnarep.2019.05.002] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2019] [Revised: 03/20/2019] [Accepted: 05/04/2019] [Indexed: 12/31/2022]
Abstract
In response to the threat of DNA damage, cells exhibit a dramatic and multi-factorial response spanning from transcriptional changes to protein modifications, collectively known as the DNA damage response (DDR). Here, we review the literature surrounding the transcriptional response to DNA damage. We review differences in observed transcriptional responses as a function of cell cycle stage and emphasize the importance of experimental design in these transcriptional response studies. We additionally consider topics including structural challenges in the transcriptional response to DNA damage as well as the connection between transcription and protein abundance.
Collapse
Affiliation(s)
- Erica Silva
- Department of Medicine, University of California San Diego, La Jolla, California, USA; Biomedical Sciences Program, University of California San Diego, La Jolla, California, USA.
| | - Trey Ideker
- Department of Medicine, University of California San Diego, La Jolla, California, USA; Biomedical Sciences Program, University of California San Diego, La Jolla, California, USA; Program in Bioinformatics, University of California San Diego, La Jolla, California, USA; Department of Bioengineering, University of California San Diego, La Jolla, California, USA.
| |
Collapse
|
4
|
Iuchi S, Paulo JA. Lysine-specific demethylase 2A enhances binding of various nuclear factors to CpG-rich genomic DNAs by action of its CXXC-PHD domain. Sci Rep 2019; 9:5496. [PMID: 30940825 PMCID: PMC6445129 DOI: 10.1038/s41598-019-41896-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2018] [Accepted: 03/19/2019] [Indexed: 02/08/2023] Open
Abstract
The lysine-specific demethylase 2A gene (KDM2A) is ubiquitously expressed and its transcripts consist of several alternatively spliced forms, including KDM2A and the shorter form N782 that lacks the 3' end encoding F-box and LRR. KDM2A binds to numerous CpG-rich genomic loci and regulates various cellular activities; however, the mechanism of the pleiotropic function is unknown. Here, we identify the mechanism of KDM2A played by its CXXC-PHD domain. KDM2A is necessary for a rapid proliferation of post-natal keratinocytes while its 3' end eclipses the stimulatory effect. EGFP-N782 binds to chromatin together with the XRCC5/6 complex, and the CXXC-PHD domain regulates the CpG-rich IGFBPL1 promoter. In vitro, CXXC-PHD enhances binding of nuclear extract ORC3 to the CpG-rich promoter, but not to the AT-rich DIP2B promoter to which ORC3 binds constitutively. Furthermore, CXXC-PHD recruits 94 nuclear factors involved in replication, ribosome synthesis, and mitosis, including POLR1A to the IGFBPL1 promoter. This recruitment is unprecedented; however, the result suggests that these nuclear factors bind to their cognate loci, as substantiated by the result that CXXC-PHD recruits POLR1A to the rDNA promoter. We propose that CXXC-PHD promotes permissiveness for nuclear factors to interact, but involvement of the XRCC5/6 complex in the recruitment is undetermined.
Collapse
Affiliation(s)
- Shiro Iuchi
- Department of Cell Biology, Harvard Medical School, 240 Longwood Avenue, Boston, MA, 20115, USA.
| | - Joao A Paulo
- Department of Cell Biology, Harvard Medical School, 240 Longwood Avenue, Boston, MA, 20115, USA
| |
Collapse
|
5
|
Dao FY, Lv H, Wang F, Ding H. Recent Advances on the Machine Learning Methods in Identifying DNA Replication Origins in Eukaryotic Genomics. Front Genet 2018; 9:613. [PMID: 30619452 PMCID: PMC6295579 DOI: 10.3389/fgene.2018.00613] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2018] [Accepted: 11/21/2018] [Indexed: 01/01/2023] Open
Abstract
The initiate site of DNA replication is called origins of replication (ORI) which is regulated by a set of regulatory proteins and plays important roles in the basic biochemical process during cell growth and division in all living organisms. Therefore, the study of ORIs is essential for understanding the cell-division cycle and gene expression regulation so that scholars can develop a new strategy against genetic diseases by using the knowledge of DNA replication. Thus, the accurate identification of ORIs will provide key clues for DNA replication research and clinical medicine. Although, the conventional experiments could provide accurate results, they are time-consuming and cost ineffective. On the contrary, bioinformatics-based methods can overcome these shortcomings. Especially, with the emergence of DNA sequences in the post-genomic era, it is highly expected to develop high throughput tools to identify ORIs based on sequence information. In this review, we will summarize the current progress in computational prediction of eukaryotic ORIs including the collection of benchmark dataset, the application of machine learning-based techniques, the results obtained by these methods, and the construction of web servers. Finally, we gave the future perspectives on ORIs prediction. The review provided readers with a whole background of ORIs prediction based on machine learning methods, which will be helpful for researchers to study DNA replication in-depth and drug therapy of genetic defect.
Collapse
Affiliation(s)
- Fu-Ying Dao
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Hao Lv
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Fang Wang
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Hui Ding
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| |
Collapse
|
6
|
Manzo SG, Hartono SR, Sanz LA, Marinello J, De Biasi S, Cossarizza A, Capranico G, Chedin F. DNA Topoisomerase I differentially modulates R-loops across the human genome. Genome Biol 2018; 19:100. [PMID: 30060749 PMCID: PMC6066927 DOI: 10.1186/s13059-018-1478-1] [Citation(s) in RCA: 99] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2018] [Accepted: 07/10/2018] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Co-transcriptional R-loops are abundant non-B DNA structures in mammalian genomes. DNA Topoisomerase I (Top1) is often thought to regulate R-loop formation owing to its ability to resolve both positive and negative supercoils. How Top1 regulates R-loop structures at a global level is unknown. RESULTS Here, we perform high-resolution strand-specific R-loop mapping in human cells depleted for Top1 and find that Top1 depletion results in both R-loop gains and losses at thousands of transcribed loci, delineating two distinct gene classes. R-loop gains are characteristic for long, highly transcribed, genes located in gene-poor regions anchored to Lamin B1 domains and in proximity to H3K9me3-marked heterochromatic patches. R-loop losses, by contrast, occur in gene-rich regions overlapping H3K27me3-marked active replication initiation regions. Interestingly, Top1 depletion coincides with a block of the cell cycle in G0/G1 phase and a trend towards replication delay. CONCLUSIONS Our findings reveal new properties of Top1 in regulating R-loop homeostasis in a context-dependent manner and suggest a potential role for Top1 in modulating the replication process via R-loop formation.
Collapse
Affiliation(s)
- Stefano G Manzo
- Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
- Present address: Division of Gene Regulation, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX, Amsterdam, The Netherlands
| | - Stella R Hartono
- Department of Molecular and Cellular Biology and Genome Center, University of California, Davis, USA
| | - Lionel A Sanz
- Department of Molecular and Cellular Biology and Genome Center, University of California, Davis, USA
| | - Jessica Marinello
- Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
| | - Sara De Biasi
- Department of Medical and Surgical Sciences for Children and Adults, University of Modena and Reggio Emilia, Modena, Italy
| | - Andrea Cossarizza
- Department of Medical and Surgical Sciences for Children and Adults, University of Modena and Reggio Emilia, Modena, Italy
| | - Giovanni Capranico
- Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy.
| | - Frederic Chedin
- Department of Molecular and Cellular Biology and Genome Center, University of California, Davis, USA.
| |
Collapse
|
7
|
Singh VK, Krishnamachari A. Context based computational analysis and characterization of ARS consensus sequences (ACS) of Saccharomyces cerevisiae genome. GENOMICS DATA 2016; 9:130-6. [PMID: 27508123 PMCID: PMC4971157 DOI: 10.1016/j.gdata.2016.07.005] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/26/2016] [Revised: 06/27/2016] [Accepted: 07/06/2016] [Indexed: 01/08/2023]
Abstract
Genome-wide experimental studies in Saccharomyces cerevisiae reveal that autonomous replicating sequence (ARS) requires an essential consensus sequence (ACS) for replication activity. Computational studies identified thousands of ACS like patterns in the genome. However, only a few hundreds of these sites act as replicating sites and the rest are considered as dormant or evolving sites. In a bid to understand the sequence makeup of replication sites, a content and context-based analysis was performed on a set of replicating ACS sequences that binds to origin-recognition complex (ORC) denoted as ORC-ACS and non-replicating ACS sequences (nrACS), that are not bound by ORC. In this study, DNA properties such as base composition, correlation, sequence dependent thermodynamic and DNA structural profiles, and their positions have been considered for characterizing ORC-ACS and nrACS. Analysis reveals that ORC-ACS depict marked differences in nucleotide composition and context features in its vicinity compared to nrACS. Interestingly, an A-rich motif was also discovered in ORC-ACS sequences within its nucleosome-free region. Profound changes in the conformational features, such as DNA helical twist, inclination angle and stacking energy between ORC-ACS and nrACS were observed. Distribution of ACS motifs in the non-coding segments points to the locations of ORC-ACS which are found far away from the adjacent gene start position compared to nrACS thereby enabling an accessible environment for ORC-proteins. Our attempt is novel in considering the contextual view of ACS and its flanking region along with nucleosome positioning in the S. cerevisiae genome and may be useful for any computational prediction scheme.
Collapse
|
8
|
Chargaff’s Cluster Rule. Evol Bioinform Online 2016. [DOI: 10.1007/978-3-319-28755-3_6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
|
9
|
Macheret M, Halazonetis TD. DNA replication stress as a hallmark of cancer. ANNUAL REVIEW OF PATHOLOGY-MECHANISMS OF DISEASE 2015; 10:425-48. [PMID: 25621662 DOI: 10.1146/annurev-pathol-012414-040424] [Citation(s) in RCA: 510] [Impact Index Per Article: 56.7] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Human cancers share properties referred to as hallmarks, among which sustained proliferation, escape from apoptosis, and genomic instability are the most pervasive. The sustained proliferation hallmark can be explained by mutations in oncogenes and tumor suppressors that regulate cell growth, whereas the escape from apoptosis hallmark can be explained by mutations in the TP53, ATM, or MDM2 genes. A model to explain the presence of the three hallmarks listed above, as well as the patterns of genomic instability observed in human cancers, proposes that the genes driving cell proliferation induce DNA replication stress, which, in turn, generates genomic instability and selects for escape from apoptosis. Here, we review the data that support this model, as well as the mechanisms by which oncogenes induce replication stress. Further, we argue that DNA replication stress should be considered as a hallmark of cancer because it likely drives cancer development and is very prevalent.
Collapse
Affiliation(s)
- Morgane Macheret
- Department of Molecular Biology, University of Geneva, 1205 Geneva, Switzerland;
| | | |
Collapse
|
10
|
Tombácz D, Csabai Z, Oláh P, Havelda Z, Sharon D, Snyder M, Boldogkői Z. Characterization of novel transcripts in pseudorabies virus. Viruses 2015; 7:2727-44. [PMID: 26008709 PMCID: PMC4452928 DOI: 10.3390/v7052727] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2015] [Revised: 05/14/2015] [Accepted: 05/18/2015] [Indexed: 01/20/2023] Open
Abstract
In this study we identified two 3'-coterminal RNA molecules in the pseudorabies virus. The highly abundant short transcript (CTO-S) proved to be encoded between the ul21 and ul22 genes in close vicinity of the replication origin (OriL) of the virus. The less abundant long RNA molecule (CTO-L) is a transcriptional readthrough product of the ul21 gene and overlaps OriL. These polyadenylated RNAs were characterized by ascertaining their nucleotide sequences with the Illumina HiScanSQ and Pacific Biosciences Real-Time (PacBio RSII) sequencing platforms and by analyzing their transcription kinetics through use of multi-time-point Real-Time RT-PCR and the PacBio RSII system. It emerged that transcription of the CTOs is fully dependent on the viral transactivator protein IE180 and CTO-S is not a microRNA precursor. We propose an interaction between the transcription and replication machineries at this genomic location, which might play an important role in the regulation of DNA synthesis.
Collapse
Affiliation(s)
- Dóra Tombácz
- These authors contributed equally to this work..
| | - Zsolt Csabai
- These authors contributed equally to this work..
| | - Péter Oláh
- These authors contributed equally to this work..
| | - Zoltán Havelda
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Somogyi B. u. 4., Szeged H-6720, Hungary.
| | - Donald Sharon
- Agricultural Biotechnology Center, Institute for Plant Biotechnology, Plant Developmental Biology Group, Szent-Györgyi A. u. 4, Gödöllő H-2100, Hungary.
| | - Michael Snyder
- Agricultural Biotechnology Center, Institute for Plant Biotechnology, Plant Developmental Biology Group, Szent-Györgyi A. u. 4, Gödöllő H-2100, Hungary.
| | | |
Collapse
|
11
|
Scala G, Affinito O, Miele G, Monticelli A, Cocozza S. Evidence for evolutionary and nonevolutionary forces shaping the distribution of human genetic variants near transcription start sites. PLoS One 2014; 9:e114432. [PMID: 25474578 PMCID: PMC4256220 DOI: 10.1371/journal.pone.0114432] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2014] [Accepted: 11/09/2014] [Indexed: 11/19/2022] Open
Abstract
The regions surrounding transcription start sites (TSSs) of genes play a critical role in the regulation of gene expression. At the same time, current evidence indicates that these regions are particularly stressed by transcription-related mutagenic phenomena. In this work we performed a genome-wide analysis of the distribution of single nucleotide polymorphisms (SNPs) inside the 10 kb region flanking human TSSs by dividing SNPs into four classes according to their frequency (rare, two intermediate classes, and common). We found that, in this 10 kb region, the distribution of variants depends on their frequency and on their localization relative to the TSS. We found that the distribution of variants is generally different for TSSs located inside or outside of CpG islands. We found a significant relationship between the distribution of rare variants and nucleosome occupancy scores. Furthermore, our analysis suggests that evolutionary (purifying selection) and nonevolutionary (biased gene conversion) forces both play a role in determining the relative SNP frequency around TSSs. Finally, we analyzed the potential pathogenicity of each class of variant using the Combined Annotation Dependent Depletion score. In conclusion, this study provides a novel and detailed view of the distribution of genomic variants around TSSs, providing insight into the forces that instigate and maintain variability in such critical regions.
Collapse
Affiliation(s)
- Giovanni Scala
- Gruppo Interdipartimentale di Bioinformatica e Biologia Computazionale, Università degli Studi di Napoli “Federico II”, Naples, Italy
- Dipartimento di Fisica, Università degli Studi di Napoli “Federico II”, Naples, Italy
- Istituto Nazionale di Fisica Nucleare, Sezione di Napoli, Naples, Italy
- * E-mail:
| | - Ornella Affinito
- Gruppo Interdipartimentale di Bioinformatica e Biologia Computazionale, Università degli Studi di Napoli “Federico II”, Naples, Italy
- Dipartimento di Medicina Molecolare e Biotecnologie Mediche, Università degli Studi di Napoli “Federico II”, Naples, Italy
- Istituto di Endocrinologia ed Oncologia Sperimentale (IEOS), CNR, Naples, Italy
| | - Gennaro Miele
- Gruppo Interdipartimentale di Bioinformatica e Biologia Computazionale, Università degli Studi di Napoli “Federico II”, Naples, Italy
- Dipartimento di Fisica, Università degli Studi di Napoli “Federico II”, Naples, Italy
- Istituto Nazionale di Fisica Nucleare, Sezione di Napoli, Naples, Italy
| | - Antonella Monticelli
- Gruppo Interdipartimentale di Bioinformatica e Biologia Computazionale, Università degli Studi di Napoli “Federico II”, Naples, Italy
- Istituto di Endocrinologia ed Oncologia Sperimentale (IEOS), CNR, Naples, Italy
| | - Sergio Cocozza
- Gruppo Interdipartimentale di Bioinformatica e Biologia Computazionale, Università degli Studi di Napoli “Federico II”, Naples, Italy
- Dipartimento di Medicina Molecolare e Biotecnologie Mediche, Università degli Studi di Napoli “Federico II”, Naples, Italy
| |
Collapse
|
12
|
Li WC, Zhong ZJ, Zhu PP, Deng EZ, Ding H, Chen W, Lin H. Sequence analysis of origins of replication in the Saccharomyces cerevisiae genomes. Front Microbiol 2014; 5:574. [PMID: 25477864 PMCID: PMC4235382 DOI: 10.3389/fmicb.2014.00574] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2014] [Accepted: 10/11/2014] [Indexed: 12/26/2022] Open
Abstract
DNA replication is a highly precise process that is initiated from origins of replication (ORIs) and is regulated by a set of regulatory proteins. The mining of DNA sequence information will be not only beneficial for understanding the regulatory mechanism of replication initiation but also for accurately identifying ORIs. In this study, the GC profile and GC skew were calculated to analyze the compositional bias in the Saccharomyces cerevisiae genome. We found that the GC profile in the region of ORIs is significantly lower than that in the flanking regions. By calculating the information redundancy, an estimation of the correlation of nucleotides, we found that the intensity of adjoining correlation in ORIs is dramatically higher than that in flanking regions. Furthermore, the relationships between ORIs and nucleosomes as well as transcription start sites were investigated. Results showed that ORIs are usually not occupied by nucleosomes. Finally, we calculated the distribution of ORIs in yeast chromosomes and found that most ORIs are in transcription terminal regions. We hope that these results will contribute to the identification of ORIs and the study of DNA replication mechanisms.
Collapse
Affiliation(s)
- Wen-Chao Li
- Key Laboratory for Neuro-Information of Ministry of Education, Center of Bioinformatics, School of Life Science and Technology, University of Electronic Science and Technology of China Chengdu, China
| | - Zhe-Jin Zhong
- Key Laboratory for Neuro-Information of Ministry of Education, Center of Bioinformatics, School of Life Science and Technology, University of Electronic Science and Technology of China Chengdu, China
| | - Pan-Pan Zhu
- Key Laboratory for Neuro-Information of Ministry of Education, Center of Bioinformatics, School of Life Science and Technology, University of Electronic Science and Technology of China Chengdu, China
| | - En-Ze Deng
- Key Laboratory for Neuro-Information of Ministry of Education, Center of Bioinformatics, School of Life Science and Technology, University of Electronic Science and Technology of China Chengdu, China
| | - Hui Ding
- Key Laboratory for Neuro-Information of Ministry of Education, Center of Bioinformatics, School of Life Science and Technology, University of Electronic Science and Technology of China Chengdu, China
| | - Wei Chen
- Department of Physics, School of Sciences and Center for Genomics and Computational Biology, Hebei United University Tangshan, China
| | - Hao Lin
- Key Laboratory for Neuro-Information of Ministry of Education, Center of Bioinformatics, School of Life Science and Technology, University of Electronic Science and Technology of China Chengdu, China
| |
Collapse
|
13
|
Abstract
The MYC oncogene is a multifunctional protein that is aberrantly expressed in a significant fraction of tumors from diverse tissue origins. Because of its multifunctional nature, it has been difficult to delineate the exact contributions of MYC's diverse roles to tumorigenesis. Here, we review the normal role of MYC in regulating DNA replication as well as its ability to generate DNA replication stress when overexpressed. Finally, we discuss the possible mechanisms by which replication stress induced by aberrant MYC expression could contribute to genomic instability and cancer.
Collapse
Affiliation(s)
| | - Jean Gautier
- Institute for Cancer Genetics, Columbia University, New York, New York 10032 Department of Genetics and Development, Columbia University, New York, New York 10032
| |
Collapse
|
14
|
Temporal and spatial regulation of eukaryotic DNA replication: From regulated initiation to genome-scale timing program. Semin Cell Dev Biol 2014; 30:110-20. [DOI: 10.1016/j.semcdb.2014.04.014] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2014] [Accepted: 04/04/2014] [Indexed: 11/23/2022]
|
15
|
The spatiotemporal program of DNA replication is associated with specific combinations of chromatin marks in human cells. PLoS Genet 2014; 10:e1004282. [PMID: 24785686 PMCID: PMC4006723 DOI: 10.1371/journal.pgen.1004282] [Citation(s) in RCA: 95] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2013] [Accepted: 02/18/2014] [Indexed: 11/19/2022] Open
Abstract
The duplication of mammalian genomes is under the control of a spatiotemporal program that orchestrates the positioning and the timing of firing of replication origins. The molecular mechanisms coordinating the activation of about predicted origins remain poorly understood, partly due to the intrinsic rarity of replication bubbles, making it difficult to purify short nascent strands (SNS). The precise identification of origins based on the high-throughput sequencing of SNS constitutes a new methodological challenge. We propose a new statistical method with a controlled resolution, adapted to the detection of replication origins from SNS data. We detected an average of 80,000 replication origins in different cell lines. To evaluate the consistency between different protocols, we compared SNS detections with bubble trapping detections. This comparison demonstrated a good agreement between genome-wide methods, with 65% of SNS-detected origins validated by bubble trapping, and 44% of bubble trapping origins validated by SNS origins, when compared at the same resolution. We investigated the interplay between the spatial and the temporal programs of replication at fine scales. We show that most of the origins detected in regions replicated in early S phase are shared by all the cell lines investigated whereas cell-type-specific origins tend to be replicated in late S phase. We shed a new light on the key role of CpG islands, by showing that 80% of the origins associated with CGIs are constitutive. Our results further show that at least 76% of CGIs are origins of replication. The analysis of associations with chromatin marks at different timing of cell division revealed new potential epigenetic regulators driving the spatiotemporal activity of replication origins. We highlight the potential role of H4K20me1 and H3K27me3, the coupling of which is correlated with increased efficiency of replication origins, clearly identifying those marks as potential key regulators of replication origins. Replication is the mechanism by which genomes are duplicated into two exact copies. Genomic stability is under the control of a spatiotemporal program that orchestrates both the positioning and the timing of firing of about 50,000 replication starting points, also called replication origins. Replication bubbles found at origins have been very difficult to map due to their short lifespan. Moreover, with the flood of data characterizing new sequencing technologies, the precise statistical analysis of replication data has become an additional challenge. We propose a new method to map replication origins on the human genome, and we assess the reliability of our finding using experimental validation and comparison with origins maps obtained by bubble trapping. This fine mapping then allowed us to identify potential regulators of the replication dynamics. Our study highlights the key role of CpG Islands and identifies new potential epigenetic regulators (methylation of lysine 4 on histone H4, and tri-methylation of lysine 27 on histone H3) whose coupling is correlated with an increase in the efficiency of replication origins, suggesting those marks as potential key regulators of replication. Overall, our study defines new potentially important pathways that might regulate the sequential firing of origins during genome duplication.
Collapse
|
16
|
Abstract
Evolutionary selection for optimal genome preservation, replication, and expression should yield similar chromosome organizations in any type of cells. And yet, the chromosome organization is surprisingly different between eukaryotes and prokaryotes. The nuclear versus cytoplasmic accommodation of genetic material accounts for the distinct eukaryotic and prokaryotic modes of genome evolution, but it falls short of explaining the differences in the chromosome organization. I propose that the two distinct ways to organize chromosomes are driven by the differences between the global-consecutive chromosome cycle of eukaryotes and the local-concurrent chromosome cycle of prokaryotes. Specifically, progressive chromosome segregation in prokaryotes demands a single duplicon per chromosome, while other "precarious" features of the prokaryotic chromosomes can be viewed as compensations for this severe restriction.
Collapse
|
17
|
Evertts AG, Coller HA. Back to the origin: reconsidering replication, transcription, epigenetics, and cell cycle control. Genes Cancer 2013; 3:678-96. [PMID: 23634256 DOI: 10.1177/1947601912474891] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
In bacteria, replication is a carefully orchestrated event that unfolds the same way for each bacterium and each cell division. The process of DNA replication in bacteria optimizes cell growth and coordinates high levels of simultaneous replication and transcription. In metazoans, the organization of replication is more enigmatic. The lack of a specific sequence that defines origins of replication has, until recently, severely limited our ability to define the organizing principles of DNA replication. This question is of particular importance as emerging data suggest that replication stress is an important contributor to inherited genetic damage and the genomic instability in tumors. We consider here the replication program in several different organisms including recent genome-wide analyses of replication origins in humans. We review recent studies on the role of cytosine methylation in replication origins, the role of transcriptional looping and gene gating in DNA replication, and the role of chromatin's 3-dimensional structure in DNA replication. We use these new findings to consider several questions surrounding DNA replication in metazoans: How are origins selected? What is the relationship between replication and transcription? How do checkpoints inhibit origin firing? Why are there early and late firing origins? We then discuss whether oncogenes promote cancer through a role in DNA replication and whether errors in DNA replication are important contributors to the genomic alterations and gene fusion events observed in cancer. We conclude with some important areas for future experimentation.
Collapse
|
18
|
Takács IF, Tombácz D, Berta B, Prazsák I, Póka N, Boldogkői Z. The ICP22 protein selectively modifies the transcription of different kinetic classes of pseudorabies virus genes. BMC Mol Biol 2013; 14:2. [PMID: 23360468 PMCID: PMC3599583 DOI: 10.1186/1471-2199-14-2] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2012] [Accepted: 01/24/2013] [Indexed: 01/24/2023] Open
Abstract
BACKGROUND Pseudorabies virus (PRV), an alpha-herpesvirus of swine, is a widely used model organism in investigations of the molecular pathomechanisms of the herpesviruses. This work is the continuation of our earlier studies, in which we investigated the effect of the abrogation of gene function on the viral transcriptome by knocking out PRV genes playing roles in the coordination of global gene expression of the virus. In this study, we deleted the us1 gene encoding the ICP22, an important viral regulatory protein, and analyzed the changes in the expression of other PRV genes. RESULTS A multi-timepoint real-time RT-PCR technique was applied to evaluate the impact of deletion of the PRV us1 gene on the overall transcription kinetics of viral genes. The mutation proved to exert a differential effect on the distinct kinetic classes of PRV genes at the various stages of lytic infection. In the us1 gene-deleted virus, all the kinetic classes of the genes were significantly down-regulated in the first hour of infection. After 2 to 6 h of infection, the late genes were severely suppressed, whereas the early genes were unaffected. In the late stage of infection, the early genes were selectively up-regulated. In the mutant virus, the transcription of the ie180 gene, the major coordinator of PRV gene expression, correlated closely with the transcription of other viral genes, a situation which was not found in the wild-type (wt) virus. A 4-h delay was observed in the commencement of DNA replication in the mutant virus as compared with the wt virus. The rate of transcription from a gene normalized to the relative copy number of the viral genome was observed to decline drastically following the initiation of DNA replication in both the wt and mutant backgrounds. Finally, the switch between the expressions of the early and late genes was demonstrated not to be controlled by DNA replication, as is widely believed, since the switch preceded the DNA replication. CONCLUSIONS Our results show a strong dependence of PRV gene expression on the presence of functional us1 gene. ICP22 is shown to exert a differential effect on the distinct kinetic classes of PRV genes and to disrupt the close correlation between the transcription kinetics of ie180 and other PRV transcripts. Furthermore, DNA replication exerts a severe constraint on the viral transcription.
Collapse
Affiliation(s)
- Irma F Takács
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Somogyi B, st, 4, Szeged, H-6720, Hungary
| | | | | | | | | | | |
Collapse
|
19
|
Arakawa K, Tomita M. Measures of compositional strand bias related to replication machinery and its applications. Curr Genomics 2012; 13:4-15. [PMID: 22942671 PMCID: PMC3269016 DOI: 10.2174/138920212799034749] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2011] [Revised: 09/10/2011] [Accepted: 09/20/2011] [Indexed: 11/22/2022] Open
Abstract
The compositional asymmetry of complementary bases in nucleotide sequences implies the existence of a mutational or selectional bias in the two strands of the DNA duplex, which is commonly shaped by strand-specific mechanisms in transcription or replication. Such strand bias in genomes, frequently visualized by GC skew graphs, is used for the computational prediction of transcription start sites and replication origins, as well as for comparative evolutionary genomics studies. The use of measures of compositional strand bias in order to quantify the degree of strand asymmetry is crucial, as it is the basis for determining the applicability of compositional analysis and comparing the strength of the mutational bias in different biological machineries in various species. Here, we review the measures of strand bias that have been proposed to date, including the ∆GC skew, the B1 index, the predictability score of linear discriminant analysis for gene orientation, the signal-to-noise ratio of the oligonucleotide bias, and the GC skew index. These measures have been predominantly designed for and applied to the analysis of replication-related mutational processes in prokaryotes, but we also give research examples in eukaryotes.
Collapse
Affiliation(s)
- Kazuharu Arakawa
- Institute for Advanced Biosciences, Keio University, Fujisawa 252-8520, Japan
| | | |
Collapse
|
20
|
Lin YL, Pasero P. Interference between DNA replication and transcription as a cause of genomic instability. Curr Genomics 2012; 13:65-73. [PMID: 22942676 PMCID: PMC3269018 DOI: 10.2174/138920212799034767] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2011] [Revised: 10/03/2011] [Accepted: 10/06/2011] [Indexed: 11/22/2022] Open
Abstract
Replication and transcription are key aspects of DNA metabolism that take place on the same template and potentially interfere with each other. Conflicts between these two activities include head-on or co-directional collisions between DNA and RNA polymerases, which can lead to the formation of DNA breaks and chromosome rearrangements. To avoid these deleterious consequences and prevent genomic instability, cells have evolved multiple mechanisms preventing replication forks from colliding with the transcription machinery. Yet, recent reports indicate that interference between replication and transcription is not limited to physical interactions between polymerases and that other cotranscriptional processes can interfere with DNA replication. These include DNA-RNA hybrids that assemble behind elongating RNA polymerases, impede fork progression and promote homologous recombination. Here, we discuss recent evidence indicating that R-loops represent a major source of genomic instability in all organisms, from bacteria to human, and are potentially implicated in cancer development.
Collapse
Affiliation(s)
- Yea-Lih Lin
- Institute of Human Genetics, CNRS-UPR1142, Montpellier, France
| | | |
Collapse
|
21
|
Baker A, Julienne H, Chen CL, Audit B, d'Aubenton-Carafa Y, Thermes C, Arneodo A. Linking the DNA strand asymmetry to the spatio-temporal replication program. I. About the role of the replication fork polarity in genome evolution. THE EUROPEAN PHYSICAL JOURNAL. E, SOFT MATTER 2012; 35:92. [PMID: 23001787 DOI: 10.1140/epje/i2012-12092-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/06/2012] [Revised: 08/08/2012] [Accepted: 08/21/2012] [Indexed: 06/01/2023]
Abstract
Two key cellular processes, namely transcription and replication, require the opening of the DNA double helix and act differently on the two DNA strands, generating different mutational patterns (mutational asymmetry) that may result, after long evolutionary time, in different nucleotide compositions on the two DNA strands (compositional asymmetry). We elaborate on the simplest model of neutral substitution rates that takes into account the strand asymmetries generated by the transcription and replication processes. Using perturbation theory, we then solve the time evolution of the DNA composition under strand-asymmetric substitution rates. In our minimal model, the compositional and substitutional asymmetries are predicted to decompose into a transcription- and a replication-associated components. The transcription-associated asymmetry increases in magnitude with transcription rate and changes sign with gene orientation while the replication-associated asymmetry is proportional to the replication fork polarity. These results are confirmed experimentally in the human genome, using substitution rates obtained by aligning the human and chimpanzee genomes using macaca and orangutan as outgroups, and replication fork polarity determined in the HeLa cell line as estimated from the derivative of the mean replication timing. When further investigating the dynamics of compositional skew evolution, we show that it is not at equilibrium yet and that its evolution is an extremely slow process with characteristic time scales of several hundred Myrs.
Collapse
Affiliation(s)
- A Baker
- Université de Lyon, Lyon, France
| | | | | | | | | | | | | |
Collapse
|
22
|
Boldogköi Z. Transcriptional interference networks coordinate the expression of functionally related genes clustered in the same genomic loci. Front Genet 2012; 3:122. [PMID: 22783276 PMCID: PMC3389743 DOI: 10.3389/fgene.2012.00122] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2012] [Accepted: 06/15/2012] [Indexed: 11/25/2022] Open
Abstract
The regulation of gene expression is essential for normal functioning of biological systems in every form of life. Gene expression is primarily controlled at the level of transcription, especially at the phase of initiation. Non-coding RNAs are one of the major players at every level of genetic regulation, including the control of chromatin organization, transcription, various post-transcriptional processes, and translation. In this study, the Transcriptional Interference Network (TIN) hypothesis was put forward in an attempt to explain the global expression of antisense RNAs and the overall occurrence of tandem gene clusters in the genomes of various biological systems ranging from viruses to mammalian cells. The TIN hypothesis suggests the existence of a novel layer of genetic regulation, based on the interactions between the transcriptional machineries of neighboring genes at their overlapping regions, which are assumed to play a fundamental role in coordinating gene expression within a cluster of functionally linked genes. It is claimed that the transcriptional overlaps between adjacent genes are much more widespread in genomes than is thought today. The Waterfall model of the TIN hypothesis postulates a unidirectional effect of upstream genes on the transcription of downstream genes within a cluster of tandemly arrayed genes, while the Seesaw model proposes a mutual interdependence of gene expression between the oppositely oriented genes. The TIN represents an auto-regulatory system with an exquisitely timed and highly synchronized cascade of gene expression in functionally linked genes located in close physical proximity to each other. In this study, we focused on herpesviruses. The reason for this lies in the compressed nature of viral genes, which allows a tight regulation and an easier investigation of the transcriptional interactions between genes. However, I believe that the same or similar principles can be applied to cellular organisms too.
Collapse
Affiliation(s)
- Zsolt Boldogköi
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, Hungary
| |
Collapse
|
23
|
Chen W, Feng P, Lin H. Prediction of replication origins by calculating DNA structural properties. FEBS Lett 2012; 586:934-8. [PMID: 22449982 DOI: 10.1016/j.febslet.2012.02.034] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2011] [Revised: 02/21/2012] [Accepted: 02/21/2012] [Indexed: 11/17/2022]
Abstract
In this study, we introduced two DNA structural characteristics, namely, bendability and hydroxyl radical cleavage intensity to analyze origin of replication (ORI) in the Saccharomyces cerevisiae genome. We found that both DNA bendability and cleavage intensity in core replication regions were significantly lower than in the linker regions. By using these two DNA structural characteristics, we developed a computational model for ORI prediction and evaluated the model in a benchmark dataset. The predictive performance of the jackknife cross-validation indicates that DNA bendability and cleavage intensity have the ability to describe core replication regions and our model is effective in ORI prediction.
Collapse
Affiliation(s)
- Wei Chen
- Department of Physics, School of Sciences, Hebei United University, Tangshan, China.
| | | | | |
Collapse
|
24
|
Koester B, Rea TJ, Templeton AR, Szalay AS, Sing CF. Long-range autocorrelations of CpG islands in the human genome. PLoS One 2012; 7:e29889. [PMID: 22253817 PMCID: PMC3256200 DOI: 10.1371/journal.pone.0029889] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2011] [Accepted: 12/07/2011] [Indexed: 01/24/2023] Open
Abstract
In this paper, we use a statistical estimator developed in astrophysics to study the distribution and organization of features of the human genome. Using the human reference sequence we quantify the global distribution of CpG islands (CGI) in each chromosome and demonstrate that the organization of the CGI across a chromosome is non-random, exhibits surprisingly long range correlations (10 Mb) and varies significantly among chromosomes. These correlations of CGI summarize functional properties of the genome that are not captured when considering variation in any particular separate (and local) feature. The demonstration of the proposed methods to quantify the organization of CGI in the human genome forms the basis of future studies. The most illuminating of these will assess the potential impact on phenotypic variation of inter-individual variation in the organization of the functional features of the genome within and among chromosomes, and among individuals for particular chromosomes.
Collapse
Affiliation(s)
- Benjamin Koester
- Department of Human Genetics, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Thomas J. Rea
- Department of Human Genetics, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Alan R. Templeton
- Department of Biology, Washington University, St Louis, Missouri, United States of America
| | - Alexander S. Szalay
- Department of Physics and Astronomy, Center for Astrophysical Sciences, Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Charles F. Sing
- Department of Human Genetics, University of Michigan, Ann Arbor, Michigan, United States of America
- * E-mail:
| |
Collapse
|
25
|
Marsolier-Kergoat MC, Goldar A. DNA replication induces compositional biases in yeast. Mol Biol Evol 2011; 29:893-904. [PMID: 21948086 DOI: 10.1093/molbev/msr240] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Asymmetries intrinsic to the process of DNA replication are expected to cause differences in the substitution patterns of the leading and the lagging strands and to induce compositional biases. These biases have been detected in the majority of eubacterial genomes but rarely in eukaryotes. Only in the human genome, the activity of a minority of replication origins seems to generate compositional biases. In this work, we provide evidence for replication-associated GC and TA skews in the genomes of two yeast species, Saccharomyces cerevisiae and Kluyveromyces lactis, whereas the data for the Schizosaccharomyces pombe genome are less conclusive. In contrast with the genomes of Homo sapiens and of the majority of eubacteria, the leading strand is enriched in cytosine and adenine in both S. cerevisiae and K. lactis. We observed significant variations across the interorigin intervals of several substitution rates in the S. cerevisiae lineage since its divergence from S. paradoxus. We also found that the S. cerevisiae genome is far from compositional equilibrium and that its present compositional biases are due to substitution rates operating before its divergence from S. paradoxus. Finally, we observed that replication and transcription tend to be cooriented in the S. cerevisiae genome, especially for genes encoding subunits of protein complexes. Taken together, our results suggest that replication-related compositional biases may be a feature of many eukaryotic genomes despite the stochastic nature of the firing of replication origins in these genomes.
Collapse
|
26
|
Chen CL, Duquenne L, Audit B, Guilbaud G, Rappailles A, Baker A, Huvet M, d'Aubenton-Carafa Y, Hyrien O, Arneodo A, Thermes C. Replication-associated mutational asymmetry in the human genome. Mol Biol Evol 2011; 28:2327-37. [PMID: 21368316 DOI: 10.1093/molbev/msr056] [Citation(s) in RCA: 60] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
During evolution, mutations occur at rates that can differ between the two DNA strands. In the human genome, nucleotide substitutions occur at different rates on the transcribed and non-transcribed strands that may result from transcription-coupled repair. These mutational asymmetries generate transcription-associated compositional skews. To date, the existence of such asymmetries associated with replication has not yet been established. Here, we compute the nucleotide substitution matrices around replication initiation zones identified as sharp peaks in replication timing profiles and associated with abrupt jumps in the compositional skew profile. We show that the substitution matrices computed in these regions fully explain the jumps in the compositional skew profile when crossing initiation zones. In intergenic regions, we observe mutational asymmetries measured as differences between complementary substitution rates; their sign changes when crossing initiation zones. These mutational asymmetries are unlikely to result from cryptic transcription but can be explained by a model based on replication errors and strand-biased repair. In transcribed regions, mutational asymmetries associated with replication superimpose on the previously described mutational asymmetries associated with transcription. We separate the substitution asymmetries associated with both mechanisms, which allows us to determine for the first time in eukaryotes, the mutational asymmetries associated with replication and to reevaluate those associated with transcription. Replication-associated mutational asymmetry may result from unequal rates of complementary base misincorporation by the DNA polymerases coupled with DNA mismatch repair (MMR) acting with different efficiencies on the leading and lagging strands. Replication, acting in germ line cells during long evolutionary times, contributed equally with transcription to produce the present abrupt jumps in the compositional skew. These results demonstrate that DNA replication is one of the major processes that shape human genome composition.
Collapse
Affiliation(s)
- Chun-Long Chen
- Centre de Génétique Moléculaire, Centre National de la Recherche Scientifique (CNRS), Gif-sur-Yvette, France
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
27
|
Weber CC, Hurst LD. Intronic AT skew is a defendable proxy for germline transcription but does not predict crossing-over or protein evolution rates in Drosophila melanogaster. J Mol Evol 2010; 71:415-26. [PMID: 20938653 DOI: 10.1007/s00239-010-9395-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2010] [Accepted: 09/17/2010] [Indexed: 01/28/2023]
Abstract
Recent evidence suggests that germline transcription may affect both protein evolutionary rates, possibly mediated by repair processes, and recombination rates, possibly mediated by chromatin and epigenetic modification. Here, we test these propositions in Drosophila melanogaster. The challenge for such analyses is to provide defendable measures of germline gene expression. Intronic AT skew is a good candidate measure as it is thought to be a consequence, at least in part, of transcription-coupled repair. Prior evidence suggests that intronic AT skew in D. melanogaster is not affected by proximity to intron extremities and differs between transcribed DNA and flanking sequence. We now also establish that intronic AT skew is a defendable proxy for germline expression as (a) it is more similar than expected by chance between introns of the same gene (which is not accounted for by physical proximity), (b) is correlated with male germline expression, and (c) is more pronounced in broadly expressed genes. Furthermore, (d) a trend for intronic skew to differ between 3' and 5' ends of genes is particular to broadly expressed genes. Finally, (e) controlling for physical distance, introns of proximate genes are most different in skew if they have different tissue specificity. We find that intronic AT skew, employed as a proxy for germline transcription, correlates neither with recombination rates nor with the rate of protein evolution. We conclude that there is no prima facie evidence that germline expression modulates recombination rates or monotonically affects protein evolution rates in D. melanogaster.
Collapse
Affiliation(s)
- Claudia C Weber
- Department of Biology and Biochemistry, University of Bath, Bath, UK
| | | |
Collapse
|
28
|
Eukaryotic DNA replication origins: many choices for appropriate answers. Nat Rev Mol Cell Biol 2010; 11:728-38. [DOI: 10.1038/nrm2976] [Citation(s) in RCA: 310] [Impact Index Per Article: 22.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
|
29
|
Abstract
Mechanisms regulating where and when eukaryotic DNA replication initiates remain a mystery. Recently, genome-scale methods have been brought to bear on this problem. The identification of replication origins and their associated proteins in yeasts is a well-integrated investigative tool, but corresponding data sets from multicellular organisms are scarce. By contrast, standardized protocols for evaluating replication timing have generated informative data sets for most eukaryotic systems. Here, I summarize the genome-scale methods that are most frequently used to analyse replication in eukaryotes, the kinds of questions each method can address and the technical hurdles that must be overcome to gain a complete understanding of the nature of eukaryotic replication origins.
Collapse
|
30
|
Abstract
Transcribed regions in the human genome differ from adjacent intergenic regions in transposable element density, crossover rates, and asymmetric substitution and sequence composition patterns. We tested whether these differences reflect selection or are instead a byproduct of germline transcription, using publicly available gene expression data from a variety of germline and somatic tissues. Crossover rate shows a strong negative correlation with gene expression in meiotic tissues, suggesting that crossover is inhibited by transcription. Strand-biased composition (G+T content) and A → G versus T → C substitution asymmetry are both positively correlated with germline gene expression. We find no evidence for a strand bias in allele frequency data, implying that the substitution asymmetry reflects a mutation rather than a fixation bias. The density of transposable elements is positively correlated with germline expression, suggesting that such elements preferentially insert into regions that are actively transcribed. For each of the features examined, our analyses favor a nonselective explanation for the observed trends and point to the role of germline gene expression in shaping the mammalian genome.
Collapse
Affiliation(s)
- Graham McVicker
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | | |
Collapse
|
31
|
Maric C, Prioleau MN. Interplay between DNA replication and gene expression: a harmonious coexistence. Curr Opin Cell Biol 2010; 22:277-83. [PMID: 20363609 DOI: 10.1016/j.ceb.2010.03.007] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2010] [Revised: 03/11/2010] [Accepted: 03/12/2010] [Indexed: 01/01/2023]
Abstract
Multicellular organisms have evolved highly sophisticated machinery to that their genomes are accurately duplicated and that the various gene expression programs are established correctly. Recent large-scale studies have shed light on how these fundamental processes interact. Although the machinery mediating these processes share similar cis-regulatory elements, they are not strictly coregulated. Furthermore, studies of the replisome show that highly transcribed genes present a major obstacle to its operation. Further studies will be needed to identify key regulators of the spatio-temporal program of DNA replication, for the elucidation of the complex interplay between replication and transcription.
Collapse
Affiliation(s)
- Chrystelle Maric
- Institut Jacques Monod, Centre National de la Recherche Scientifique, Université Paris 7, Paris, France
| | | |
Collapse
|
32
|
Abstract
The discovery of the DNA double helix structure half a century ago immediately suggested a mechanism for its duplication by semi-conservative copying of the nucleotide sequence into two DNA daughter strands. Shortly after, a second fundamental step toward the elucidation of the mechanism of DNA replication was taken with the isolation of the first enzyme able to polymerize DNA from a template. In the subsequent years, the basic mechanism of DNA replication and its enzymatic machinery components were elucidated, mostly through genetic approaches and in vitro biochemistry. Most recently, the spatial and temporal organization of the DNA replication process in vivo within the context of chromatin and inside the intact cell are finally beginning to be elucidated. On the one hand, recent advances in genome-wide high throughput techniques are providing a new wave of information on the progression of genome replication at high spatial resolution. On the other hand, novel super-resolution microscopy techniques are just starting to give us the first glimpses of how DNA replication is organized within the context of single intact cells with high spatial resolution. The integration of these data with time lapse microscopy analysis will give us the ability to film and dissect the replication of the genome in situ and in real time.
Collapse
Affiliation(s)
- Vadim O Chagin
- Department of Biology, Technische Universität Darmstadt, Germany
| | | | | |
Collapse
|
33
|
Cadoret JC, Prioleau MN. Genome-wide approaches to determining origin distribution. Chromosome Res 2009; 18:79-89. [DOI: 10.1007/s10577-009-9094-2] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
|
34
|
Increased rate of human mutations where DNA and RNA polymerases collide. Trends Genet 2009; 25:523-7. [PMID: 19853958 DOI: 10.1016/j.tig.2009.10.002] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2009] [Revised: 10/05/2009] [Accepted: 10/05/2009] [Indexed: 12/27/2022]
Abstract
Gene density and orientation of genes in eukaryotes seem to be correlated with the replication origin and the mutation rate is greater in late replicating regions; however, the reason for these patterns is unknown. Here, we investigate predicted replication origins in the human genome and find that levels of polymorphism as well as divergence from the chimpanzee genome are greater in genes transcribed on the lagging strand than those on the leading strand. This might be caused by interference between RNA and DNA polymerases, and avoidance of collisions between these enzymes might be an evolutionary force shaping gene orientation and density surrounding replication start sites. Physical constraints might have a larger influence on genome evolution than previously thought.
Collapse
|
35
|
Audit B, Zaghloul L, Vaillant C, Chevereau G, d'Aubenton-Carafa Y, Thermes C, Arneodo A. Open chromatin encoded in DNA sequence is the signature of 'master' replication origins in human cells. Nucleic Acids Res 2009; 37:6064-75. [PMID: 19671527 PMCID: PMC2764438 DOI: 10.1093/nar/gkp631] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
For years, progress in elucidating the mechanisms underlying replication initiation and its coupling to transcriptional activities and to local chromatin structure has been hampered by the small number (approximately 30) of well-established origins in the human genome and more generally in mammalian genomes. Recent in silico studies of compositional strand asymmetries revealed a high level of organization of human genes around 1000 putative replication origins. Here, by comparing with recently experimentally identified replication origins, we provide further support that these putative origins are active in vivo. We show that regions approximately 300-kb wide surrounding most of these putative replication origins that replicate early in the S phase are hypersensitive to DNase I cleavage, hypomethylated and present a significant enrichment in genomic energy barriers that impair nucleosome formation (nucleosome-free regions). This suggests that these putative replication origins are specified by an open chromatin structure favored by the DNA sequence. We discuss how this distinctive attribute makes these origins, further qualified as 'master' replication origins, priviledged loci for future research to decipher the human spatio-temporal replication program. Finally, we argue that these 'master' origins are likely to play a key role in genome dynamics during evolution and in pathological situations.
Collapse
|
36
|
Polak P, Arndt PF. Long-range bidirectional strand asymmetries originate at CpG islands in the human genome. Genome Biol Evol 2009; 1:189-97. [PMID: 20333189 PMCID: PMC2817419 DOI: 10.1093/gbe/evp024] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/22/2009] [Indexed: 12/24/2022] Open
Abstract
In the human genome, CpG islands (CGIs), which are GC- and CpG-rich sequences, are associated with transcription starting sites (TSSs); in addition, there is evidence that CGIs harbor origins of bidirectional replication (OBRs) and are preferred sites for heteroduplex formation during recombination. Transcription, replication, and recombination processes are known to induce specific mutational patterns in various genomes, and therefore, these patterns are expected to be found around CGIs. We use triple alignments of human, chimp, and macaque to compute the rates of nucleotide substitutions in up to 1 Mbps long intergenic regions on both sides of CGIs. Our analysis revealed that around a CGI there is an asymmetry between complementary substitution rates that is similar to the one that found around the OBR in bacteria. We hypothesize that these asymmetries are induced by differences in the replication of the leading and lagging strand and that a significant number of CGIs overlap OBRs. Within CGIs, we observed a mutational signature of GC-biased gene conversion that is associated with recombination. We suggest that recombination has played a major role in the creation of CGIs.
Collapse
Affiliation(s)
- Paz Polak
- Max Planck Institute for Molecular Genetics, Berlin, Germany.
| | | |
Collapse
|
37
|
Yin S, Deng W, Hu L, Kong X. The impact of nucleosome positioning on the organization of replication origins in eukaryotes. Biochem Biophys Res Commun 2009; 385:363-8. [PMID: 19463783 DOI: 10.1016/j.bbrc.2009.05.072] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2009] [Accepted: 05/13/2009] [Indexed: 01/01/2023]
Abstract
The distribution of DNA replication origins (ORIs) on eukaryotic chromosomes is nonrandom, but the reasons behind this are not well understood. Previous studies have suggested a prominent role of transcriptional activity in determining the ORI organization. Here, we identify nucleosome occupancy as a likely candidate to set up ORI distribution. Combining genome-wide data on nucleosome positioning and ORI organization in yeast and humans, we demonstrate that open chromatin domains, characterized by nucleosome depletion, are preferentially permissive for replication. However, contrary to priori claims, the impact of transcriptional activity is considerably weaker than previously proposed and could partly be explained by our nucleosome exclusion model. We propose that the ORI organization imposed by nucleosome positioning is phylogenetically widespread in eukaryotes.
Collapse
Affiliation(s)
- Shanye Yin
- Institute of Health Sciences, Shanghai Institutes for Biological Sciences (SIBS), Chinese Academy of Sciences (CAS) & Shanghai Jiao Tong University School of Medicine (SJTUSM), Shanghai 200025, People's Republic of China
| | | | | | | |
Collapse
|
38
|
Affiliation(s)
- Marie-Noëlle Prioleau
- Institut Jacques Monod, CNRS, Université Paris 7, Université Pierre et Marie Curie, Paris, France.
| |
Collapse
|