1
|
Ngo TTM, Liu B, Wang F, Basu A, Wu C, Ha T. Dependence of nucleosome mechanical stability on DNA mismatches. eLife 2024; 13:RP95514. [PMID: 38656237 DOI: 10.7554/elife.95514] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/26/2024] Open
Abstract
The organization of nucleosomes into chromatin and their accessibility are shaped by local DNA mechanics. Conversely, nucleosome positions shape genetic variations, which may originate from mismatches during replication and chemical modification of DNA. To investigate how DNA mismatches affect the mechanical stability and the exposure of nucleosomal DNA, we used an optical trap combined with single-molecule FRET and a single-molecule FRET cyclization assay. We found that a single base-pair C-C mismatch enhances DNA bendability and nucleosome mechanical stability for the 601-nucleosome positioning sequence. An increase in force required for DNA unwrapping from the histone core is observed for single base-pair C-C mismatches placed at three tested positions: at the inner turn, at the outer turn, or at the junction of the inner and outer turn of the nucleosome. The results support a model where nucleosomal DNA accessibility is reduced by mismatches, potentially explaining the preferred accumulation of single-nucleotide substitutions in the nucleosome core and serving as the source of genetic variation during evolution and cancer progression. Mechanical stability of an intact nucleosome, that is mismatch-free, is also dependent on the species as we find that yeast nucleosomes are mechanically less stable and more symmetrical in the outer turn unwrapping compared to Xenopus nucleosomes.
Collapse
Affiliation(s)
- Thuy T M Ngo
- Department of Physics, Center for Physics in Living Cells University of Illinois Urbana-Champaign, Urbana, United States
- Department of Molecular and Medical Genetics, Oregon Health and Science University, Portland, United States
- Cancer Early Detection Advanced Research Center (CEDAR), Knight Cancer Institute, Oregon Health and Science University, Portland, United States
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, United States
- Division of Oncological Sciences, Oregon Health and Science University, Portland, United States
| | - Bailey Liu
- Department of Biophysics, Johns Hopkins University, Baltimore, United States
| | - Feng Wang
- Laboratory of Biochemistry and Molecular Biology, Center for Cancer Research, National Cancer Institute, Bethesda, United States
| | - Aakash Basu
- Department of Biophysics and Biophysical Chemistry, Johns Hopkins University, Baltimore, United States
- Department of Biosciences, Durham University, Durham, United Kingdom
| | - Carl Wu
- Department of Biology, Johns Hopkins University, Baltimore, United States
- Department of Molecular Biology and Genetics, Johns Hopkins University, Baltimore, United States
| | - Taekjip Ha
- Department of Physics, Center for Physics in Living Cells University of Illinois Urbana-Champaign, Urbana, United States
- Department of Biophysics, Johns Hopkins University, Baltimore, United States
- Department of Biophysics and Biophysical Chemistry, Johns Hopkins University, Baltimore, United States
- Program in Cellular and Molecular Medicine, Boston Children's Hospital, Boston, United States
- Department of Pediatrics, Harvard Medical School, Boston, United States
- Howard Hughes Medical Institute, Boston, United States
| |
Collapse
|
2
|
García A, Durán L, Sánchez M, González S, Santamaría R, Antequera F. Asymmetrical nucleosomal DNA signatures regulate transcriptional directionality. Cell Rep 2024; 43:113605. [PMID: 38127622 DOI: 10.1016/j.celrep.2023.113605] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2023] [Revised: 10/03/2023] [Accepted: 12/05/2023] [Indexed: 12/23/2023] Open
Abstract
Despite the symmetrical structure of nucleosomes, in vitro studies have shown that transcription proceeds with different efficiency depending on the orientation of the DNA sequence around them. However, it is unclear whether this functional asymmetry is present in vivo and whether it could regulate transcriptional directionality. Here, we report that the proximal and distal halves of nucleosomal DNA contribute differentially to nucleosome stability in the genome. In +1 nucleosomes, this asymmetry facilitates or hinders transcription depending on the orientation of its underlying DNA, and this difference is associated with an asymmetrical interaction between DNA and histones. These properties are encoded in the DNA signature of +1 nucleosomes, since its incorporation in the two orientations into downstream nucleosomes renders them asymmetrically accessible to MNase and inverts the balance between sense and antisense transcription. Altogether, our results show that nucleosomal DNA endows nucleosomes with asymmetrical properties that modulate the directionality of transcription.
Collapse
Affiliation(s)
- Alicia García
- Instituto de Biología Funcional y Genómica (IBFG), CSIC-Universidad de Salamanca, Campus Miguel de Unamuno, 37007 Salamanca, Spain
| | - Laura Durán
- Instituto de Biología Funcional y Genómica (IBFG), CSIC-Universidad de Salamanca, Campus Miguel de Unamuno, 37007 Salamanca, Spain
| | - Mar Sánchez
- Instituto de Biología Funcional y Genómica (IBFG), CSIC-Universidad de Salamanca, Campus Miguel de Unamuno, 37007 Salamanca, Spain
| | - Sara González
- Instituto de Biología Funcional y Genómica (IBFG), CSIC-Universidad de Salamanca, Campus Miguel de Unamuno, 37007 Salamanca, Spain
| | - Rodrigo Santamaría
- Departamento de Informática y Automática, Universidad de Salamanca/Facultad de Ciencias, Plaza de los Caídos s/n, 37007 Salamanca, Spain
| | - Francisco Antequera
- Instituto de Biología Funcional y Genómica (IBFG), CSIC-Universidad de Salamanca, Campus Miguel de Unamuno, 37007 Salamanca, Spain.
| |
Collapse
|
3
|
Abstract
Today massive amounts of sequenced metagenomic and metatranscriptomic data from different ecological niches and environmental locations are available. Scientific progress depends critically on methods that allow extracting useful information from the various types of sequence data. Here, we will first discuss types of information contained in the various flavours of biological sequence data, and how this information can be interpreted to increase our scientific knowledge and understanding. We argue that a mechanistic understanding of biological systems analysed from different perspectives is required to consistently interpret experimental observations, and that this understanding is greatly facilitated by the generation and analysis of dynamic mathematical models. We conclude that, in order to construct mathematical models and to test mechanistic hypotheses, time-series data are of critical importance. We review diverse techniques to analyse time-series data and discuss various approaches by which time-series of biological sequence data have been successfully used to derive and test mechanistic hypotheses. Analysing the bottlenecks of current strategies in the extraction of knowledge and understanding from data, we conclude that combined experimental and theoretical efforts should be implemented as early as possible during the planning phase of individual experiments and scientific research projects. This article is part of the theme issue ‘Integrative research perspectives on marine conservation’.
Collapse
Affiliation(s)
- Ovidiu Popa
- Institute of Quantitative and Theoretical Biology, CEPLAS, Heinrich-Heine University Düsseldorf, Germany
| | - Ellen Oldenburg
- Institute of Quantitative and Theoretical Biology, CEPLAS, Heinrich-Heine University Düsseldorf, Germany
| | - Oliver Ebenhöh
- Institute of Quantitative and Theoretical Biology, CEPLAS, Heinrich-Heine University Düsseldorf, Germany.,Cluster of Excellence on Plant Sciences, CEPLAS, Heinrich-Heine University Düsseldorf, Germany
| |
Collapse
|
4
|
Chromatin Structure and Drug Resistance in Candida spp. J Fungi (Basel) 2020; 6:jof6030121. [PMID: 32751495 PMCID: PMC7559719 DOI: 10.3390/jof6030121] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Revised: 07/21/2020] [Accepted: 07/25/2020] [Indexed: 12/14/2022] Open
Abstract
Anti-microbial resistance (AMR) is currently one of the most serious threats to global human health and, appropriately, research to tackle AMR garnishes significant investment and extensive attention from the scientific community. However, most of this effort focuses on antibiotics, and research into anti-fungal resistance (AFR) is vastly under-represented in comparison. Given the growing number of vulnerable, immunocompromised individuals, as well as the positive impact global warming has on fungal growth, there is an immediate urgency to tackle fungal disease, and the disturbing rise in AFR. Chromatin structure and gene expression regulation play pivotal roles in the adaptation of fungal species to anti-fungal stress, suggesting a potential therapeutic avenue to tackle AFR. In this review we discuss both the genetic and epigenetic mechanisms by which chromatin structure can dictate AFR mechanisms and will present evidence of how pathogenic yeast, specifically from the Candida genus, modify chromatin structure to promote survival in the presence of anti-fungal drugs. We also discuss the mechanisms by which anti-chromatin therapy, specifically lysine deacetylase inhibitors, influence the acquisition and phenotypic expression of AFR in Candida spp. and their potential as effective adjuvants to mitigate against AFR.
Collapse
|
5
|
Zhao Y, Wang J, Liang F, Liu Y, Wang Q, Zhang H, Jiang M, Zhang Z, Zhao W, Bao Y, Zhang Z, Wu J, Asmann YW, Li R, Xiao J. NucMap: a database of genome-wide nucleosome positioning map across species. Nucleic Acids Res 2020; 47:D163-D169. [PMID: 30335176 PMCID: PMC6323900 DOI: 10.1093/nar/gky980] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2018] [Accepted: 10/10/2018] [Indexed: 12/16/2022] Open
Abstract
Dynamics of nucleosome positioning affects chromatin state, transcription and all other biological processes occurring on genomic DNA. While MNase-Seq has been used to depict nucleosome positioning map in eukaryote in the past years, nucleosome positioning data is increasing dramatically. To facilitate the usage of published data across studies, we developed a database named nucleosome positioning map (NucMap, http://bigd.big.ac.cn/nucmap). NucMap includes 798 experimental data from 477 samples across 15 species. With a series of functional modules, users can search profile of nucleosome positioning at the promoter region of each gene across all samples and make enrichment analysis on nucleosome positioning data in all genomic regions. Nucleosome browser was built to visualize the profiles of nucleosome positioning. Users can also visualize multiple sources of omics data with the nucleosome browser and make side-by-side comparisons. All processed data in the database are freely available. NucMap is the first comprehensive nucleosome positioning platform and it will serve as an important resource to facilitate the understanding of chromatin regulation.
Collapse
Affiliation(s)
- Yongbing Zhao
- Department of Health Sciences Research, Mayo Clinic, Jacksonville, FL 32224, USA
| | - Jinyue Wang
- BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Fang Liang
- BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Yanxia Liu
- BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Qi Wang
- BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Hao Zhang
- BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Meiye Jiang
- BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Zhewen Zhang
- BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Wenming Zhao
- BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Yiming Bao
- BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Zhang Zhang
- BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,Collaborative Innovation Center of Genetics and Development, Fudan University, Shanghai 200438, China
| | - Jiayan Wu
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Yan W Asmann
- Department of Health Sciences Research, Mayo Clinic, Jacksonville, FL 32224, USA
| | - Rujiao Li
- BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Jingfa Xiao
- BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China.,Collaborative Innovation Center of Genetics and Development, Fudan University, Shanghai 200438, China
| |
Collapse
|
6
|
Feng JX, Riddle NC. Epigenetics and genome stability. Mamm Genome 2020; 31:181-195. [PMID: 32296924 DOI: 10.1007/s00335-020-09836-2] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2019] [Accepted: 04/07/2020] [Indexed: 12/19/2022]
Abstract
Maintaining genome stability is essential to an organism's health and survival. Breakdown of the mechanisms protecting the genome and the resulting genome instability are an important aspect of the aging process and have been linked to diseases such as cancer. Thus, a large network of interconnected pathways is responsible for ensuring genome integrity in the face of the continuous challenges that induce DNA damage. While these pathways are diverse, epigenetic mechanisms play a central role in many of them. DNA modifications, histone variants and modifications, chromatin structure, and non-coding RNAs all carry out a variety of functions to ensure that genome stability is maintained. Epigenetic mechanisms ensure the functions of centromeres and telomeres that are essential for genome stability. Epigenetic mechanisms also protect the genome from the invasion by transposable elements and contribute to various DNA repair pathways. In this review, we highlight the integral role of epigenetic mechanisms in the maintenance of genome stability and draw attention to issues in need of further study.
Collapse
Affiliation(s)
- Justina X Feng
- Department of Biology, The University of Alabama at Birmingham, Birmingham, AL, USA
| | - Nicole C Riddle
- Department of Biology, The University of Alabama at Birmingham, Birmingham, AL, USA.
| |
Collapse
|
7
|
Lu J, Cao X, Zhong S. A likelihood approach to testing hypotheses on the co-evolution of epigenome and genome. PLoS Comput Biol 2018; 14:e1006673. [PMID: 30586383 PMCID: PMC6324829 DOI: 10.1371/journal.pcbi.1006673] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2018] [Revised: 01/08/2019] [Accepted: 11/26/2018] [Indexed: 01/03/2023] Open
Abstract
Central questions to epigenome evolution include whether interspecies changes of histone modifications are independent of evolutionary changes of DNA, and if there is dependence whether they depend on any specific types of DNA sequence changes. Here, we present a likelihood approach for testing hypotheses on the co-evolution of genome and histone modifications. The gist of this approach is to convert evolutionary biology hypotheses into probabilistic forms, by explicitly expressing the joint probability of multispecies DNA sequences and histone modifications, which we refer to as a class of Joint Evolutionary Model for the Genome and the Epigenome (JEMGE). JEMGE can be summarized as a mixture model of four components representing four evolutionary hypotheses, namely dependence and independence of interspecies epigenomic variations to underlying sequence substitutions and to underlying sequence insertions and deletions (indels). We implemented a maximum likelihood method to fit the models to the data. Based on comparison of likelihoods, we inferred whether interspecies epigenomic variations depended on substitution or indels in local genomic sequences based on DNase hypersensitivity and spermatid H3K4me3 ChIP-seq data from human and rhesus macaque. Approximately 5.5% of homologous regions in the genomes exhibited H3K4me3 modification in either species, among which approximately 67% homologous regions exhibited local-sequence-dependent interspecies H3K4me3 variations. Substitutions accounted for less local-sequence-dependent H3K4me3 variations than indels. Among transposon-mediated indels, ERV1 insertions and L1 insertions were most strongly associated with H3K4me3 gains and losses, respectively. By initiating probabilistic formulation on the co-evolution of genomes and epigenomes, JEMGE helps to bring evolutionary biology principles to comparative epigenomic studies. Epigenetic modifications play a significant role in gene regulations and thus heavily influence phenotypic outcomes. Whereas cross-species epigenomic comparisons have been fruitful in revealing the function of epigenetic modifications, it still remains unclear how the epigenome changes across species. A central question in epigenome evolution studies is whether interspecies epigenomic variations rely on genomic changes in cis and, if partially yes, whether different genomic changes have distinct impacts. To tackle this question, we initiated a likelihood-based approach, in which different hypotheses related to the co-evolution of the genome and the epigenome could be converted into probabilistic models. By fitting the models to actual data, each model yielded a likelihood, and the hypothesis corresponded to the largest likelihood was selected as most supported by observed data. In this work, we focused on the influence of two types of underlying sequence changes: substitutions, and insertions and deletions (indels). We quantitatively assessed the dependence of H3K4me3 variations on substitutions and indels between human and rhesus, and separated their relative impacts within each genomic region with H3K4me3. The methodology presented here provides a framework for modeling the epigenome together with the genome and a quantitative approach to test different evolutionary hypotheses.
Collapse
Affiliation(s)
- Jia Lu
- Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America
| | - Xiaoyi Cao
- Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America
| | - Sheng Zhong
- Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America
- * E-mail:
| |
Collapse
|
8
|
Brunet FG, Audit B, Drillon G, Argoul F, Volff JN, Arneodo A. Evidence for DNA Sequence Encoding of an Accessible Nucleosomal Array across Vertebrates. Biophys J 2018; 114:2308-2316. [PMID: 29580552 PMCID: PMC6028776 DOI: 10.1016/j.bpj.2018.02.025] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2017] [Revised: 02/07/2018] [Accepted: 02/20/2018] [Indexed: 12/15/2022] Open
Abstract
Nucleosome-depleted regions around which nucleosomes order following the "statistical" positioning scenario were recently shown to be encoded in the DNA sequence in human. This intrinsic nucleosomal ordering strongly correlates with oscillations in the local GC content as well as with the interspecies and intraspecies mutation profiles, revealing the existence of both positive and negative selection. In this letter, we show that these predicted nucleosome inhibitory energy barriers (NIEBs) with compacted neighboring nucleosomes are indeed ubiquitous to all vertebrates tested. These 1 kb-sized chromatin patterns are widely distributed along vertebrate chromosomes, overall covering more than a third of the genome. We have previously observed in human deviations from neutral evolution at these genome-wide distributed regions, which we interpreted as a possible indication of the selection of an open, accessible, and dynamic nucleosomal array to constitutively facilitate the epigenetic regulation of nuclear functions in a cell-type-specific manner. As a first, very appealing observation supporting this hypothesis, we report evidence of a strong association between NIEB borders and the poly(A) tails of Alu sequences in human. These results suggest that NIEBs provide adequate chromatin patterns favorable to the integration of Alu retrotransposons and, more generally to various transposable elements in the genomes of primates and other vertebrates.
Collapse
Affiliation(s)
- Frédéric G Brunet
- Institut de Génomique Fonctionnelle de Lyon, Univ Lyon, CNRS UMR 5242, Ecole Normale Supérieure de Lyon, Univ Claude Bernard Lyon 1, Lyon, France
| | - Benjamin Audit
- Univ Lyon, ENS de Lyon, Univ Claude Bernard Lyon 1, CNRS Laboratoire de Physique, Lyon, France
| | - Guénola Drillon
- Univ Lyon, ENS de Lyon, Univ Claude Bernard Lyon 1, CNRS Laboratoire de Physique, Lyon, France
| | - Françoise Argoul
- Univ Lyon, ENS de Lyon, Univ Claude Bernard Lyon 1, CNRS Laboratoire de Physique, Lyon, France; LOMA, Université de Bordeaux, CNRS UMR 5798, Talence, France
| | - Jean-Nicolas Volff
- Institut de Génomique Fonctionnelle de Lyon, Univ Lyon, CNRS UMR 5242, Ecole Normale Supérieure de Lyon, Univ Claude Bernard Lyon 1, Lyon, France
| | - Alain Arneodo
- Univ Lyon, ENS de Lyon, Univ Claude Bernard Lyon 1, CNRS Laboratoire de Physique, Lyon, France; LOMA, Université de Bordeaux, CNRS UMR 5798, Talence, France.
| |
Collapse
|
9
|
García A, González S, Antequera F. Nucleosomal organization and DNA base composition patterns. Nucleus 2017. [PMID: 28635365 PMCID: PMC5703254 DOI: 10.1080/19491034.2017.1337611] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Nucleosomes are the basic units of chromatin. They compact the genome inside the nucleus and regulate the access of proteins to DNA. In the yeast genome, most nucleosomes occupy well-defined positions, which are maintained under many different physiological situations and genetic backgrounds. Although several short sequence elements have been described that favor or reduce the affinity between histones and DNA, the extent to which the DNA sequence affects nucleosome positioning in the genomic context remains unclear. Recent analyses indicate that the base composition pattern of mononucleosomal DNA differs among species, and that the same sequence elements have a different impact on nucleosome positioning in different genomes despite the high level of phylogenetic conservation of histones. These studies have also shown that the DNA sequence contributes to nucleosome positioning to the point that it is possible to design synthetic DNA molecules capable of generating regular and species-specific nucleosomal patterns in vivo.
Collapse
Affiliation(s)
- Alicia García
- a Instituto de Biología Funcional y Genómica, Consejo Superior de Investigaciones Científicas (CSIC)/Universidad de Salamanca , Salamanca , Spain
| | - Sara González
- a Instituto de Biología Funcional y Genómica, Consejo Superior de Investigaciones Científicas (CSIC)/Universidad de Salamanca , Salamanca , Spain
| | - Francisco Antequera
- a Instituto de Biología Funcional y Genómica, Consejo Superior de Investigaciones Científicas (CSIC)/Universidad de Salamanca , Salamanca , Spain
| |
Collapse
|
10
|
Hettiarachchi N, Saitou N. GC Content Heterogeneity Transition of Conserved Noncoding Sequences Occurred at the Emergence of Vertebrates. Genome Biol Evol 2016; 8:3377-3392. [PMID: 28040773 PMCID: PMC5203776 DOI: 10.1093/gbe/evw231] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Conserved non-coding sequences (CNSs) of Eukaryotes are known to be significantly enriched in regulatory sequences. CNSs of diverse lineages follow different patterns in abundance, sequence composition, and location. Here, we report a thorough analysis of CNSs in diverse groups of Eukaryotes with respect to GC content heterogeneity. We examined 24 fungi, 19 invertebrates, and 12 non-mammalian vertebrates so as to find lineage specific features of CNSs. We found that fungi and invertebrate CNSs are predominantly GC rich as in plants we previously observed, whereas vertebrate CNSs are GC poor. This result suggests that the CNS GC content transition occurred from the ancestral GC rich state of Eukaryotes to GC poor in the vertebrate lineage due to the enrollment of GC poor transcription factor binding sites that are lineage specific. CNS GC content is closely linked with the nucleosome occupancy that determines the location and structural architecture of DNAs.
Collapse
Affiliation(s)
- Nilmini Hettiarachchi
- Department of Genetics, School of Life Science, Graduate University for Advanced Studies (SOKENDAI), Mishima, Japan.,Division of Population Genetics, National institute of Genetics, Mishima, Japan
| | - Naruya Saitou
- Department of Genetics, School of Life Science, Graduate University for Advanced Studies (SOKENDAI), Mishima, Japan .,Division of Population Genetics, National institute of Genetics, Mishima, Japan
| |
Collapse
|
11
|
González S, García A, Vázquez E, Serrano R, Sánchez M, Quintales L, Antequera F. Nucleosomal signatures impose nucleosome positioning in coding and noncoding sequences in the genome. Genome Res 2016; 26:1532-1543. [PMID: 27662899 PMCID: PMC5088595 DOI: 10.1101/gr.207241.116] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2016] [Accepted: 09/19/2016] [Indexed: 12/18/2022]
Abstract
In the yeast genome, a large proportion of nucleosomes occupy well-defined and stable positions. While the contribution of chromatin remodelers and DNA binding proteins to maintain this organization is well established, the relevance of the DNA sequence to nucleosome positioning in the genome remains controversial. Through quantitative analysis of nucleosome positioning, we show that sequence changes distort the nucleosomal pattern at the level of individual nucleosomes in three species of Schizosaccharomyces and in Saccharomyces cerevisiae. This effect is equally detected in transcribed and nontranscribed regions, suggesting the existence of sequence elements that contribute to positioning. To identify such elements, we incorporated information from nucleosomal signatures into artificial synthetic DNA molecules and found that they generated regular nucleosomal arrays indistinguishable from those of endogenous sequences. Strikingly, this information is species-specific and can be combined with coding information through the use of synonymous codons such that genes from one species can be engineered to adopt the nucleosomal organization of another. These findings open the possibility of designing coding and noncoding DNA molecules capable of directing their own nucleosomal organization.
Collapse
Affiliation(s)
- Sara González
- Instituto de Biología Funcional y Genómica, Consejo Superior de Investigaciones Científicas (CSIC)/Universidad de Salamanca, 37007 Salamanca, Spain
| | - Alicia García
- Instituto de Biología Funcional y Genómica, Consejo Superior de Investigaciones Científicas (CSIC)/Universidad de Salamanca, 37007 Salamanca, Spain
| | - Enrique Vázquez
- Instituto de Biología Funcional y Genómica, Consejo Superior de Investigaciones Científicas (CSIC)/Universidad de Salamanca, 37007 Salamanca, Spain
| | - Rebeca Serrano
- Instituto de Biología Funcional y Genómica, Consejo Superior de Investigaciones Científicas (CSIC)/Universidad de Salamanca, 37007 Salamanca, Spain
| | - Mar Sánchez
- Instituto de Biología Funcional y Genómica, Consejo Superior de Investigaciones Científicas (CSIC)/Universidad de Salamanca, 37007 Salamanca, Spain
| | - Luis Quintales
- Instituto de Biología Funcional y Genómica, Consejo Superior de Investigaciones Científicas (CSIC)/Universidad de Salamanca, 37007 Salamanca, Spain.,Departamento de Informática y Automática, Universidad de Salamanca/Facultad de Ciencias, 37007 Salamanca, Spain
| | - Francisco Antequera
- Instituto de Biología Funcional y Genómica, Consejo Superior de Investigaciones Científicas (CSIC)/Universidad de Salamanca, 37007 Salamanca, Spain
| |
Collapse
|
12
|
Drillon G, Audit B, Argoul F, Arneodo A. Evidence of selection for an accessible nucleosomal array in human. BMC Genomics 2016; 17:526. [PMID: 27472913 PMCID: PMC4966569 DOI: 10.1186/s12864-016-2880-2] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2015] [Accepted: 07/04/2016] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND Recently, a physical model of nucleosome formation based on sequence-dependent bending properties of the DNA double-helix has been used to reveal some enrichment of nucleosome-inhibiting energy barriers (NIEBs) nearby ubiquitous human "master" replication origins. Here we use this model to predict the existence of about 1.6 millions NIEBs over the 22 human autosomes. RESULTS We show that these high energy barriers of mean size 153 bp correspond to nucleosome-depleted regions (NDRs) in vitro, as expected, but also in vivo. On either side of these NIEBs, we observe, in vivo and in vitro, a similar compacted nucleosome ordering, suggesting an absence of chromatin remodeling. This nucleosomal ordering strongly correlates with oscillations of the GC content as well as with the interspecies and intraspecies mutation profiles along these regions. Comparison of these divergence rates reveals the existence of both positive and negative selections linked to nucleosome positioning around these intrinsic NDRs. Overall, these NIEBs and neighboring nucleosomes cover 37.5 % of the human genome where nucleosome occupancy is stably encoded in the DNA sequence. These 1 kb-sized regions of intrinsic nucleosome positioning are equally found in GC-rich and GC-poor isochores, in early and late replicating regions, in intergenic and genic regions but not at gene promoters. CONCLUSION The source of selection pressure on the NIEBs has yet to be resolved in future work. One possible scenario is that these widely distributed chromatin patterns have been selected in human to impair the condensation of the nucleosomal array into the 30 nm chromatin fiber, so as to facilitate the epigenetic regulation of nuclear functions in a cell-type-specific manner.
Collapse
Affiliation(s)
- Guénola Drillon
- Univ Lyon, Ens de Lyon, Univ Claude Bernard Lyon 1, CNRS, Laboratoire de Physique, Lyon, F-69342 France
| | - Benjamin Audit
- Univ Lyon, Ens de Lyon, Univ Claude Bernard Lyon 1, CNRS, Laboratoire de Physique, Lyon, F-69342 France
| | - Françoise Argoul
- Univ Lyon, Ens de Lyon, Univ Claude Bernard Lyon 1, CNRS, Laboratoire de Physique, Lyon, F-69342 France
- LOMA, Université de Bordeaux, CNRS, UMR 5798, 51 Cours de le Libération, Talence, F-33405 France
| | - Alain Arneodo
- Univ Lyon, Ens de Lyon, Univ Claude Bernard Lyon 1, CNRS, Laboratoire de Physique, Lyon, F-69342 France
- LOMA, Université de Bordeaux, CNRS, UMR 5798, 51 Cours de le Libération, Talence, F-33405 France
| |
Collapse
|
13
|
Liu G, Xing Y, Zhao H, Wang J, Shang Y, Cai L. A deformation energy-based model for predicting nucleosome dyads and occupancy. Sci Rep 2016; 6:24133. [PMID: 27053067 PMCID: PMC4823781 DOI: 10.1038/srep24133] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2016] [Accepted: 03/21/2016] [Indexed: 12/14/2022] Open
Abstract
Nucleosome plays an essential role in various cellular processes, such as DNA replication, recombination, and transcription. Hence, it is important to decode the mechanism of nucleosome positioning and identify nucleosome positions in the genome. In this paper, we present a model for predicting nucleosome positioning based on DNA deformation, in which both bending and shearing of the nucleosomal DNA are considered. The model successfully predicted the dyad positions of nucleosomes assembled in vitro and the in vitro map of nucleosomes in Saccharomyces cerevisiae. Applying the model to Caenorhabditis elegans and Drosophila melanogaster, we achieved satisfactory results. Our data also show that shearing energy of nucleosomal DNA outperforms bending energy in nucleosome occupancy prediction and the ability to predict nucleosome dyad positions is attributed to bending energy that is associated with rotational positioning of nucleosomes.
Collapse
Affiliation(s)
- Guoqing Liu
- The Institute of Bioengineering and Technology, Inner Mongolia University of Science and Technology, Baotou, 014010, China.,Computational Systems Biology Lab, Department of Biochemistry and Molecular Biology, Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| | - Yongqiang Xing
- The Institute of Bioengineering and Technology, Inner Mongolia University of Science and Technology, Baotou, 014010, China
| | - Hongyu Zhao
- The Institute of Bioengineering and Technology, Inner Mongolia University of Science and Technology, Baotou, 014010, China
| | - Jianying Wang
- The Institute of Bioengineering and Technology, Inner Mongolia University of Science and Technology, Baotou, 014010, China.,State Key Laboratory for Utilization of Bayan Obo Multi-Metallic Resources, Inner Mongolia University of Science and Technology, Baotou, 014010, China
| | - Yu Shang
- Computational Systems Biology Lab, Department of Biochemistry and Molecular Biology, Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA.,College of Computer Science and Technology, Jilin University, Changchun, Jilin 130021, China
| | - Lu Cai
- The Institute of Bioengineering and Technology, Inner Mongolia University of Science and Technology, Baotou, 014010, China
| |
Collapse
|
14
|
Gouda N, Shiwa Y, Akashi M, Yoshikawa H, Kasahara K, Furusawa M. Distribution of human single-nucleotide polymorphisms is approximated by the power law and represents a fractal structure. Genes Cells 2016; 21:396-407. [PMID: 27030000 DOI: 10.1111/gtc.12344] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2015] [Accepted: 12/12/2015] [Indexed: 01/31/2023]
Abstract
Single-nucleotide polymorphisms (SNPs) are one of the main causes of evolution. The distribution of human SNPs, which were examined in detail genomewide, was analyzed. Three discrete databases of human SNPs were used for this analysis, and similar results were obtained from these databases. It was found that the distribution of the distance between SNPs was approximated by the power law, and the shape of the regions including SNPs had the so-called fractal structure. Although the reason why the distribution of SNPs obeys such a certain law of physics is unclear, a speculation was attempted in connection with the three-dimensional structure of human chromatin which has a fractal structure.
Collapse
Affiliation(s)
- Norio Gouda
- Department of Systems Medicine, Sakaguchi Laboratory, Keio University School of Medicine, 35 Shinanomachi, Shinjuku, Tokyo, 160-8582, Japan
| | - Yuh Shiwa
- Genome Research Center, NODAI Research Institute, Tokyo University of Agriculture, 1-1-1 Sakuragaoka, Setagaya-ku, Tokyo, 156-8502, Japan
| | - Motohiro Akashi
- Department of Bioscience, Tokyo University of Agriculture, 1-1-1 Sakuragaoka, Setagaya-ku, Tokyo, 156-8502, Japan
| | - Hirofumi Yoshikawa
- Genome Research Center, NODAI Research Institute, Tokyo University of Agriculture, 1-1-1 Sakuragaoka, Setagaya-ku, Tokyo, 156-8502, Japan.,Department of Bioscience, Tokyo University of Agriculture, 1-1-1 Sakuragaoka, Setagaya-ku, Tokyo, 156-8502, Japan
| | - Ken Kasahara
- Chitose Laboratory Corp., Biotechnology Research Center, 907 Nogawa, Miyamae-ku, Kawasaki, 216-0001, Japan
| | - Mitsuru Furusawa
- Chitose Laboratory Corp., Biotechnology Research Center, 907 Nogawa, Miyamae-ku, Kawasaki, 216-0001, Japan
| |
Collapse
|
15
|
Quintales L, Soriano I, Vázquez E, Segurado M, Antequera F. A species-specific nucleosomal signature defines a periodic distribution of amino acids in proteins. Open Biol 2016; 5:140218. [PMID: 25854683 PMCID: PMC4422121 DOI: 10.1098/rsob.140218] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Nucleosomes are the basic structural units of chromatin. Most of the yeast genome is organized in a pattern of positioned nucleosomes that is stably maintained under a wide range of physiological conditions. In this work, we have searched for sequence determinants associated with positioned nucleosomes in four species of fission and budding yeasts. We show that mononucleosomal DNA follows a highly structured base composition pattern, which differs among species despite the high degree of histone conservation. These nucleosomal signatures are present in transcribed and non-transcribed regions across the genome. In the case of open reading frames, they correctly predict the relative distribution of codons on mononucleosomal DNA, and they also determine a periodicity in the average distribution of amino acids along the proteins. These results establish a direct and species-specific connection between the position of each codon around the histone octamer and protein composition.
Collapse
Affiliation(s)
- Luis Quintales
- Instituto de Biología Funcional y Genómica, Consejo Superior de Investigaciones Científicas (CSIC)/Universidad de Salamanca, Campus Miguel de Unamuno, 37007 Salamanca, Spain
| | - Ignacio Soriano
- Instituto de Biología Funcional y Genómica, Consejo Superior de Investigaciones Científicas (CSIC)/Universidad de Salamanca, Campus Miguel de Unamuno, 37007 Salamanca, Spain
| | - Enrique Vázquez
- Instituto de Biología Funcional y Genómica, Consejo Superior de Investigaciones Científicas (CSIC)/Universidad de Salamanca, Campus Miguel de Unamuno, 37007 Salamanca, Spain
| | - Mónica Segurado
- Instituto de Biología Funcional y Genómica, Consejo Superior de Investigaciones Científicas (CSIC)/Universidad de Salamanca, Campus Miguel de Unamuno, 37007 Salamanca, Spain
| | - Francisco Antequera
- Instituto de Biología Funcional y Genómica, Consejo Superior de Investigaciones Científicas (CSIC)/Universidad de Salamanca, Campus Miguel de Unamuno, 37007 Salamanca, Spain
| |
Collapse
|
16
|
Abstract
DNA damage is a constant threat to cells, causing cytotoxicity as well as inducing genetic alterations. The steady-state abundance of DNA lesions in a cell is minimized by a variety of DNA repair mechanisms, including DNA strand break repair, mismatch repair, nucleotide excision repair, base excision repair, and ribonucleotide excision repair. The efficiencies and mechanisms by which these pathways remove damage from chromosomes have been primarily characterized by investigating the processing of lesions at defined genomic loci, among bulk genomic DNA, on episomal DNA constructs, or using in vitro substrates. However, the structure of a chromosome is heterogeneous, consisting of heavily protein-bound heterochromatic regions, open regulatory regions, actively transcribed genes, and even areas of transient single stranded DNA. Consequently, DNA repair pathways function in a much more diverse set of chromosomal contexts than can be readily assessed using previous methods. Recent efforts to develop whole genome maps of DNA damage, repair processes, and even mutations promise to greatly expand our understanding of DNA repair and mutagenesis. Here we review the current efforts to utilize whole genome maps of DNA damage and mutation to understand how different chromosomal contexts affect DNA excision repair pathways.
Collapse
Affiliation(s)
- John J Wyrick
- School of Molecular Biosciences, Washington State University, Pullman, WA 99164, USA; Center for Reproductive Biology, Washington State University, Pullman, WA 99164, USA.
| | - Steven A Roberts
- School of Molecular Biosciences, Washington State University, Pullman, WA 99164, USA.
| |
Collapse
|
17
|
Vázquez E, Antequera F. Replication dynamics in fission and budding yeasts through DNA polymerase tracking. Bioessays 2015; 37:1067-73. [PMID: 26293347 PMCID: PMC5054902 DOI: 10.1002/bies.201500072] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
The dynamics of eukaryotic DNA polymerases has been difficult to establish because of the difficulty of tracking them along the chromosomes during DNA replication. Recent work has addressed this problem in the yeasts Schizosaccharomyces pombe and Saccharomyces cerevisiae through the engineering of replicative polymerases to render them prone to incorporating ribonucleotides at high rates. Their use as tracers of the passage of each polymerase has provided a picture of unprecedented resolution of the organization of replicons and replication origins in the two yeasts and has uncovered important differences between them. Additional studies have found an overlapping distribution of DNA polymorphisms and the junctions of Okazaki fragments along mononucleosomal DNA. This sequence instability is caused by the premature release of polymerase δ and the retention of non proof‐read DNA tracts replicated by polymerase α. The possible implementation of these new experimental approaches in multicellular organisms opens the door to the analysis of replication dynamics under a broad range of genetic backgrounds and physiological or pathological conditions.
Collapse
Affiliation(s)
- Enrique Vázquez
- Instituto de Biología, Funcional y Genómica (IBFG), Consejo Superior de Investigaciones Científicas (CSIC), Universidad de Salamanca, Campus Miguel de Unamuno, Salamanca, Spain
| | - Francisco Antequera
- Instituto de Biología, Funcional y Genómica (IBFG), Consejo Superior de Investigaciones Científicas (CSIC), Universidad de Salamanca, Campus Miguel de Unamuno, Salamanca, Spain
| |
Collapse
|
18
|
Nucleosome Organization around Pseudogenes in the Human Genome. BIOMED RESEARCH INTERNATIONAL 2015; 2015:821596. [PMID: 26064955 PMCID: PMC4434184 DOI: 10.1155/2015/821596] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/09/2014] [Accepted: 12/17/2014] [Indexed: 12/02/2022]
Abstract
Pseudogene, disabled copy of functional gene, plays a subtle role in gene
expression and genome evolution. The first step in deciphering RNA-level regulation
of pseudogenes is to understand their transcriptional activity. So far, there has been no
report on possible roles of nucleosome organization in pseudogene transcription. In
this paper, we investigated the effect of nucleosome positioning on pseudogene
transcription. For transcribed pseudogenes, the experimental nucleosome occupancy
shows a prominent depletion at the regions both upstream of pseudogene start
positions and downstream of pseudogene end positions. Intriguingly, the same
depletion is also observed for nontranscribed pseudogenes, which is unexpected
since nucleosome depletion in those regions is thought to be unnecessary in light of the
nontranscriptional property of those pseudogenes. The sequence-dependent
prediction of nucleosome occupancy shows a consistent pattern with the experimental
data-based analysis. Our results indicate that nucleosome positioning may play
important roles in both the transcription initiation and termination of pseudogenes.
Collapse
|
19
|
Abstract
Species survival depends on the faithful replication of genetic information, which is continually monitored and maintained by DNA repair pathways that correct replication errors and the thousands of lesions that arise daily from the inherent chemical lability of DNA and the effects of genotoxic agents. Nonetheless, neutrally evolving DNA (not under purifying selection) accumulates base substitutions with time (the neutral mutation rate). Thus, repair processes are not 100% efficient. The neutral mutation rate varies both between and within chromosomes. For example it is 10-50 fold higher at CpGs than at non-CpG positions. Interestingly, the neutral mutation rate at non-CpG sites is positively correlated with CpG content. Although the basis of this correlation was not immediately apparent, some bioinformatic results were consistent with the induction of non-CpG mutations by DNA repair at flanking CpG sites. Recent studies with a model system showed that in vivo repair of preformed lesions (mismatches, abasic sites, single stranded nicks) can in fact induce mutations in flanking DNA. Mismatch repair (MMR) is an essential component for repair-induced mutations, which can occur as distant as 5 kb from the introduced lesions. Most, but not all, mutations involved the C of TpCpN (G of NpGpA) which is the target sequence of the C-preferring single-stranded DNA specific APOBEC deaminases. APOBEC-mediated mutations are not limited to our model system: Recent studies by others showed that some tumors harbor mutations with the same signature, as can intermediates in RNA-guided endonuclease-mediated genome editing. APOBEC deaminases participate in normal physiological functions such as generating mutations that inactivate viruses or endogenous retrotransposons, or that enhance immunoglobulin diversity in B cells. The recruitment of normally physiological error-prone processes during DNA repair would have important implications for disease, aging and evolution. This perspective briefly reviews both the bioinformatic and biochemical literature relevant to repair-induced mutagenesis and discusses future directions required to understand the mechanistic basis of this process.
Collapse
Affiliation(s)
- Jia Chen
- School of Life Science and Technology, ShanghaiTech University, Building 8, 319 Yueyang Road, Shanghai 200031, China
| | - Anthony V Furano
- Section on Genomic Structure and Function, Laboratory of Cell and Molecular Biology, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Building 8, Room 203, 8 Center Drive, MSC 0830, Bethesda, MD 20892-0830, USA.
| |
Collapse
|
20
|
Makova KD, Hardison RC. The effects of chromatin organization on variation in mutation rates in the genome. Nat Rev Genet 2015; 16:213-23. [PMID: 25732611 PMCID: PMC4500049 DOI: 10.1038/nrg3890] [Citation(s) in RCA: 145] [Impact Index Per Article: 16.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
The variation in local rates of mutations can affect both the evolution of genes and their function in normal and cancer cells. Deciphering the molecular determinants of this variation will be aided by the elucidation of distinct types of mutations, as they differ in regional preferences and in associations with genomic features. Chromatin organization contributes to regional variation in mutation rates, but its contribution differs among mutation types. In both germline and somatic mutations, base substitutions are more abundant in regions of closed chromatin, perhaps reflecting error accumulation late in replication. By contrast, a distinctive mutational state with very high levels of insertions and deletions (indels) and substitutions is enriched in regions of open chromatin. These associations indicate an intricate interplay between the nucleotide sequence of DNA and its dynamic packaging into chromatin, and have important implications for current biomedical research. This Review focuses on recent studies showing associations between chromatin state and mutation rates, including pairwise and multivariate investigations of germline and somatic (particularly cancer) mutations.
Collapse
Affiliation(s)
- Kateryna D Makova
- Department of Biology, Huck Institute for Genome Sciences, The Pennsylvania State University, University Park, State College, Pennsylvania 16802, USA
| | - Ross C Hardison
- Department of Biochemistry and Molecular Biology, Huck Institute for Genome Sciences, The Pennsylvania State University, University Park, State College, Pennsylvania 16802, USA
| |
Collapse
|
21
|
Lagging-strand replication shapes the mutational landscape of the genome. Nature 2015; 518:502-506. [PMID: 25624100 PMCID: PMC4374164 DOI: 10.1038/nature14183] [Citation(s) in RCA: 168] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2014] [Accepted: 01/05/2015] [Indexed: 12/21/2022]
Abstract
The origin of mutations is central to understanding evolution and of key relevance to health. Variation occurs non-randomly across the genome, and mechanisms for this remain to be defined. Here, we report that the 5′-ends of Okazaki fragments have significantly elevated levels of nucleotide substitution, indicating a replicative origin for such mutations. With a novel method, emRiboSeq, we map the genome-wide contribution of polymerases, and show that despite Okazaki fragment processing, DNA synthesised by error-prone Pol-α is retained in vivo, comprising ~1.5% of the mature genome. We propose that DNA-binding proteins that rapidly re-associate post-replication act as partial barriers to Pol-δ mediated displacement of Pol-α synthesised DNA, resulting in incorporation of such Pol-α tracts and elevated mutation rates at specific sites. We observe a mutational cost to chromatin and regulatory protein binding, resulting in mutation hotspots at regulatory elements, with signatures of this process detectable in both yeast and humans.
Collapse
|
22
|
Abstract
Mutational heterogeneity must be taken into account when reconstructing evolutionary histories, calibrating molecular clocks, and predicting links between genes and disease. Selective pressures and various DNA transactions have been invoked to explain the heterogeneous distribution of genetic variation between species, within populations, and in tissue-specific tumors. To examine relationships between such heterogeneity and variations in leading- and lagging-strand replication fidelity and mismatch repair, we accumulated 40,000 spontaneous mutations in eight diploid yeast strains in the absence of selective pressure. We found that replicase error rates vary by fork direction, coding state, nucleosome proximity, and sequence context. Further, error rates and DNA mismatch repair efficiency both vary by mismatch type, responsible polymerase, replication time, and replication origin proximity. Mutation patterns implicate replication infidelity as one driver of variation in somatic and germline evolution, suggest mechanisms of mutual modulation of genome stability and composition, and predict future observations in specific cancers.
Collapse
|
23
|
Nucleosomes shape DNA polymorphism and divergence. PLoS Genet 2014; 10:e1004457. [PMID: 24991813 PMCID: PMC4081404 DOI: 10.1371/journal.pgen.1004457] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2013] [Accepted: 05/12/2014] [Indexed: 11/30/2022] Open
Abstract
An estimated 80% of genomic DNA in eukaryotes is packaged as nucleosomes, which, together with the remaining interstitial linker regions, generate higher order chromatin structures [1]. Nucleosome sequences isolated from diverse organisms exhibit ∼10 bp periodic variations in AA, TT and GC dinucleotide frequencies. These sequence elements generate intrinsically curved DNA and help establish the histone-DNA interface. We investigated an important unanswered question concerning the interplay between chromatin organization and genome evolution: do the DNA sequence preferences inherent to the highly conserved histone core exert detectable natural selection on genomic divergence and polymorphism? To address this hypothesis, we isolated nucleosomal DNA sequences from Drosophila melanogaster embryos and examined the underlying genomic variation within and between species. We found that divergence along the D. melanogaster lineage is periodic across nucleosome regions with base changes following preferred nucleotides, providing new evidence for systematic evolutionary forces in the generation and maintenance of nucleosome-associated dinucleotide periodicities. Further, Single Nucleotide Polymorphism (SNP) frequency spectra show striking periodicities across nucleosomal regions, paralleling divergence patterns. Preferred alleles occur at higher frequencies in natural populations, consistent with a central role for natural selection. These patterns are stronger for nucleosomes in introns than in intergenic regions, suggesting selection is stronger in transcribed regions where nucleosomes undergo more displacement, remodeling and functional modification. In addition, we observe a large-scale (∼180 bp) periodic enrichment of AA/TT dinucleotides associated with nucleosome occupancy, while GC dinucleotide frequency peaks in linker regions. Divergence and polymorphism data also support a role for natural selection in the generation and maintenance of these super-nucleosomal patterns. Our results demonstrate that nucleosome-associated sequence periodicities are under selective pressure, implying that structural interactions between nucleosomes and DNA sequence shape sequence evolution, particularly in introns. In eukaryotic cells, the majority of DNA is packaged in nucleosomes comprised of ∼147 bp of DNA wound tightly around the highly conserved histone octamer. Nucleosomal DNA from diverse organisms shows an anti-correlated ∼10 bp periodicity of AT-rich and GC-rich dinucleotides. These sequence features influence DNA bending and shape, facilitating structural interactions. We asked whether natural selection mediated through the periodic sequence preferences of nucleosomes shapes the evolution of non-protein-coding regions of D. melanogaster by examining the inter- and intra-species genomic variation relative to these fundamental chromatin building blocks. The sequence changes across nucleosome-bound regions on the melanogaster lineage mirror the observed nucleosome dinucleotide periodicities. Importantly, we show that the frequencies of polymorphisms in natural populations vary across these regions, paralleling divergence, with higher frequencies of preferred alleles. These patterns are most evident for intronic regions and indicate that non-protein coding regions are evolving toward sequences that facilitate the canonical association with the histone core. This result is consistent with the hypothesis that interactions between DNA and the core have systematic impacts on function that are subject to natural selection and are not solely due to mutational bias. These ubiquitous interactions with the histone core partially account for the evolutionary constraint observed in unannotated genomic regions, and may drive broad changes in base composition.
Collapse
|
24
|
Warnecke T, Becker EA, Facciotti MT, Nislow C, Lehner B. Conserved substitution patterns around nucleosome footprints in eukaryotes and Archaea derive from frequent nucleosome repositioning through evolution. PLoS Comput Biol 2013; 9:e1003373. [PMID: 24278010 PMCID: PMC3836710 DOI: 10.1371/journal.pcbi.1003373] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2013] [Accepted: 10/13/2013] [Indexed: 11/21/2022] Open
Abstract
Nucleosomes, the basic repeat units of eukaryotic chromatin, have been suggested to influence the evolution of eukaryotic genomes, both by altering the propensity of DNA to mutate and by selection acting to maintain or exclude nucleosomes in particular locations. Contrary to the popular idea that nucleosomes are unique to eukaryotes, histone proteins have also been discovered in some archaeal genomes. Archaeal nucleosomes, however, are quite unlike their eukaryotic counterparts in many respects, including their assembly into tetramers (rather than octamers) from histone proteins that lack N- and C-terminal tails. Here, we show that despite these fundamental differences the association between nucleosome footprints and sequence evolution is strikingly conserved between humans and the model archaeon Haloferax volcanii. In light of this finding we examine whether selection or mutation can explain concordant substitution patterns in the two kingdoms. Unexpectedly, we find that neither the mutation nor the selection model are sufficient to explain the observed association between nucleosomes and sequence divergence. Instead, we demonstrate that nucleosome-associated substitution patterns are more consistent with a third model where sequence divergence results in frequent repositioning of nucleosomes during evolution. Indeed, we show that nucleosome repositioning is both necessary and largely sufficient to explain the association between current nucleosome positions and biased substitution patterns. This finding highlights the importance of considering the direction of causality between genetic and epigenetic change. Genome sequences as well as epigenetic states, such as DNA methylation or nucleosome binding patterns, change during evolution. But what is the causal relationship between the two? We already know that nucleotide variation within and between species is distributed unevenly around nucleosome footprints, but does this mean that sequence evolution follows a biased course because the presence of nucleosomes affects mutation and DNA repair dynamics? Or is it, in fact, the other way around, i.e. changes happen at the DNA level and prompt shifts in nucleosome positioning? To investigate the direction of causality in genetic versus epigenetic evolution, we analyze substitutions patterns in eukaryotes as well as the archaeon Haloferax volcanii in the context of genome-wide nucleosome binding maps. We demonstrate that the relationship between nucleosome positions and between-species divergence patterns, strikingly similar in eukaryotes and archaea, can be explained in large parts by nucleosomes shifting positions in response to substitution, although both mutation and selection biases might still exist. Our results illustrate that it is important to consider the direction of causality between epigenetic and genetic change when analyzing patterns of sequence divergence and using sequence conservation to infer selection on epigenetic states.
Collapse
Affiliation(s)
- Tobias Warnecke
- Bioinformatics and Genomics Program, Centre for Genomic Regulation (CRG) and UPF, Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- * E-mail:
| | - Erin A. Becker
- Microbiology Graduate Group, University of California, Davis, Davis, California, United States of America
| | - Marc T. Facciotti
- Microbiology Graduate Group, University of California, Davis, Davis, California, United States of America
- Department of Biomedical Engineering, University of California, Davis, Davis, California, United States of America
- Genome Center, University of California, Davis, Davis, California, United States of America
| | - Corey Nislow
- Department of Pharmaceutical Sciences, University of British Columbia, Vancouver, British Columbia, Canada
| | - Ben Lehner
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- EMBL-CRG Systems Biology Unit, Centre for Genomic Regulation (CRG), Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats, Centre for Genomic Regulation (CRG) and UPF, Barcelona, Spain
| |
Collapse
|
25
|
Abstract
Histone-DNA complexes, so-called nucleosomes, are the building blocks of DNA packaging in eukaryotic cells. The histone-binding affinity of a local DNA segment depends on its elastic properties and determines its accessibility within the nucleus, which plays an important role in the regulation of gene expression. Here, we derive a fitness landscape for intergenic DNA segments in yeast as a function of two molecular phenotypes: their elasticity-dependent histone affinity and their coverage with transcription factor binding sites. This landscape reveals substantial selection against nucleosome formation over a wide range of both phenotypes. We use it as the core component of a quantitative evolutionary model for intergenic DNA segments. This model consistently predicts the observed diversity of histone affinities within wild Saccharomyces paradoxus populations, as well as the affinity divergence between neighboring Saccharomyces species. Our analysis establishes histone binding and transcription factor binding as two separable modes of sequence evolution, each of which is a direct target of natural selection.
Collapse
|
26
|
Kenigsberg E, Tanay A. Drosophila functional elements are embedded in structurally constrained sequences. PLoS Genet 2013; 9:e1003512. [PMID: 23750124 PMCID: PMC3671938 DOI: 10.1371/journal.pgen.1003512] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2012] [Accepted: 03/04/2013] [Indexed: 12/22/2022] Open
Abstract
Modern functional genomics uncovered numerous functional elements in metazoan genomes. Nevertheless, only a small fraction of the typical non-exonic genome contains elements that code for function directly. On the other hand, a much larger fraction of the genome is associated with significant evolutionary constraints, suggesting that much of the non-exonic genome is weakly functional. Here we show that in flies, local (30–70 bp) conserved sequence elements that are associated with multiple regulatory functions serve as focal points to a pattern of punctuated regional increase in G/C nucleotide frequencies. We show that this pattern, which covers a region tenfold larger than the conserved elements themselves, is an evolutionary consequence of a shift in the balance between gain and loss of G/C nucleotides and that it is correlated with nucleosome occupancy across multiple classes of epigenetic state. Evidence for compensatory evolution and analysis of SNP allele frequencies show that the evolutionary regime underlying this balance shift is likely to be non-neutral. These data suggest that current gaps in our understanding of genome function and evolutionary dynamics are explicable by a model of sparse sequence elements directly encoding for function, embedded into structural sequences that help to define the local and global epigenomic context of such functional elements. A key challenge in functional genomics is to predict evolutionary dynamics from functional annotation of the genome and vice versa. Modern epigenomic studies helped assign function to numerous new sequence elements, but left most of the genome essentially uncharacterized. Evolutionary genomics, on the other hand, consistently suggests that a much larger fraction of the un-annotated genome evolves under selective pressure. We hypothesize that this function-selection gap can be attributed to sequences that facilitate the physical organization of functional elements, such as transcription factor binding sites, within chromosomes. We exemplify this by studying in detail the sequences embedding small conserved elements (CEs) in Drosophila. We show that, while CEs have typically high AT content, high GC content levels around them are maintained by a non-neutral evolutionary balance between gain and loss of GC nucleotides. This non-uniform pattern is highly correlated with nucleosome organization around CEs, potentially imposing an evolutionary constraint on as much as one quarter of the genome. We suggest this can at least partly explain the above function-selection gap. Weak evolutionary constraints on “structural” sequences (at scales ranging from one nucleosome to recently described multi-megabase topological domains) may affect genome evolution just like structural motifs shape protein evolution.
Collapse
Affiliation(s)
- Ephraim Kenigsberg
- Department of Computer Science and Applied Mathematics and Department of Biological Regulation, Weizmann Institute, Rehovot, Israel
| | - Amos Tanay
- Department of Computer Science and Applied Mathematics and Department of Biological Regulation, Weizmann Institute, Rehovot, Israel
- * E-mail:
| |
Collapse
|
27
|
H2A.Z nucleosome positioning has no impact on genetic variation in Drosophila genome. PLoS One 2013; 8:e58295. [PMID: 23472174 PMCID: PMC3589275 DOI: 10.1371/journal.pone.0058295] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2012] [Accepted: 02/01/2013] [Indexed: 11/20/2022] Open
Abstract
Nucleosome occupancy results in complex sequence variation rate heterogeneity by either increasing mutation rate or inhibiting DNA repair in yeast, fish, and human. H2A.Z nucleosome is extensively involved in gene transcription activation and regulation. To test whether H2A.Z nucleosome has the similar impact on sequence variability in the Drosophila genome, we profiled the H2A.Z nucleosome occupancy and sequence variation rate at gene ends and splicing sites. Consistent with previous studies, H2A.Z nucleosome positioning helps to demarcate the borders of exons. Nucleosome occupancy is anticorrelated with sequence divergence rate in the regions flanking transcription start sites and splicing sites. However, there is no rate heterogeneity between the linker DNA and H2A.Z nucleosomal DNA regardless of nucleosome occupancy, fuzziness, positioning in promoter, coding, and intergenic regions, young or old genes. But the rate at intergenic nucleosomes and the flanking linker regions is higher than that at the genic counterparts. Further analyses found that the high sequence divergence rate in the promoter regions that are usually nucleosome depleted regions may be likely resulted from the high mutation rate in the enriched tandem repeats. Interestingly, within nucleosomes spanning splicing sites, sequence variability of nucleosomal DNA significantly increases from the end within exons to the other end protruding into introns. The relaxed functional constraint in introns contributes to the high rate of nucleosomal DNA residing in introns while the strict functional constraint in exons maintains the low rate of nucleosomal DNA residing in exons. Taken together, H2A.Z nucleosome occupancy has no effect on sequence variability of Drosophila genome, which is likely determined by local sequence composition and the concomitant selection pressure.
Collapse
|
28
|
Tsai ZTY, Tsai HK, Cheng JH, Lin CH, Tsai YF, Wang D. Evolution of cis-regulatory elements in yeast de novo and duplicated new genes. BMC Genomics 2012; 13:717. [PMID: 23256513 PMCID: PMC3553024 DOI: 10.1186/1471-2164-13-717] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2012] [Accepted: 12/18/2012] [Indexed: 12/22/2022] Open
Abstract
Background New genes that originate from non-coding DNA rather than being duplicated from parent genes are called de novo genes. Their short evolution time and lack of parent genes provide a chance to study the evolution of cis-regulatory elements in the initial stage of gene emergence. Although a few reports have discussed cis-regulatory elements in new genes, knowledge of the characteristics of these elements in de novo genes is lacking. Here, we conducted a comprehensive investigation to depict the emergence and establishment of cis-regulatory elements in de novo yeast genes. Results In a genome-wide investigation, we found that the number of transcription factor binding sites (TFBSs) in de novo genes of S. cerevisiae increased rapidly and quickly became comparable to the number of TFBSs in established genes. This phenomenon might have resulted from certain characteristics of de novo genes; namely, a relatively frequent gain of TFBSs, an unexpectedly high number of preexisting TFBSs, or lower selection pressure in the promoter regions of the de novo genes. Furthermore, we identified differences in the promoter architecture between de novo genes and duplicated new genes, suggesting that distinct regulatory strategies might be employed by genes of different origin. Finally, our functional analyses of the yeast de novo genes revealed that they might be related to reproduction. Conclusions Our observations showed that de novo genes and duplicated new genes possess mutually distinct regulatory characteristics, implying that these two types of genes might have different roles in evolution.
Collapse
|
29
|
Warnecke T, Supek F, Lehner B. Nucleoid-associated proteins affect mutation dynamics in E. coli in a growth phase-specific manner. PLoS Comput Biol 2012; 8:e1002846. [PMID: 23284284 PMCID: PMC3527292 DOI: 10.1371/journal.pcbi.1002846] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2012] [Accepted: 11/03/2012] [Indexed: 02/06/2023] Open
Abstract
The binding of proteins can shield DNA from mutagenic processes but also interfere with efficient repair. How the presence of DNA-binding proteins shapes intra-genomic differences in mutability and, ultimately, sequence variation in natural populations, however, remains poorly understood. In this study, we examine sequence evolution in Escherichia coli in relation to the binding of four abundant nucleoid-associated proteins: Fis, H-NS, IhfA, and IhfB. We find that, for a subset of mutations, protein occupancy is associated with both increased and decreased mutability in the underlying sequence depending on when the protein is bound during the bacterial growth cycle. On average, protein-bound DNA exhibits reduced mutability compared to protein-free DNA. However, this net protective effect is weak and can be abolished or even reversed during stages of colony growth where binding coincides – and hence likely interferes with – DNA repair activity. We suggest that the four nucleoid-associated proteins analyzed here have played a minor but significant role in patterning extant sequence variation in E. coli. Mutations can be more or less likely to occur depending on whether DNA is naked or bound by proteins. On the one hand, DNA-binding proteins can shield the DNA from certain mutagenic processes. On the other hand, the very same proteins can interfere with efficient DNA repair. In this study, we reconstruct the history of mutations across 54 E. coli genomes and ask whether mutation risk is higher or lower in regions occupied by proteins that help organize bacterial DNA into chromatin. Intriguingly, we find that the effect of binding depends on its timing. When we consider genomic regions bound during stationary phase, we observe that binding is associated with lower mutation risk for some mutation classes compared to naked DNA, albeit weakly. However, when binding occurs during exponential phase, bound regions actually experience more mutations on average. We argue that this is because, during exponential phase, the major effect of binding is that it interferes with efficient DNA repair, whereas in stationary phase – when many repair pathways are inactive – the protective effect of binding dominates. Our results suggest that the four DNA-binding proteins considered here have a small but significant growth phase-specific effect on mutation dynamics in E. coli.
Collapse
Affiliation(s)
- Tobias Warnecke
- Bioinformatics and Genomics Program, Centre for Genomic Regulation (CRG), Barcelona, Spain.
| | | | | |
Collapse
|
30
|
Park C, Qian W, Zhang J. Genomic evidence for elevated mutation rates in highly expressed genes. EMBO Rep 2012; 13:1123-9. [PMID: 23146897 DOI: 10.1038/embor.2012.165] [Citation(s) in RCA: 85] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2012] [Revised: 09/10/2012] [Accepted: 10/05/2012] [Indexed: 11/09/2022] Open
Abstract
Reporter gene assays have demonstrated both transcription-associated mutagenesis (TAM) and transcription-coupled repair, but the net impact of transcription on mutation rate remains unclear, especially at the genomic scale. Using comparative genomics of related species as well as mutation accumulation lines, we show in yeast that the rate of point mutation in a gene increases with the expression level of the gene. Transcription induces mutagenesis on both DNA strands, indicating simultaneous actions of several TAM mechanisms. A significant positive correlation is also detected between the human germline mutation rate and expression level. These results indicate that transcription is overall mutagenic.
Collapse
Affiliation(s)
- Chungoo Park
- Department of Ecology and Evolutionary Biology, University of Michigan, 1075 Natural Science Building, 830 North University Avenue, Ann Arbor, Michigan 48109, USA
| | | | | |
Collapse
|
31
|
|
32
|
The transcript-centric mutations in human genomes. GENOMICS PROTEOMICS & BIOINFORMATICS 2012; 10:11-22. [PMID: 22449397 PMCID: PMC5054492 DOI: 10.1016/s1672-0229(11)60029-6] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/06/2012] [Accepted: 02/15/2012] [Indexed: 01/30/2023]
Abstract
Since the human genome is mostly transcribed, genetic variations must exhibit sequence signatures reflecting the relationship between transcription processes and chromosomal structures as we have observed in unicellular organisms. In this study, a set of 646 ubiquitous expression-invariable genes (EIGs) which are present in germline cells were defined and examined based on RNA-sequencing data from multiple high-throughput transcriptomic data. We demonstrated a relationship between gene expression level and transcript-centric mutations in the human genome based on single nucleotide polymorphism (SNP) data. A significant positive correlation was shown between gene expression and mutation, where highly-expressed genes accumulate more mutations than lowly-expressed genes. Furthermore, we found four major types of transcript-centric mutations: C→T, A→G, C→G, and G→T in human genomes and identified a negative gradient of the sequence variations aligning from the 5′ end to the 3′ end of the transcription units (TUs). The periodical occurrence of these genetic variations across TUs is associated with nucleosome phasing. We propose that transcript-centric mutations are one of the major driving forces for gene and genome evolution along with creation of new genes, gene/genome duplication, and horizontal gene transfer.
Collapse
|
33
|
Chen X, Chen Z, Chen H, Su Z, Yang J, Lin F, Shi S, He X. Nucleosomes suppress spontaneous mutations base-specifically in eukaryotes. Science 2012; 335:1235-8. [PMID: 22403392 DOI: 10.1126/science.1217580] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
It is unknown how the composition and structure of DNA within the cell affect spontaneous mutations. Theory suggests that in eukaryotic genomes, nucleosomal DNA undergoes fewer C→T mutations because of suppressed cytosine hydrolytic deamination relative to nucleosome-depleted DNA. Comparative genomic analyses and a mutation accumulation experiment showed that nucleosome occupancy nearly eliminated cytosine deamination, resulting in an ~50% decrease of the C→T mutation rate in nucleosomal DNA. Furthermore, the rates of G→T and A→T mutations were also about twofold suppressed by nucleosomes. On the basis of these results, we conclude that nucleosome-dependent mutation spectra affect eukaryotic genome structure and evolution and may have implications for understanding the origin of mutations in cancers and in induced pluripotent stem cells.
Collapse
Affiliation(s)
- Xiaoshu Chen
- State Key Laboratory of Bio-control, College of Life Sciences, Sun Yat-sen University, Guangzhou 510275, China
| | | | | | | | | | | | | | | |
Collapse
|
34
|
Ma X, Rogacheva MV, Nishant KT, Zanders S, Bustamante CD, Alani E. Mutation hot spots in yeast caused by long-range clustering of homopolymeric sequences. Cell Rep 2012; 1:36-42. [PMID: 22832106 DOI: 10.1016/j.celrep.2011.10.003] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2011] [Revised: 09/29/2011] [Accepted: 10/21/2011] [Indexed: 11/18/2022] Open
Abstract
Evolutionary theory assumes that mutations occur randomly in the genome; however, studies performed in a variety of organisms indicate the existence of context-dependent mutation biases. Sources of mutagenesis variation across large genomic contexts (e.g., hundreds of bases) have not been identified. Here, we use high-coverage whole-genome sequencing of a conditional mismatch repair mutant line of diploid yeast to identify mutations that accumulated after 160 generations of growth. The vast majority of the mutations accumulated as insertion/deletions (in/dels) in homopolymeric [poly(dA:dT)] and repetitive DNA tracts. Surprisingly, the likelihood of an in/del mutation in a given poly(dA:dT) tract is increased by the presence of nearby poly(dA:dT) tracts in up to a 1,000 bp region centered on the given tract. Our work suggests that specific mutation hot spots can contribute disproportionately to the genetic variation that is introduced into populations and provides long-range genomic sequence context that contributes to mutagenesis.
Collapse
Affiliation(s)
- Xin Ma
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, NY 14853, USA
| | | | | | | | | | | |
Collapse
|
35
|
Lin MF, Kheradpour P, Washietl S, Parker BJ, Pedersen JS, Kellis M. Locating protein-coding sequences under selection for additional, overlapping functions in 29 mammalian genomes. Genome Res 2011; 21:1916-28. [PMID: 21994248 DOI: 10.1101/gr.108753.110] [Citation(s) in RCA: 72] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
The degeneracy of the genetic code allows protein-coding DNA and RNA sequences to simultaneously encode additional, overlapping functional elements. A sequence in which both protein-coding and additional overlapping functions have evolved under purifying selection should show increased evolutionary conservation compared to typical protein-coding genes--especially at synonymous sites. In this study, we use genome alignments of 29 placental mammals to systematically locate short regions within human ORFs that show conspicuously low estimated rates of synonymous substitution across these species. The 29-species alignment provides statistical power to locate more than 10,000 such regions with resolution down to nine-codon windows, which are found within more than a quarter of all human protein-coding genes and contain ∼2% of their synonymous sites. We collect numerous lines of evidence that the observed synonymous constraint in these regions reflects selection on overlapping functional elements including splicing regulatory elements, dual-coding genes, RNA secondary structures, microRNA target sites, and developmental enhancers. Our results show that overlapping functional elements are common in mammalian genes, despite the vast genomic landscape.
Collapse
Affiliation(s)
- Michael F Lin
- Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| | | | | | | | | | | |
Collapse
|
36
|
Onishi-Seebacher M, Korbel JO. Challenges in studying genomic structural variant formation mechanisms: The short-read dilemma and beyond. Bioessays 2011; 33:840-50. [DOI: 10.1002/bies.201100075] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
|
37
|
Guan Y, Yao V, Tsui K, Gebbia M, Dunham MJ, Nislow C, Troyanskaya OG. Nucleosome-coupled expression differences in closely-related species. BMC Genomics 2011; 12:466. [PMID: 21942931 PMCID: PMC3209474 DOI: 10.1186/1471-2164-12-466] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2010] [Accepted: 09/26/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Genome-wide nucleosome occupancy is negatively related to the average level of transcription factor motif binding based on studies in yeast and several other model organisms. The degree to which nucleosome-motif interactions relate to phenotypic changes across species is, however, unknown. RESULTS We address this challenge by generating nucleosome positioning and cell cycle expression data for Saccharomyces bayanus and show that differences in nucleosome occupancy reflect cell cycle expression divergence between two yeast species, S. bayanus and S. cerevisiae. Specifically, genes with nucleosome-depleted MBP1 motifs upstream of their coding sequence show periodic expression during the cell cycle, whereas genes with nucleosome-shielded motifs do not. In addition, conserved cell cycle regulatory motifs across these two species are more nucleosome-depleted compared to those that are not conserved, suggesting that the degree of conservation of regulatory sites varies, and is reflected by nucleosome occupancy patterns. Finally, many changes in cell cycle gene expression patterns across species can be correlated to changes in nucleosome occupancy on motifs (rather than to the presence or absence of motifs). CONCLUSIONS Our observations suggest that alteration of nucleosome occupancy is a previously uncharacterized feature related to the divergence of cell cycle expression between species.
Collapse
Affiliation(s)
- Yuanfang Guan
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA
| | | | | | | | | | | | | |
Collapse
|
38
|
Prendergast JGD, Semple CAM. Widespread signatures of recent selection linked to nucleosome positioning in the human lineage. Genome Res 2011; 21:1777-87. [PMID: 21903742 DOI: 10.1101/gr.122275.111] [Citation(s) in RCA: 61] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
In this study we investigated the strengths and modes of selection associated with nucleosome positioning in the human lineage through the comparison of interspecies and intraspecies rates of divergence. We identify significant evidence for both positive and negative selection linked to human nucleosome positioning for the first time, implicating a widespread and important role for DNA sequence in the location of well-positioned nucleosomes. Selection appears to be acting on particular base substitutions to maintain optimum GC compositions in core and linker regions, with, e.g., unexpectedly elevated rates of C→T substitutions during recent human evolution at linker regions 60-90 bp from the nucleosome dyad but significant depletion of the same substitutions within nucleosome core regions. These patterns are strikingly consistent with the known relationships between genomic sequence composition and nucleosome assembly. By stratifying nucleosomes according to the GC content of their genomic neighborhood, we also show that the strength and direction of selection detected is dictated by local GC content. Intriguingly these signatures of selection are not restricted to nucleosomes in close proximity to exons, suggesting the correct positioning of nucleosomes is not only important in and around coding regions. This analysis provides strong evidence that the genomic sequences associated with nucleosomes are not evolving neutrally, and suggests that underlying DNA sequence is an important factor in nucleosome positioning. Recent signatures of selection linked to genomic features as ubiquitous as the nucleosome have important implications for human genome evolution and disease.
Collapse
Affiliation(s)
- James G D Prendergast
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Western General Hospital, Edinburgh EH4 2XU, United Kingdom.
| | | |
Collapse
|
39
|
Takuno S, Gaut BS. Body-Methylated Genes in Arabidopsis thaliana Are Functionally Important and Evolve Slowly. Mol Biol Evol 2011; 29:219-27. [DOI: 10.1093/molbev/msr188] [Citation(s) in RCA: 166] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
|
40
|
Bensasson D. Evidence for a high mutation rate at rapidly evolving yeast centromeres. BMC Evol Biol 2011; 11:211. [PMID: 21767380 PMCID: PMC3155921 DOI: 10.1186/1471-2148-11-211] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2011] [Accepted: 07/18/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Although their role in cell division is essential, centromeres evolve rapidly in animals, plants and yeasts. Unlike the complex centromeres of plants and aminals, the point centromeres of Saccharomcyes yeasts can be readily sequenced to distinguish amongst the possible explanations for fast centromere evolution. RESULTS Using DNA sequences of all 16 centromeres from 34 strains of Saccharomyces cerevisiae and population genomic data from Saccharomyces paradoxus, I show that centromeres in both species evolve 3 times more rapidly even than selectively unconstrained DNA. Exceptionally high levels of polymorphism seen in multiple yeast populations suggest that rapid centromere evolution does not result from the repeated selective sweeps expected under meiotic drive. I further show that there is little evidence for crossing-over or gene conversion within centromeres, although there is clear evidence for recombination in their immediate vicinity. Finally I show that the mutation spectrum at centromeres is consistent with the pattern of spontaneous mutation elsewhere in the genome. CONCLUSIONS These results indicate that rapid centromere evolution is a common phenomenon in yeast species. Furthermore, these results suggest that rapid centromere evolution does not result from the mutagenic effect of gene conversion, but from a generalised increase in the mutation rate, perhaps arising from the unusual chromatin structure at centromeres in yeast and other eukaryotes.
Collapse
|
41
|
Swamy KBS, Chu WY, Wang CY, Tsai HK, Wang D. Evidence of association between nucleosome occupancy and the evolution of transcription factor binding sites in yeast. BMC Evol Biol 2011; 11:150. [PMID: 21627806 PMCID: PMC3124427 DOI: 10.1186/1471-2148-11-150] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2011] [Accepted: 05/31/2011] [Indexed: 11/14/2022] Open
Abstract
Background Divergence of transcription factor binding sites is considered to be an important source of regulatory evolution. The associations between transcription factor binding sites and phenotypic diversity have been investigated in many model organisms. However, the understanding of other factors that contribute to it is still limited. Recent studies have elucidated the effect of chromatin structure on molecular evolution of genomic DNA. Though the profound impact of nucleosome positions on gene regulation has been reported, their influence on transcriptional evolution is still less explored. With the availability of genome-wide nucleosome map in yeast species, it is thus desirable to investigate their impact on transcription factor binding site evolution. Here, we present a comprehensive analysis of the role of nucleosome positioning in the evolution of transcription factor binding sites. Results We compared the transcription factor binding site frequency in nucleosome occupied regions and nucleosome depleted regions in promoters of old (orthologs among Saccharomycetaceae) and young (Saccharomyces specific) genes; and in duplicate gene pairs. We demonstrated that nucleosome occupied regions accommodate greater binding site variations than nucleosome depleted regions in young genes and in duplicate genes. This finding was confirmed by measuring the difference in evolutionary rates of binding sites in sensu stricto yeasts at nucleosome occupied regions and nucleosome depleted regions. The binding sites at nucleosome occupied regions exhibited a consistently higher evolution rate than those at nucleosome depleted regions, corroborating the difference in the selection constraints at the two regions. Finally, through site-directed mutagenesis experiment, we found that binding site gain or loss events at nucleosome depleted regions may cause more expression differences than those in nucleosome occupied regions. Conclusions Our study indicates the existence of different selection constraint on binding sites at nucleosome occupied regions than at the nucleosome depleted regions. We found that the binding sites have a different rate of evolution at nucleosome occupied and depleted regions. Finally, using transcription factor binding site-directed mutagenesis experiment, we confirmed the difference in the impact of binding site changes on expression at these regions. Thus, our work demonstrates the importance of composite analysis of chromatin and transcriptional evolution.
Collapse
Affiliation(s)
- Krishna B S Swamy
- Institute of Information Science, Academia Sinica, Taipei, 115, Taiwan
| | | | | | | | | |
Collapse
|
42
|
Dai Z, Dai X, Xiang Q. Genome-wide DNA sequence polymorphisms facilitate nucleosome positioning in yeast. ACTA ACUST UNITED AC 2011; 27:1758-64. [PMID: 21551148 DOI: 10.1093/bioinformatics/btr290] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
MOTIVATION The intrinsic DNA sequence is an important determinant of nucleosome positioning. Some DNA sequence patterns can facilitate nucleosome formation, while others can inhibit nucleosome formation. Nucleosome positioning influences the overall rate of sequence evolution. However, its impacts on specific patterns of sequence evolution are still poorly understood. RESULTS Here, we examined whether nucleosomal DNA and nucleosome-depleted DNA show distinct polymorphism patterns to maintain adequate nucleosome architecture on a genome scale in yeast. We found that sequence polymorphisms in nucleosomal DNA tend to facilitate nucleosome formation, whereas polymorphisms in nucleosome-depleted DNA tend to inhibit nucleosome formation, which is especially evident at nucleosome-disfavored sequences in nucleosome-free regions at both ends of genes. Sequence polymorphisms facilitating nucleosome positioning correspond to stable nucleosome positioning. These results reveal that sequence polymorphisms are under selective constraints to maintain nucleosome positioning. CONTACT zhimdai@gmail.com; issdxh@mail.sysu.edu.cn
Collapse
Affiliation(s)
- Zhiming Dai
- Department of Electronic, School of Information Science and Technology, Sun Yat-Sen University, Guangzhou 510006, China.
| | | | | |
Collapse
|
43
|
Tolstorukov MY, Volfovsky N, Stephens RM, Park PJ. Impact of chromatin structure on sequence variability in the human genome. Nat Struct Mol Biol 2011; 18:510-5. [PMID: 21399641 DOI: 10.1038/nsmb.2012] [Citation(s) in RCA: 60] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2009] [Accepted: 12/10/2010] [Indexed: 02/02/2023]
Abstract
DNA sequence variations in individual genomes give rise to different phenotypes within the same species. One mechanism in this process is the alteration of chromatin structure due to sequence variation that influences gene regulation. We composed a high-confidence collection of human single-nucleotide polymorphisms and indels based on analysis of publicly available sequencing data and investigated whether the DNA loci associated with stable nucleosome positions are protected against mutations. We addressed how the sequence variation reflects the occupancy profiles of nucleosomes bearing different epigenetic modifications on genome scale. We found that indels are depleted around nucleosome positions of all considered types, whereas single-nucleotide polymorphisms are enriched around the positions of bulk nucleosomes but depleted around the positions of epigenetically modified nucleosomes. These findings indicate an increased level of conservation for the sequences associated with epigenetically modified nucleosomes, highlighting complex organization of the human chromatin.
Collapse
Affiliation(s)
- Michael Y Tolstorukov
- Center for Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
| | | | | | | |
Collapse
|
44
|
Abstract
The medaka fish, Oryzias latipes, is an emerging vertebrate model and now has a high quality draft genome and a number of unique mutants. The long history of medaka research in Japan has provided medaka with unique features, which are complementary to other vertebrate models. A large collection of spontaneous mutants collected over a century, the presence of highly polymorphic inbred lines established over decades, and the recently completed genome sequence all give the medaka a big boost. This review focuses on the state of the art in medaka genetics and genomics, such as the first isolation of active transposons in vertebrates, the influence of chromatin structure on sequence variation, fine quantitative trait locus (QTL) analysis, and versatile mutants as human disease models.
Collapse
Affiliation(s)
- Hiroyuki Takeda
- Department of Biological Sciences, Graduate School of Science, University of Tokyo, Tokyo 113-0033, Japan.
| | | |
Collapse
|
45
|
Kenigsberg E, Bar A, Segal E, Tanay A. Widespread compensatory evolution conserves DNA-encoded nucleosome organization in yeast. PLoS Comput Biol 2010; 6:e1001039. [PMID: 21203484 PMCID: PMC3009600 DOI: 10.1371/journal.pcbi.1001039] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2010] [Accepted: 11/22/2010] [Indexed: 12/02/2022] Open
Abstract
Evolution maintains organismal fitness by preserving genomic information. This is widely assumed to involve conservation of specific genomic loci among species. Many genomic encodings are now recognized to integrate small contributions from multiple genomic positions into quantitative dispersed codes, but the evolutionary dynamics of such codes are still poorly understood. Here we show that in yeast, sequences that quantitatively affect nucleosome occupancy evolve under compensatory dynamics that maintain heterogeneous levels of A+T content through spatially coupled A/T-losing and A/T-gaining substitutions. Evolutionary modeling combined with data on yeast polymorphisms supports the idea that these substitution dynamics are a consequence of weak selection. This shows that compensatory evolution, so far believed to affect specific groups of epistatically linked loci like paired RNA bases, is a widespread phenomenon in the yeast genome, affecting the majority of intergenic sequences in it. The model thus derived suggests that compensation is inevitable when evolution conserves quantitative and dispersed genomic functions. Purifying selection is a major force in conserving genomic features. It pushes deleterious mutations to extinction while conserving the specific DNA sequence. Here we show that a large proportion of the yeast genome evolves under compensatory dynamics that conserve genomic properties while modifying the genomic sequence. Such compensatory evolution conserves the local G+C content of the genome, which influences nucleosome organization. Since purifying selection is too weak to eliminate every weakly deleterious mutation in nucleosome bound or unbound sequences, the local G+C content is frequently stabilized by compensatory G+C gaining and G+C losing mutations in proximal loci. Theoretical analysis shows that compensatory evolution is inevitable when natural selection is weak and the genomic feature is distributed over many loci. These results imply that sequence conservation may not always be equated with overall selection. They demonstrate that cycles of weakly deleterious substitutions followed by positive selection for corrective mutations, which were so far studied mostly in RNA coding genes, are observed broadly and profoundly affect genome evolution.
Collapse
Affiliation(s)
- Ephraim Kenigsberg
- Department of Computer Science and Applied Mathematics, Weizmann Institute of Science, Rehovot, Israel
| | - Amir Bar
- Department of Computer Science and Applied Mathematics, Weizmann Institute of Science, Rehovot, Israel
- Department of Physics of Complex Systems, Weizmann Institute of Science, Rehovot, Israel
| | - Eran Segal
- Department of Computer Science and Applied Mathematics, Weizmann Institute of Science, Rehovot, Israel
- Department of Molecular Cell Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Amos Tanay
- Department of Computer Science and Applied Mathematics, Weizmann Institute of Science, Rehovot, Israel
- * E-mail:
| |
Collapse
|
46
|
Detection of heterozygous mutations in the genome of mismatch repair defective diploid yeast using a Bayesian approach. Genetics 2010; 186:493-503. [PMID: 20660644 DOI: 10.1534/genetics.110.120105] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
DNA replication errors that escape polymerase proofreading and mismatch repair (MMR) can lead to base substitution and frameshift mutations. Such mutations can disrupt gene function, reduce fitness, and promote diseases such as cancer and are also the raw material of molecular evolution. To analyze with limited bias genomic features associated with DNA polymerase errors, we performed a genome-wide analysis of mutations that accumulate in MMR-deficient diploid lines of Saccharomyces cerevisiae. These lines were derived from a common ancestor and were grown for 160 generations, with bottlenecks reducing the population to one cell every 20 generations. We sequenced to between 8- and 20-fold coverage one wild-type and three mutator lines using Illumina Solexa 36-bp reads. Using an experimentally aware Bayesian genotype caller developed to pool experimental data across sequencing runs for all strains, we detected 28 heterozygous single-nucleotide polymorphisms (SNPs) and 48 single-nt insertion/deletions (indels) from the data set. This method was evaluated on simulated data sets and found to have a very low false-positive rate (∼6 × 10(-5)) and a false-negative rate of 0.08 within the unique mapping regions of the genome that contained at least sevenfold coverage. The heterozygous mutations identified by the Bayesian genotype caller were confirmed by Sanger sequencing. All of the mutations were unique to a given line, except for a single-nt deletion mutation which occurred independently in two lines. All 48 indels, composed of 46 deletions and two insertions, occurred in homopolymer (HP) tracts [i.e., 47 poly(A) or (T) tracts, 1 poly(G) or (C) tract] between 5 and 13 bp long. Our findings are of interest because HP tracts are present at high levels in the yeast genome (>77,400 for 5- to 20-nt HP tracts), and frameshift mutations in these regions are likely to disrupt gene function. In addition, they demonstrate that the mutation pattern seen previously in mismatch repair defective strains using a limited number of reporters holds true for the entire genome.
Collapse
|
47
|
Tsankov AM, Thompson DA, Socha A, Regev A, Rando OJ. The role of nucleosome positioning in the evolution of gene regulation. PLoS Biol 2010; 8:e1000414. [PMID: 20625544 PMCID: PMC2897762 DOI: 10.1371/journal.pbio.1000414] [Citation(s) in RCA: 170] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2010] [Accepted: 05/27/2010] [Indexed: 11/18/2022] Open
Abstract
Chromatin organization plays a major role in gene regulation and can affect the function and evolution of new transcriptional programs. However, it can be difficult to decipher the basis of changes in chromatin organization and their functional effect on gene expression. Here, we present a large-scale comparative genomic analysis of the relationship between chromatin organization and gene expression, by measuring mRNA abundance and nucleosome positions genome-wide in 12 Hemiascomycota yeast species. We found substantial conservation of global and functional chromatin organization in all species, including prominent nucleosome-free regions (NFRs) at gene promoters, and distinct chromatin architecture in growth and stress genes. Chromatin organization has also substantially diverged in both global quantitative features, such as spacing between adjacent nucleosomes, and in functional groups of genes. Expression levels, intrinsic anti-nucleosomal sequences, and trans-acting chromatin modifiers all play important, complementary, and evolvable roles in determining NFRs. We identify five mechanisms that couple chromatin organization to evolution of gene regulation and have contributed to the evolution of respiro-fermentation and other key systems, including (1) compensatory evolution of alternative modifiers associated with conserved chromatin organization, (2) a gradual transition from constitutive to trans-regulated NFRs, (3) a loss of intrinsic anti-nucleosomal sequences accompanying changes in chromatin organization and gene expression, (4) re-positioning of motifs from NFRs to nucleosome-occluded regions, and (5) the expanded use of NFRs by paralogous activator-repressor pairs. Our study sheds light on the molecular basis of chromatin organization, and on the role of chromatin organization in the evolution of gene regulation.
Collapse
Affiliation(s)
- Alexander M. Tsankov
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
- Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Dawn Anne Thompson
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Amanda Socha
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Aviv Regev
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
- Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- Howard Hughes Medical Institute, Cambridge, Massachusetts, United States of America
| | - Oliver J. Rando
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, Massachusetts, United States of America
| |
Collapse
|
48
|
Prohaska SJ, Stadler PF, Krakauer DC. Innovation in gene regulation: The case of chromatin computation. J Theor Biol 2010; 265:27-44. [DOI: 10.1016/j.jtbi.2010.03.011] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2009] [Accepted: 03/06/2010] [Indexed: 11/17/2022]
|
49
|
Dai Z, Dai X, Xiang Q, Feng J. Nucleosomal context of binding sites influences transcription factor binding affinity and gene regulation. GENOMICS PROTEOMICS & BIOINFORMATICS 2010; 7:155-62. [PMID: 20172488 PMCID: PMC5054407 DOI: 10.1016/s1672-0229(08)60045-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Transcription factor (TF) binding to its DNA target site plays an essential role in gene regulation. The location, orientation and spacing of transcription factor binding sites (TFBSs) also affect regulatory function of the TF. However, how nucleosomal context of TFBSs influences TF binding and subsequent gene regulation remains to be elucidated. Using genome-wide nucleosome positioning and TF binding data in budding yeast, we found that binding affinities of TFs to DNA tend to decrease with increasing nucleosome occupancy of the associated binding sites. We further demonstrated that nucleosomal context of binding sites is correlated with gene regulation of the corresponding TF. Nucleosome-depleted TFBSs are linked to high gene activity and low expression noise, whereas nucleosome-covered TFBSs are associated with low gene activity and high expression noise. Moreover, nucleosome-covered TFBSs tend to disrupt coexpression of the corresponding TF target genes. We conclude that nucleosomal context of binding sites influences TF binding affinity, subsequently affecting the regulation of TFs on their target genes. This emphasizes the need to include nucleosomal context of TFBSs in modeling gene regulation.
Collapse
Affiliation(s)
- Zhiming Dai
- Electronic Department, Sun Yat-Sen University, Guangzhou 510006, China
| | | | | | | |
Collapse
|
50
|
Chromatin density and splicing destiny: on the cross-talk between chromatin structure and splicing. EMBO J 2010; 29:1629-36. [PMID: 20407423 DOI: 10.1038/emboj.2010.71] [Citation(s) in RCA: 106] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2010] [Accepted: 03/26/2010] [Indexed: 12/11/2022] Open
Abstract
How are short exonic sequences recognized within the vast intronic oceans in which they reside? Despite decades of research, this remains one of the most fundamental, yet enigmatic, questions in the field of pre-mRNA splicing research. For many years, studies aiming to shed light on this process were focused at the RNA level, characterizing the manner by which splicing factors and auxiliary proteins interact with splicing signals, thereby enabling, facilitating and regulating splicing. However, we increasingly understand that splicing is not an isolated process; rather it occurs co-transcriptionally and is presumably also regulated by transcription-related processes. In fact, studies by our group and others over the past year suggest that DNA structure in terms of nucleosome positioning and specific histone modifications, which have a well established role in transcription, may also have a role in splicing. In this review we discuss evidence for the coupling between transcription and splicing, focusing on recent findings suggesting a link between chromatin structure and splicing, and highlighting challenges this emerging field is facing.
Collapse
|