1
|
MIR retrotransposons link the epigenome and the transcriptome of coding genes in acute myeloid leukemia. Nat Commun 2022; 13:6524. [PMID: 36316347 PMCID: PMC9622910 DOI: 10.1038/s41467-022-34211-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Accepted: 10/18/2022] [Indexed: 11/06/2022] Open
Abstract
DNMT3A and IDH1/2 mutations combinatorically regulate the transcriptome and the epigenome in acute myeloid leukemia; yet the mechanisms of this interplay are unknown. Using a systems approach within topologically associating domains, we find that genes with significant expression-methylation correlations are enriched in signaling and metabolic pathways. The common denominator across these methylation-regulated genes is the density in MIR retrotransposons of their introns. Moreover, a discrete number of CpGs overlapping enhancers are responsible for regulating most of these genes. Established mouse models recapitulate the dependency of MIR-rich genes on the balanced expression of epigenetic modifiers, while projection of leukemic profiles onto normal hematopoiesis ones further consolidates the dependencies of methylation-regulated genes on MIRs. Collectively, MIR elements on genes and enhancers are susceptible to changes in DNA methylation activity and explain the cooperativity of proteins in this pathway in normal and malignant hematopoiesis.
Collapse
|
2
|
Shi L, Magee P, Fassan M, Sahoo S, Leong HS, Lee D, Sellers R, Brullé-Soumaré L, Cairo S, Monteverde T, Volinia S, Smith DD, Di Leva G, Galuppini F, Paliouras AR, Zeng K, O'Keefe R, Garofalo M. A KRAS-responsive long non-coding RNA controls microRNA processing. Nat Commun 2021; 12:2038. [PMID: 33795683 PMCID: PMC8016872 DOI: 10.1038/s41467-021-22337-3] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2020] [Accepted: 03/02/2021] [Indexed: 12/13/2022] Open
Abstract
Wild-type KRAS (KRASWT) amplification has been shown to be a secondary means of KRAS activation in cancer and associated with poor survival. Nevertheless, the precise role of KRASWT overexpression in lung cancer progression is largely unexplored. Here, we identify and characterize a KRAS-responsive lncRNA, KIMAT1 (ENSG00000228709) and show that it correlates with KRAS levels both in cell lines and in lung cancer specimens. Mechanistically, KIMAT1 is a MYC target and drives lung tumorigenesis by promoting the processing of oncogenic microRNAs (miRNAs) through DHX9 and NPM1 stabilization while halting the biogenesis of miRNAs with tumor suppressor function via MYC-dependent silencing of p21, a component of the Microprocessor Complex. KIMAT1 knockdown suppresses not only KRAS expression but also KRAS downstream signaling, thereby arresting lung cancer growth in vitro and in vivo. Taken together, this study uncovers a role for KIMAT1 in maintaining a positive feedback loop that sustains KRAS signaling during lung cancer progression and provides a proof of principle that interfering with KIMAT1 could be a strategy to hamper KRAS-induced tumorigenesis.
Collapse
Affiliation(s)
- Lei Shi
- Transcriptional Networks in Lung Cancer Group, Cancer Research UK Manchester Institute, University of Manchester, Manchester, UK
- Cancer Research UK Lung Cancer Centre of Excellence, at Manchester and University College London, London, UK
| | - Peter Magee
- Transcriptional Networks in Lung Cancer Group, Cancer Research UK Manchester Institute, University of Manchester, Manchester, UK
- Cancer Research UK Lung Cancer Centre of Excellence, at Manchester and University College London, London, UK
| | - Matteo Fassan
- Department of Medicine, Surgical Pathology & Cytopathology Unit, University of Padua, Padua, Italy
| | - Sudhakar Sahoo
- Computational Biology Support, Cancer Research UK Manchester Institute, University of Manchester, Manchester, UK
| | - Hui Sun Leong
- Computational Biology Support, Cancer Research UK Manchester Institute, University of Manchester, Manchester, UK
| | - Dave Lee
- Computational Biology Support, Cancer Research UK Manchester Institute, University of Manchester, Manchester, UK
| | - Robert Sellers
- Computational Biology Support, Cancer Research UK Manchester Institute, University of Manchester, Manchester, UK
| | | | | | - Tiziana Monteverde
- Transcriptional Networks in Lung Cancer Group, Cancer Research UK Manchester Institute, University of Manchester, Manchester, UK
- Cancer Research UK Lung Cancer Centre of Excellence, at Manchester and University College London, London, UK
| | - Stefano Volinia
- Department of Morphology, Surgery and Experimental Medicine, University of Ferrara, Ferrara, Italy
| | - Duncan D Smith
- Biological Mass Spectrometry Facility, Cancer Research UK Manchester Institute, University of Manchester, Manchester, UK
| | - Gianpiero Di Leva
- School of Pharmacy and Bioengineering, Keele University, Stock-on-Trent, UK
| | - Francesca Galuppini
- Department of Medicine, Surgical Pathology & Cytopathology Unit, University of Padua, Padua, Italy
| | - Athanasios R Paliouras
- Transcriptional Networks in Lung Cancer Group, Cancer Research UK Manchester Institute, University of Manchester, Manchester, UK
- Cancer Research UK Lung Cancer Centre of Excellence, at Manchester and University College London, London, UK
| | - Kang Zeng
- Imaging & Cytometry Facility, Cancer Research UK Manchester Institute, University of Manchester, Manchester, UK
| | - Raymond O'Keefe
- Division of Evolution & Genomic Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
| | - Michela Garofalo
- Transcriptional Networks in Lung Cancer Group, Cancer Research UK Manchester Institute, University of Manchester, Manchester, UK.
- Cancer Research UK Lung Cancer Centre of Excellence, at Manchester and University College London, London, UK.
| |
Collapse
|
3
|
Clayton EA, Rishishwar L, Huang TC, Gulati S, Ban D, McDonald JF, Jordan IK. An atlas of transposable element-derived alternative splicing in cancer. Philos Trans R Soc Lond B Biol Sci 2020; 375:20190342. [PMID: 32075558 PMCID: PMC7061986 DOI: 10.1098/rstb.2019.0342] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/06/2019] [Indexed: 12/18/2022] Open
Abstract
Transposable element (TE)-derived sequences comprise more than half of the human genome, and their presence has been documented to alter gene expression in a number of different ways, including the generation of alternatively spliced transcript isoforms. Alternative splicing has been associated with tumorigenesis for a number of different cancers. The objective of this study was to broadly characterize the role of human TEs in generating alternatively spliced transcript isoforms in cancer. To do so, we screened for the presence of TE-derived sequences co-located with alternative splice sites that are differentially used in normal versus cancer tissues. We analysed a comprehensive set of alternative splice variants characterized for 614 matched normal-tumour tissue pairs across 13 cancer types, resulting in the discovery of 4820 TE-generated alternative splice events distributed among 723 cancer-associated genes. Short interspersed nuclear elements (Alu) and long interspersed nuclear elements (L1) were found to contribute the majority of TE-generated alternative splice sites in cancer genes. A number of cancer-associated genes, including MYH11, WHSC1 and CANT1, were shown to have overexpressed TE-derived isoforms across a range of cancer types. TE-derived isoforms were also linked to cancer-specific fusion transcripts, suggesting a novel mechanism for the generation of transcriptome diversity via trans-splicing mediated by dispersed TE repeats. This article is part of a discussion meeting issue 'Crossroads between transposons and gene regulation'.
Collapse
Affiliation(s)
- Evan A. Clayton
- Integrated Cancer Research Center, School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA
| | - Lavanya Rishishwar
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA
- PanAmerican Bioinformatics Institute, Cali, Colombia
- Applied Bioinformatics Laboratory, Atlanta, GA, USA
| | - Tzu-Chuan Huang
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA
| | - Saurabh Gulati
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA
| | - Dongjo Ban
- Integrated Cancer Research Center, School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA
| | - John F. McDonald
- Integrated Cancer Research Center, School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA
| | - I. King Jordan
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA
- PanAmerican Bioinformatics Institute, Cali, Colombia
- Applied Bioinformatics Laboratory, Atlanta, GA, USA
| |
Collapse
|
4
|
Villanueva‐Cañas JL, Rech GE, Cara MAR, González J. Beyond
SNP
s: how to detect selection on transposable element insertions. Methods Ecol Evol 2017. [DOI: 10.1111/2041-210x.12781] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Affiliation(s)
| | - Gabriel E. Rech
- Institute of Evolutionary Biology (CSIC‐Universitat Pompeu Fabra) Barcelona Spain
| | - Maria Angeles Rodriguez Cara
- Ecoanthropology and Ethnobiology Laboratory, UMR 7206, CNRS/MNHN/Universite Paris 7 Museum National d'HistoireNaturelle F‐75116 Paris France
| | - Josefa González
- Institute of Evolutionary Biology (CSIC‐Universitat Pompeu Fabra) Barcelona Spain
| |
Collapse
|
5
|
Shapiro JA. Nothing in Evolution Makes Sense Except in the Light of Genomics: Read-Write Genome Evolution as an Active Biological Process. BIOLOGY 2016; 5:E27. [PMID: 27338490 PMCID: PMC4929541 DOI: 10.3390/biology5020027] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 02/12/2016] [Revised: 05/20/2016] [Accepted: 06/02/2016] [Indexed: 01/15/2023]
Abstract
The 21st century genomics-based analysis of evolutionary variation reveals a number of novel features impossible to predict when Dobzhansky and other evolutionary biologists formulated the neo-Darwinian Modern Synthesis in the middle of the last century. These include three distinct realms of cell evolution; symbiogenetic fusions forming eukaryotic cells with multiple genome compartments; horizontal organelle, virus and DNA transfers; functional organization of proteins as systems of interacting domains subject to rapid evolution by exon shuffling and exonization; distributed genome networks integrated by mobile repetitive regulatory signals; and regulation of multicellular development by non-coding lncRNAs containing repetitive sequence components. Rather than single gene traits, all phenotypes involve coordinated activity by multiple interacting cell molecules. Genomes contain abundant and functional repetitive components in addition to the unique coding sequences envisaged in the early days of molecular biology. Combinatorial coding, plus the biochemical abilities cells possess to rearrange DNA molecules, constitute a powerful toolbox for adaptive genome rewriting. That is, cells possess "Read-Write Genomes" they alter by numerous biochemical processes capable of rapidly restructuring cellular DNA molecules. Rather than viewing genome evolution as a series of accidental modifications, we can now study it as a complex biological process of active self-modification.
Collapse
Affiliation(s)
- James A Shapiro
- Department of Biochemistry and Molecular Biology, University of Chicago, GCIS W123B, 979 E. 57th Street, Chicago, IL 60637, USA.
| |
Collapse
|
6
|
Jjingo D, Conley AB, Wang J, Mariño-Ramírez L, Lunyak VV, Jordan IK. Mammalian-wide interspersed repeat (MIR)-derived enhancers and the regulation of human gene expression. Mob DNA 2014; 5:14. [PMID: 25018785 PMCID: PMC4090950 DOI: 10.1186/1759-8753-5-14] [Citation(s) in RCA: 57] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2013] [Accepted: 04/10/2014] [Indexed: 11/26/2022] Open
Abstract
Background Mammalian-wide interspersed repeats (MIRs) are the most ancient family of transposable elements (TEs) in the human genome. The deep conservation of MIRs initially suggested the possibility that they had been exapted to play functional roles for their host genomes. MIRs also happen to be the only TEs whose presence in-and-around human genes is positively correlated to tissue-specific gene expression. Similar associations of enhancer prevalence within genes and tissue-specific expression, along with MIRs’ previous implication as providing regulatory sequences, suggested a possible link between MIRs and enhancers. Results To test the possibility that MIRs contribute functional enhancers to the human genome, we evaluated the relationship between MIRs and human tissue-specific enhancers in terms of genomic location, chromatin environment, regulatory function, and mechanistic attributes. This analysis revealed MIRs to be highly concentrated in enhancers of the K562 and HeLa human cell-types. Significantly more enhancers were found to be linked to MIRs than would be expected by chance, and putative MIR-derived enhancers are characterized by a chromatin environment highly similar to that of canonical enhancers. MIR-derived enhancers show strong associations with gene expression levels, tissue-specific gene expression and tissue-specific cellular functions, including a number of biological processes related to erythropoiesis. MIR-derived enhancers were found to be a rich source of transcription factor binding sites, underscoring one possible mechanistic route for the element sequences co-option as enhancers. There is also tentative evidence to suggest that MIR-enhancer function is related to the transcriptional activity of non-coding RNAs. Conclusions Taken together, these data reveal enhancers to be an important cis-regulatory platform from which MIRs can exercise a regulatory function in the human genome and help to resolve a long-standing conundrum as to the reason for MIRs’ deep evolutionary conservation.
Collapse
Affiliation(s)
- Daudi Jjingo
- School of Biology, Georgia Institute of Technology, Atlanta, GA, USA
| | - Andrew B Conley
- School of Biology, Georgia Institute of Technology, Atlanta, GA, USA
| | - Jianrong Wang
- School of Biology, Georgia Institute of Technology, Atlanta, GA, USA
| | - Leonardo Mariño-Ramírez
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA ; PanAmerican Bioinformatics Institute, Santa Marta, Magdalena, Colombia
| | - Victoria V Lunyak
- PanAmerican Bioinformatics Institute, Santa Marta, Magdalena, Colombia ; Buck Institute for Research on Aging, Novato, CA, USA
| | - I King Jordan
- School of Biology, Georgia Institute of Technology, Atlanta, GA, USA ; PanAmerican Bioinformatics Institute, Santa Marta, Magdalena, Colombia
| |
Collapse
|
7
|
Hénaff E, Vives C, Desvoyes B, Chaurasia A, Payet J, Gutierrez C, Casacuberta JM. Extensive amplification of the E2F transcription factor binding sites by transposons during evolution of Brassica species. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2014; 77:852-62. [PMID: 24447172 DOI: 10.1111/tpj.12434] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/31/2013] [Revised: 12/24/2013] [Accepted: 01/09/2014] [Indexed: 05/10/2023]
Abstract
Transposable elements (TEs) are major players in genome evolution. The effects of their movement vary from gene knockouts to more subtle effects such as changes in gene expression. It has recently been shown that TEs may contain transcription factor binding sites (TFBSs), and it has been proposed that they may rewire new genes into existing transcriptional networks. However, little is known about the dynamics of this process and its effect on transcription factor binding. Here we show that TEs have extensively amplified the number of sequences that match the E2F TFBS during Brassica speciation, and, as a result, as many as 85% of the sequences that fit the E2F TFBS consensus are within TEs in some Brassica species. We show that these sequences found within TEs bind E2Fa in vivo, which indicates a direct effect of these TEs on E2F-mediated gene regulation. Our results suggest that the TEs located close to genes may directly participate in gene promoters, whereas those located far from genes may have an indirect effect by diluting the effective amount of E2F protein able to bind to its cognate promoters. These results illustrate an extreme case of the effect of TEs in TFBS evolution, and suggest a singular way by which they affect host genes by modulating essential transcriptional networks.
Collapse
Affiliation(s)
- Elizabeth Hénaff
- Center for Research in Agricultural Genomics, Consejo Superior de Investigaciones Científicas-Institut de Recerca i Tecnologia Agroalimentàries-Universitat Autònoma de Barcelona-Universitat de Barcelona, Campus Universitat Autònoma de Barcelona, Bellaterra - Cerdanyola del Vallès, 08193, Barcelona, Spain
| | | | | | | | | | | | | |
Collapse
|
8
|
Abstract
The development of rigorous molecular taxonomy pioneered by Carl Woese has freed evolution science to explore numerous cellular activities that lead to genome change in evolution. These activities include symbiogenesis, inter- and intracellular horizontal DNA transfer, incorporation of DNA from infectious agents, and natural genetic engineering, especially the activity of mobile elements. This article reviews documented examples of all these processes and proposes experiments to extend our understanding of cell-mediated genome change.
Collapse
Affiliation(s)
- James A Shapiro
- Department of Biochemistry and Molecular Biology; University of Chicago; Chicago, IL USA
| |
Collapse
|
9
|
Carareto CMA, Hernandez EH, Vieira C. Genomic regions harboring insecticide resistance-associated Cyp genes are enriched by transposable element fragments carrying putative transcription factor binding sites in two sibling Drosophila species. Gene 2013; 537:93-9. [PMID: 24361809 DOI: 10.1016/j.gene.2013.11.080] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2011] [Revised: 11/27/2013] [Accepted: 11/30/2013] [Indexed: 11/27/2022]
Abstract
In the present study, an in silico analysis was performed to identify transposable element (TE) fragments inserted in Cyps with functions associated with resistance to insecticides and developmental regulation as well as in neighboring genes in two sibling species, Drosophila melanogaster and Drosophila simulans. The Cyps associated with insecticide resistance and their neighboring non-Cyp genes have accumulated a greater number of TE fragments than the other Cyps or a random sample of genes, predominantly in the 5'-flanking regions. Most of the insertions were due to DNA transposons, with DNAREP1 fragments being the most common. These fragments carry putative binding sites for transcription factors, which reinforces the hypothesis that DNAREP1 may influence gene regulation and play a role in the adaptation of the Drosophila species.
Collapse
Affiliation(s)
- Claudia M A Carareto
- UNESP-Univ. Estadual Paulista, Departamento de Biologia, Laboratório de Evolução Molecular, 15054-1000 São José do Rio Preto, São Paulo, Brazil.
| | - Eric H Hernandez
- UNESP-Univ. Estadual Paulista, Departamento de Biologia, Laboratório de Evolução Molecular, 15054-1000 São José do Rio Preto, São Paulo, Brazil
| | - Cristina Vieira
- Université de Lyon, F-69000, Lyon, Université Lyon 1, CNRS, UMR5558, Laboratoire de Biométrie et Biologie Evolutive, F-69622, Villeurbanne, France; Institut Universitaire de France, France
| |
Collapse
|
10
|
Testori A, Caizzi L, Cutrupi S, Friard O, De Bortoli M, Cora' D, Caselle M. The role of Transposable Elements in shaping the combinatorial interaction of Transcription Factors. BMC Genomics 2012; 13:400. [PMID: 22897927 PMCID: PMC3478180 DOI: 10.1186/1471-2164-13-400] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2012] [Accepted: 06/28/2012] [Indexed: 12/22/2022] Open
Abstract
Background In the last few years several studies have shown that Transposable Elements (TEs) in the human genome are significantly associated with Transcription Factor Binding Sites (TFBSs) and that in several cases their expansion within the genome led to a substantial rewiring of the regulatory network. Another important feature of the regulatory network which has been thoroughly studied is the combinatorial organization of transcriptional regulation. In this paper we combine these two observations and suggest that TEs, besides rewiring the network, also played a central role in the evolution of particular patterns of combinatorial gene regulation. Results To address this issue we searched for TEs overlapping Estrogen Receptor α (ERα) binding peaks in two publicly available ChIP-seq datasets from the MCF7 cell line corresponding to different modalities of exposure to estrogen. We found a remarkable enrichment of a few specific classes of Transposons. Among these a prominent role was played by MIR (Mammalian Interspersed Repeats) transposons. These TEs underwent a dramatic expansion at the beginning of the mammalian radiation and then stabilized. We conjecture that the special affinity of ERα for the MIR class of TEs could be at the origin of the important role assumed by ERα in Mammalians. We then searched for TFBSs within the TEs overlapping ChIP-seq peaks. We found a strong enrichment of a few precise combinations of TFBS. In several cases the corresponding Transcription Factors (TFs) were known cofactors of ERα, thus supporting the idea of a co-regulatory role of TFBS within the same TE. Moreover, most of these correlations turned out to be strictly associated to specific classes of TEs thus suggesting the presence of a well-defined "transposon code" within the regulatory network. Conclusions In this work we tried to shed light into the role of Transposable Elements (TEs) in shaping the regulatory network of higher eukaryotes. To test this idea we focused on a particular transcription factor: the Estrogen Receptor α (ERα) and we found that ERα preferentially targets a well defined set of TEs and that these TEs host combinations of transcriptional regulators involving several of known co-regulators of ERα. Moreover, a significant number of these TEs turned out to be conserved between human and mouse and located in the vicinity (and thus candidate to be regulators) of important estrogen-related genes.
Collapse
Affiliation(s)
- Alessandro Testori
- Center for Molecular Systems Biology, University of Turin, Turin, Candiolo I-10060, Italy.
| | | | | | | | | | | | | |
Collapse
|
11
|
Rebollo R, Romanish MT, Mager DL. Transposable elements: an abundant and natural source of regulatory sequences for host genes. Annu Rev Genet 2012; 46:21-42. [PMID: 22905872 DOI: 10.1146/annurev-genet-110711-155621] [Citation(s) in RCA: 352] [Impact Index Per Article: 29.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
The fact that transposable elements (TEs) can influence host gene expression was first recognized more than 50 years ago. However, since that time, TEs have been widely regarded as harmful genetic parasites-selfish elements that are rarely co-opted by the genome to serve a beneficial role. Here, we survey recent findings that relate to TE impact on host genes and remind the reader that TEs, in contrast to other noncoding parts of the genome, are uniquely suited to gene regulatory functions. We review recent studies that demonstrate the role of TEs in establishing and rewiring gene regulatory networks and discuss the overall ubiquity of exaptation. We suggest that although individuals within a population can be harmed by the deleterious effects of new TE insertions, the presence of TE sequences in a genome is of overall benefit to the population.
Collapse
Affiliation(s)
- Rita Rebollo
- Terry Fox Laboratory, British Columbia Cancer Agency, Vancouver, Canada.
| | | | | |
Collapse
|
12
|
Bire S, Rouleux-Bonnin F. Transposable elements as tools for reshaping the genome: it is a huge world after all! Methods Mol Biol 2012; 859:1-28. [PMID: 22367863 DOI: 10.1007/978-1-61779-603-6_1] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Transposable elements (TEs) are discrete pieces of DNA that can move from one site to another within genomes and sometime between genomes. They are found in all major branches of life. Because of their wide distribution and considerable diversity, they are a considerable source of genomic variation and as such, they constitute powerful drivers of genome evolution. Moreover, it is becoming clear that the epigenetic regulation of certain genes is derived from defense mechanisms against the activity of ancestral transposable elements. TEs now tend to be viewed as natural molecular tools that can reshape the genome, which challenges the idea that TEs are natural tools used to answer biological questions. In the first part of this chapter, we review the classification and distribution of TEs, and look at how they have contributed to the structural and transcriptional reshaping of genomes. In the second part, we describe methodological innovations that have modified their contribution as molecular tools.
Collapse
Affiliation(s)
- Solenne Bire
- GICC, UMR CNRS 6239, Université François Rabelais, UFR des Sciences et Technques, Tours, France
| | | |
Collapse
|
13
|
Zemojtel T, Kielbasa SM, Arndt PF, Behrens S, Bourque G, Vingron M. CpG deamination creates transcription factor-binding sites with high efficiency. Genome Biol Evol 2011; 3:1304-11. [PMID: 22016335 PMCID: PMC3228489 DOI: 10.1093/gbe/evr107] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
The formation of new transcription factor–binding sites (TFBSs) has a major impact on the evolution of gene regulatory networks. Clearly, single nucleotide mutations arising within genomic DNA can lead to the creation of TFBSs. Are molecular processes inducing single nucleotide mutations contributing equally to the creation of TFBSs? In the human genome, a spontaneous deamination of methylated cytosine in the context of CpG dinucleotides results in the creation of thymine (C → T), and this mutation has the highest rate among all base substitutions. CpG deamination has been ascribed a role in silencing of transposons and induction of variation in regional methylation. We have previously shown that CpG deamination created thousands of p53-binding sites within genomic sequences of Alu transposons. Interestingly, we have defined a ∼30 bp region in Alu sequence, which, depending on a pattern of CpG deamination, can be converted to functional p53-, PAX-6-, and Myc-binding sites. Here, we have studied single nucleotide mutational events leading to creation of TFBSs in promoters of human genes and in genomic regions bound by such key transcription factors as Oct4, NANOG, and c-Myc. We document that CpG deamination events can create TFBSs with much higher efficiency than other types of mutational events. Our findings add a new role to CpG methylation: We propose that deamination of methylated CpGs constitutes one of the evolutionary forces acting on mutational trajectories of TFBSs formation contributing to variability in gene regulation.
Collapse
Affiliation(s)
- Tomasz Zemojtel
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Berlin, Germany.
| | | | | | | | | | | |
Collapse
|
14
|
Jjingo D, Huda A, Gundapuneni M, Mariño-Ramírez L, Jordan IK. Effect of the transposable element environment of human genes on gene length and expression. Genome Biol Evol 2011; 3:259-71. [PMID: 21362639 PMCID: PMC3070429 DOI: 10.1093/gbe/evr015] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Independent lines of investigation have documented effects of both transposable elements (TEs) and gene length (GL) on gene expression. However, TE gene fractions are highly correlated with GL, suggesting that they cannot be considered independently. We evaluated the TE environment of human genes and GL jointly in an attempt to tease apart their relative effects. TE gene fractions and GL were compared with the overall level of gene expression and the breadth of expression across tissues. GL is strongly correlated with overall expression level but weakly correlated with the breadth of expression, confirming the selection hypothesis that attributes the compactness of highly expressed genes to selection for economy of transcription. However, TE gene fractions overall, and for the L1 family in particular, show stronger anticorrelations with expression level than GL, indicating that GL may not be the most important target of selection for transcriptional economy. These results suggest a specific mechanism, removal of TEs, by which highly expressed genes are selectively tuned for efficiency. MIR elements are the only family of TEs with gene fractions that show a positive correlation with tissue-specific expression, suggesting that they may provide regulatory sequences that help to control human gene expression. Consistent with this notion, MIR fractions are relatively enriched close to transcription start sites and associated with coexpression in specific sets of related tissues. Our results confirm the overall relevance of the TE environment to gene expression and point to distinct mechanisms by which different TE families may contribute to gene regulation.
Collapse
Affiliation(s)
- Daudi Jjingo
- School of Biology, Georgia Institute of Technology, GA, USA
| | | | | | | | | |
Collapse
|
15
|
Shou C, Bhardwaj N, Lam HYK, Yan KK, Kim PM, Snyder M, Gerstein MB. Measuring the evolutionary rewiring of biological networks. PLoS Comput Biol 2011; 7:e1001050. [PMID: 21253555 PMCID: PMC3017101 DOI: 10.1371/journal.pcbi.1001050] [Citation(s) in RCA: 87] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2010] [Accepted: 12/03/2010] [Indexed: 11/18/2022] Open
Abstract
We have accumulated a large amount of biological network data and expect even more to come. Soon, we anticipate being able to compare many different biological networks as we commonly do for molecular sequences. It has long been believed that many of these networks change, or "rewire", at different rates. It is therefore important to develop a framework to quantify the differences between networks in a unified fashion. We developed such a formalism based on analogy to simple models of sequence evolution, and used it to conduct a systematic study of network rewiring on all the currently available biological networks. We found that, similar to sequences, biological networks show a decreased rate of change at large time divergences, because of saturation in potential substitutions. However, different types of biological networks consistently rewire at different rates. Using comparative genomics and proteomics data, we found a consistent ordering of the rewiring rates: transcription regulatory, phosphorylation regulatory, genetic interaction, miRNA regulatory, protein interaction, and metabolic pathway network, from fast to slow. This ordering was found in all comparisons we did of matched networks between organisms. To gain further intuition on network rewiring, we compared our observed rewirings with those obtained from simulation. We also investigated how readily our formalism could be mapped to other network contexts; in particular, we showed how it could be applied to analyze changes in a range of "commonplace" networks such as family trees, co-authorships and linux-kernel function dependencies.
Collapse
Affiliation(s)
- Chong Shou
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America
| | - Nitin Bhardwaj
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, United States of America
| | - Hugo Y. K. Lam
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America
| | - Koon-Kiu Yan
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, United States of America
| | - Philip M. Kim
- Terrence Donnelly Center for Cellular and Biomolecular Research, Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario, Canada
| | - Michael Snyder
- Department of Genetics, Stanford University, Stanford, California, United States of America
| | - Mark B. Gerstein
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, United States of America
- Department of Computer Science, Yale University, New Haven, Connecticut, United States of America
| |
Collapse
|
16
|
Early Career Research Award Lecture. Structure, evolution and dynamics of transcriptional regulatory networks. Biochem Soc Trans 2011; 38:1155-78. [PMID: 20863280 DOI: 10.1042/bst0381155] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
The availability of entire genome sequences and the wealth of literature on gene regulation have enabled researchers to model an organism's transcriptional regulation system in the form of a network. In such a network, TFs (transcription factors) and TGs (target genes) are represented as nodes and regulatory interactions between TFs and TGs are represented as directed links. In the present review, I address the following topics pertaining to transcriptional regulatory networks. (i) Structure and organization: first, I introduce the concept of networks and discuss our understanding of the structure and organization of transcriptional networks. (ii) Evolution: I then describe the different mechanisms and forces that influence network evolution and shape network structure. (iii) Dynamics: I discuss studies that have integrated information on dynamics such as mRNA abundance or half-life, with data on transcriptional network in order to elucidate general principles of regulatory network dynamics. In particular, I discuss how cell-to-cell variability in the expression level of TFs could permit differential utilization of the same underlying network by distinct members of a genetically identical cell population. Finally, I conclude by discussing open questions for future research and highlighting the implications for evolution, development, disease and applications such as genetic engineering.
Collapse
|
17
|
Genomic neighbourhood and the regulation of gene expression. Curr Opin Cell Biol 2010; 22:326-33. [DOI: 10.1016/j.ceb.2010.04.004] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2010] [Revised: 04/12/2010] [Accepted: 04/13/2010] [Indexed: 12/31/2022]
|
18
|
Ladunga I. An overview of the computational analyses and discovery of transcription factor binding sites. Methods Mol Biol 2010; 674:1-22. [PMID: 20827582 DOI: 10.1007/978-1-60761-854-6_1] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
Here we provide a pragmatic, high-level overview of the computational approaches and tools for the discovery of transcription factor binding sites. Unraveling transcription regulatory networks and their malfunctions such as cancer became feasible due to recent stellar progress in experimental techniques and computational analyses. While predictions of isolated sites still pose notorious challenges, cis-regulatory modules (clusters) of binding sites can now be identified with high accuracy. Further support comes from conserved DNA segments, co-regulation, transposable elements, nucleosomes, and three-dimensional chromosomal structures. We introduce computational tools for the analysis and interpretation of chromatin immunoprecipitation, next-generation sequencing, SELEX, and protein-binding microarray results. Because immunoprecipitation produces overly large DNA segments and well over half of the sequencing reads from constitute background noise, methods are presented for background correction, sequence read mapping, peak calling, false discovery rate estimation, and co-localization analyses. To discover short binding site motifs from extensive immunoprecipitation segments, we recommend algorithms and software based on expectation maximization and Gibbs sampling. Data integration using several databases further improves performance. Binding sites can be visualized in genomic and chromatin context using genome browsers. Binding site information, integrated with co-expression in large compendia of gene expression experiments, allows us to reveal complex transcriptional regulatory networks.
Collapse
Affiliation(s)
- Istvan Ladunga
- Department of Statistics, University of Nebraska-Lincoln, Lincoln, NE, USA.
| |
Collapse
|
19
|
Identification of transcription factor binding sites derived from transposable element sequences using ChIP-seq. Methods Mol Biol 2010; 674:225-40. [PMID: 20827595 DOI: 10.1007/978-1-60761-854-6_14] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
Transposable elements (TEs) form a substantial fraction of the non-coding DNA of many eukaryotic genomes. There are numerous examples of TEs being exapted for regulatory function by the host, many of which were identified through their high conservation. However, given that TEs are often the youngest part of a genome and typically exhibit a high turnover, conservation-based methods will fail to identify lineage- or species-specific exaptations. ChIP-seq has become a very popular and effective method for identifying in vivo DNA-protein interactions, such as those seen at transcription factor binding sites (TFBS), and has been used to show that there are a large number of TE-derived TFBS. Many of these TE-derived TFBS show poor conservation and would go unnoticed using conservation screens. Here, we describe a simple pipeline method for using data generated through ChIP-seq to identify TE-derived TFBS.
Collapse
|