1
|
Mañes-García J, Marco-Ferreres R, Beccari L. Shaping gene expression and its evolution by chromatin architecture and enhancer activity. Curr Top Dev Biol 2024; 159:406-437. [PMID: 38729683 DOI: 10.1016/bs.ctdb.2024.01.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/12/2024]
Abstract
Transcriptional regulation plays a pivotal role in orchestrating the intricate genetic programs governing embryonic development. The expression of developmental genes relies on the combined activity of several cis-regulatory elements (CREs), such as enhancers and silencers, which can be located at long linear distances from the genes that they regulate and that interact with them through establishment of chromatin loops. Mutations affecting their activity or interaction with their target genes can lead to developmental disorders and are thought to have importantly contributed to the evolution of the animal body plan. The income of next-generation-sequencing approaches has allowed identifying over a million of sequences with putative regulatory potential in the human genome. Characterizing their function and establishing gene-CREs maps is essential to decode the logic governing developmental gene expression and is one of the major challenges of the post-genomic era. Chromatin 3D organization plays an essential role in determining how CREs specifically contact their target genes while avoiding deleterious off-target interactions. Our understanding of these aspects has greatly advanced with the income of chromatin conformation capture techniques and fluorescence microscopy approaches to visualize the organization of DNA elements in the nucleus. Here we will summarize relevant aspects of how the interplay between CRE activity and chromatin 3D organization regulates developmental gene expression and how it relates to pathological conditions and the evolution of animal body plan.
Collapse
Affiliation(s)
| | | | - Leonardo Beccari
- Centro de Biología Molecular Severo Ochoa, CSIC-UAM, Madrid, Spain.
| |
Collapse
|
2
|
Liu X, Chen M, Qu X, Liu W, Dou Y, Liu Q, Shi D, Jiang M, Li H. Cis-Regulatory Elements in Mammals. Int J Mol Sci 2023; 25:343. [PMID: 38203513 PMCID: PMC10779164 DOI: 10.3390/ijms25010343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 12/21/2023] [Accepted: 12/23/2023] [Indexed: 01/12/2024] Open
Abstract
In cis-regulatory elements, enhancers and promoters with complex molecular interactions are used to coordinate gene transcription through physical proximity and chemical modifications. These processes subsequently influence the phenotypic characteristics of an organism. An in-depth exploration of enhancers and promoters can substantially enhance our understanding of gene regulatory networks, shedding new light on mammalian development, evolution and disease pathways. In this review, we provide a comprehensive overview of the intrinsic structural attributes, detection methodologies as well as the operational mechanisms of enhancers and promoters, coupled with the relevant novel and innovative investigative techniques used to explore their actions. We further elucidated the state-of-the-art research on the roles of enhancers and promoters in the realms of mammalian development, evolution and disease, and we conclude with forward-looking insights into prospective research avenues.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Mingsheng Jiang
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi Key Laboratory of Animal Breeding, Disease Control and Prevention, College of Animal Science and Technology, Guangxi University, Nanning 530005, China
| | - Hui Li
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi Key Laboratory of Animal Breeding, Disease Control and Prevention, College of Animal Science and Technology, Guangxi University, Nanning 530005, China
| |
Collapse
|
3
|
Gebrie A. Transposable elements as essential elements in the control of gene expression. Mob DNA 2023; 14:9. [PMID: 37596675 PMCID: PMC10439571 DOI: 10.1186/s13100-023-00297-3] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Accepted: 08/08/2023] [Indexed: 08/20/2023] Open
Abstract
Interspersed repetitions called transposable elements (TEs), commonly referred to as mobile elements, make up a significant portion of the genomes of higher animals. TEs contribute in controlling the expression of genes locally and even far away at the transcriptional and post-transcriptional levels, which is one of their significant functional effects on gene function and genome evolution. There are different mechanisms through which TEs control the expression of genes. First, TEs offer cis-regulatory regions in the genome with their inherent regulatory features for their own expression, making them potential factors for controlling the expression of the host genes. Promoter and enhancer elements contain cis-regulatory sites generated from TE, which function as binding sites for a variety of trans-acting factors. Second, a significant portion of miRNAs and long non-coding RNAs (lncRNAs) have been shown to have TEs that encode for regulatory RNAs, revealing the TE origin of these RNAs. Furthermore, it was shown that TE sequences are essential for these RNAs' regulatory actions, which include binding to the target mRNA. By being a member of cis-regulatory and regulatory RNA sequences, TEs therefore play essential regulatory roles. Additionally, it has been suggested that TE-derived regulatory RNAs and cis-regulatory regions both contribute to the evolutionary novelty of gene regulation. Additionally, these regulatory systems arising from TE frequently have tissue-specific functions. The objective of this review is to discuss TE-mediated gene regulation, with a particular emphasis on the processes, contributions of various TE types, differential roles of various tissue types, based mostly on recent studies on humans.
Collapse
Affiliation(s)
- Alemu Gebrie
- Department of Biomedical Sciences, School of Medicine, Debre Markos University, Debre Markos, Ethiopia.
| |
Collapse
|
4
|
Fueyo R, Judd J, Feschotte C, Wysocka J. Roles of transposable elements in the regulation of mammalian transcription. Nat Rev Mol Cell Biol 2022; 23:481-497. [PMID: 35228718 PMCID: PMC10470143 DOI: 10.1038/s41580-022-00457-y] [Citation(s) in RCA: 109] [Impact Index Per Article: 54.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/25/2022] [Indexed: 12/16/2022]
Abstract
Transposable elements (TEs) comprise about half of the mammalian genome. TEs often contain sequences capable of recruiting the host transcription machinery, which they use to express their own products and promote transposition. However, the regulatory sequences carried by TEs may affect host transcription long after the TEs have lost the ability to transpose. Recent advances in genome analysis and engineering have facilitated systematic interrogation of the regulatory activities of TEs. In this Review, we discuss diverse mechanisms by which TEs contribute to transcription regulation. Notably, TEs can donate enhancer and promoter sequences that influence the expression of host genes, modify 3D chromatin architecture and give rise to novel regulatory genes, including non-coding RNAs and transcription factors. We discuss how TEs spur regulatory evolution and facilitate the emergence of genetic novelties in mammalian physiology and development. By virtue of their repetitive and interspersed nature, TEs offer unique opportunities to dissect the effects of mutation and genomic context on the function and evolution of cis-regulatory elements. We argue that TE-centric studies hold the key to unlocking general principles of transcription regulation and evolution.
Collapse
Affiliation(s)
- Raquel Fueyo
- Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, CA, USA
| | - Julius Judd
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
| | - Cedric Feschotte
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA.
| | - Joanna Wysocka
- Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, CA, USA.
- Department of Developmental Biology, Stanford University School of Medicine, Stanford, CA, USA.
- Institute for Stem Cell Biology and Regenerative Medicine, Stanford University School of Medicine, Stanford, CA, USA.
- Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA, USA.
| |
Collapse
|
5
|
Frith MC. Paleozoic Protein Fossils Illuminate the Evolution of Vertebrate Genomes and Transposable Elements. Mol Biol Evol 2022; 39:6555113. [PMID: 35348724 PMCID: PMC9004415 DOI: 10.1093/molbev/msac068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Genomes hold a treasure trove of protein fossils: fragments of formerly protein-coding DNA, which mainly come from transposable elements (TEs) or host genes. These fossils reveal ancient evolution of TEs and genomes, and many fossils have been exapted to perform diverse functions important for the host's fitness. However, old and highly-degraded fossils are hard to identify, standard methods (e.g. BLAST) are not optimized for this task, and few Paleozoic protein fossils have been found. Here, a recently optimized method is used to find protein fossils in vertebrate genomes. It finds Paleozoic fossils predating the amphibian/amniote divergence from most major TE categories, including virus-related Polinton and Gypsy elements. It finds 10 fossils in the human genome (8 from TEs and 2 from host genes) that predate the last common ancestor of all jawed vertebrates, probably from the Ordovician period. It also finds types of transposon and retrotransposon not found in human before. These fossils have extreme sequence conservation, indicating exaptation: some have evidence of gene-regulatory function, and they tend to lienearest to developmental genes. Some ancient fossils suggest "genome tectonics", where two fragments of one TE have drifted apart by up to megabases, possibly explaining gene deserts and large introns. This paints a picture of great TE diversity in our aquatic ancestors, with patchy TE inheritance by later vertebrates, producing new genes and regulatory elements on the way. Host-gene fossils too have contributed anciently-conserved DNA segments. This paves the way to further studies of ancient protein fossils.
Collapse
Affiliation(s)
- Martin C Frith
- Artificial Intelligence Research Center, AIST, Tokyo, Japan.,Graduate School of Frontier Sciences, University of Tokyo, Chiba, Japan.,Computational Bio Big-Data Open Innovation Laboratory, AIST, Tokyo, Japan
| |
Collapse
|
6
|
Diehl AG, Ouyang N, Boyle AP. Transposable elements contribute to cell and species-specific chromatin looping and gene regulation in mammalian genomes. Nat Commun 2020; 11:1796. [PMID: 32286261 PMCID: PMC7156512 DOI: 10.1038/s41467-020-15520-5] [Citation(s) in RCA: 64] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2019] [Accepted: 03/14/2020] [Indexed: 01/12/2023] Open
Abstract
Chromatin looping is important for gene regulation, and studies of 3D chromatin structure across species and cell types have improved our understanding of the principles governing chromatin looping. However, 3D genome evolution and its relationship with natural selection remains largely unexplored. In mammals, the CTCF protein defines the boundaries of most chromatin loops, and variations in CTCF occupancy are associated with looping divergence. While many CTCF binding sites fall within transposable elements (TEs), their contribution to 3D chromatin structural evolution is unknown. Here we report the relative contributions of TE-driven CTCF binding site expansions to conserved and divergent chromatin looping in human and mouse. We demonstrate that TE-derived CTCF binding divergence may explain a large fraction of variable loops. These variable loops contribute significantly to corresponding gene expression variability across cells and species, possibly by refining sub-TAD-scale loop contacts responsible for cell-type-specific enhancer-promoter interactions.
Collapse
Affiliation(s)
- Adam G Diehl
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Ningxin Ouyang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Alan P Boyle
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA.
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA.
| |
Collapse
|
7
|
Choudhary MN, Friedman RZ, Wang JT, Jang HS, Zhuo X, Wang T. Co-opted transposons help perpetuate conserved higher-order chromosomal structures. Genome Biol 2020; 21:16. [PMID: 31973766 PMCID: PMC6979391 DOI: 10.1186/s13059-019-1916-8] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2019] [Accepted: 12/08/2019] [Indexed: 12/26/2022] Open
Abstract
BACKGROUND Transposable elements (TEs) make up half of mammalian genomes and shape genome regulation by harboring binding sites for regulatory factors. These include binding sites for architectural proteins, such as CTCF, RAD21, and SMC3, that are involved in tethering chromatin loops and marking domain boundaries. The 3D organization of the mammalian genome is intimately linked to its function and is remarkably conserved. However, the mechanisms by which these structural intricacies emerge and evolve have not been thoroughly probed. RESULTS Here, we show that TEs contribute extensively to both the formation of species-specific loops in humans and mice through deposition of novel anchoring motifs, as well as to the maintenance of conserved loops across both species through CTCF binding site turnover. The latter function demonstrates the ability of TEs to contribute to genome plasticity and reinforce conserved genome architecture as redundant loop anchors. Deleting such candidate TEs in human cells leads to the collapse of conserved loop and domain structures. These TEs are also marked by reduced DNA methylation and bear mutational signatures of hypomethylation through evolutionary time. CONCLUSIONS TEs have long been considered a source of genetic innovation. By examining their contribution to genome topology, we show that TEs can contribute to regulatory plasticity by inducing redundancy and potentiating genetic drift locally while conserving genome architecture globally, revealing a paradigm for defining regulatory conservation in the noncoding genome beyond classic sequence-level conservation.
Collapse
Affiliation(s)
- Mayank Nk Choudhary
- The Edison Family Center for Genome Sciences & Systems Biology, Department of Genetics, Washington University, 4515 McKinley Avenue, Campus Box 8510, St. Louis, MO, 63110, USA
| | - Ryan Z Friedman
- The Edison Family Center for Genome Sciences & Systems Biology, Department of Genetics, Washington University, 4515 McKinley Avenue, Campus Box 8510, St. Louis, MO, 63110, USA
| | - Julia T Wang
- The Edison Family Center for Genome Sciences & Systems Biology, Department of Genetics, Washington University, 4515 McKinley Avenue, Campus Box 8510, St. Louis, MO, 63110, USA
| | - Hyo Sik Jang
- The Edison Family Center for Genome Sciences & Systems Biology, Department of Genetics, Washington University, 4515 McKinley Avenue, Campus Box 8510, St. Louis, MO, 63110, USA
| | - Xiaoyu Zhuo
- The Edison Family Center for Genome Sciences & Systems Biology, Department of Genetics, Washington University, 4515 McKinley Avenue, Campus Box 8510, St. Louis, MO, 63110, USA
| | - Ting Wang
- The Edison Family Center for Genome Sciences & Systems Biology, Department of Genetics, Washington University, 4515 McKinley Avenue, Campus Box 8510, St. Louis, MO, 63110, USA.
| |
Collapse
|
8
|
Casanova EL, Konkel MK. The Developmental Gene Hypothesis for Punctuated Equilibrium: Combined Roles of Developmental Regulatory Genes and Transposable Elements. Bioessays 2020; 42:e1900173. [PMID: 31943266 DOI: 10.1002/bies.201900173] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2019] [Revised: 11/30/2019] [Indexed: 12/13/2022]
Abstract
Theories of the genetics underlying punctuated equilibrium (PE) have been vague to date. Here the developmental gene hypothesis is proposed, which states that: 1) developmental regulatory (DevReg) genes are responsible for the orchestration of metazoan morphogenesis and their extreme conservation and mutation intolerance generates the equilibrium or stasis present throughout much of the fossil record and 2) the accumulation of regulatory elements and recombination within these same genes-often derived from transposable elements-drives punctuated bursts of morphological divergence and speciation across metazoa. This two-part hypothesis helps to explain the features that characterize PE, providing a theoretical genetic basis for the once-controversial theory. Also see the video abstract here https://youtu.be/C-fu-ks5yDs.
Collapse
Affiliation(s)
- Emily L Casanova
- Department of Biomedical Sciences, University of South Carolina School of Medicine at Greenville, 200A Patewood Dr., Greenville, SC, 29615, USA
| | - Miriam K Konkel
- Department of Genetics, Clemson Center for Human Genetics, Biomedical Data Science and Informatics Program, Clemson University, 105 Collings St., Clemson, SC, 29631, USA
| |
Collapse
|
9
|
Mustafin RN, Kazantseva AV, Enikeeva RF, Davydova YD, Karunas AS, Malykh SB, Khusnutdinova EK. Epigenetics of Aggressive Behavior. RUSS J GENET+ 2019. [DOI: 10.1134/s1022795419090096] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
|
10
|
Mustafin RN, Khusnutdinova EK. The role of transposable elements in the ecological morphogenesis under the influence of stress. Vavilovskii Zhurnal Genet Selektsii 2019. [DOI: 10.18699/vj19.506] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
In natural selection, insertional mutagenesis is an important source of genome variability. Transposons are sensors of environmental stress effects, which contribute to adaptation and speciation. These effects are due to changes in the mechanisms of morphogenesis, since transposons contain regulatory sequences that have cis and trans effects on specific protein-coding genes. In variability of genomes, the horizontal transfer of transposons plays an important role, because it contributes to changing the composition of transposons and the acquisition of new properties. Transposons are capable of site-specific transpositions, which lead to the activation of stress response genes. Transposons are sources of non-coding RNA, transcription factors binding sites and protein-coding genes due to domestication, exonization, and duplication. These genes contain nucleotide sequences that interact with non-coding RNAs processed from transposons transcripts, and therefore they are under the control of epigenetic regulatory networks involving transposons. Therefore, inherited features of the location and composition of transposons, along with a change in the phenotype, play an important role in the characteristics of responding to a variety of environmental stressors. This is the basis for the selection and survival of organisms with a specific composition and arrangement of transposons that contribute to adaptation under certain environmental conditions. In evolution, the capability to transpose into specific genome sites, regulate gene expression, and interact with transcription factors, along with the ability to respond to stressors, is the basis for rapid variability and speciation by altering the regulation of ontogenesis. The review presents evidence of tissue-specific and stage-specific features of transposon activation and their role in the regulation of cell differentiation to confirm their role in ecological morphogenesis.
Collapse
Affiliation(s)
| | - E. K. Khusnutdinova
- Bashkir State Medical University;
Institute of Biochemistry and Genetics – Subdivision of the Ufa Federal Research Centre of RAS
| |
Collapse
|
11
|
Guliaev AS, Semyenova SK. MGERT: a pipeline to retrieve coding sequences of mobile genetic elements from genome assemblies. Mob DNA 2019; 10:21. [PMID: 31114637 PMCID: PMC6515669 DOI: 10.1186/s13100-019-0163-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2019] [Accepted: 04/26/2019] [Indexed: 12/17/2022] Open
Abstract
Background Genomes of eukaryotes are inhabited by myriads of mobile genetic elements (MGEs) – transposons and retrotransposons - which play a great role in genome plasticity and evolution. A lot of computational tools were developed to annotate them either in genomic assemblies or raw reads using de novo or homology-based approaches. But there has been no pipeline enabling users to get coding and flanking sequences of MGEs suitable for a downstream analysis from genome assemblies. Results We developed a new pipeline, MGERT (Mobile Genetic Elements Retrieving Tool), that automates all the steps necessary to obtain protein-coding sequences of mobile genetic elements from genomic assemblies even if no previous knowledge on MGE content of a particular genome is available. Conclusions Using MGERT, researchers can easily find MGEs, their coding and flanking sequences in the genome of interest. Thus, this pipeline helps researchers to focus on the biological analysis of MGEs rather than excessive scripting and pipelining. Electronic supplementary material The online version of this article (10.1186/s13100-019-0163-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Andrei S Guliaev
- Laboratory of Genome Organization, Institute of Gene Biology of the Russian Academy of Sciences, Vavilov Str., 34/5, Moscow, 119334 Russia
| | - Seraphima K Semyenova
- Laboratory of Genome Organization, Institute of Gene Biology of the Russian Academy of Sciences, Vavilov Str., 34/5, Moscow, 119334 Russia
| |
Collapse
|
12
|
Mustafin RN, Khusnutdinova EK. Prospects in the Search for Peptides for Specific Regulation of Aging. ADVANCES IN GERONTOLOGY 2019. [DOI: 10.1134/s2079057019020176] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
13
|
Mustafin RN. Functional Dualism of Transposon Transcripts in Evolution of Eukaryotic Genomes. Russ J Dev Biol 2019. [DOI: 10.1134/s1062360418070019] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
|
14
|
Mustafin RN. The Relationship between Transposons and Transcription Factors in the Evolution of Eukaryotes. J EVOL BIOCHEM PHYS+ 2019. [DOI: 10.1134/s0022093019010022] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
15
|
Exaptation at the molecular genetic level. SCIENCE CHINA-LIFE SCIENCES 2018; 62:437-452. [PMID: 30798493 DOI: 10.1007/s11427-018-9447-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/10/2018] [Accepted: 12/01/2018] [Indexed: 12/22/2022]
Abstract
The realization that body parts of animals and plants can be recruited or coopted for novel functions dates back to, or even predates the observations of Darwin. S.J. Gould and E.S. Vrba recognized a mode of evolution of characters that differs from adaptation. The umbrella term aptation was supplemented with the concept of exaptation. Unlike adaptations, which are restricted to features built by selection for their current role, exaptations are features that currently enhance fitness, even though their present role was not a result of natural selection. Exaptations can also arise from nonaptations; these are characters which had previously been evolving neutrally. All nonaptations are potential exaptations. The concept of exaptation was expanded to the molecular genetic level which aided greatly in understanding the enormous potential of neutrally evolving repetitive DNA-including transposed elements, formerly considered junk DNA-for the evolution of genes and genomes. The distinction between adaptations and exaptations is outlined in this review and examples are given. Also elaborated on is the fact that such distinctions are sometimes more difficult to determine; this is a widespread phenomenon in biology, where continua abound and clear borders between states and definitions are rare.
Collapse
|
16
|
Dunn-Fletcher CE, Muglia LM, Pavlicev M, Wolf G, Sun MA, Hu YC, Huffman E, Tumukuntala S, Thiele K, Mukherjee A, Zoubovsky S, Zhang X, Swaggart KA, Lamm KYB, Jones H, Macfarlan TS, Muglia LJ. Anthropoid primate-specific retroviral element THE1B controls expression of CRH in placenta and alters gestation length. PLoS Biol 2018; 16:e2006337. [PMID: 30231016 PMCID: PMC6166974 DOI: 10.1371/journal.pbio.2006337] [Citation(s) in RCA: 51] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2018] [Revised: 10/01/2018] [Accepted: 09/10/2018] [Indexed: 01/22/2023] Open
Abstract
Pregnancy and parturition are intricately regulated to ensure successful reproductive outcomes. However, the factors that control gestational length in humans and other anthropoid primates remain poorly defined. Here, we show the endogenous retroviral long terminal repeat transposon-like human element 1B (THE1B) selectively controls placental expression of corticotropin-releasing hormone (CRH) that, in turn, influences gestational length and birth timing. Placental expression of CRH and subsequently prolonged gestational length were found in two independent strains of transgenic mice carrying a 180-kb human bacterial artificial chromosome (BAC) DNA that contained the full length of CRH and extended flanking regions, including THE1B. Restricted deletion of THE1B silenced placental CRH expression and normalized birth timing in these transgenic lines. Furthermore, we revealed an interaction at the 5′ insertion site of THE1B with distal-less homeobox 3 (DLX3), a transcription factor expressed in placenta. Together, these findings suggest that retroviral insertion of THE1B into the anthropoid primate genome may have initiated expression of CRH in placental syncytiotrophoblasts via DLX3 and that this placental CRH is sufficient to alter the timing of birth. The proper timing of delivery is critical during pregnancy; if too early or too late, the baby will be at risk of serious health problems and even death. Corticotropin-releasing hormone (CRH) is a protein that can be detected in maternal blood, and its concentration correlates with the timing of birth. In humans and other anthropoid primates, CRH is made by the placenta, whereas in other mammals, it is produced in a specialized region of the brain. To understand the regulation and evolution of this key protein, we inserted the human CRH gene and nearby regions into the mouse genome, which resulted in human CRH expression in the mouse placenta. Mouse litters that make CRH in their placentas are born later than control mice, showing that CRH can directly affect birth timing. Using our mouse model, we then selectively deleted a remnant of an ancient retrovirus that is normally found in the DNA of anthropoid primates and demonstrated that this specific region controls expression of CRH in the placenta. Deletion of this region also restored normal birth timing in the mice by eliminating CRH production from the placenta. We propose that retroviral regulation of CRH in the placenta may be a mechanism of controlling birth timing in humans and other anthropoid primates.
Collapse
Affiliation(s)
- Caitlin E. Dunn-Fletcher
- Division of Human Genetics, Center for Prevention of Preterm Birth, Perinatal Institute, Cincinnati Children’s Hospital Medical Center, Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, Ohio, United States of America
- * E-mail: (CED); (LJM)
| | - Lisa M. Muglia
- Division of Human Genetics, Center for Prevention of Preterm Birth, Perinatal Institute, Cincinnati Children’s Hospital Medical Center, Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, Ohio, United States of America
| | - Mihaela Pavlicev
- Division of Human Genetics, Center for Prevention of Preterm Birth, Perinatal Institute, Cincinnati Children’s Hospital Medical Center, Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, Ohio, United States of America
| | - Gernot Wolf
- The Eunice Kennedy Shriver National Institute of Child Health and Human Development, The National Institutes of Health, Bethesda, Maryland, United States of America
| | - Ming-An Sun
- The Eunice Kennedy Shriver National Institute of Child Health and Human Development, The National Institutes of Health, Bethesda, Maryland, United States of America
| | - Yueh-Chiang Hu
- Division of Developmental Biology, Cincinnati Children’s Hospital Medical Center, Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, Ohio, United States of America
| | - Elizabeth Huffman
- Division of Human Genetics, Center for Prevention of Preterm Birth, Perinatal Institute, Cincinnati Children’s Hospital Medical Center, Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, Ohio, United States of America
| | - Shivani Tumukuntala
- Division of Human Genetics, Center for Prevention of Preterm Birth, Perinatal Institute, Cincinnati Children’s Hospital Medical Center, Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, Ohio, United States of America
| | - Katri Thiele
- Division of Human Genetics, Center for Prevention of Preterm Birth, Perinatal Institute, Cincinnati Children’s Hospital Medical Center, Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, Ohio, United States of America
| | - Amrita Mukherjee
- Division of Human Genetics, Center for Prevention of Preterm Birth, Perinatal Institute, Cincinnati Children’s Hospital Medical Center, Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, Ohio, United States of America
| | - Sandra Zoubovsky
- Division of Human Genetics, Center for Prevention of Preterm Birth, Perinatal Institute, Cincinnati Children’s Hospital Medical Center, Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, Ohio, United States of America
| | - Xuzhe Zhang
- Division of Human Genetics, Center for Prevention of Preterm Birth, Perinatal Institute, Cincinnati Children’s Hospital Medical Center, Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, Ohio, United States of America
| | - Kayleigh A. Swaggart
- Division of Human Genetics, Center for Prevention of Preterm Birth, Perinatal Institute, Cincinnati Children’s Hospital Medical Center, Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, Ohio, United States of America
| | - Katherine Y. Bezold Lamm
- Division of Human Genetics, Center for Prevention of Preterm Birth, Perinatal Institute, Cincinnati Children’s Hospital Medical Center, Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, Ohio, United States of America
| | - Helen Jones
- Division of Pediatric Surgery, Cincinnati Children’s Hospital Medical Center, Department of Surgery, University of Cincinnati College of Medicine, Cincinnati, Ohio, United States of America
| | - Todd S. Macfarlan
- The Eunice Kennedy Shriver National Institute of Child Health and Human Development, The National Institutes of Health, Bethesda, Maryland, United States of America
| | - Louis J. Muglia
- Division of Human Genetics, Center for Prevention of Preterm Birth, Perinatal Institute, Cincinnati Children’s Hospital Medical Center, Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, Ohio, United States of America
- * E-mail: (CED); (LJM)
| |
Collapse
|
17
|
Mustafin RN, Khusnutdinova EK. Epigenetic Hypothesis of the Role of Peptides in Aging. ADVANCES IN GERONTOLOGY 2018. [DOI: 10.1134/s2079057018030128] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
18
|
Lv X, Cui Z, Li H, Li J, Yang Z, Bi Y, Gao M, Zhou B, Yin Z. Polymorphism in lncRNA AC008392.1 and its interaction with smoking on the risk of lung cancer in a Chinese population. Cancer Manag Res 2018; 10:1377-1387. [PMID: 29881308 PMCID: PMC5985799 DOI: 10.2147/cmar.s160818] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Purpose To evaluate the association between rs7248320 in lncRNA AC008392.1 and the risk of lung cancer, this case–control study was carried out in a Chinese population. This study also evaluated the gene–environment interaction between rs7248320 and exposure to smoking status on the risk of lung cancer. Patients and methods We conducted a hospital-based case–control study including 512 lung cancer cases and 588 healthy controls. The association between rs7248320 and the risk of lung cancer was analyzed, and the gene–environment interaction was estimated on an additive scale. Results The variant genotype of rs7248320 was significantly related to the risk of non-small-cell lung cancer (NSCLC). Individuals carrying homozygous GG genotype had decreased risk of NSCLC, compared with individuals carrying the homozygous wild AA genotype/heterozygote GA genotype (adjusted odds ratio [OR] =0.653, 95% confidence interval [CI] =0.442–0.966, P=0.033). Moreover, in the subgroup of ages, there were statistically significant associations between rs7248320 and the risk of lung cancer and NSCLC in the population over 60 years of age. Compared with individuals carrying genotypes AA/GA, individuals with genotype GG had the lower risk of lung cancer and NSCLC (adjusted ORs were 0.579 and 0.433, 95% CIs were 0.338–0.994 and 0.231–0.811, P-values were 0.048 and 0.009, respectively). Compared with homozygote AA, the homozygote GG was associated with a decreased risk in NSCLC (OR =0.456, 95% CI =0.235–0.887, P=0.021). There were no statistically significant results in gene–environment interactions on an additive scale. Conclusion These findings suggest that lncRNA AC008392.1 rs7248320 may be involved in genetic susceptibility to NSCLC in a Chinese population.
Collapse
Affiliation(s)
- Xiaoting Lv
- Department of Epidemiology, School of Public Health, China Medical University, Shenyang, 110122, People's Republic of China.,Key Laboratory of Cancer Etiology and Intervention, University of Liaoning Province, Shenyang, 110122, People's Republic of China
| | - Zhigang Cui
- School of Nursing, China Medical University, Shenyang, 110122, People's Republic of China
| | - Hang Li
- Department of Epidemiology, School of Public Health, China Medical University, Shenyang, 110122, People's Republic of China.,Key Laboratory of Cancer Etiology and Intervention, University of Liaoning Province, Shenyang, 110122, People's Republic of China
| | - Juan Li
- Department of Epidemiology, School of Public Health, China Medical University, Shenyang, 110122, People's Republic of China.,Key Laboratory of Cancer Etiology and Intervention, University of Liaoning Province, Shenyang, 110122, People's Republic of China
| | - Zitai Yang
- Department of Epidemiology, School of Public Health, China Medical University, Shenyang, 110122, People's Republic of China.,Key Laboratory of Cancer Etiology and Intervention, University of Liaoning Province, Shenyang, 110122, People's Republic of China
| | - Yanhong Bi
- Department of Epidemiology, School of Public Health, China Medical University, Shenyang, 110122, People's Republic of China.,Key Laboratory of Cancer Etiology and Intervention, University of Liaoning Province, Shenyang, 110122, People's Republic of China
| | - Min Gao
- Department of Epidemiology, School of Public Health, China Medical University, Shenyang, 110122, People's Republic of China.,Key Laboratory of Cancer Etiology and Intervention, University of Liaoning Province, Shenyang, 110122, People's Republic of China
| | - Baosen Zhou
- Department of Epidemiology, School of Public Health, China Medical University, Shenyang, 110122, People's Republic of China.,Key Laboratory of Cancer Etiology and Intervention, University of Liaoning Province, Shenyang, 110122, People's Republic of China
| | - Zhihua Yin
- Department of Epidemiology, School of Public Health, China Medical University, Shenyang, 110122, People's Republic of China.,Key Laboratory of Cancer Etiology and Intervention, University of Liaoning Province, Shenyang, 110122, People's Republic of China
| |
Collapse
|
19
|
Diehl AG, Boyle AP. Conserved and species-specific transcription factor co-binding patterns drive divergent gene regulation in human and mouse. Nucleic Acids Res 2018; 46:1878-1894. [PMID: 29361190 PMCID: PMC5829737 DOI: 10.1093/nar/gky018] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2017] [Revised: 12/15/2017] [Accepted: 01/08/2018] [Indexed: 12/24/2022] Open
Abstract
The mouse is widely used as system to study human genetic mechanisms. However, extensive rewiring of transcriptional regulatory networks often confounds translation of findings between human and mouse. Site-specific gain and loss of individual transcription factor binding sites (TFBS) has caused functional divergence of orthologous regulatory loci, and so we must look beyond this positional conservation to understand common themes of regulatory control. Fortunately, transcription factor co-binding patterns shared across species often perform conserved regulatory functions. These can be compared to 'regulatory sentences' that retain the same meanings regardless of sequence and species context. By analyzing TFBS co-occupancy patterns observed in four human and mouse cell types, we learned a regulatory grammar: the rules by which TFBS are combined into meaningful regulatory sentences. Different parts of this grammar associate with specific sets of functional annotations regardless of sequence conservation and predict functional signatures more accurately than positional conservation. We further show that both species-specific and conserved portions of this grammar are involved in gene expression divergence and human disease risk. These findings expand our understanding of transcriptional regulatory mechanisms, suggesting that phenotypic divergence and disease risk are driven by a complex interplay between deeply conserved and species-specific transcriptional regulatory pathways.
Collapse
Affiliation(s)
- Adam G Diehl
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Alan P Boyle
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
- Department of Human Genetics, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
20
|
Comparative Analysis of Immune Cells Reveals a Conserved Regulatory Lexicon. Cell Syst 2018; 6:381-394.e7. [PMID: 29454939 DOI: 10.1016/j.cels.2018.01.002] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2017] [Revised: 10/06/2017] [Accepted: 12/30/2017] [Indexed: 12/13/2022]
Abstract
Most well-characterized enhancers are deeply conserved. In contrast, genome-wide comparative studies of steady-state systems showed that only a small fraction of active enhancers are conserved. To better understand conservation of enhancer activity, we used a comparative genomics approach that integrates temporal expression and epigenetic profiles in an innate immune system. We found that gene expression programs diverge among mildly induced genes, while being highly conserved for strongly induced genes. The fraction of conserved enhancers varies greatly across gene expression programs, with induced genes and early-response genes, in particular, being regulated by a higher fraction of conserved enhancers. Clustering of conserved accessible DNA sequences within enhancers resulted in over 60 sequence motifs including motifs for known factors, as well as many with unknown function. We further show that the number of instances of these motifs is a strong predictor of the responsiveness of a gene to pathogen detection.
Collapse
|
21
|
Platt RN, Vandewege MW, Ray DA. Mammalian transposable elements and their impacts on genome evolution. Chromosome Res 2018; 26:25-43. [PMID: 29392473 PMCID: PMC5857283 DOI: 10.1007/s10577-017-9570-z] [Citation(s) in RCA: 136] [Impact Index Per Article: 22.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2017] [Revised: 12/12/2017] [Accepted: 12/28/2017] [Indexed: 12/22/2022]
Abstract
Transposable elements (TEs) are genetic elements with the ability to mobilize and replicate themselves in a genome. Mammalian genomes are dominated by TEs, which can reach copy numbers in the hundreds of thousands. As a result, TEs have had significant impacts on mammalian evolution. Here we summarize the current understanding of TE content in mammal genomes and find that, with a few exceptions, most fall within a predictable range of observations. First, one third to one half of the genome is derived from TEs. Second, most mammalian genomes are dominated by LINE and SINE retrotransposons, more limited LTR retrotransposons, and minimal DNA transposon accumulation. Third, most mammal genome contains at least one family of actively accumulating retrotransposon. Finally, horizontal transfer of TEs among lineages is rare. TE exaptation events are being recognized with increasing frequency. Despite these beneficial aspects of TE content and activity, the majority of TE insertions are neutral or deleterious. To limit the deleterious effects of TE proliferation, the genome has evolved several defense mechanisms that act at the epigenetic, transcriptional, and post-transcriptional levels. The interaction between TEs and these defense mechanisms has led to an evolutionary arms race where TEs are suppressed, evolve to escape suppression, then are suppressed again as the defense mechanisms undergo compensatory change. The result is complex and constantly evolving interactions between TEs and host genomes.
Collapse
Affiliation(s)
- Roy N Platt
- Department of Biological Sciences, Texas Tech University, Lubbock, TX, USA.
| | | | - David A Ray
- Department of Biological Sciences, Texas Tech University, Lubbock, TX, USA
| |
Collapse
|
22
|
Craig RJ, Suh A, Wang M, Ellegren H. Natural selection beyond genes: Identification and analyses of evolutionarily conserved elements in the genome of the collared flycatcher (Ficedula albicollis). Mol Ecol 2018; 27:476-492. [DOI: 10.1111/mec.14462] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2017] [Revised: 11/28/2017] [Accepted: 11/28/2017] [Indexed: 12/13/2022]
Affiliation(s)
- Rory J. Craig
- Department of Evolutionary Biology; Evolutionary Biology Centre; Uppsala University; Uppsala Sweden
- Institute of Evolutionary Biology; School of Biological Sciences; University of Edinburgh; Edinburgh UK
| | - Alexander Suh
- Department of Evolutionary Biology; Evolutionary Biology Centre; Uppsala University; Uppsala Sweden
| | - Mi Wang
- Department of Evolutionary Biology; Evolutionary Biology Centre; Uppsala University; Uppsala Sweden
| | - Hans Ellegren
- Department of Evolutionary Biology; Evolutionary Biology Centre; Uppsala University; Uppsala Sweden
| |
Collapse
|
23
|
Simonti CN, Pavličev M, Capra JA. Transposable Element Exaptation into Regulatory Regions Is Rare, Influenced by Evolutionary Age, and Subject to Pleiotropic Constraints. Mol Biol Evol 2017; 34:2856-2869. [PMID: 28961735 PMCID: PMC5850124 DOI: 10.1093/molbev/msx219] [Citation(s) in RCA: 53] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Transposable element (TE)-derived sequences make up approximately half of most mammalian genomes, and many TEs have been co-opted into gene regulatory elements. However, we lack a comprehensive tissue- and genome-wide understanding of how and when TEs gain regulatory activity in their hosts. We evaluated the prevalence of TE-derived DNA in enhancers and promoters across hundreds of human and mouse cell lines and primary tissues. Promoters are significantly depleted of TEs in all tissues compared with their overall prevalence in the genome (P < 0.001); enhancers are also depleted of TEs, though not as strongly as promoters. The degree of enhancer depletion also varies across contexts (1.5-3×), with reproductive and immune cells showing the highest levels of TE regulatory activity in humans. Overall, in spite of the regulatory potential of many TE sequences, they are significantly less active in gene regulation than expected from their prevalence. TE age is predictive of the likelihood of enhancer activity; TEs originating before the divergence of amniotes are 9.2 times more likely to have enhancer activity than TEs that integrated in great apes. Context-specific enhancers are more likely to be TE-derived than enhancers active in multiple tissues, and young TEs are more likely to overlap context-specific enhancers than old TEs (86% vs. 47%). Once TEs obtain enhancer activity in the host, they have similar functional dynamics to one another and non-TE-derived enhancers, likely driven by pleiotropic constraints. However, a few TE families, most notably endogenous retroviruses, have greater regulatory potential. Our observations suggest a model of regulatory co-option in which TE-derived sequences are initially repressed, after which a small fraction obtains context-specific enhancer activity, with further gains subject to pleiotropic constraints.
Collapse
Affiliation(s)
| | - Mihaela Pavličev
- Center for Prevention of Preterm Birth, Perinatal Institute, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH
| | - John A. Capra
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN
- Department of Biological Sciences, Vanderbilt University, Nashville, TN
| |
Collapse
|
24
|
Abstract
What made us human? Gene expression changes clearly played a significant part in human evolution, but pinpointing the causal regulatory mutations is hard. Comparative genomics enabled the identification of human accelerated regions (HARs) and other human-specific genome sequences. The major challenge in the past decade has been to link diverged sequences to uniquely human biology. This review discusses approaches to this problem, progress made at the molecular level, and prospects for moving towards genetic causes for uniquely human biology.
Collapse
Affiliation(s)
- Lucía F Franchini
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular (INGEBI), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Katherine S Pollard
- Gladstone Institutes, San Francisco, CA, 94158, USA. .,Department of Epidemiology & Biostatistics, Institute for Human Genetics, Institute for Computational Health Sciences, University of California, San Francisco, CA, 94158, USA.
| |
Collapse
|
25
|
Genetic susceptibility in childhood acute lymphoblastic leukemia. Med Oncol 2017; 34:179. [PMID: 28905228 DOI: 10.1007/s12032-017-1038-7] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2017] [Accepted: 09/05/2017] [Indexed: 12/27/2022]
Abstract
Acute lymphoblastic leukemia (ALL) is the most common childhood malignancy and a leading cause of death due to disease in children. The genetic basis of ALL susceptibility has been supported by its association with certain congenital disorders and, more recently, by several genome-wide association studies (GWAS). These GWAS identified common variants in ARID5B, IKZF1, CEBPE, CDKN2A, PIP4K2A, LHPP and ELK3 influencing ALL risk. However, the risk variants of these SNPs were not validated in all populations, suggesting that some of the loci could be population specific. On the other hand, the currently identified risk SNPs in these genes only account for 19% of the additive heritable risk. This estimation indicates that additional susceptibility variants could be discovered. In this review, we will provide an overview of the most important findings carried out in genetic susceptibility of childhood ALL in all GWAS and subsequent studies and we will also point to future directions that could be explored in the near future.
Collapse
|
26
|
Villanueva‐Cañas JL, Rech GE, Cara MAR, González J. Beyond
SNP
s: how to detect selection on transposable element insertions. Methods Ecol Evol 2017. [DOI: 10.1111/2041-210x.12781] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Affiliation(s)
| | - Gabriel E. Rech
- Institute of Evolutionary Biology (CSIC‐Universitat Pompeu Fabra) Barcelona Spain
| | - Maria Angeles Rodriguez Cara
- Ecoanthropology and Ethnobiology Laboratory, UMR 7206, CNRS/MNHN/Universite Paris 7 Museum National d'HistoireNaturelle F‐75116 Paris France
| | - Josefa González
- Institute of Evolutionary Biology (CSIC‐Universitat Pompeu Fabra) Barcelona Spain
| |
Collapse
|
27
|
Zovoilis A, Cifuentes-Rojas C, Chu HP, Hernandez AJ, Lee JT. Destabilization of B2 RNA by EZH2 Activates the Stress Response. Cell 2017; 167:1788-1802.e13. [PMID: 27984727 DOI: 10.1016/j.cell.2016.11.041] [Citation(s) in RCA: 51] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2016] [Revised: 10/03/2016] [Accepted: 11/22/2016] [Indexed: 12/12/2022]
Abstract
More than 98% of the mammalian genome is noncoding, and interspersed transposable elements account for ∼50% of noncoding space. Here, we demonstrate that a specific interaction between the polycomb protein EZH2 and RNA made from B2 SINE retrotransposons controls stress-responsive genes in mouse cells. In the heat-shock model, B2 RNA binds stress genes and suppresses their transcription. Upon stress, EZH2 is recruited and triggers cleavage of B2 RNA. B2 degradation in turn upregulates stress genes. Evidence indicates that B2 RNA operates as a "speed bump" against advancement of RNA polymerase II, and temperature stress releases the brakes on transcriptional elongation. These data attribute a new function to EZH2 that is independent of its histone methyltransferase activity and reconcile how EZH2 can be associated with both gene repression and activation. Our study reveals that EZH2 and B2 together control activation of a large network of genes involved in thermal stress.
Collapse
Affiliation(s)
- Athanasios Zovoilis
- Howard Hughes Medical Institute; Department of Molecular Biology, Massachusetts General Hospital, Boston, MA 02114, USA; Department of Genetics, Harvard Medical School, Boston, MA 02114, USA
| | - Catherine Cifuentes-Rojas
- Howard Hughes Medical Institute; Department of Molecular Biology, Massachusetts General Hospital, Boston, MA 02114, USA; Department of Genetics, Harvard Medical School, Boston, MA 02114, USA
| | - Hsueh-Ping Chu
- Howard Hughes Medical Institute; Department of Molecular Biology, Massachusetts General Hospital, Boston, MA 02114, USA; Department of Genetics, Harvard Medical School, Boston, MA 02114, USA
| | - Alfredo J Hernandez
- Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, MA 02115, USA
| | - Jeannie T Lee
- Howard Hughes Medical Institute; Department of Molecular Biology, Massachusetts General Hospital, Boston, MA 02114, USA; Department of Genetics, Harvard Medical School, Boston, MA 02114, USA.
| |
Collapse
|
28
|
Chuong EB, Elde NC, Feschotte C. Regulatory activities of transposable elements: from conflicts to benefits. Nat Rev Genet 2016; 18:71-86. [PMID: 27867194 DOI: 10.1038/nrg.2016.139] [Citation(s) in RCA: 760] [Impact Index Per Article: 95.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Transposable elements (TEs) are a prolific source of tightly regulated, biochemically active non-coding elements, such as transcription factor-binding sites and non-coding RNAs. Many recent studies reinvigorate the idea that these elements are pervasively co-opted for the regulation of host genes. We argue that the inherent genetic properties of TEs and the conflicting relationships with their hosts facilitate their recruitment for regulatory functions in diverse genomes. We review recent findings supporting the long-standing hypothesis that the waves of TE invasions endured by organisms for eons have catalysed the evolution of gene-regulatory networks. We also discuss the challenges of dissecting and interpreting the phenotypic effect of regulatory activities encoded by TEs in health and disease.
Collapse
Affiliation(s)
- Edward B Chuong
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, Utah 84103, USA
| | - Nels C Elde
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, Utah 84103, USA
| | - Cédric Feschotte
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, Utah 84103, USA
| |
Collapse
|
29
|
Schemberger MO, Nogaroto V, Almeida MC, Artoni RF, Valente GT, Martins C, Moreira-Filho O, Cestari MM, Vicari MR. Sequence analyses and chromosomal distribution of the Tc1/Mariner element in Parodontidae fish (Teleostei: Characiformes). Gene 2016; 593:308-14. [DOI: 10.1016/j.gene.2016.08.034] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2016] [Revised: 07/31/2016] [Accepted: 08/20/2016] [Indexed: 10/21/2022]
|
30
|
Göke J, Ng HH. CTRL+INSERT: retrotransposons and their contribution to regulation and innovation of the transcriptome. EMBO Rep 2016; 17:1131-44. [PMID: 27402545 DOI: 10.15252/embr.201642743] [Citation(s) in RCA: 51] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2016] [Accepted: 06/20/2016] [Indexed: 12/25/2022] Open
Abstract
The human genome contains millions of fragments from retrotransposons-highly repetitive DNA sequences that were once able to "copy and paste" themselves to other regions in the genome. However, the majority of retrotransposons have lost this capacity through acquisition of mutations or through endogenous silencing mechanisms. Without this imminent threat of transposition, retrotransposons have the potential to act as a major source of genomic innovation. Indeed, large numbers of retrotransposons have been found to be active in specific contexts: as gene regulatory elements and promoters for protein-coding genes or long noncoding RNAs, among others. In this review, we summarise recent findings about retrotransposons, with implications in gene expression regulation, the expansion of gene isoform diversity and the generation of long noncoding RNAs. We highlight key examples that demonstrate their role in cellular identity and their versatility as markers of cell states, and we discuss how their dysregulation may contribute to the formation of and possibly therapeutic response in human cancers.
Collapse
Affiliation(s)
- Jonathan Göke
- Computational and Systems Biology, Genome Institute of Singapore, Singapore
| | - Huck Hui Ng
- Gene Regulation Laboratory, Genome Institute of Singapore, Singapore Department of Biochemistry, National University of Singapore, Singapore Department of Biological Sciences, National University of Singapore, Singapore School of Biological Sciences, Nanyang Technological University, Singapore
| |
Collapse
|
31
|
Thompson PJ, Macfarlan TS, Lorincz MC. Long Terminal Repeats: From Parasitic Elements to Building Blocks of the Transcriptional Regulatory Repertoire. Mol Cell 2016; 62:766-76. [PMID: 27259207 PMCID: PMC4910160 DOI: 10.1016/j.molcel.2016.03.029] [Citation(s) in RCA: 162] [Impact Index Per Article: 20.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
The life cycle of endogenous retroviruses (ERVs), also called long terminal repeat (LTR) retrotransposons, begins with transcription by RNA polymerase II followed by reverse transcription and re-integration into the host genome. While most ERVs are relics of ancient integration events, "young" proviruses competent for retrotransposition-found in many mammals, but not humans-represent an ongoing threat to host fitness. As a consequence, several restriction pathways have evolved to suppress their activity at both transcriptional and post-transcriptional stages of the viral life cycle. Nevertheless, accumulating evidence has revealed that LTR sequences derived from distantly related ERVs have been exapted as regulatory sequences for many host genes in a wide range of cell types throughout mammalian evolution. Here, we focus on emerging themes from recent studies cataloging the diversity of ERV LTRs acting as important transcriptional regulatory elements in mammals and explore the molecular features that likely account for LTR exaptation in developmental and tissue-specific gene regulation.
Collapse
Affiliation(s)
- Peter J Thompson
- Department of Medical Genetics, Life Sciences Institute, University of British Columbia, Vancouver, BC V6T 1Z3, Canada
| | - Todd S Macfarlan
- Eunice Kennedy Shriver National Institute of Child Health and Human Development, NIH, Bethesda, MD 20892, USA.
| | - Matthew C Lorincz
- Department of Medical Genetics, Life Sciences Institute, University of British Columbia, Vancouver, BC V6T 1Z3, Canada.
| |
Collapse
|
32
|
Li H, Lv X. Functional annotation of noncoding variants and prioritization of cancer-associated lncRNAs in lung cancer. Oncol Lett 2016; 12:222-230. [PMID: 27347129 DOI: 10.3892/ol.2016.4604] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2015] [Accepted: 04/01/2016] [Indexed: 11/05/2022] Open
Abstract
Multiple computational tools have been widely applied to the detection of coding driver mutations in cancer; however, the prioritization of pathogenic non-coding variants remains a difficult and demanding task. The present study was performed to distinguish non-coding disease-causing mutations from neutral ones, and to prioritize potential cancer-associated long non-coding RNAs (lncRNAs) with a logistic regression model in lung cancer. A logistic regression model was constructed, combining 19,153 disease-associated ClinVar and Human Gene Mutation Database pathogenic variants as the response variable and non-coding features as the predictor variable. Validation of the model was conducted with genome-wide association study (GWAS) disease- or trait-associated single nucleotide polymorphisms (SNPs) and recurrent somatic mutations. High scoring regions were characterized with respect to their distribution in various features and gene classes; potential cancer-associated lncRNA candidates were prioritized, combining the fraction of high-scoring regions and average score predicted by the logistic regression model. H3K79me2 was the most negative factor that contributed to the model, while conserved regions were most positively informative to the model. The area under the receiver operating characteristic curve of the model was 0.89. The model assigned a significantly higher score to GWAS SNPs and recurrent somatic mutations compared with neutral SNPs (mean, 5.9012 vs. 5.5238; P<0.001, Mann-Whitney U test) and non-recurrent mutations (mean, 5.4677 vs. 5.2277, P<0.001, Mann-Whitney U test), respectively. It was observed that regions, including splicing sites and untranslated regions, and gene classes, including cancer genes and cancer-associated lncRNAs, had an increased enrichment of high-scoring regions. In total, 2,679 cancer-associated lncRNAs were determined and characterized. A total of 104 of these lncRNAs were differentially expressed between lung cancer and normal specimens. The logistic regression model is a useful and efficient scoring system to prioritize non-coding pathogenic variants and lncRNAs, and may provide the basis for detecting non-coding driver lncRNAs in lung cancer.
Collapse
Affiliation(s)
- Hua Li
- Department of Anesthesiology, Shanghai Pulmonary Hospital, School of Medicine, Tongji University, Shanghai 200072, P.R. China
| | - Xin Lv
- Department of Anesthesiology, Shanghai Pulmonary Hospital, School of Medicine, Tongji University, Shanghai 200072, P.R. China
| |
Collapse
|
33
|
Sciamanna I, De Luca C, Spadafora C. The Reverse Transcriptase Encoded by LINE-1 Retrotransposons in the Genesis, Progression, and Therapy of Cancer. Front Chem 2016; 4:6. [PMID: 26904537 PMCID: PMC4749692 DOI: 10.3389/fchem.2016.00006] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2015] [Accepted: 01/26/2016] [Indexed: 12/24/2022] Open
Abstract
In higher eukaryotic genomes, Long Interspersed Nuclear Element 1 (LINE-1) retrotransposons represent a large family of repeated genomic elements. They transpose using a reverse transcriptase (RT), which they encode as part of the ORF2p product. RT inhibition in cancer cells, either via RNA interference-dependent silencing of active LINE-1 elements, or using RT inhibitory drugs, reduces cancer cell proliferation, promotes their differentiation and antagonizes tumor progression in animal models. Indeed, the non-nucleoside RT inhibitor efavirenz has recently been tested in a phase II clinical trial with metastatic prostate cancer patients. An in-depth analysis of ORF2p in a mouse model of breast cancer showed ORF2p to be precociously expressed in precancerous lesions and highly abundant in advanced cancer stages, while being barely detectable in normal breast tissue, providing a rationale for the finding that RT-expressing tumors are therapeutically sensitive to RT inhibitors. We summarize mechanistic and gene profiling studies indicating that abundant LINE-1-derived RT can “sequester” RNA substrates for reverse transcription in tumor cells, entailing the formation of RNA:DNA hybrid molecules and impairing the overall production of regulatory miRNAs, with a global impact on the cell transcriptome. Based on these data, LINE-1-ORF2 encoded RT has a tumor-promoting potential that is exerted at an epigenetic level. We propose a model whereby LINE1-RT drives a previously unrecognized global regulatory process, the deregulation of which drives cell transformation and tumorigenesis with possible implications for cancer cell heterogeneity.
Collapse
Affiliation(s)
| | | | - Corrado Spadafora
- Institute of Translational Pharmacology, National Resarch Council of Italy Rome, Italy
| |
Collapse
|
34
|
Fu Y, Liu Z, Lou S, Bedford J, Mu XJ, Yip KY, Khurana E, Gerstein M. FunSeq2: a framework for prioritizing noncoding regulatory variants in cancer. Genome Biol 2015; 15:480. [PMID: 25273974 PMCID: PMC4203974 DOI: 10.1186/s13059-014-0480-5] [Citation(s) in RCA: 226] [Impact Index Per Article: 25.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2014] [Indexed: 12/15/2022] Open
Abstract
Identification of noncoding drivers from thousands of somatic alterations in a typical tumor is a difficult and unsolved problem. We report a computational framework, FunSeq2, to annotate and prioritize these mutations. The framework combines an adjustable data context integrating large-scale genomics and cancer resources with a streamlined variant-prioritization pipeline. The pipeline has a weighted scoring system combining: inter- and intra-species conservation; loss- and gain-of-function events for transcription-factor binding; enhancer-gene linkages and network centrality; and per-element recurrence across samples. We further highlight putative drivers with information specific to a particular sample, such as differential expression. FunSeq2 is available from funseq2.gersteinlab.org.
Collapse
Affiliation(s)
- Yao Fu
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
| | | | | | | | | | | | | | | |
Collapse
|
35
|
Li J, Drubay D, Michiels S, Gautheret D. Mining the coding and non-coding genome for cancer drivers. Cancer Lett 2015; 369:307-15. [PMID: 26433158 DOI: 10.1016/j.canlet.2015.09.015] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2015] [Revised: 09/24/2015] [Accepted: 09/24/2015] [Indexed: 12/20/2022]
Abstract
Progress in next-generation sequencing provides unprecedented opportunities to fully characterize the spectrum of somatic mutations of cancer genomes. Given the large number of somatic mutations identified by such technologies, the prioritization of cancer-driving events is a consistent bottleneck. Most bioinformatics tools concentrate on driver mutations in the coding fraction of the genome, those causing changes in protein products. As more non-coding pathogenic variants are identified and characterized, the development of computational approaches to effectively prioritize cancer-driving variants within the non-coding fraction of human genome is becoming critical. After a short summary of methods for coding variant prioritization, we here review the highly diverse non-coding elements that may act as cancer drivers and describe recent methods that attempt to evaluate the deleteriousness of sequence variation in these elements. With such tools, the prioritization and identification of cancer-implicated regulatory elements and non-coding RNAs is becoming a reality.
Collapse
Affiliation(s)
- Jia Li
- Institute for Integrative Biology of the Cell (I2BC), CNRS, CEA, Université Paris-Sud, Université Paris-Saclay, 91198 Gif sur Yvette, France
| | - Damien Drubay
- Service de Biostatistique et d'Epidemiologie, Gustave Roussy, Villejuif, France; INSERM U1018, CESP, Université Paris-Sud, Université Paris-Saclay, Villejuif, France
| | - Stefan Michiels
- Service de Biostatistique et d'Epidemiologie, Gustave Roussy, Villejuif, France; INSERM U1018, CESP, Université Paris-Sud, Université Paris-Saclay, Villejuif, France
| | - Daniel Gautheret
- Institute for Integrative Biology of the Cell (I2BC), CNRS, CEA, Université Paris-Sud, Université Paris-Saclay, 91198 Gif sur Yvette, France.
| |
Collapse
|
36
|
Nelson ADL, Shippen DE. Evolution of TERT-interacting lncRNAs: expanding the regulatory landscape of telomerase. Front Genet 2015; 6:277. [PMID: 26442096 PMCID: PMC4564757 DOI: 10.3389/fgene.2015.00277] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2015] [Accepted: 08/17/2015] [Indexed: 12/17/2022] Open
Abstract
Long non-coding RNAs (lncRNAs) evolve rapidly and are functionally diverse. The emergence of new lncRNAs is driven by genome disturbance events, including whole genome duplication, and transposition. One of the few lncRNAs with a conserved role throughout eukaryotes is the telomerase RNA, TER. TER works in concert with the telomerase reverse transcriptase (TERT) to maintain telomeres. Here we discuss recent findings from Arabidopsis thaliana and its relatives illustrating the remarkable evolutionary flexibility within TER and the potential for non-canonical TERT-lncRNA interactions. We highlight the two TERs in A. thaliana. One is a conventional telomerase template. The other lncRNA negatively regulates telomerase activity in response to DNA damage, a function mediated by co-option of a transposable element. In addition, we discuss evidence for multiple independent TER loci throughout the plant family Brassicaceae, and how these loci not only reflect rapid convergent evolution, but also the flexibility of having a lncRNA at the core of telomerase. Lastly, we discuss the propensity for TERT to bind a suite of non-templating lncRNAs, and how such RNAs may facilitate telomerase regulation and off-telomere functions.
Collapse
Affiliation(s)
| | - Dorothy E Shippen
- Department of Biochemistry and Biophysics, Texas A&M University , College Station, TX, USA
| |
Collapse
|
37
|
Young RS, Hayashizaki Y, Andersson R, Sandelin A, Kawaji H, Itoh M, Lassmann T, Carninci P, Bickmore WA, Forrest AR, Taylor MS. The frequent evolutionary birth and death of functional promoters in mouse and human. Genome Res 2015; 25:1546-57. [PMID: 26228054 PMCID: PMC4579340 DOI: 10.1101/gr.190546.115] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2015] [Accepted: 07/28/2015] [Indexed: 12/04/2022]
Abstract
Promoters are central to the regulation of gene expression. Changes in gene regulation are thought to underlie much of the adaptive diversification between species and phenotypic variation within populations. In contrast to earlier work emphasizing the importance of enhancer evolution and subtle sequence changes at promoters, we show that dramatic changes such as the complete gain and loss (collectively, turnover) of functional promoters are common. Using quantitative measures of transcription initiation in both humans and mice across 52 matched tissues, we discriminate promoter sequence gains from losses and resolve the lineage of changes. We also identify expression divergence and functional turnover between orthologous promoters, finding only the latter is associated with local sequence changes. Promoter turnover has occurred at the majority (>56%) of protein-coding genes since humans and mice diverged. Tissue-restricted promoters are the most evolutionarily volatile where retrotransposition is an important, but not the sole, source of innovation. There is considerable heterogeneity of turnover rates between promoters in different tissues, but the consistency of these in both lineages suggests that the same biological systems are similarly inclined to transcriptional rewiring. The genes affected by promoter turnover show evidence of adaptive evolution. In mice, promoters are primarily lost through deletion of the promoter containing sequence, whereas in humans, many promoters appear to be gradually decaying with weak transcriptional output and relaxed selective constraint. Our results suggest that promoter gain and loss is an important process in the evolutionary rewiring of gene regulation and may be a significant source of phenotypic diversification.
Collapse
Affiliation(s)
- Robert S Young
- MRC Human Genetics Unit, MRC Institute for Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, EH4 2XU, United Kingdom
| | - Yoshihide Hayashizaki
- RIKEN Preventive Medicine and Diagnosis Innovation Program, Wako, Saitama, 351-0198, Japan
| | - Robin Andersson
- Department of Biology and Biotech Research and Innovation Centre, Copenhagen University, 2200 Copenhagen N, Denmark
| | - Albin Sandelin
- Department of Biology and Biotech Research and Innovation Centre, Copenhagen University, 2200 Copenhagen N, Denmark
| | - Hideya Kawaji
- RIKEN Preventive Medicine and Diagnosis Innovation Program, Wako, Saitama, 351-0198, Japan; RIKEN Center for Life Science Technologies, Division of Genomic Technologies, Tsurumi-ku, Yokohama, 230-0045, Japan
| | - Masayoshi Itoh
- RIKEN Preventive Medicine and Diagnosis Innovation Program, Wako, Saitama, 351-0198, Japan; RIKEN Center for Life Science Technologies, Division of Genomic Technologies, Tsurumi-ku, Yokohama, 230-0045, Japan
| | - Timo Lassmann
- RIKEN Center for Life Science Technologies, Division of Genomic Technologies, Tsurumi-ku, Yokohama, 230-0045, Japan
| | - Piero Carninci
- RIKEN Center for Life Science Technologies, Division of Genomic Technologies, Tsurumi-ku, Yokohama, 230-0045, Japan
| | | | - Wendy A Bickmore
- MRC Human Genetics Unit, MRC Institute for Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, EH4 2XU, United Kingdom
| | - Alistair R Forrest
- RIKEN Center for Life Science Technologies, Division of Genomic Technologies, Tsurumi-ku, Yokohama, 230-0045, Japan; Systems Biology and Genomics, Harry Perkins Institute of Medical Research, QEII Medical Centre, Nedlands, Western Australia 6009, Australia
| | - Martin S Taylor
- MRC Human Genetics Unit, MRC Institute for Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, EH4 2XU, United Kingdom
| |
Collapse
|
38
|
Dhadi SR, Xu Z, Shaik R, Driscoll K, Ramakrishna W. Differential regulation of genes by retrotransposons in rice promoters. PLANT MOLECULAR BIOLOGY 2015; 87:603-13. [PMID: 25697955 DOI: 10.1007/s11103-015-0300-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/05/2014] [Accepted: 02/16/2015] [Indexed: 05/06/2023]
Abstract
Rice genome harbors genes and promoters with retrotransposon insertions. There is very little information about their function. The effect of retrotransposon insertions in four rice promoter regions on gene regulation, was investigated using promoter-reporter gene constructs with and without retrotransposons. Differences in expression levels of gus and egfp reporter genes in forward orientation and rfp in reverse orientation were evaluated in rice plants with transient expression employing quantitative RT-PCR analysis, histochemical GUS staining, and eGFP and RFP fluorescent microscopy. The presence of SINE in the promoter 1 (P1) resulted in higher expression levels of the reporter genes, whereas the presence of LINE in P2 or gypsy LTR retrotransposon in P3 reduced expression of the reporter genes. Furthermore, the SINE in P1 acts as an enhancer in contrast with the LINE in P2 and the gypsy LTR retrotransposon in P3 which act as silencers. CTAA and CGG motifs in these retrotransposons are the likely candidates for the downregulation compared to TCTT motif (SINE) which is a candidate for the upregulation of gene expression. The effect of retrotransposons on gene regulation correlated with the earlier investigation of conservation patterns of these four retrotransposon insertions in several rice accessions implying their evolutionary significance.
Collapse
Affiliation(s)
- Surendar Reddy Dhadi
- Department of Biological Sciences, Michigan Technological University, Houghton, MI, 49931, USA
| | | | | | | | | |
Collapse
|
39
|
Grandbastien MA. LTR retrotransposons, handy hitchhikers of plant regulation and stress response. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2015; 1849:403-16. [DOI: 10.1016/j.bbagrm.2014.07.017] [Citation(s) in RCA: 110] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/26/2014] [Revised: 07/21/2014] [Accepted: 07/23/2014] [Indexed: 11/30/2022]
|
40
|
Farré M, Robinson TJ, Ruiz-Herrera A. An Integrative Breakage Model of genome architecture, reshuffling and evolution: The Integrative Breakage Model of genome evolution, a novel multidisciplinary hypothesis for the study of genome plasticity. Bioessays 2015; 37:479-88. [PMID: 25739389 DOI: 10.1002/bies.201400174] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2014] [Revised: 02/12/2015] [Accepted: 02/13/2015] [Indexed: 12/23/2022]
Abstract
Our understanding of genomic reorganization, the mechanics of genomic transmission to offspring during germ line formation, and how these structural changes contribute to the speciation process, and genetic disease is far from complete. Earlier attempts to understand the mechanism(s) and constraints that govern genome remodeling suffered from being too narrowly focused, and failed to provide a unified and encompassing view of how genomes are organized and regulated inside cells. Here, we propose a new multidisciplinary Integrative Breakage Model for the study of genome evolution. The analysis of the high-level structural organization of genomes (nucleome), together with the functional constrains that accompany genome reshuffling, provide insights into the origin and plasticity of genome organization that may assist with the detection and isolation of therapeutic targets for the treatment of complex human disorders.
Collapse
Affiliation(s)
- Marta Farré
- Departament de Biologia Cel·lular, Fisiologia i Immunologia, Universitat Autònoma de Barcelona, Campus UAB, Barcelona, Spain
| | | | | |
Collapse
|
41
|
Hoen DR, Bureau TE. Discovery of novel genes derived from transposable elements using integrative genomic analysis. Mol Biol Evol 2015; 32:1487-506. [PMID: 25713212 DOI: 10.1093/molbev/msv042] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Complex eukaryotes contain millions of transposable elements (TEs), comprising large fractions of their nuclear genomes. TEs consist of structural, regulatory, and coding sequences that are ordinarily associated with transposition, but that occasionally confer on the organism a selective advantage and may thereby become exapted. Exapted transposable element genes (ETEs) are known to play critical roles in diverse systems, from vertebrate adaptive immunity to plant development. Yet despite their evident importance, most ETEs have been identified fortuitously and few systematic searches have been conducted, suggesting that additional ETEs may await discovery. To explore this possibility, we develop a comprehensive systematic approach to searching for ETEs. We use TE-specific conserved domains to identify with high precision genes derived from TEs and screen them for signatures of exaptation based on their similarities to reference sets of known ETEs, conventional (non-TE) genes, and TE genes across diverse genetic attributes including repetitiveness, conservation of genomic location and sequence, and levels of expression and repressive small RNAs. Applying this approach in the model plant Arabidopsis thaliana, we discover a surprisingly large number of novel high confidence ETEs. Intriguingly, unlike known plant ETEs, several of the novel ETE families form tandemly arrayed gene clusters, whereas others are relatively young. Our results not only identify novel TE-derived genes that may have practical applications but also challenge the notion that TE exaptation is merely a relic of ancient life, instead suggesting that it may continue to fundamentally drive evolution.
Collapse
Affiliation(s)
- Douglas R Hoen
- Department of Biology, McGill University, Montréal, QC, Canada
| | - Thomas E Bureau
- Department of Biology, McGill University, Montréal, QC, Canada
| |
Collapse
|
42
|
Lynch VJ, Nnamani MC, Kapusta A, Brayer K, Plaza SL, Mazur EC, Emera D, Sheikh SZ, Grützner F, Bauersachs S, Graf A, Young SL, Lieb JD, DeMayo FJ, Feschotte C, Wagner GP. Ancient transposable elements transformed the uterine regulatory landscape and transcriptome during the evolution of mammalian pregnancy. Cell Rep 2015; 10:551-61. [PMID: 25640180 PMCID: PMC4447085 DOI: 10.1016/j.celrep.2014.12.052] [Citation(s) in RCA: 179] [Impact Index Per Article: 19.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2014] [Revised: 11/14/2014] [Accepted: 12/22/2014] [Indexed: 11/24/2022] Open
Abstract
A major challenge in biology is determining how evolutionarily novel characters originate; however, mechanistic explanations for the origin of new characters are almost completely unknown. The evolution of pregnancy is an excellent system in which to study the origin of novelties because mammals preserve stages in the transition from egg laying to live birth. To determine the molecular bases of this transition, we characterized the pregnant/gravid uterine transcriptome from tetrapods to trace the evolutionary history of uterine gene expression. We show that thousands of genes evolved endometrial expression during the origins of mammalian pregnancy, including genes that mediate maternal-fetal communication and immunotolerance. Furthermore, thousands of cis-regulatory elements that mediate decidualization and cell-type identity in decidualized stromal cells are derived from ancient mammalian transposable elements (TEs). Our results indicate that one of the defining mammalian novelties evolved from DNA sequences derived from ancient mammalian TEs coopted into hormone-responsive regulatory elements distributed throughout the genome.
Collapse
Affiliation(s)
- Vincent J Lynch
- Department of Human Genetics, The University of Chicago, 920 East 58(th) Street, CLSC 319C, Chicago, IL 60637, USA.
| | - Mauris C Nnamani
- Yale Systems Biology Institute and Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06511, USA
| | - Aurélie Kapusta
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT 84112, USA
| | - Kathryn Brayer
- Yale Systems Biology Institute and Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06511, USA
| | - Silvia L Plaza
- Yale Systems Biology Institute and Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06511, USA
| | - Erik C Mazur
- Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX 77030, USA
| | - Deena Emera
- Yale Systems Biology Institute and Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06511, USA
| | - Shehzad Z Sheikh
- Division of Gastroenterology and Hepatology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Frank Grützner
- The Robinson Institute, School of Molecular and Biomedical Sciences, University of Adelaide, Adelaide, SA 5005, Australia
| | - Stefan Bauersachs
- Laboratory for Functional Genome Analysis (LAFUGA), Gene Center, LMU Munich, Feodor Lynen Strasse 25, 81377 Munich, Germany
| | - Alexander Graf
- Laboratory for Functional Genome Analysis (LAFUGA), Gene Center, LMU Munich, Feodor Lynen Strasse 25, 81377 Munich, Germany
| | - Steven L Young
- Department of Obstetrics and Gynecology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27705, USA
| | - Jason D Lieb
- Department of Human Genetics, The University of Chicago, 920 East 58(th) Street, CLSC 319C, Chicago, IL 60637, USA
| | - Francesco J DeMayo
- Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX 77030, USA; Department of Obstetrics and Gynecology, Baylor College of Medicine, Houston, TX 77030, USA
| | - Cédric Feschotte
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT 84112, USA
| | - Günter P Wagner
- Yale Systems Biology Institute and Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06511, USA
| |
Collapse
|
43
|
Brosius J. The persistent contributions of RNA to eukaryotic gen(om)e architecture and cellular function. Cold Spring Harb Perspect Biol 2014; 6:a016089. [PMID: 25081515 DOI: 10.1101/cshperspect.a016089] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Currently, the best scenario for earliest forms of life is based on RNA molecules as they have the proven ability to catalyze enzymatic reactions and harbor genetic information. Evolutionary principles valid today become apparent in such models already. Furthermore, many features of eukaryotic genome architecture might have their origins in an RNA or RNA/protein (RNP) world, including the onset of a further transition, when DNA replaced RNA as the genetic bookkeeper of the cell. Chromosome maintenance, splicing, and regulatory function via RNA may be deeply rooted in the RNA/RNP worlds. Mostly in eukaryotes, conversion from RNA to DNA is still ongoing, which greatly impacts the plasticity of extant genomes. Raw material for novel genes encoding protein or RNA, or parts of genes including regulatory elements that selection can act on, continues to enter the evolutionary lottery.
Collapse
Affiliation(s)
- Jürgen Brosius
- Institute of Experimental Pathology (ZMBE), University of Münster, D-48149 Münster, Germany
| |
Collapse
|
44
|
Kamm GB, López-Leal R, Lorenzo JR, Franchini LF. A fast-evolving human NPAS3 enhancer gained reporter expression in the developing forebrain of transgenic mice. Philos Trans R Soc Lond B Biol Sci 2013; 368:20130019. [PMID: 24218632 DOI: 10.1098/rstb.2013.0019] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
The developmental brain gene NPAS3 stands out as a hot spot in human evolution because it contains the largest number of human-specific, fast-evolving, conserved, non-coding elements. In this paper we studied 2xHAR142, one of these elements that is located in the fifth intron of NPAS3. Using transgenic mice, we show that the mouse and chimp 2xHAR142 orthologues behave as transcriptional enhancers driving expression of the reporter gene lacZ to a similar NPAS3 expression subdomain in the mouse central nervous system. Interestingly, the human 2xHAR142 orthologue drives lacZ expression to an extended expression pattern in the nervous system. Thus, molecular evolution of 2xHAR142 provides the first documented example of human-specific heterotopy in the forebrain promoted by a transcriptional enhancer and suggests that it may have contributed to assemble the unique properties of the human brain.
Collapse
Affiliation(s)
- Gretel B Kamm
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular (INGEBI), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), , Buenos Aires, Argentina
| | | | | | | |
Collapse
|
45
|
Rubinstein M, de Souza FSJ. Evolution of transcriptional enhancers and animal diversity. Philos Trans R Soc Lond B Biol Sci 2013; 368:20130017. [PMID: 24218630 DOI: 10.1098/rstb.2013.0017] [Citation(s) in RCA: 63] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Deciphering the genetic bases that drive animal diversity is one of the major challenges of modern biology. Although four decades ago it was proposed that animal evolution was mainly driven by changes in cis-regulatory DNA elements controlling gene expression rather than in protein-coding sequences, only now are powerful bioinformatics and experimental approaches available to accelerate studies into how the evolution of transcriptional enhancers contributes to novel forms and functions. In the introduction to this Theme Issue, we start by defining the general properties of transcriptional enhancers, such as modularity and the coexistence of tight sequence conservation with transcription factor-binding site shuffling as different mechanisms that maintain the enhancer grammar over evolutionary time. We discuss past and current methods used to identify cell-type-specific enhancers and provide examples of how enhancers originate de novo, change and are lost in particular lineages. We then focus in the central part of this Theme Issue on analysing examples of how the molecular evolution of enhancers may change form and function. Throughout this introduction, we present the main findings of the articles, reviews and perspectives contributed to this Theme Issue that together illustrate some of the great advances and current frontiers in the field.
Collapse
Affiliation(s)
- Marcelo Rubinstein
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular, Consejo Nacional de Investigaciones Científicas y Técnicas, , C1428ADN Buenos Aires, Argentina
| | | |
Collapse
|
46
|
Harmston N, Baresic A, Lenhard B. The mystery of extreme non-coding conservation. Philos Trans R Soc Lond B Biol Sci 2013; 368:20130021. [PMID: 24218634 PMCID: PMC3826495 DOI: 10.1098/rstb.2013.0021] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Regions of several dozen to several hundred base pairs of extreme conservation have been found in non-coding regions in all metazoan genomes. The distribution of these elements within and across genomes has suggested that many have roles as transcriptional regulatory elements in multi-cellular organization, differentiation and development. Currently, there is no known mechanism or function that would account for this level of conservation at the observed evolutionary distances. Previous studies have found that, while these regions are under strong purifying selection, and not mutational coldspots, deletion of entire regions in mice does not necessarily lead to identifiable changes in phenotype during development. These opposing findings lead to several questions regarding their functional importance and why they are under strong selection in the first place. In this perspective, we discuss the methods and techniques used in identifying and dissecting these regions, their observed patterns of conservation, and review the current hypotheses on their functional significance.
Collapse
Affiliation(s)
- Nathan Harmston
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London and MRC Clinical Sciences Centre, , Hammersmith Hospital Campus, Du Cane Road, London W12 0NN, UK
| | | | | |
Collapse
|
47
|
Abstract
When the human genome project started, the major challenge was how to sequence a 3 billion letter code in an organized and cost-effective manner. When completed, the project had laid the foundation for a huge variety of biomedical fields through the production of a complete human genome sequence, but also had driven the development of laboratory and analytical methods that could produce large amounts of sequencing data cheaply. These technological developments made possible the sequencing of many more vertebrate genomes, which have been necessary for the interpretation of the human genome. They have also enabled large-scale studies of vertebrate genome evolution, as well as comparative and human medicine. In this review, we give examples of evolutionary analysis using a wide variety of time frames—from the comparison of populations within a species to the comparison of species separated by at least 300 million years. Furthermore, we anticipate discoveries related to evolutionary mechanisms, adaptation, and disease to quickly accelerate in the coming years.
Collapse
Affiliation(s)
- Jessica Alföldi
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | | |
Collapse
|
48
|
Ivancevic AM, Walsh AM, Kortschak RD, Adelson DL. Jumping the fine LINE between species: horizontal transfer of transposable elements in animals catalyses genome evolution. Bioessays 2013; 35:1071-82. [PMID: 24003001 DOI: 10.1002/bies.201300072] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
Horizontal transfer (HT) is the transmission of genetic material between non-mating species, a phenomenon thought to occur rarely in multicellular eukaryotes. However, many transposable elements (TEs) are not only capable of HT, but have frequently jumped between widely divergent species. Here we review and integrate reported cases of HT in retrotransposons of the BovB family, and DNA transposons, over a broad range of animals spanning all continents. Our conclusions challenge the paradigm that HT in vertebrates is restricted to infective long terminal repeat (LTR) retrotransposons or retroviruses. This raises the possibility that other non-LTR retrotransposons, such as L1 or CR1 elements, believed to be only vertically transmitted, can horizontally transfer between species. Growing evidence indicates that the process of HT is much more general across different TEs and species than previously believed, and that it likely shapes eukaryotic genomes and catalyses genome evolution.
Collapse
Affiliation(s)
- Atma M Ivancevic
- School of Molecular and Biomedical Science, University of Adelaide, Adelaide, SA, Australia
| | | | | | | |
Collapse
|
49
|
Shapiro JA. Implications of the Read-Write Genome view. Phys Life Rev 2013; 10:347-50. [PMID: 23988879 DOI: 10.1016/j.plrev.2013.07.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2013] [Accepted: 07/16/2013] [Indexed: 10/26/2022]
Affiliation(s)
- James A Shapiro
- Dept. of Biochemistry and Molecular Biology, University of Chicago, GCIS W123B, 979 E. 57th Street, Chicago, IL 60637, USA.
| |
Collapse
|
50
|
Shapiro JA. How life changes itself: the Read-Write (RW) genome. Phys Life Rev 2013; 10:287-323. [PMID: 23876611 DOI: 10.1016/j.plrev.2013.07.001] [Citation(s) in RCA: 78] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2013] [Accepted: 07/02/2013] [Indexed: 01/06/2023]
Abstract
The genome has traditionally been treated as a Read-Only Memory (ROM) subject to change by copying errors and accidents. In this review, I propose that we need to change that perspective and understand the genome as an intricately formatted Read-Write (RW) data storage system constantly subject to cellular modifications and inscriptions. Cells operate under changing conditions and are continually modifying themselves by genome inscriptions. These inscriptions occur over three distinct time-scales (cell reproduction, multicellular development and evolutionary change) and involve a variety of different processes at each time scale (forming nucleoprotein complexes, epigenetic formatting and changes in DNA sequence structure). Research dating back to the 1930s has shown that genetic change is the result of cell-mediated processes, not simply accidents or damage to the DNA. This cell-active view of genome change applies to all scales of DNA sequence variation, from point mutations to large-scale genome rearrangements and whole genome duplications (WGDs). This conceptual change to active cell inscriptions controlling RW genome functions has profound implications for all areas of the life sciences.
Collapse
Affiliation(s)
- James A Shapiro
- Dept. of Biochemistry and Molecular Biology, University of Chicago, GCIS W123B, 979 E. 57th Street, Chicago, IL 60637, USA. http://www.huffingtonpost.com/james-a-shapiro
| |
Collapse
|