1
|
Majid M, Liu X, Khan H, Huang Y. Transcriptional dynamics and tissue-specific expression patterns of transposable elements in orthopteran insects. Gene 2025; 936:149090. [PMID: 39549779 DOI: 10.1016/j.gene.2024.149090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2024] [Revised: 10/25/2024] [Accepted: 11/11/2024] [Indexed: 11/18/2024]
Abstract
Transposable elements (TEs) are prevalent in the genomes of orthopteran insects, contributing significantly to their genome evolution and diversity. In light of the existing gap in our understanding of TEs transcript dynamics in orthopteran insects, we recognize the critical need to undertake comprehensive analyses in this area. Therefore, we have decided to delve into the characterization of TE transcripts, their abundance profiles, and the formation of chimeric transcripts using RNA-seq data and genome assemblies. The transcript analysis of TEs across various species revealed significant differences in TE abundance and expression patterns. In particular, Schistocerca americana exhibited twice the number of transcripts within the genus Schistocerca compared to the average of other species, while Gryllus bimaculatus displayed the lowest number of transcripts. Despite this, all Schistocerca species shared similar fractions of TE transcripts at the clade level, with DNA transposons (45%) being the most abundant, followed by LINE (19%) and LTR elements (18%). Interestingly, Acrida cinerea displayed different TE abundance patterns compared to Schistocerca species, particularly with an increased proportion of LTR transcripts, accounting for 31% of the total transcripts. Further analysis revealed tissue-specific transcriptional activity of TE clades, with notable differences between male and female specimens. In Gryllus bimaculatus, TEs were highly transcribed across ovaries and gut tissues in females compared to male testes and gut. Conversely, Gastrimargus marmoratus displayed higher TE transcription in male tissues compared to females, indicating species-specific expression patterns. A similar pattern has been observed in Acrida cinerea, except in female gonads, where 4618 TEs were transcribed compared to 3757 in male gonads. Despite these variations, no correlation was found between genome size and TE transcript abundance. Additionally, highly conserved TEs were involved in the formation of chimeric transcripts, indicating potential regulatory roles in gene expression. The expression quantification analysis of chimeric TEs and genes revealed tissue-specific expression patterns, and TEs do not control the overall expression of all genes except some, suggesting regulatory roles of TEs in gene expression. Overall, our study underscores tissue-specific variations in TE expression and transcript abundance among different species. Additionally, our findings suggest the involvement of highly conserved TEs in the formation of chimeric transcripts across different species.
Collapse
Affiliation(s)
- Muhammad Majid
- College of Life Sciences, Shaanxi Normal University, Xi'an, China
| | - Xuanzeng Liu
- College of Life Sciences, Shaanxi Normal University, Xi'an, China
| | - Hashim Khan
- College of Life Sciences, Shaanxi Normal University, Xi'an, China
| | - Yuan Huang
- College of Life Sciences, Shaanxi Normal University, Xi'an, China.
| |
Collapse
|
2
|
Du AY, Chobirko JD, Zhuo X, Feschotte C, Wang T. Regulatory transposable elements in the encyclopedia of DNA elements. Nat Commun 2024; 15:7594. [PMID: 39217141 PMCID: PMC11366022 DOI: 10.1038/s41467-024-51921-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2023] [Accepted: 08/16/2024] [Indexed: 09/04/2024] Open
Abstract
Transposable elements (TEs) comprise ~50% of our genome, but knowledge of how TEs affect genome evolution remains incomplete. Leveraging ENCODE4 data, we provide the most comprehensive study to date of TE contributions to the regulatory genome. We find 236,181 (~25%) human candidate cis-regulatory elements (cCREs) are TE-derived, with over 90% lineage-specific since the human-mouse split, accounting for 8-36% of lineage-specific cCREs. Except for SINEs, cCRE-associated transcription factor (TF) motifs in TEs are derived from ancestral TE sequence more than expected by chance. We show that TEs may adopt similar regulatory activities of elements near their integration site. Since human-mouse divergence, TEs have contributed 3-56% of TF binding site turnover events across 30 examined TFs. Finally, TE-derived cCREs are similar to non-TE cCREs in terms of MPRA activity and GWAS variant enrichment. Overall, our results substantiate the notion that TEs have played an important role in shaping the human regulatory genome.
Collapse
Affiliation(s)
- Alan Y Du
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA
| | - Jason D Chobirko
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
| | - Xiaoyu Zhuo
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA
| | - Cédric Feschotte
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA.
| | - Ting Wang
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA.
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA.
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA.
| |
Collapse
|
3
|
Trovato M, Bunina D, Yildiz U, Fernandez-Novel Marx N, Uckelmann M, Levina V, Perez Y, Janeva A, Garcia BA, Davidovich C, Zaugg JB, Noh KM. Histone H3.3 lysine 9 and 27 control repressive chromatin at cryptic enhancers and bivalent promoters. Nat Commun 2024; 15:7557. [PMID: 39214979 PMCID: PMC11364623 DOI: 10.1038/s41467-024-51785-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Accepted: 08/15/2024] [Indexed: 09/04/2024] Open
Abstract
Histone modifications are associated with distinct transcriptional states, but it is unclear whether they instruct gene expression. To investigate this, we mutate histone H3.3 K9 and K27 residues in mouse embryonic stem cells (mESCs). Here, we find that H3.3K9 is essential for controlling specific distal intergenic regions and for proper H3K27me3 deposition at promoters. The H3.3K9A mutation resulted in decreased H3K9me3 at regions encompassing endogenous retroviruses and induced a gain of H3K27ac and nascent transcription. These changes in the chromatin environment unleash cryptic enhancers, resulting in the activation of distinctive transcriptional programs and culminating in protein expression normally restricted to specialized immune cell types. The H3.3K27A mutant disrupts the deposition and spreading of the repressive H3K27me3 mark, particularly impacting bivalent genes with higher basal levels of H3.3 at promoters. Therefore, H3.3K9 and K27 crucially orchestrate repressive chromatin states at cis-regulatory elements and bivalent promoters, respectively, and instruct proper transcription in mESCs.
Collapse
Affiliation(s)
- Matteo Trovato
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
- Collaboration for joint PhD degree between EMBL and Heidelberg University, Faculty of Biosciences, Heidelberg, Germany
| | - Daria Bunina
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
- European Molecular Biology Laboratory (EMBL), Structural and Computational Biology Unit, Heidelberg, Germany
| | - Umut Yildiz
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
- Collaboration for joint PhD degree between EMBL and Heidelberg University, Faculty of Biosciences, Heidelberg, Germany
| | | | - Michael Uckelmann
- Department of Biochemistry and Molecular Biology, Biomedicine Discovery Institute, Faculty of Medicine, Nursing and Health Sciences, Monash University, and EMBL-Australia, Clayton, VIC, Australia
| | - Vita Levina
- Department of Biochemistry and Molecular Biology, Biomedicine Discovery Institute, Faculty of Medicine, Nursing and Health Sciences, Monash University, and EMBL-Australia, Clayton, VIC, Australia
| | - Yekaterina Perez
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA
| | - Ana Janeva
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
| | - Benjamin A Garcia
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA
| | - Chen Davidovich
- Department of Biochemistry and Molecular Biology, Biomedicine Discovery Institute, Faculty of Medicine, Nursing and Health Sciences, Monash University, and EMBL-Australia, Clayton, VIC, Australia
| | - Judith B Zaugg
- European Molecular Biology Laboratory (EMBL), Structural and Computational Biology Unit, Heidelberg, Germany
| | - Kyung-Min Noh
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany.
| |
Collapse
|
4
|
Deaville LA, Berrens RV. Technology to the rescue: how to uncover the role of transposable elements in preimplantation development. Biochem Soc Trans 2024; 52:1349-1362. [PMID: 38752836 PMCID: PMC11346443 DOI: 10.1042/bst20231262] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Revised: 04/23/2024] [Accepted: 04/24/2024] [Indexed: 06/27/2024]
Abstract
Transposable elements (TEs) are highly expressed in preimplantation development. Preimplantation development is the phase when the cells of the early embryo undergo the first cell fate choice and change from being totipotent to pluripotent. A range of studies have advanced our understanding of TEs in preimplantation, as well as their epigenetic regulation and functional roles. However, many questions remain about the implications of TE expression during early development. Challenges originate first due to the abundance of TEs in the genome, and second because of the limited cell numbers in preimplantation. Here we review the most recent technological advancements promising to shed light onto the role of TEs in preimplantation development. We explore novel avenues to identify genomic TE insertions and improve our understanding of the regulatory mechanisms and roles of TEs and their RNA and protein products during early development.
Collapse
Affiliation(s)
- Lauryn A. Deaville
- Institute for Developmental and Regenerative Medicine, Oxford University, IMS-Tetsuya Nakamura Building, Old Road Campus, Roosevelt Dr, Oxford OX3 7TY, U.K
- Department of Paediatrics, Oxford University, Level 2, Children's Hospital, John Radcliffe Headington, Oxford OX3 9DU, U.K
- MRC Weatherall Institute of Molecular Medicine, Oxford University, John Radcliffe Hospital, Oxford OX3 9DS, U.K
| | - Rebecca V. Berrens
- Institute for Developmental and Regenerative Medicine, Oxford University, IMS-Tetsuya Nakamura Building, Old Road Campus, Roosevelt Dr, Oxford OX3 7TY, U.K
- Department of Paediatrics, Oxford University, Level 2, Children's Hospital, John Radcliffe Headington, Oxford OX3 9DU, U.K
| |
Collapse
|
5
|
Aurigemma I, Lanzetta O, Cirino A, Allegretti S, Lania G, Ferrentino R, Poondi Krishnan V, Angelini C, Illingworth E, Baldini A. Endothelial gene regulatory elements associated with cardiopharyngeal lineage differentiation. Commun Biol 2024; 7:351. [PMID: 38514806 PMCID: PMC10957928 DOI: 10.1038/s42003-024-06017-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Accepted: 03/06/2024] [Indexed: 03/23/2024] Open
Abstract
Endothelial cells (EC) differentiate from multiple sources, including the cardiopharyngeal mesoderm, which gives rise also to cardiac and branchiomeric muscles. The enhancers activated during endothelial differentiation within the cardiopharyngeal mesoderm are not completely known. Here, we use a cardiogenic mesoderm differentiation model that activates an endothelial transcription program to identify endothelial regulatory elements activated in early cardiogenic mesoderm. Integrating chromatin remodeling and gene expression data with available single-cell RNA-seq data from mouse embryos, we identify 101 putative regulatory elements of EC genes. We then apply a machine-learning strategy, trained on validated enhancers, to predict enhancers. Using this computational assay, we determine that 50% of these sequences are likely enhancers, some of which are already reported. We also identify a smaller set of regulatory elements of well-known EC genes and validate them using genetic and epigenetic perturbation. Finally, we integrate multiple data sources and computational tools to search for transcriptional factor binding motifs. In conclusion, we show EC regulatory sequences with a high likelihood to be enhancers, and we validate a subset of them using computational and cell culture models. Motif analyses show that the core EC transcription factors GATA/ETS/FOS is a likely driver of EC regulation in cardiopharyngeal mesoderm.
Collapse
Affiliation(s)
- Ilaria Aurigemma
- PhD program in Molecular Medicine and Medical Biotechnology, University Federico II, Via Sergio Pansini 5, 80131, Naples, Italy
- Department of Chemistry and Biology, University of Salerno, Via Giovanni Paolo II 132, 84084, Fisciano, Italy
| | - Olga Lanzetta
- Institute of Genetics and Biophysics, National Research Council, Via Pietro Castellino 111, 80131, Naples, Italy
| | - Andrea Cirino
- Institute of Genetics and Biophysics, National Research Council, Via Pietro Castellino 111, 80131, Naples, Italy
| | - Sara Allegretti
- PhD program in Molecular Medicine and Medical Biotechnology, University Federico II, Via Sergio Pansini 5, 80131, Naples, Italy
| | - Gabriella Lania
- Institute of Genetics and Biophysics, National Research Council, Via Pietro Castellino 111, 80131, Naples, Italy
| | - Rosa Ferrentino
- Institute of Genetics and Biophysics, National Research Council, Via Pietro Castellino 111, 80131, Naples, Italy
| | - Varsha Poondi Krishnan
- Institute of Genetics and Biophysics, National Research Council, Via Pietro Castellino 111, 80131, Naples, Italy
| | - Claudia Angelini
- Istituto Applicazioni del Calcolo, National Research Council, Via Pietro Castellino 111, 80131, Naples, Italy
| | - Elizabeth Illingworth
- Department of Chemistry and Biology, University of Salerno, Via Giovanni Paolo II 132, 84084, Fisciano, Italy
| | - Antonio Baldini
- PhD program in Molecular Medicine and Medical Biotechnology, University Federico II, Via Sergio Pansini 5, 80131, Naples, Italy.
- Department of Molecular Medicine and Medical Biotechnology, University Federico II, Via Sergio Pansini 5, 80131, Naples, Italy.
| |
Collapse
|
6
|
Hossain MJ, Nyame P, Monde K. Species-Specific Transcription Factors Associated with Long Terminal Repeat Promoters of Endogenous Retroviruses: A Comprehensive Review. Biomolecules 2024; 14:280. [PMID: 38540701 PMCID: PMC10968565 DOI: 10.3390/biom14030280] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Revised: 02/19/2024] [Accepted: 02/24/2024] [Indexed: 11/11/2024] Open
Abstract
Endogenous retroviruses (ERVs) became a part of the eukaryotic genome through endogenization millions of years ago. Moreover, they have lost their innate capability of virulence or replication. Nevertheless, in eukaryotic cells, they actively engage in various activities that may be advantageous or disadvantageous to the cells. The mechanisms by which transcription is triggered and implicated in cellular processes are complex. Owing to the diversity in the expression of transcription factors (TFs) in cells and the TF-binding motifs of viruses, the comprehensibility of ERV initiation and its impact on cellular functions are unclear. Currently, several factors are known to be related to their initiation. TFs that bind to the viral long-terminal repeat (LTR) are critical initiators. This review discusses the TFs shown to actively associate with ERV stimulation across species such as humans, mice, pigs, monkeys, zebrafish, Drosophila, and yeast. A comprehensive summary of the expression of previously reported TFs may aid in identifying similarities between animal species and endogenous viruses. Moreover, an in-depth understanding of ERV expression will assist in elucidating their physiological roles in eukaryotic cell development and in clarifying their relationship with endogenous retrovirus-associated diseases.
Collapse
Affiliation(s)
| | | | - Kazuaki Monde
- Department of Microbiology, Faculty of Life Sciences, Kumamoto University, Kumamoto 860-8556, Japan; (M.J.H.); (P.N.)
| |
Collapse
|
7
|
Jiang T, Zhou ZM, Ling ZQ, Zhang Q, Wu ZZ, Yang JW, Yang SY, Yang B, Huang LS. Pig H3K4me3, H3K27ac, and gene expression profiles reveal reproductive tissue-specific activity of transposable elements. Zool Res 2024; 45:138-151. [PMID: 38155423 PMCID: PMC10839656 DOI: 10.24272/j.issn.2095-8137.2023.060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Accepted: 09/04/2023] [Indexed: 12/30/2023] Open
Abstract
Regulatory sequences and transposable elements (TEs) account for a large proportion of the genomic sequences of species; however, their roles in gene transcription, especially tissue-specific expression, remain largely unknown. Pigs serve as an excellent animal model for studying genomic sequence biology due to the extensive diversity among their wild and domesticated populations. Here, we conducted an integrated analysis using H3K27ac ChIP-seq, H3K4me3 ChIP-seq, and RNA-seq data from 10 different tissues of seven fetuses and eight closely related adult pigs. We aimed to annotate the regulatory elements and TEs to elucidate their associations with histone modifications and mRNA expression across different tissues and developmental stages. Based on correlation analysis between mRNA expression and H3K27ac and H3K4me3 peak activity, results indicated that H3K27ac exhibited stronger associations with gene expression than H3K4me3. Furthermore, 1.45% of TEs overlapped with either the H3K27ac or H3K4me3 peaks, with the majority displaying tissue-specific activity. Notably, a TE subfamily (LTR4C_SS), containing binding motifs for SIX1 and SIX4, showed specific enrichment in the H3K27ac peaks of the adult and fetal ovaries. RNA-seq analysis also revealed widespread expression of TEs in the exons or promoters of genes, including 4 688 TE-containing transcripts with distinct development stage-specific and tissue-specific expression. Of note, 1 967 TE-containing transcripts were enriched in the testes. We identified a long terminal repeat (LTR), MLT1F1, acting as a testis-specific alternative promoter in SRPK2 (a cell cycle-related protein kinase) in our pig dataset. This element was also conserved in humans and mice, suggesting either an ancient integration of TEs in genes specifically expressed in the testes or parallel evolutionary patterns. Collectively, our findings demonstrate that TEs are deeply embedded in the genome and exhibit important tissue-specific biological functions, particularly in the reproductive organs.
Collapse
Affiliation(s)
- Tao Jiang
- National Key Laboratory of Pig Genetic Improvement and Germplasm Innovation, Jiangxi Agricultural University, Nanchang, Jiangxi 330045, China
| | - Zhi-Min Zhou
- National Key Laboratory of Pig Genetic Improvement and Germplasm Innovation, Jiangxi Agricultural University, Nanchang, Jiangxi 330045, China
| | - Zi-Qi Ling
- National Key Laboratory of Pig Genetic Improvement and Germplasm Innovation, Jiangxi Agricultural University, Nanchang, Jiangxi 330045, China
| | - Qing Zhang
- National Key Laboratory of Pig Genetic Improvement and Germplasm Innovation, Jiangxi Agricultural University, Nanchang, Jiangxi 330045, China
| | - Zhong-Zi Wu
- National Key Laboratory of Pig Genetic Improvement and Germplasm Innovation, Jiangxi Agricultural University, Nanchang, Jiangxi 330045, China
| | - Jia-Wen Yang
- National Key Laboratory of Pig Genetic Improvement and Germplasm Innovation, Jiangxi Agricultural University, Nanchang, Jiangxi 330045, China
| | - Si-Yu Yang
- National Key Laboratory of Pig Genetic Improvement and Germplasm Innovation, Jiangxi Agricultural University, Nanchang, Jiangxi 330045, China
| | - Bin Yang
- National Key Laboratory of Pig Genetic Improvement and Germplasm Innovation, Jiangxi Agricultural University, Nanchang, Jiangxi 330045, China. E-mail:
| | - Lu-Sheng Huang
- National Key Laboratory of Pig Genetic Improvement and Germplasm Innovation, Jiangxi Agricultural University, Nanchang, Jiangxi 330045, China. E-mail:
| |
Collapse
|
8
|
Lu X. Regulation of endogenous retroviruses in murine embryonic stem cells and early embryos. J Mol Cell Biol 2024; 15:mjad052. [PMID: 37604781 PMCID: PMC10794949 DOI: 10.1093/jmcb/mjad052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Revised: 11/24/2022] [Accepted: 08/19/2023] [Indexed: 08/23/2023] Open
Abstract
Endogenous retroviruses (ERVs) are important components of transposable elements that constitute ∼40% of the mouse genome. ERVs exhibit dynamic expression patterns during early embryonic development and are engaged in numerous biological processes. Therefore, ERV expression must be closely monitored in cells. Most studies have focused on the regulation of ERV expression in mouse embryonic stem cells (ESCs) and during early embryonic development. This review touches on the classification, expression, and functions of ERVs in mouse ESCs and early embryos and mainly discusses ERV modulation strategies from the perspectives of transcription, epigenetic modification, nucleosome/chromatin assembly, and post-transcriptional control.
Collapse
Affiliation(s)
- Xinyi Lu
- State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin 300350, China
| |
Collapse
|
9
|
Abstract
Enhancers are cis-regulatory elements that can stimulate gene expression from distance, and drive precise spatiotemporal gene expression profiles during development. Functional enhancers display specific features including an open chromatin conformation, Histone H3 lysine 27 acetylation, Histone H3 lysine 4 mono-methylation enrichment, and enhancer RNAs production. These features are modified upon developmental cues which impacts their activity. In this review, we describe the current state of knowledge about enhancer functions and the diverse chromatin signatures found on enhancers. We also discuss the dynamic changes of enhancer chromatin signatures, and their impact on lineage specific gene expression profiles, during development or cellular differentiation.
Collapse
Affiliation(s)
- Amandine Barral
- Institute for Regenerative Medicine, Epigenetics Institute, Department of Cell and Developmental Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA,CONTACT Amandine Barral Institute for Regenerative Medicine, Epigenetics Institute, Department of Cell and Developmental Biology, Perelman School of Medicine, University of Pennsylvania. 3400 Civic Blvd, Philadelphia, Pennsylvania19104, USA
| | - Jérôme Déjardin
- Biology of repetitive sequences, Institute of Human Genetics CNRS-Université de Montpellier UMR 9002, Montpellier, France,Jérôme Déjardin Biology of repetitive sequences, Institute of Human Genetics CNRS-Université de Montpellier UMR 9002, 141 rue de la Cardonille, Montpellier34000, France
| |
Collapse
|
10
|
Zemke NR, Armand EJ, Wang W, Lee S, Zhou J, Li YE, Liu H, Tian W, Nery JR, Castanon RG, Bartlett A, Osteen JK, Li D, Zhuo X, Xu V, Chang L, Dong K, Indralingam HS, Rink JA, Xie Y, Miller M, Krienen FM, Zhang Q, Taskin N, Ting J, Feng G, McCarroll SA, Callaway EM, Wang T, Lein ES, Behrens MM, Ecker JR, Ren B. Conserved and divergent gene regulatory programs of the mammalian neocortex. Nature 2023; 624:390-402. [PMID: 38092918 PMCID: PMC10719095 DOI: 10.1038/s41586-023-06819-6] [Citation(s) in RCA: 28] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Accepted: 11/01/2023] [Indexed: 12/17/2023]
Abstract
Divergence of cis-regulatory elements drives species-specific traits1, but how this manifests in the evolution of the neocortex at the molecular and cellular level remains unclear. Here we investigated the gene regulatory programs in the primary motor cortex of human, macaque, marmoset and mouse using single-cell multiomics assays, generating gene expression, chromatin accessibility, DNA methylome and chromosomal conformation profiles from a total of over 200,000 cells. From these data, we show evidence that divergence of transcription factor expression corresponds to species-specific epigenome landscapes. We find that conserved and divergent gene regulatory features are reflected in the evolution of the three-dimensional genome. Transposable elements contribute to nearly 80% of the human-specific candidate cis-regulatory elements in cortical cells. Through machine learning, we develop sequence-based predictors of candidate cis-regulatory elements in different species and demonstrate that the genomic regulatory syntax is highly preserved from rodents to primates. Finally, we show that epigenetic conservation combined with sequence similarity helps to uncover functional cis-regulatory elements and enhances our ability to interpret genetic variants contributing to neurological disease and traits.
Collapse
Affiliation(s)
- Nathan R Zemke
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, CA, USA
- Center for Epigenomics, University of California, San Diego School of Medicine, La Jolla, CA, USA
| | - Ethan J Armand
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, CA, USA
- Bioinformatics and Systems Biology Program, University of California, San Diego, La Jolla, CA, USA
| | - Wenliang Wang
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Seoyeon Lee
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, CA, USA
| | - Jingtian Zhou
- Bioinformatics and Systems Biology Program, University of California, San Diego, La Jolla, CA, USA
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Yang Eric Li
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, CA, USA
| | - Hanqing Liu
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA, USA
- Division of Biological Sciences, University of California, San Diego, La Jolla, CA, USA
| | - Wei Tian
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Joseph R Nery
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Rosa G Castanon
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Anna Bartlett
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Julia K Osteen
- Computational Neurobiology Laboratory, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Daofeng Li
- Department of Genetics, The Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine, St Louis, MO, USA
| | - Xiaoyu Zhuo
- Department of Genetics, The Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine, St Louis, MO, USA
| | - Vincent Xu
- Department of Genetics, The Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine, St Louis, MO, USA
| | - Lei Chang
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, CA, USA
| | - Keyi Dong
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, CA, USA
- Center for Epigenomics, University of California, San Diego School of Medicine, La Jolla, CA, USA
| | - Hannah S Indralingam
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, CA, USA
- Center for Epigenomics, University of California, San Diego School of Medicine, La Jolla, CA, USA
| | - Jonathan A Rink
- Computational Neurobiology Laboratory, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Yang Xie
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, CA, USA
| | - Michael Miller
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, CA, USA
- Center for Epigenomics, University of California, San Diego School of Medicine, La Jolla, CA, USA
| | - Fenna M Krienen
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
- Department of Genetics, Harvard Medical School, Boston, USA
| | - Qiangge Zhang
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- McGovern Institute for Brain Research, Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Naz Taskin
- Allen Institute for Brain Science, Seattle, WA, USA
| | | | - Guoping Feng
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- McGovern Institute for Brain Research, Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Steven A McCarroll
- Department of Genetics, Harvard Medical School, Boston, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Edward M Callaway
- Systems Neurobiology Laboratories, The Salk Institute for Biological Studies, La Jolla, CA, USA
- Department of Neurosciences, University of California San Diego, La Jolla, CA, USA
| | - Ting Wang
- Department of Genetics, The Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine, St Louis, MO, USA
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
| | - Ed S Lein
- Allen Institute for Brain Science, Seattle, WA, USA
- Department of Neurological Surgery, University of Washington, Seattle, WA, USA
| | - M Margarita Behrens
- Computational Neurobiology Laboratory, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Joseph R Ecker
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA, USA.
- Howard Hughes Medical Institute, The Salk Institute for Biological Studies, La Jolla, CA, USA.
| | - Bing Ren
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, CA, USA.
- Center for Epigenomics, University of California, San Diego School of Medicine, La Jolla, CA, USA.
- Institute of Genomic Medicine, Moores Cancer Center, School of Medicine, University of California San Diego, La Jolla, CA, USA.
| |
Collapse
|
11
|
Tam PLF, Leung D. The Molecular Impacts of Retrotransposons in Development and Diseases. Int J Mol Sci 2023; 24:16418. [PMID: 38003607 PMCID: PMC10671454 DOI: 10.3390/ijms242216418] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2023] [Revised: 11/11/2023] [Accepted: 11/14/2023] [Indexed: 11/26/2023] Open
Abstract
Retrotransposons are invasive genetic elements that constitute substantial portions of mammalian genomes. They have the potential to influence nearby gene expression through their cis-regulatory sequences, reverse transcription machinery, and the ability to mold higher-order chromatin structures. Due to their multifaceted functions, it is crucial for host fitness to maintain strict regulation of these parasitic sequences to ensure proper growth and development. This review explores how subsets of retrotransposons have undergone evolutionary exaptation to enhance the complexity of mammalian genomes. It also highlights the significance of regulating these elements, drawing on recent studies conducted in human and murine systems.
Collapse
Affiliation(s)
- Phoebe Lut Fei Tam
- Division of Life Science, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong SAR, China;
| | - Danny Leung
- Division of Life Science, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong SAR, China;
- Center for Epigenomics Research, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong SAR, China
| |
Collapse
|
12
|
Pulver C, Grun D, Duc J, Sheppard S, Planet E, Coudray A, de Fondeville R, Pontis J, Trono D. Statistical learning quantifies transposable element-mediated cis-regulation. Genome Biol 2023; 24:258. [PMID: 37950299 PMCID: PMC10637000 DOI: 10.1186/s13059-023-03085-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Accepted: 10/09/2023] [Indexed: 11/12/2023] Open
Abstract
BACKGROUND Transposable elements (TEs) have colonized the genomes of most metazoans, and many TE-embedded sequences function as cis-regulatory elements (CREs) for genes involved in a wide range of biological processes from early embryogenesis to innate immune responses. Because of their repetitive nature, TEs have the potential to form CRE platforms enabling the coordinated and genome-wide regulation of protein-coding genes by only a handful of trans-acting transcription factors (TFs). RESULTS Here, we directly test this hypothesis through mathematical modeling and demonstrate that differences in expression at protein-coding genes alone are sufficient to estimate the magnitude and significance of TE-contributed cis-regulatory activities, even in contexts where TE-derived transcription fails to do so. We leverage hundreds of overexpression experiments and estimate that, overall, gene expression is influenced by TE-embedded CREs situated within approximately 500 kb of promoters. Focusing on the cis-regulatory potential of TEs within the gene regulatory network of human embryonic stem cells, we find that pluripotency-specific and evolutionarily young TE subfamilies can be reactivated by TFs involved in post-implantation embryogenesis. Finally, we show that TE subfamilies can be split into truly regulatorily active versus inactive fractions based on additional information such as matched epigenomic data, observing that TF binding may better predict TE cis-regulatory activity than differences in histone marks. CONCLUSION Our results suggest that TE-embedded CREs contribute to gene regulation during and beyond gastrulation. On a methodological level, we provide a statistical tool that infers TE-dependent cis-regulation from RNA-seq data alone, thus facilitating the study of TEs in the next-generation sequencing era.
Collapse
Affiliation(s)
- Cyril Pulver
- School of Life Sciences, Swiss Federal Institute of Technology Lausanne (EPFL), CH-1015, Lausanne, Switzerland
| | - Delphine Grun
- School of Life Sciences, Swiss Federal Institute of Technology Lausanne (EPFL), CH-1015, Lausanne, Switzerland
| | - Julien Duc
- School of Life Sciences, Swiss Federal Institute of Technology Lausanne (EPFL), CH-1015, Lausanne, Switzerland
| | - Shaoline Sheppard
- School of Life Sciences, Swiss Federal Institute of Technology Lausanne (EPFL), CH-1015, Lausanne, Switzerland
| | - Evarist Planet
- School of Life Sciences, Swiss Federal Institute of Technology Lausanne (EPFL), CH-1015, Lausanne, Switzerland
| | - Alexandre Coudray
- School of Life Sciences, Swiss Federal Institute of Technology Lausanne (EPFL), CH-1015, Lausanne, Switzerland
| | - Raphaël de Fondeville
- Swiss Data Science Center, Swiss Federal Institute of Technology Lausanne (EPFL), CH-1015, Lausanne, Switzerland.
| | - Julien Pontis
- School of Life Sciences, Swiss Federal Institute of Technology Lausanne (EPFL), CH-1015, Lausanne, Switzerland.
- SOPHiA GENETICS SA, La Pièce 12, CH-1180, Rolle, Switzerland.
| | - Didier Trono
- School of Life Sciences, Swiss Federal Institute of Technology Lausanne (EPFL), CH-1015, Lausanne, Switzerland.
| |
Collapse
|
13
|
Sun X, Zhang T, Tong B, Cheng L, Jiang W, Sun Y. POGZ suppresses 2C transcriptional program and retrotransposable elements. Cell Rep 2023; 42:112867. [PMID: 37494184 DOI: 10.1016/j.celrep.2023.112867] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Revised: 03/23/2023] [Accepted: 07/11/2023] [Indexed: 07/28/2023] Open
Abstract
The POGZ gene has been found frequently mutated in neurodevelopmental disorders (NDDs) such as autism spectrum disorder (ASD) and intellectual disability (ID). We have recently shown that POGZ maintains mouse embryonic stem cells (ESCs). However, the exact mechanisms remain unclear. Here, we show that POGZ plays an important role in the maintenance of ESCs by silencing Dux and endogenous retroviruses (ERVs). POGZ maintains a silent chromatin state at Dux and ERVs by associating with and recruiting TRIM28 and SETDB1, and its loss leads to decreased levels of H3K9me3/H4K20me3, resulting in up-regulation of 2C transcripts and ESC transition to a 2C-like state. POGZ suppresses different classes of ERVs through direct (IAPEy, the intracisternal A-type particle elements) and indirect regulation (MERVL). Activation of POGZ-bound ERVs is associated with up-regulation of nearby neural disease genes such as Serpina3m. Our findings provide important insights into understanding the disease mechanism caused by POGZ dysfunction.
Collapse
Affiliation(s)
- Xiaoyun Sun
- State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, P.R. China
| | - Tianzhe Zhang
- Department of Biological Repositories, Frontier Science Center for Immunology and Metabolism, Medical Research Institute, Wuhan University, Wuhan 430071, China
| | - Bei Tong
- State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, P.R. China
| | - Linxi Cheng
- State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, P.R. China
| | - Wei Jiang
- Department of Biological Repositories, Frontier Science Center for Immunology and Metabolism, Medical Research Institute, Wuhan University, Wuhan 430071, China
| | - Yuhua Sun
- State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, P.R. China; The Innovation of Seed Design, Chinese Academy of Sciences, Wuhan 430072, P.R. China; Hubei Hongshan Laboratory, Wuhan 430070, P.R. China.
| |
Collapse
|
14
|
DiRusso JA, Clark AT. Transposable elements in early human embryo development and embryo models. Curr Opin Genet Dev 2023; 81:102086. [PMID: 37441874 PMCID: PMC10917458 DOI: 10.1016/j.gde.2023.102086] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Revised: 05/31/2023] [Accepted: 06/15/2023] [Indexed: 07/15/2023]
Abstract
Transposable elements (TEs), long discounted as 'selfish genomic elements,' are increasingly appreciated as the drivers of genomic evolution, genome organization, and gene regulation. TEs are particularly important in early embryo development, where advances in stem cell technologies, in tandem with improved computational and next-generation sequencing approaches, have provided an unprecedented opportunity to study the contribution of TEs to early mammalian development. Here, we summarize advances in our understanding of TEs in early human development and expand on how new stem cell-based embryo models can be leveraged to augment this understanding.
Collapse
Affiliation(s)
- Jonathan A DiRusso
- Department of Molecular, Cell and Developmental Biology, University of California, 90095 Los Angeles, CA, USA; Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research, University of California, 90095 Los Angeles, CA, USA.; Molecular Biology Institute, University of California, 90095 Los Angeles, CA, USA; Center for Reproductive Science, Health and Education, University of California, 90095 Los Angeles, CA, USA
| | - Amander T Clark
- Department of Molecular, Cell and Developmental Biology, University of California, 90095 Los Angeles, CA, USA; Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research, University of California, 90095 Los Angeles, CA, USA.; Molecular Biology Institute, University of California, 90095 Los Angeles, CA, USA; Center for Reproductive Science, Health and Education, University of California, 90095 Los Angeles, CA, USA.
| |
Collapse
|
15
|
Pal D, Patel M, Boulet F, Sundarraj J, Grant OA, Branco MR, Basu S, Santos SDM, Zabet NR, Scaffidi P, Pradeepa MM. H4K16ac activates the transcription of transposable elements and contributes to their cis-regulatory function. Nat Struct Mol Biol 2023; 30:935-947. [PMID: 37308596 PMCID: PMC10352135 DOI: 10.1038/s41594-023-01016-5] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Accepted: 05/05/2023] [Indexed: 06/14/2023]
Abstract
Mammalian genomes harbor abundant transposable elements (TEs) and their remnants, with numerous epigenetic repression mechanisms enacted to silence TE transcription. However, TEs are upregulated during early development, neuronal lineage, and cancers, although the epigenetic factors contributing to the transcription of TEs have yet to be fully elucidated. Here, we demonstrate that the male-specific lethal (MSL)-complex-mediated histone H4 acetylation at lysine 16 (H4K16ac) is enriched at TEs in human embryonic stem cells (hESCs) and cancer cells. This in turn activates transcription of subsets of full-length long interspersed nuclear elements (LINE1s, L1s) and endogenous retrovirus (ERV) long terminal repeats (LTRs). Furthermore, we show that the H4K16ac-marked L1 and LTR subfamilies display enhancer-like functions and are enriched in genomic locations with chromatin features associated with active enhancers. Importantly, such regions often reside at boundaries of topologically associated domains and loop with genes. CRISPR-based epigenetic perturbation and genetic deletion of L1s reveal that H4K16ac-marked L1s and LTRs regulate the expression of genes in cis. Overall, TEs enriched with H4K16ac contribute to the cis-regulatory landscape at specific genomic locations by maintaining an active chromatin landscape at TEs.
Collapse
Affiliation(s)
- Debosree Pal
- Blizard Institute, Faculty of Medicine and Dentistry, Queen Mary University of London, London, UK
| | - Manthan Patel
- Blizard Institute, Faculty of Medicine and Dentistry, Queen Mary University of London, London, UK
| | - Fanny Boulet
- Blizard Institute, Faculty of Medicine and Dentistry, Queen Mary University of London, London, UK
| | - Jayakumar Sundarraj
- Blizard Institute, Faculty of Medicine and Dentistry, Queen Mary University of London, London, UK
- Bhabha Atomic Research Centre, Mumbai, India
| | - Olivia A Grant
- Blizard Institute, Faculty of Medicine and Dentistry, Queen Mary University of London, London, UK
- School of Life Sciences, University of Essex, Colchester, UK
| | - Miguel R Branco
- Blizard Institute, Faculty of Medicine and Dentistry, Queen Mary University of London, London, UK
| | - Srinjan Basu
- Wellcome-MRC Cambridge Stem Cell Institute, University of Cambridge, Cambridge, UK
| | | | - Nicolae Radu Zabet
- Blizard Institute, Faculty of Medicine and Dentistry, Queen Mary University of London, London, UK
| | - Paola Scaffidi
- Francis Crick Institute, London, UK
- Department of Experimental Oncology, European Institute of Oncology, Milan, Italy
| | - Madapura M Pradeepa
- Blizard Institute, Faculty of Medicine and Dentistry, Queen Mary University of London, London, UK.
| |
Collapse
|
16
|
Andrews G, Fan K, Pratt HE, Phalke N, Karlsson EK, Lindblad-Toh K, Gazal S, Moore JE, Weng Z, Andrews G, Armstrong JC, Bianchi M, Birren BW, Bredemeyer KR, Breit AM, Christmas MJ, Clawson H, Damas J, Di Palma F, Diekhans M, Dong MX, Eizirik E, Fan K, Fanter C, Foley NM, Forsberg-Nilsson K, Garcia CJ, Gatesy J, Gazal S, Genereux DP, Goodman L, Grimshaw J, Halsey MK, Harris AJ, Hickey G, Hiller M, Hindle AG, Hubley RM, Hughes GM, Johnson J, Juan D, Kaplow IM, Karlsson EK, Keough KC, Kirilenko B, Koepfli KP, Korstian JM, Kowalczyk A, Kozyrev SV, Lawler AJ, Lawless C, Lehmann T, Levesque DL, Lewin HA, Li X, Lind A, Lindblad-Toh K, Mackay-Smith A, Marinescu VD, Marques-Bonet T, Mason VC, Meadows JRS, Meyer WK, Moore JE, Moreira LR, Moreno-Santillan DD, Morrill KM, Muntané G, Murphy WJ, Navarro A, Nweeia M, Ortmann S, Osmanski A, Paten B, Paulat NS, Pfenning AR, Phan BN, Pollard KS, Pratt HE, Ray DA, Reilly SK, Rosen JR, Ruf I, Ryan L, Ryder OA, Sabeti PC, Schäffer DE, Serres A, Shapiro B, Smit AFA, Springer M, Srinivasan C, Steiner C, Storer JM, Sullivan KAM, Sullivan PF, Sundström E, Supple MA, Swofford R, Talbot JE, Teeling E, Turner-Maier J, Valenzuela A, Wagner F, Wallerman O, Wang C, Wang J, Weng Z, Wilder AP, Wirthlin ME, Xue JR, Zhang X. Mammalian evolution of human cis-regulatory elements and transcription factor binding sites. Science 2023; 380:eabn7930. [PMID: 37104580 DOI: 10.1126/science.abn7930] [Citation(s) in RCA: 33] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/29/2023]
Abstract
Understanding the regulatory landscape of the human genome is a long-standing objective of modern biology. Using the reference-free alignment across 241 mammalian genomes produced by the Zoonomia Consortium, we charted evolutionary trajectories for 0.92 million human candidate cis-regulatory elements (cCREs) and 15.6 million human transcription factor binding sites (TFBSs). We identified 439,461 cCREs and 2,024,062 TFBSs under evolutionary constraint. Genes near constrained elements perform fundamental cellular processes, whereas genes near primate-specific elements are involved in environmental interaction, including odor perception and immune response. About 20% of TFBSs are transposable element-derived and exhibit intricate patterns of gains and losses during primate evolution whereas sequence variants associated with complex traits are enriched in constrained TFBSs. Our annotations illuminate the regulatory functions of the human genome.
Collapse
Affiliation(s)
- Gregory Andrews
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
| | - Kaili Fan
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
| | - Henry E Pratt
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
| | - Nishigandha Phalke
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
| | - Elinor K Karlsson
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02139, USA
- Program in Molecular Medicine, UMass Chan Medical School, Worcester, MA 01605, USA
| | - Kerstin Lindblad-Toh
- Broad Institute of MIT and Harvard, Cambridge, MA 02139, USA
- Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University, 75132 Uppsala, Sweden
| | - Steven Gazal
- Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA
- Center for Genetic Epidemiology, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA
| | - Jill E Moore
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
| | - Zhiping Weng
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Zemke NR, Armand EJ, Wang W, Lee S, Zhou J, Li YE, Liu H, Tian W, Nery JR, Castanon RG, Bartlett A, Osteen JK, Li D, Zhuo X, Xu V, Miller M, Krienen FM, Zhang Q, Taskin N, Ting J, Feng G, McCarroll SA, Callaway EM, Wang T, Behrens MM, Lein ES, Ecker JR, Ren B. Comparative single cell epigenomic analysis of gene regulatory programs in the rodent and primate neocortex. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.08.536119. [PMID: 37066152 PMCID: PMC10104177 DOI: 10.1101/2023.04.08.536119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/18/2023]
Abstract
Sequence divergence of cis- regulatory elements drives species-specific traits, but how this manifests in the evolution of the neocortex at the molecular and cellular level remains to be elucidated. We investigated the gene regulatory programs in the primary motor cortex of human, macaque, marmoset, and mouse with single-cell multiomics assays, generating gene expression, chromatin accessibility, DNA methylome, and chromosomal conformation profiles from a total of over 180,000 cells. For each modality, we determined species-specific, divergent, and conserved gene expression and epigenetic features at multiple levels. We find that cell type-specific gene expression evolves more rapidly than broadly expressed genes and that epigenetic status at distal candidate cis -regulatory elements (cCREs) evolves faster than promoters. Strikingly, transposable elements (TEs) contribute to nearly 80% of the human-specific cCREs in cortical cells. Through machine learning, we develop sequence-based predictors of cCREs in different species and demonstrate that the genomic regulatory syntax is highly preserved from rodents to primates. Lastly, we show that epigenetic conservation combined with sequence similarity helps uncover functional cis -regulatory elements and enhances our ability to interpret genetic variants contributing to neurological disease and traits.
Collapse
|
18
|
Choudhary MNK, Quaid K, Xing X, Schmidt H, Wang T. Widespread contribution of transposable elements to the rewiring of mammalian 3D genomes. Nat Commun 2023; 14:634. [PMID: 36746940 PMCID: PMC9902604 DOI: 10.1038/s41467-023-36364-9] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Accepted: 01/26/2023] [Indexed: 02/08/2023] Open
Abstract
Transposable elements (TEs) are major contributors of genetic material in mammalian genomes. These often include binding sites for architectural proteins, including the multifarious master protein, CTCF, which shapes the 3D genome by creating loops, domains, compartment borders, and RNA-DNA interactions. These play a role in the compact packaging of DNA and have the potential to facilitate regulatory function. In this study, we explore the widespread contribution of TEs to mammalian 3D genomes by quantifying the extent to which they give rise to loops and domain border differences across various cell types and species using several 3D genome mapping technologies. We show that specific families and subfamilies of TEs have contributed to lineage-specific 3D chromatin structures across mammalian species. In many cases, these loops may facilitate sustained interaction between distant cis-regulatory elements and target genes, and domains may segregate chromatin state to impact gene expression in a lineage-specific manner. An experimental validation of our analytical findings using CRISPR-Cas9 to delete a candidate TE resulted in disruption of species-specific 3D chromatin structure. Taken together, we comprehensively quantify and selectively validate our finding that TEs contribute to shaping 3D genome organization and may, in some cases, impact gene regulation during the course of mammalian evolution.
Collapse
Affiliation(s)
- Mayank N K Choudhary
- Center for Genome Sciences & Systems Biology, Washington University in St. Louis, St. Louis, MO, 63110, USA
- Department of Genetics, Washington University in St. Louis, St. Louis, MO, 63110, USA
| | - Kara Quaid
- Center for Genome Sciences & Systems Biology, Washington University in St. Louis, St. Louis, MO, 63110, USA
- Department of Genetics, Washington University in St. Louis, St. Louis, MO, 63110, USA
| | - Xiaoyun Xing
- Center for Genome Sciences & Systems Biology, Washington University in St. Louis, St. Louis, MO, 63110, USA
- Department of Genetics, Washington University in St. Louis, St. Louis, MO, 63110, USA
| | - Heather Schmidt
- Center for Genome Sciences & Systems Biology, Washington University in St. Louis, St. Louis, MO, 63110, USA
- Department of Genetics, Washington University in St. Louis, St. Louis, MO, 63110, USA
| | - Ting Wang
- Center for Genome Sciences & Systems Biology, Washington University in St. Louis, St. Louis, MO, 63110, USA.
- Department of Genetics, Washington University in St. Louis, St. Louis, MO, 63110, USA.
| |
Collapse
|
19
|
Yelagandula R, Stecher K, Novatchkova M, Michetti L, Michlits G, Wang J, Hofbauer P, Vainorius G, Pribitzer C, Isbel L, Mendjan S, Schübeler D, Elling U, Brennecke J, Bell O. ZFP462 safeguards neural lineage specification by targeting G9A/GLP-mediated heterochromatin to silence enhancers. Nat Cell Biol 2023; 25:42-55. [PMID: 36604593 PMCID: PMC10038669 DOI: 10.1038/s41556-022-01051-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Accepted: 11/10/2022] [Indexed: 01/07/2023]
Abstract
ZNF462 haploinsufficiency is linked to Weiss-Kruszka syndrome, a genetic disorder characterized by neurodevelopmental defects, including autism. Though conserved in vertebrates and essential for embryonic development, the molecular functions of ZNF462 remain unclear. We identified its murine homologue ZFP462 in a screen for mediators of epigenetic gene silencing. Here we show that ZFP462 safeguards neural lineage specification of mouse embryonic stem cells (ESCs) by targeting the H3K9-specific histone methyltransferase complex G9A/GLP to silence meso-endodermal genes. ZFP462 binds to transposable elements that are potential enhancers harbouring pluripotency and meso-endoderm transcription factor binding sites. Recruiting G9A/GLP, ZFP462 seeds heterochromatin, restricting transcription factor binding. Loss of ZFP462 in ESCs results in increased chromatin accessibility at target sites and ectopic expression of meso-endodermal genes. Taken together, ZFP462 confers lineage and locus specificity to the broadly expressed epigenetic regulator G9A/GLP. Our results suggest that aberrant activation of lineage non-specific genes in the neuronal lineage underlies ZNF462-associated neurodevelopmental pathology.
Collapse
Affiliation(s)
- Ramesh Yelagandula
- Institute of Molecular Biotechnology of the Austrian Academy of Science (IMBA), Vienna BioCenter (VBC), Vienna, Austria.
- Department of Biochemistry and Molecular Medicine and Norris Comprehensive Cancer Center, Keck School of Medicine of the University of Southern California, Los Angeles, CA, USA.
| | - Karin Stecher
- Institute of Molecular Biotechnology of the Austrian Academy of Science (IMBA), Vienna BioCenter (VBC), Vienna, Austria
- Vienna BioCenter PhD Program, Vienna, Austria
| | - Maria Novatchkova
- Research Institute of Molecular Pathology (IMP), Vienna BioCenter (VBC), Vienna, Austria
| | - Luca Michetti
- Institute of Molecular Biotechnology of the Austrian Academy of Science (IMBA), Vienna BioCenter (VBC), Vienna, Austria
- Università Vita-Salute San Raffaele, Milan, Italy
| | - Georg Michlits
- Institute of Molecular Biotechnology of the Austrian Academy of Science (IMBA), Vienna BioCenter (VBC), Vienna, Austria
| | - Jingkui Wang
- Research Institute of Molecular Pathology (IMP), Vienna BioCenter (VBC), Vienna, Austria
| | - Pablo Hofbauer
- Institute of Molecular Biotechnology of the Austrian Academy of Science (IMBA), Vienna BioCenter (VBC), Vienna, Austria
| | - Gintautas Vainorius
- Institute of Molecular Biotechnology of the Austrian Academy of Science (IMBA), Vienna BioCenter (VBC), Vienna, Austria
| | - Carina Pribitzer
- Research Institute of Molecular Pathology (IMP), Vienna BioCenter (VBC), Vienna, Austria
| | - Luke Isbel
- Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland
| | - Sasha Mendjan
- Institute of Molecular Biotechnology of the Austrian Academy of Science (IMBA), Vienna BioCenter (VBC), Vienna, Austria
| | - Dirk Schübeler
- Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland
| | - Ulrich Elling
- Institute of Molecular Biotechnology of the Austrian Academy of Science (IMBA), Vienna BioCenter (VBC), Vienna, Austria
| | - Julius Brennecke
- Institute of Molecular Biotechnology of the Austrian Academy of Science (IMBA), Vienna BioCenter (VBC), Vienna, Austria
| | - Oliver Bell
- Institute of Molecular Biotechnology of the Austrian Academy of Science (IMBA), Vienna BioCenter (VBC), Vienna, Austria.
- Department of Biochemistry and Molecular Medicine and Norris Comprehensive Cancer Center, Keck School of Medicine of the University of Southern California, Los Angeles, CA, USA.
| |
Collapse
|
20
|
Angileri KM, Bagia NA, Feschotte C. Transposon control as a checkpoint for tissue regeneration. Development 2022; 149:dev191957. [PMID: 36440631 PMCID: PMC10655923 DOI: 10.1242/dev.191957] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Accepted: 10/03/2022] [Indexed: 11/29/2022]
Abstract
Tissue regeneration requires precise temporal control of cellular processes such as inflammatory signaling, chromatin remodeling and proliferation. The combination of these processes forms a unique microenvironment permissive to the expression, and potential mobilization of, transposable elements (TEs). Here, we develop the hypothesis that TE activation creates a barrier to tissue repair that must be overcome to achieve successful regeneration. We discuss how uncontrolled TE activity may impede tissue restoration and review mechanisms by which TE activity may be controlled during regeneration. We posit that the diversification and co-evolution of TEs and host control mechanisms may contribute to the wide variation in regenerative competency across tissues and species.
Collapse
Affiliation(s)
- Krista M. Angileri
- Department of Molecular Biology and Genetics, Cornell University, 526 Campus Rd, Ithaca, NY 14850, USA
| | - Nornubari A. Bagia
- Department of Molecular Biology and Genetics, Cornell University, 526 Campus Rd, Ithaca, NY 14850, USA
| | - Cedric Feschotte
- Department of Molecular Biology and Genetics, Cornell University, 526 Campus Rd, Ithaca, NY 14850, USA
| |
Collapse
|
21
|
Modzelewski AJ, Gan Chong J, Wang T, He L. Mammalian genome innovation through transposon domestication. Nat Cell Biol 2022; 24:1332-1340. [PMID: 36008480 PMCID: PMC9729749 DOI: 10.1038/s41556-022-00970-4] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2022] [Accepted: 06/27/2022] [Indexed: 01/13/2023]
Abstract
Since the discovery of transposons, their sheer abundance in host genomes has puzzled many. While historically viewed as largely harmless 'parasitic' DNAs during evolution, transposons are not a mere record of ancient genome invasion. Instead, nearly every element of transposon biology has been integrated into host biology. Here we review how host genome sequences introduced by transposon activities provide raw material for genome innovation and document the distinct evolutionary path of each species.
Collapse
Affiliation(s)
- Andrew J Modzelewski
- Division of Cellular and Developmental Biology, MCB Department, University of California, Berkeley, CA, USA
- Department of Biomedical Sciences, School of Veterinary Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Johnny Gan Chong
- Division of Cellular and Developmental Biology, MCB Department, University of California, Berkeley, CA, USA
| | - Ting Wang
- Department of Genetics, Edison Family Center for Genome Science and System Biology, McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
| | - Lin He
- Division of Cellular and Developmental Biology, MCB Department, University of California, Berkeley, CA, USA.
| |
Collapse
|
22
|
Fueyo R, Judd J, Feschotte C, Wysocka J. Roles of transposable elements in the regulation of mammalian transcription. Nat Rev Mol Cell Biol 2022; 23:481-497. [PMID: 35228718 PMCID: PMC10470143 DOI: 10.1038/s41580-022-00457-y] [Citation(s) in RCA: 168] [Impact Index Per Article: 56.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/25/2022] [Indexed: 12/16/2022]
Abstract
Transposable elements (TEs) comprise about half of the mammalian genome. TEs often contain sequences capable of recruiting the host transcription machinery, which they use to express their own products and promote transposition. However, the regulatory sequences carried by TEs may affect host transcription long after the TEs have lost the ability to transpose. Recent advances in genome analysis and engineering have facilitated systematic interrogation of the regulatory activities of TEs. In this Review, we discuss diverse mechanisms by which TEs contribute to transcription regulation. Notably, TEs can donate enhancer and promoter sequences that influence the expression of host genes, modify 3D chromatin architecture and give rise to novel regulatory genes, including non-coding RNAs and transcription factors. We discuss how TEs spur regulatory evolution and facilitate the emergence of genetic novelties in mammalian physiology and development. By virtue of their repetitive and interspersed nature, TEs offer unique opportunities to dissect the effects of mutation and genomic context on the function and evolution of cis-regulatory elements. We argue that TE-centric studies hold the key to unlocking general principles of transcription regulation and evolution.
Collapse
Affiliation(s)
- Raquel Fueyo
- Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, CA, USA
| | - Julius Judd
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
| | - Cedric Feschotte
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA.
| | - Joanna Wysocka
- Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, CA, USA.
- Department of Developmental Biology, Stanford University School of Medicine, Stanford, CA, USA.
- Institute for Stem Cell Biology and Regenerative Medicine, Stanford University School of Medicine, Stanford, CA, USA.
- Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA, USA.
| |
Collapse
|
23
|
Barnada SM, Isopi A, Tejada-Martinez D, Goubert C, Patoori S, Pagliaroli L, Tracewell M, Trizzino M. Genomic features underlie the co-option of SVA transposons as cis-regulatory elements in human pluripotent stem cells. PLoS Genet 2022; 18:e1010225. [PMID: 35704668 PMCID: PMC9239442 DOI: 10.1371/journal.pgen.1010225] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Revised: 06/28/2022] [Accepted: 04/28/2022] [Indexed: 01/08/2023] Open
Abstract
Domestication of transposable elements (TEs) into functional cis-regulatory elements is a widespread phenomenon. However, the mechanisms behind why some TEs are co-opted as functional enhancers while others are not are underappreciated. SINE-VNTR-Alus (SVAs) are the youngest group of transposons in the human genome, where ~3,700 copies are annotated, nearly half of which are human-specific. Many studies indicate that SVAs are among the most frequently co-opted TEs in human gene regulation, but the mechanisms underlying such processes have not yet been thoroughly investigated. Here, we leveraged CRISPR-interference (CRISPRi), computational and functional genomics to elucidate the genomic features that underlie SVA domestication into human stem-cell gene regulation. We found that ~750 SVAs are co-opted as functional cis-regulatory elements in human induced pluripotent stem cells. These SVAs are significantly closer to genes and harbor more transcription factor binding sites than non-co-opted SVAs. We show that a long DNA motif composed of flanking YY1/2 and OCT4 binding sites is enriched in the co-opted SVAs and that these two transcription factors bind consecutively on the TE sequence. We used CRISPRi to epigenetically repress active SVAs in stem cell-like NCCIT cells. Epigenetic perturbation of active SVAs strongly attenuated YY1/OCT4 binding and influenced neighboring gene expression. Ultimately, SVA repression resulted in ~3,000 differentially expressed genes, 131 of which were the nearest gene to an annotated SVA. In summary, we demonstrated that SVAs modulate human gene expression, and uncovered that location and sequence composition contribute to SVA domestication into gene regulatory networks.
Collapse
Affiliation(s)
- Samantha M. Barnada
- Department of Biochemistry and Molecular Biology, Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, Pennsylvania, United States of America
- Genetics, Genomics and Cancer Biology PhD Program, Thomas Jefferson University, Philadelphia, Pennsylvania, United States of America
| | - Andrew Isopi
- Department of Microbiology and Immunology, Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, Pennsylvania, United States of America
- Biochemistry and Molecular Pharmacology PhD Program, Thomas Jefferson University, Philadelphia, Pennsylvania, United States of America
| | - Daniela Tejada-Martinez
- Department of Biochemistry and Molecular Biology, Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, Pennsylvania, United States of America
| | - Clément Goubert
- Department of Human Genetics, McGill University, Montreal, Quebec, Canada
| | - Sruti Patoori
- Department of Biochemistry and Molecular Biology, Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, Pennsylvania, United States of America
| | - Luca Pagliaroli
- Department of Biochemistry and Molecular Biology, Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, Pennsylvania, United States of America
| | - Mason Tracewell
- Department of Biochemistry and Molecular Biology, Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, Pennsylvania, United States of America
- Biochemistry and Molecular Pharmacology PhD Program, Thomas Jefferson University, Philadelphia, Pennsylvania, United States of America
| | - Marco Trizzino
- Department of Biochemistry and Molecular Biology, Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, Pennsylvania, United States of America
| |
Collapse
|
24
|
Bakoulis S, Krautz R, Alcaraz N, Salvatore M, Andersson R. OUP accepted manuscript. Nucleic Acids Res 2022; 50:2111-2127. [PMID: 35166831 PMCID: PMC8887488 DOI: 10.1093/nar/gkac088] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Revised: 01/22/2022] [Accepted: 01/27/2022] [Indexed: 11/12/2022] Open
Affiliation(s)
| | | | - Nicolas Alcaraz
- The Bioinformatics Centre, Department of Biology, University of Copenhagen, 2200 Copenhagen, Denmark
- Novo Nordisk Foundation Center for Protein Research (CPR), University of Copenhagen, 2200 Copenhagen, Denmark
| | - Marco Salvatore
- The Bioinformatics Centre, Department of Biology, University of Copenhagen, 2200 Copenhagen, Denmark
| | - Robin Andersson
- To whom correspondence should be addressed. Tel: +45 35330245;
| |
Collapse
|
25
|
Glaser LV, Steiger M, Fuchs A, van Bömmel A, Einfeldt E, Chung HR, Vingron M, Meijsing SH. Assessing genome-wide dynamic changes in enhancer activity during early mESC differentiation by FAIRE-STARR-seq. Nucleic Acids Res 2021; 49:12178-12195. [PMID: 34850108 PMCID: PMC8643627 DOI: 10.1093/nar/gkab1100] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Revised: 10/14/2021] [Accepted: 10/22/2021] [Indexed: 11/18/2022] Open
Abstract
Embryonic stem cells (ESCs) can differentiate into any given cell type and therefore represent a versatile model to study the link between gene regulation and differentiation. To quantitatively assess the dynamics of enhancer activity during the early stages of murine ESC differentiation, we analyzed accessible genomic regions using STARR-seq, a massively parallel reporter assay. This resulted in a genome-wide quantitative map of active mESC enhancers, in pluripotency and during the early stages of differentiation. We find that only a minority of accessible regions is active and that such regions are enriched near promoters, characterized by specific chromatin marks, enriched for distinct sequence motifs, and modeling shows that active regions can be predicted from sequence alone. Regions that change their activity upon retinoic acid-induced differentiation are more prevalent at distal intergenic regions when compared to constitutively active enhancers. Further, analysis of differentially active enhancers verified the contribution of individual TF motifs toward activity and inducibility as well as their role in regulating endogenous genes. Notably, the activity of retinoic acid receptor alpha (RARα) occupied regions can either increase or decrease upon the addition of its ligand, retinoic acid, with the direction of the change correlating with spacing and orientation of the RARα consensus motif and the co-occurrence of additional sequence motifs. Together, our genome-wide enhancer activity map elucidates features associated with enhancer activity levels, identifies regulatory regions disregarded by computational prediction tools, and provides a resource for future studies into regulatory elements in mESCs.
Collapse
Affiliation(s)
- Laura V Glaser
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany
| | - Mara Steiger
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany
| | - Alisa Fuchs
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany
- The Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine, 10115 Berlin, Germany
| | - Alena van Bömmel
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany
| | - Edda Einfeldt
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany
| | - Ho-Ryun Chung
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany
- Institute for Medical Bioinformatics and Biostatistics, Philipps University of Marburg, 35037 Marburg, Germany
| | - Martin Vingron
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany
| | - Sebastiaan H Meijsing
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany
- Max Planck Unit for the Science of Pathogens, 10117 Berlin, Germany
| |
Collapse
|
26
|
Modzelewski AJ, Shao W, Chen J, Lee A, Qi X, Noon M, Tjokro K, Sales G, Biton A, Anand A, Speed TP, Xuan Z, Wang T, Risso D, He L. A mouse-specific retrotransposon drives a conserved Cdk2ap1 isoform essential for development. Cell 2021; 184:5541-5558.e22. [PMID: 34644528 PMCID: PMC8787082 DOI: 10.1016/j.cell.2021.09.021] [Citation(s) in RCA: 54] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2021] [Revised: 07/26/2021] [Accepted: 09/14/2021] [Indexed: 12/13/2022]
Abstract
Retrotransposons mediate gene regulation in important developmental and pathological processes. Here, we characterized the transient retrotransposon induction during preimplantation development of eight mammals. Induced retrotransposons exhibit similar preimplantation profiles across species, conferring gene regulatory activities, particularly through long terminal repeat (LTR) retrotransposon promoters. A mouse-specific MT2B2 retrotransposon promoter generates an N-terminally truncated Cdk2ap1ΔN that peaks in preimplantation embryos and promotes proliferation. In contrast, the canonical Cdk2ap1 peaks in mid-gestation and represses cell proliferation. This MT2B2 promoter, whose deletion abolishes Cdk2ap1ΔN production, reduces cell proliferation and impairs embryo implantation, is developmentally essential. Intriguingly, Cdk2ap1ΔN is evolutionarily conserved in sequence and function yet is driven by different promoters across mammals. The distinct preimplantation Cdk2ap1ΔN expression in each mammalian species correlates with the duration of its preimplantation development. Hence, species-specific transposon promoters can yield evolutionarily conserved, alternative protein isoforms, bestowing them with new functions and species-specific expression to govern essential biological divergence.
Collapse
Affiliation(s)
- Andrew J Modzelewski
- Division of Cellular and Developmental Biology, MCB Department, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Wanqing Shao
- Department of Genetics, Edison Family Center for Genome Science and System Biology, McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Jingqi Chen
- Division of Cellular and Developmental Biology, MCB Department, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Angus Lee
- Division of Cellular and Developmental Biology, MCB Department, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Xin Qi
- Division of Cellular and Developmental Biology, MCB Department, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Mackenzie Noon
- Division of Cellular and Developmental Biology, MCB Department, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Kristy Tjokro
- Division of Cellular and Developmental Biology, MCB Department, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Gabriele Sales
- Department of Biology, University of Padova, Padova 35122, Italy
| | - Anne Biton
- Department of Statistics, University of California, Berkeley, Berkeley, CA 94720, USA; Bioinformatics and Biostatistics, Department of Computational Biology, USR 3756 CNRS, Institut Pasteur, Paris 75015, France
| | - Aparna Anand
- Department of Genetics, Edison Family Center for Genome Science and System Biology, McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Terence P Speed
- Bioinformatics Division, WEHI, Parkville, VIC 3052, Australia
| | - Zhenyu Xuan
- Department of Biological Sciences, The University of Texas at Dallas, Richardson, TX 75080, USA
| | - Ting Wang
- Department of Genetics, Edison Family Center for Genome Science and System Biology, McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63110, USA.
| | - Davide Risso
- Department of Statistical Sciences, University of Padova, Padova 35122, Italy.
| | - Lin He
- Division of Cellular and Developmental Biology, MCB Department, University of California, Berkeley, Berkeley, CA 94720, USA.
| |
Collapse
|
27
|
Khetan S, Kales S, Kursawe R, Jillette A, Ulirsch JC, Reilly SK, Ucar D, Tewhey R, Stitzel ML. Functional characterization of T2D-associated SNP effects on baseline and ER stress-responsive β cell transcriptional activation. Nat Commun 2021; 12:5242. [PMID: 34475398 PMCID: PMC8413311 DOI: 10.1038/s41467-021-25514-6] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2020] [Accepted: 08/10/2021] [Indexed: 11/08/2022] Open
Abstract
Genome-wide association studies (GWAS) have linked single nucleotide polymorphisms (SNPs) at >250 loci in the human genome to type 2 diabetes (T2D) risk. For each locus, identifying the functional variant(s) among multiple SNPs in high linkage disequilibrium is critical to understand molecular mechanisms underlying T2D genetic risk. Using massively parallel reporter assays (MPRA), we test the cis-regulatory effects of SNPs associated with T2D and altered in vivo islet chromatin accessibility in MIN6 β cells under steady state and pathophysiologic endoplasmic reticulum (ER) stress conditions. We identify 1,982/6,621 (29.9%) SNP-containing elements that activate transcription in MIN6 and 879 SNP alleles that modulate MPRA activity. Multiple T2D-associated SNPs alter the activity of short interspersed nuclear element (SINE)-containing elements that are strongly induced by ER stress. We identify 220 functional variants at 104 T2D association signals, narrowing 54 signals to a single candidate SNP. Together, this study identifies elements driving β cell steady state and ER stress-responsive transcriptional activation, nominates causal T2D SNPs, and uncovers potential roles for repetitive elements in β cell transcriptional stress response and T2D genetics.
Collapse
Affiliation(s)
- Shubham Khetan
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
- Department of Genetics and Genome Sciences, University of Connecticut, Farmington, CT, USA
| | - Susan Kales
- The Jackson Laboratory for Mammalian Genetics, Bar Harbor, ME, USA
| | - Romy Kursawe
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | | | - Jacob C Ulirsch
- Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | | | - Duygu Ucar
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
- Department of Genetics and Genome Sciences, University of Connecticut, Farmington, CT, USA
- Institute of Systems Genomics, University of Connecticut, Farmington, CT, USA
| | - Ryan Tewhey
- The Jackson Laboratory for Mammalian Genetics, Bar Harbor, ME, USA.
- Graduate School of Biomedical Sciences and Engineering, University of Maine, Orono, ME, USA.
- Tufts University School of Medicine, Boston, MA, USA.
| | - Michael L Stitzel
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA.
- Department of Genetics and Genome Sciences, University of Connecticut, Farmington, CT, USA.
- Institute of Systems Genomics, University of Connecticut, Farmington, CT, USA.
| |
Collapse
|
28
|
Judd J, Sanderson H, Feschotte C. Evolution of mouse circadian enhancers from transposable elements. Genome Biol 2021; 22:193. [PMID: 34187518 PMCID: PMC8240256 DOI: 10.1186/s13059-021-02409-9] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2020] [Accepted: 06/10/2021] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Transposable elements are increasingly recognized as a source of cis-regulatory variation. Previous studies have revealed that transposons are often bound by transcription factors and some have been co-opted into functional enhancers regulating host gene expression. However, the process by which transposons mature into complex regulatory elements, like enhancers, remains poorly understood. To investigate this process, we examined the contribution of transposons to the cis-regulatory network controlling circadian gene expression in the mouse liver, a well-characterized network serving an important physiological function. RESULTS ChIP-seq analyses reveal that transposons and other repeats contribute ~ 14% of the binding sites for core circadian regulators (CRs) including BMAL1, CLOCK, PER1/2, and CRY1/2, in the mouse liver. RSINE1, an abundant murine-specific SINE, is the only transposon family enriched for CR binding sites across all datasets. Sequence analyses and reporter assays reveal that the circadian regulatory activity of RSINE1 stems from the presence of imperfect CR binding motifs in the ancestral RSINE1 sequence. These motifs matured into canonical motifs through point mutations after transposition. Furthermore, maturation occurred preferentially within elements inserted in the proximity of ancestral CR binding sites. RSINE1 also acquired motifs that recruit nuclear receptors known to cooperate with CRs to regulate circadian gene expression specifically in the liver. CONCLUSIONS Our results suggest that the birth of enhancers from transposons is predicated both by the sequence of the transposon and by the cis-regulatory landscape surrounding their genomic integration site.
Collapse
Affiliation(s)
- Julius Judd
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, 14853, USA
| | - Hayley Sanderson
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT, 84112, USA
| | - Cédric Feschotte
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, 14853, USA.
| |
Collapse
|
29
|
Ferrari R, Grandi N, Tramontano E, Dieci G. Retrotransposons as Drivers of Mammalian Brain Evolution. Life (Basel) 2021; 11:life11050376. [PMID: 33922141 PMCID: PMC8143547 DOI: 10.3390/life11050376] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Revised: 04/20/2021] [Accepted: 04/21/2021] [Indexed: 12/11/2022] Open
Abstract
Retrotransposons, a large and diverse class of transposable elements that are still active in humans, represent a remarkable force of genomic innovation underlying mammalian evolution. Among the features distinguishing mammals from all other vertebrates, the presence of a neocortex with a peculiar neuronal organization, composition and connectivity is perhaps the one that, by affecting the cognitive abilities of mammals, contributed mostly to their evolutionary success. Among mammals, hominids and especially humans display an extraordinarily expanded cortical volume, an enrichment of the repertoire of neural cell types and more elaborate patterns of neuronal connectivity. Retrotransposon-derived sequences have recently been implicated in multiple layers of gene regulation in the brain, from transcriptional and post-transcriptional control to both local and large-scale three-dimensional chromatin organization. Accordingly, an increasing variety of neurodevelopmental and neurodegenerative conditions are being recognized to be associated with retrotransposon dysregulation. We review here a large body of recent studies lending support to the idea that retrotransposon-dependent evolutionary novelties were crucial for the emergence of mammalian, primate and human peculiarities of brain morphology and function.
Collapse
Affiliation(s)
- Roberto Ferrari
- Department of Chemistry, Life Sciences and Environmental Sustainability, University of Parma, 43124 Parma, Italy;
| | - Nicole Grandi
- Laboratory of Molecular Virology, Department of Life and Environmental Sciences, University of Cagliari, Cittadella Universitaria di Monserrato, 09042 Monserrato, Italy; (N.G.); (E.T.)
| | - Enzo Tramontano
- Laboratory of Molecular Virology, Department of Life and Environmental Sciences, University of Cagliari, Cittadella Universitaria di Monserrato, 09042 Monserrato, Italy; (N.G.); (E.T.)
- Istituto di Ricerca Genetica e Biomedica, Consiglio Nazionale delle Ricerche, 09042 Monserrato, Italy
| | - Giorgio Dieci
- Department of Chemistry, Life Sciences and Environmental Sustainability, University of Parma, 43124 Parma, Italy;
- Correspondence:
| |
Collapse
|
30
|
Avsec Ž, Weilert M, Shrikumar A, Krueger S, Alexandari A, Dalal K, Fropf R, McAnany C, Gagneur J, Kundaje A, Zeitlinger J. Base-resolution models of transcription-factor binding reveal soft motif syntax. Nat Genet 2021; 53:354-366. [PMID: 33603233 PMCID: PMC8812996 DOI: 10.1038/s41588-021-00782-6] [Citation(s) in RCA: 306] [Impact Index Per Article: 76.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2020] [Accepted: 01/07/2021] [Indexed: 01/30/2023]
Abstract
The arrangement (syntax) of transcription factor (TF) binding motifs is an important part of the cis-regulatory code, yet remains elusive. We introduce a deep learning model, BPNet, that uses DNA sequence to predict base-resolution chromatin immunoprecipitation (ChIP)-nexus binding profiles of pluripotency TFs. We develop interpretation tools to learn predictive motif representations and identify soft syntax rules for cooperative TF binding interactions. Strikingly, Nanog preferentially binds with helical periodicity, and TFs often cooperate in a directional manner, which we validate using clustered regularly interspaced short palindromic repeat (CRISPR)-induced point mutations. Our model represents a powerful general approach to uncover the motifs and syntax of cis-regulatory sequences in genomics data.
Collapse
Affiliation(s)
- Žiga Avsec
- Department of Informatics, Technical University of Munich, Garching, Germany,Graduate School of Quantitative Biosciences (QBM), Ludwig-Maximilians-Universität München, Munich, Germany,Currently at DeepMind, London, UK
| | - Melanie Weilert
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Avanti Shrikumar
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Sabrina Krueger
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Amr Alexandari
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Khyati Dalal
- Stowers Institute for Medical Research, Kansas City, MO, USA,The University of Kansas Medical Center, Kansas City, KS, USA
| | - Robin Fropf
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Charles McAnany
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Julien Gagneur
- Department of Informatics, Technical University of Munich, Garching, Germany
| | - Anshul Kundaje
- Department of Computer Science, Stanford University, Stanford, CA, USA,Department of Genetics, Stanford University, Stanford, CA, USA,correspondence: ,
| | - Julia Zeitlinger
- Stowers Institute for Medical Research, Kansas City, MO, USA,The University of Kansas Medical Center, Kansas City, KS, USA,correspondence: ,
| |
Collapse
|
31
|
Pérez-Molina R, Arzate-Mejía RG, Ayala-Ortega E, Guerrero G, Meier K, Suaste-Olmos F, Recillas-Targa F. An Intronic Alu Element Attenuates the Transcription of a Long Non-coding RNA in Human Cell Lines. Front Genet 2020; 11:928. [PMID: 33061937 PMCID: PMC7489498 DOI: 10.3389/fgene.2020.00928] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2020] [Accepted: 07/27/2020] [Indexed: 12/12/2022] Open
Abstract
Alu elements are primate-specific repeats and represent the most abundant type of transposable elements (TE) in the human genome. Genome-wide analysis of the enrichment of histone post-translational modifications suggests that human Alu sequences could function as transcriptional enhancers; however, no functional experiments have evaluated the role of Alu sequences in the control of transcription in situ. The present study analyses the regulatory activity of a human Alu sequence from the AluSx family located in the second intron of the long intergenic non-coding RNA Linc00441, found in divergent orientation to the RB1 gene. We observed that the Alu sequence acts as an enhancer element based on reporter gene assays while CRISPR-Cas9 deletions of the Alu sequence in K562 cells resulted in a marked transcriptional upregulation of Linc00441 and a decrease in proliferation. Our results suggest that an intragenic Alu sequence with enhancer activity can act as a transcriptional attenuator of its host lincRNA.
Collapse
Affiliation(s)
- Rosario Pérez-Molina
- Departamento de Genética Molecular, Instituto de Fisiología Celular, Universidad Nacional Autónoma de México, Mexico City, Mexico
| | - Rodrigo G Arzate-Mejía
- Departamento de Genética Molecular, Instituto de Fisiología Celular, Universidad Nacional Autónoma de México, Mexico City, Mexico
| | - Erandi Ayala-Ortega
- Departamento de Genética Molecular, Instituto de Fisiología Celular, Universidad Nacional Autónoma de México, Mexico City, Mexico
| | - Georgina Guerrero
- Departamento de Genética Molecular, Instituto de Fisiología Celular, Universidad Nacional Autónoma de México, Mexico City, Mexico
| | - Karin Meier
- Departamento de Genética Molecular, Instituto de Fisiología Celular, Universidad Nacional Autónoma de México, Mexico City, Mexico
| | - Fernando Suaste-Olmos
- Departamento de Genética Molecular, Instituto de Fisiología Celular, Universidad Nacional Autónoma de México, Mexico City, Mexico
| | - Félix Recillas-Targa
- Departamento de Genética Molecular, Instituto de Fisiología Celular, Universidad Nacional Autónoma de México, Mexico City, Mexico
| |
Collapse
|
32
|
Miao B, Fu S, Lyu C, Gontarz P, Wang T, Zhang B. Tissue-specific usage of transposable element-derived promoters in mouse development. Genome Biol 2020; 21:255. [PMID: 32988383 PMCID: PMC7520981 DOI: 10.1186/s13059-020-02164-3] [Citation(s) in RCA: 43] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2020] [Accepted: 09/07/2020] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Transposable elements (TEs) are a significant component of eukaryotic genomes and play essential roles in genome evolution. Mounting evidence indicates that TEs are highly transcribed in early embryo development and contribute to distinct biological functions and tissue morphology. RESULTS We examine the epigenetic dynamics of mouse TEs during the development of five tissues: intestine, liver, lung, stomach, and kidney. We found that TEs are associated with over 20% of open chromatin regions during development. Close to half of these accessible TEs are only activated in a single tissue and a specific developmental stage. Most accessible TEs are rodent-specific. Across these five tissues, 453 accessible TEs are found to create the transcription start sites of downstream genes in mouse, including 117 protein-coding genes and 144 lincRNA genes, 93.7% of which are mouse-specific. Species-specific TE-derived transcription start sites are found to drive the expression of tissue-specific genes and change their tissue-specific expression patterns during evolution. CONCLUSION Our results suggest that TE insertions increase the regulatory potential of the genome, and some TEs have been domesticated to become a crucial component of gene and regulate tissue-specific expression during mouse tissue development.
Collapse
Affiliation(s)
- Benpeng Miao
- Department of Developmental Biology, Center of Regenerative Medicine, Washington University School of Medicine, St. Louis, MO, 63108, USA
- Department of Genetics, Edison Family Center for Genomic Sciences and Systems Biology, McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, 63108, USA
| | - Shuhua Fu
- Department of Developmental Biology, Center of Regenerative Medicine, Washington University School of Medicine, St. Louis, MO, 63108, USA
| | - Cheng Lyu
- Department of Developmental Biology, Center of Regenerative Medicine, Washington University School of Medicine, St. Louis, MO, 63108, USA
| | - Paul Gontarz
- Department of Developmental Biology, Center of Regenerative Medicine, Washington University School of Medicine, St. Louis, MO, 63108, USA
| | - Ting Wang
- Department of Genetics, Edison Family Center for Genomic Sciences and Systems Biology, McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, 63108, USA.
| | - Bo Zhang
- Department of Developmental Biology, Center of Regenerative Medicine, Washington University School of Medicine, St. Louis, MO, 63108, USA.
| |
Collapse
|
33
|
Lee HE, Huh JW, Kim HS. Bioinformatics Analysis of Evolution and Human Disease Related Transposable Element-Derived microRNAs. Life (Basel) 2020; 10:life10060095. [PMID: 32630504 PMCID: PMC7345915 DOI: 10.3390/life10060095] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2020] [Revised: 06/22/2020] [Accepted: 06/23/2020] [Indexed: 12/22/2022] Open
Abstract
Transposable element (TE) has the ability to insert into certain parts of the genome, and due to this event, it is possible for TEs to generate new factors and one of these factors are microRNAs (miRNA). miRNAs are non-coding RNAs made up of 19 to 24 nucleotides and numerous miRNAs are derived from TE. In this study, to support general knowledge on TE and miRNAs derived from TE, several bioinformatics tools and databases were used to analyze miRNAs derived from TE in two aspects: evolution and human disease. The distribution of TEs in diverse species presents that almost half of the genome is covered with TE in mammalians and less than a half in other vertebrates and invertebrates. Based on selected evolution-related miRNAs studies, a total of 51 miRNAs derived from TE were found and analyzed. For the human disease-related miRNAs, total of 34 miRNAs derived from TE were organized from the previous studies. In summary, abundant miRNAs derived from TE are found, however, the function of miRNAs derived from TE is not informed either. Therefore, this study provides theoretical understanding of miRNAs derived from TE by using various bioinformatics tools.
Collapse
Affiliation(s)
- Hee-Eun Lee
- National Primate Research Center, Korea Research Institute of Bioscience and Biotechnology, Cheongju 28116, Korea; (H.-E.L.); (J.-W.H.)
| | - Jae-Won Huh
- National Primate Research Center, Korea Research Institute of Bioscience and Biotechnology, Cheongju 28116, Korea; (H.-E.L.); (J.-W.H.)
- Department of Functional Genomics, KRIBB School of Bioscience, Korea University of Science and Technology (UST), Daejeon 34113, Korea
| | - Heui-Soo Kim
- Department of Biological Sciences, College of Natural Sciences, Pusan National University, Busan 46241, Korea
- Institute of Systems Biology, Pusan National University, Busan 46241, Korea
- Correspondence: ; Tel.: +82-51-510-2259; Fax: +82-51-581-2962
| |
Collapse
|
34
|
Nishihara H. Retrotransposons spread potential cis-regulatory elements during mammary gland evolution. Nucleic Acids Res 2020; 47:11551-11562. [PMID: 31642473 PMCID: PMC7145552 DOI: 10.1093/nar/gkz1003] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2018] [Revised: 10/14/2019] [Accepted: 10/17/2019] [Indexed: 12/18/2022] Open
Abstract
Acquisition of cis-elements is a major driving force for rewiring a gene regulatory network. Several kinds of transposable elements (TEs), mostly retrotransposons that propagate via a copy-and-paste mechanism, are known to possess transcription factor binding motifs and have provided source sequences for enhancers/promoters. However, it remains largely unknown whether retrotransposons have spread the binding sites of master regulators of morphogenesis and accelerated cis-regulatory expansion involved in common mammalian morphological features during evolution. Here, I demonstrate that thousands of binding sites for estrogen receptor α (ERα) and three related pioneer factors (FoxA1, GATA3 and AP2γ) that are essential regulators of mammary gland development arose from a spreading of the binding motifs by retrotransposons. The TE-derived functional elements serve primarily as distal enhancers and are enriched around genes associated with mammary gland morphogenesis. The source TEs occurred via a two-phased expansion consisting of mainly L2/MIR in a eutherian ancestor and endogenous retrovirus 1 (ERV1) in simian primates and murines. Thus the build-up of potential sources for cis-elements by retrotransposons followed by their frequent utilization by the host (co-option/exaptation) may have a general accelerating effect on both establishing and diversifying a gene regulatory network, leading to morphological innovation.
Collapse
Affiliation(s)
- Hidenori Nishihara
- Department of Life Science and Technology, Tokyo Institute of Technology, 4259-S2-17, Nagatsuta-cho, Midori-ku, Yokohama, Kanagawa 226-8501, Japan
| |
Collapse
|
35
|
Carter AC, Xu J, Nakamoto MY, Wei Y, Zarnegar BJ, Shi Q, Broughton JP, Ransom RC, Salhotra A, Nagaraja SD, Li R, Dou DR, Yost KE, Cho SW, Mistry A, Longaker MT, Khavari PA, Batey RT, Wuttke DS, Chang HY. Spen links RNA-mediated endogenous retrovirus silencing and X chromosome inactivation. eLife 2020; 9:e54508. [PMID: 32379046 PMCID: PMC7282817 DOI: 10.7554/elife.54508] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Accepted: 05/05/2020] [Indexed: 12/12/2022] Open
Abstract
The Xist lncRNA mediates X chromosome inactivation (XCI). Here we show that Spen, an Xist-binding repressor protein essential for XCI , binds to ancient retroviral RNA, performing a surveillance role to recruit chromatin silencing machinery to these parasitic loci. Spen loss activates a subset of endogenous retroviral (ERV) elements in mouse embryonic stem cells, with gain of chromatin accessibility, active histone modifications, and ERV RNA transcription. Spen binds directly to ERV RNAs that show structural similarity to the A-repeat of Xist, a region critical for Xist-mediated gene silencing. ERV RNA and Xist A-repeat bind the RRM domains of Spen in a competitive manner. Insertion of an ERV into an A-repeat deficient Xist rescues binding of Xist RNA to Spen and results in strictly local gene silencing in cis. These results suggest that Xist may coopt transposable element RNA-protein interactions to repurpose powerful antiviral chromatin silencing machinery for sex chromosome dosage compensation.
Collapse
Affiliation(s)
- Ava C Carter
- Center for Personal Dynamic Regulomes, Stanford UniversityStanfordUnited States
| | - Jin Xu
- Center for Personal Dynamic Regulomes, Stanford UniversityStanfordUnited States
| | - Meagan Y Nakamoto
- Department of Biochemistry, University of ColoradoBoulderUnited States
| | - Yuning Wei
- Center for Personal Dynamic Regulomes, Stanford UniversityStanfordUnited States
| | - Brian J Zarnegar
- Department of Dermatology, Stanford University School of MedicineStanfordUnited States
| | - Quanming Shi
- Center for Personal Dynamic Regulomes, Stanford UniversityStanfordUnited States
| | - James P Broughton
- Center for Personal Dynamic Regulomes, Stanford UniversityStanfordUnited States
| | - Ryan C Ransom
- Department of Surgery, Division of Plastic and Reconstructive Surgery, Stanford University School of MedicineStanfordUnited States
| | - Ankit Salhotra
- Department of Surgery, Division of Plastic and Reconstructive Surgery, Stanford University School of MedicineStanfordUnited States
| | - Surya D Nagaraja
- Institute for Stem Cell Biology and Regenerative Medicine, Stanford UniversityStanfordUnited States
| | - Rui Li
- Center for Personal Dynamic Regulomes, Stanford UniversityStanfordUnited States
| | - Diana R Dou
- Center for Personal Dynamic Regulomes, Stanford UniversityStanfordUnited States
| | - Kathryn E Yost
- Center for Personal Dynamic Regulomes, Stanford UniversityStanfordUnited States
| | - Seung-Woo Cho
- Center for Personal Dynamic Regulomes, Stanford UniversityStanfordUnited States
| | - Anil Mistry
- Novartis Institute for Biomedical ResearchCambridgeUnited States
| | - Michael T Longaker
- Department of Surgery, Division of Plastic and Reconstructive Surgery, Stanford University School of MedicineStanfordUnited States
- Institute for Stem Cell Biology and Regenerative Medicine, Stanford UniversityStanfordUnited States
| | - Paul A Khavari
- Department of Dermatology, Stanford University School of MedicineStanfordUnited States
| | - Robert T Batey
- Department of Biochemistry, University of ColoradoBoulderUnited States
| | - Deborah S Wuttke
- Department of Biochemistry, University of ColoradoBoulderUnited States
| | - Howard Y Chang
- Center for Personal Dynamic Regulomes, Stanford UniversityStanfordUnited States
- Department of Dermatology, Stanford University School of MedicineStanfordUnited States
- Howard Hughes Medical Institute, Stanford UniversityStanfordUnited States
| |
Collapse
|
36
|
Abstract
Since Barbara McClintock’s groundbreaking discovery of mobile DNA sequences some 70 years ago, transposable elements have come to be recognized as important mutagenic agents impacting genome composition, genome evolution, and human health. Transposable elements are a major constituent of prokaryotic and eukaryotic genomes, and the transposition mechanisms enabling transposon proliferation over evolutionary time remain engaging topics for study, suggesting complex interactions with the host, both antagonistic and mutualistic. The impact of transposition is profound, as over 100 human heritable diseases have been attributed to transposon insertions. Transposition can be highly mutagenic, perturbing genome integrity and gene expression in a wide range of organisms. This mutagenic potential has been exploited in the laboratory, where transposons have long been utilized for phenotypic screening and the generation of defined mutant libraries. More recently, barcoding applications and methods for RNA-directed transposition are being used towards new phenotypic screens and studies relevant for gene therapy. Thus, transposable elements are significant in affecting biology both
in vivo and in the laboratory, and this review will survey advances in understanding the biological role of transposons and relevant laboratory applications of these powerful molecular tools.
Collapse
Affiliation(s)
- Anuj Kumar
- Department of Molecular, Cellular, and Developmental Biology, University of Michigan, Ann Arbor, MI, USA.,Program in Cellular and Molecular Biology, University of Michigan, Ann Arbor, MI, USA
| |
Collapse
|
37
|
Sundaram V, Wysocka J. Transposable elements as a potent source of diverse cis-regulatory sequences in mammalian genomes. Philos Trans R Soc Lond B Biol Sci 2020; 375:20190347. [PMID: 32075564 PMCID: PMC7061989 DOI: 10.1098/rstb.2019.0347] [Citation(s) in RCA: 119] [Impact Index Per Article: 23.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Eukaryotic gene regulation is mediated by cis-regulatory elements, which are embedded within the vast non-coding genomic space and recognized by the transcription factors in a sequence- and context-dependent manner. A large proportion of eukaryotic genomes, including at least half of the human genome, are composed of transposable elements (TEs), which in their ancestral form carried their own cis-regulatory sequences able to exploit the host trans environment to promote TE transcription and facilitate transposition. Although not all present-day TE copies have retained this regulatory function, the preexisting regulatory potential of TEs can provide a rich source of cis-regulatory innovation for the host. Here, we review recent evidence documenting diverse contributions of TE sequences to gene regulation by functioning as enhancers, promoters, silencers and boundary elements. We discuss how TE-derived enhancer sequences can rapidly facilitate changes in existing gene regulatory networks and mediate species- and cell-type-specific regulatory innovations, and we postulate a unique contribution of TEs to species-specific gene expression divergence in pluripotency and early embryogenesis. With advances in genome-wide technologies and analyses, systematic investigation of TEs' cis-regulatory potential is now possible and our understanding of the biological impact of genomic TEs is increasing. This article is part of a discussion meeting issue 'Crossroads between transposons and gene regulation'.
Collapse
Affiliation(s)
- Vasavi Sundaram
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Joanna Wysocka
- Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, USA.,Department of Developmental Biology, Stanford University School of Medicine, Stanford, USA.,Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, USA
| |
Collapse
|
38
|
Pehrsson EC, Choudhary MNK, Sundaram V, Wang T. The epigenomic landscape of transposable elements across normal human development and anatomy. Nat Commun 2019; 10:5640. [PMID: 31822674 PMCID: PMC6904449 DOI: 10.1038/s41467-019-13555-x] [Citation(s) in RCA: 53] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2019] [Accepted: 11/12/2019] [Indexed: 12/21/2022] Open
Abstract
Transposable elements (TEs) have deposited functional regulatory elements throughout the human genome. Although most are silenced, certain TEs have been co-opted by the host. However, a comprehensive, multidimensional picture of the contribution of TEs to normal human gene regulation is still lacking. Here, we quantify the epigenomic status of TEs across human anatomy and development using data from the Roadmap Epigenomics Project. We find that TEs encompass a quarter of the human regulatory epigenome, and 47% of elements can be in an active regulatory state. We demonstrate that SINEs are enriched relative to other classes for active and transcribed marks, that TEs encompass a higher proportion of enhancer states in the hematopoietic lineage, and that DNA methylation of Alu elements decreases with age, corresponding with a loss of CpG islands. Finally, we identify TEs that may perform an evolutionarily conserved regulatory function, providing a systematic profile of TE activity in normal human tissue.
Collapse
Affiliation(s)
- Erica C Pehrsson
- Department of Genetics, Washington University School of Medicine, 4515 McKinley Avenue, St. Louis, MO, 63110, USA.
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, 63110, USA.
| | - Mayank N K Choudhary
- Department of Genetics, Washington University School of Medicine, 4515 McKinley Avenue, St. Louis, MO, 63110, USA
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, 63110, USA
| | - Vasavi Sundaram
- Department of Genetics, Washington University School of Medicine, 4515 McKinley Avenue, St. Louis, MO, 63110, USA
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, 63110, USA
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Ting Wang
- Department of Genetics, Washington University School of Medicine, 4515 McKinley Avenue, St. Louis, MO, 63110, USA.
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, 63110, USA.
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, 63108, USA.
| |
Collapse
|
39
|
Dechaud C, Volff JN, Schartl M, Naville M. Sex and the TEs: transposable elements in sexual development and function in animals. Mob DNA 2019; 10:42. [PMID: 31700550 PMCID: PMC6825717 DOI: 10.1186/s13100-019-0185-0] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2019] [Accepted: 10/21/2019] [Indexed: 12/23/2022] Open
Abstract
Transposable elements are endogenous DNA sequences able to integrate into and multiply within genomes. They constitute a major source of genetic innovations, as they can not only rearrange genomes but also spread ready-to-use regulatory sequences able to modify host gene expression, and even can give birth to new host genes. As their evolutionary success depends on their vertical transmission, transposable elements are intrinsically linked to reproduction. In organisms with sexual reproduction, this implies that transposable elements have to manifest their transpositional activity in germ cells or their progenitors. The control of sexual development and function can be very versatile, and several studies have demonstrated the implication of transposable elements in the evolution of sex. In this review, we report the functional and evolutionary relationships between transposable elements and sexual reproduction in animals. In particular, we highlight how transposable elements can influence expression of sexual development genes, and how, reciprocally, they are tightly controlled in gonads. We also review how transposable elements contribute to the organization, expression and evolution of sexual development genes and sex chromosomes. This underscores the intricate co-evolution between host functions and transposable elements, which regularly shift from a parasitic to a domesticated status useful to the host.
Collapse
Affiliation(s)
- Corentin Dechaud
- Institut de Genomique Fonctionnelle de Lyon, Univ Lyon, CNRS UMR 5242, Ecole Normale Superieure de Lyon, Universite Claude Bernard Lyon 1, 46 allee d’Italie, F-69364 Lyon, France
| | - Jean-Nicolas Volff
- Institut de Genomique Fonctionnelle de Lyon, Univ Lyon, CNRS UMR 5242, Ecole Normale Superieure de Lyon, Universite Claude Bernard Lyon 1, 46 allee d’Italie, F-69364 Lyon, France
| | - Manfred Schartl
- Entwicklungsbiochemie, Biozentrum, Universität Würzburg, Würzburg, Germany
- The Xiphophorus Genetic Stock Center, Department of Chemistry and Biochemistry, Texas State University, San Marcos, TX USA
| | - Magali Naville
- Institut de Genomique Fonctionnelle de Lyon, Univ Lyon, CNRS UMR 5242, Ecole Normale Superieure de Lyon, Universite Claude Bernard Lyon 1, 46 allee d’Italie, F-69364 Lyon, France
| |
Collapse
|
40
|
Ray DA, Grimshaw JR, Halsey MK, Korstian JM, Osmanski AB, Sullivan KAM, Wolf KA, Reddy H, Foley N, Stevens RD, Knisbacher BA, Levy O, Counterman B, Edelman NB, Mallet J. Simultaneous TE Analysis of 19 Heliconiine Butterflies Yields Novel Insights into Rapid TE-Based Genome Diversification and Multiple SINE Births and Deaths. Genome Biol Evol 2019; 11:2162-2177. [PMID: 31214686 PMCID: PMC6685494 DOI: 10.1093/gbe/evz125] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/11/2019] [Indexed: 12/21/2022] Open
Abstract
Transposable elements (TEs) play major roles in the evolution of genome structure and function. However, because of their repetitive nature, they are difficult to annotate and discovering the specific roles they may play in a lineage can be a daunting task. Heliconiine butterflies are models for the study of multiple evolutionary processes including phenotype evolution and hybridization. We attempted to determine how TEs may play a role in the diversification of genomes within this clade by performing a detailed examination of TE content and accumulation in 19 species whose genomes were recently sequenced. We found that TE content has diverged substantially and rapidly in the time since several subclades shared a common ancestor with each lineage harboring a unique TE repertoire. Several novel SINE lineages have been established that are restricted to a subset of species. Furthermore, the previously described SINE, Metulj, appears to have gone extinct in two subclades while expanding to significant numbers in others. This diversity in TE content and activity has the potential to impact how heliconiine butterflies continue to evolve and diverge.
Collapse
Affiliation(s)
- David A Ray
- Department of Biological Science, Texas Tech University
| | | | | | | | | | | | | | - Harsith Reddy
- Department of Biological Science, Texas Tech University
| | - Nicole Foley
- Department of Biological Science, Texas Tech University
- Department of Veterinary Integrative Biosciences, College of Veterinary Medicine, Texas A&M University, College Station, TX
| | | | - Binyamin A Knisbacher
- The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel
- Broad Institute of MIT and Harvard, Cambridge, MA
| | - Orr Levy
- Department of Physics, Bar-Ilan University, Ramat Gan, Israel
| | | | | | - James Mallet
- Department of Organismic and Evolutionary Biology, Harvard University
| |
Collapse
|
41
|
Brind'Amour J, Mager DL. Reality check for transposon enhancers. eLife 2019; 8:47900. [PMID: 31148541 PMCID: PMC6544458 DOI: 10.7554/elife.47900] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2019] [Accepted: 05/28/2019] [Indexed: 12/13/2022] Open
Abstract
Hundreds of retrovirus-like sequences have features that suggest they might be gene enhancers, but only a small fraction displays gene-regulating activity in experiments on mouse stem cells.
Collapse
Affiliation(s)
- Julie Brind'Amour
- Department of Medical Genetics, University of British Columbia, Vancouver, Canada
| | | |
Collapse
|
42
|
Todd CD, Deniz Ö, Taylor D, Branco MR. Functional evaluation of transposable elements as enhancers in mouse embryonic and trophoblast stem cells. eLife 2019; 8:e44344. [PMID: 31012843 PMCID: PMC6544436 DOI: 10.7554/elife.44344] [Citation(s) in RCA: 93] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2018] [Accepted: 04/20/2019] [Indexed: 12/18/2022] Open
Abstract
Transposable elements (TEs) are thought to have helped establish gene regulatory networks. Both the embryonic and extraembryonic lineages of the early mouse embryo have seemingly co-opted TEs as enhancers, but there is little evidence that they play significant roles in gene regulation. Here we tested a set of long terminal repeat TE families for roles as enhancers in mouse embryonic and trophoblast stem cells. Epigenomic and transcriptomic data suggested that a large number of TEs helped to establish tissue-specific gene expression programmes. Genetic editing of individual TEs confirmed a subset of these regulatory relationships. However, a wider survey via CRISPR interference of RLTR13D6 elements in embryonic stem cells revealed that only a minority play significant roles in gene regulation. Our results suggest that a subset of TEs are important for gene regulation in early mouse development, and highlight the importance of functional experiments when evaluating gene regulatory roles of TEs.
Collapse
Affiliation(s)
- Christopher D Todd
- Blizard Institute, Barts and The London School of Medicine and DentistryQueen Mary University of LondonLondonUnited Kingdom
- Centre for Genomic Health, Life Sciences InstituteQueen Mary University of LondonLondonUnited Kingdom
| | - Özgen Deniz
- Blizard Institute, Barts and The London School of Medicine and DentistryQueen Mary University of LondonLondonUnited Kingdom
- Centre for Genomic Health, Life Sciences InstituteQueen Mary University of LondonLondonUnited Kingdom
| | - Darren Taylor
- Centre for Genomic Health, Life Sciences InstituteQueen Mary University of LondonLondonUnited Kingdom
| | - Miguel R Branco
- Centre for Genomic Health, Life Sciences InstituteQueen Mary University of LondonLondonUnited Kingdom
| |
Collapse
|
43
|
Guio L, González J. New Insights on the Evolution of Genome Content: Population Dynamics of Transposable Elements in Flies and Humans. Methods Mol Biol 2019; 1910:505-530. [PMID: 31278675 DOI: 10.1007/978-1-4939-9074-0_16] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Understanding the abundance, diversity, and distribution of TEs in genomes is crucial to understand genome structure, function, and evolution. Advances in whole-genome sequencing techniques, as well as in bioinformatics tools, have increased our ability to detect and analyze the transposable element content in genomes. In addition to reference genomes, we now have access to population datasets in which multiple individuals within a species are sequenced. In this chapter, we highlight the recent advances in the study of TE population dynamics focusing on fruit flies and humans, which represent two extremes in terms of TE abundance, diversity, and activity. We review the most recent methodological approaches applied to the study of TE dynamics as well as the new knowledge on host factors involved in the regulation of TE activity. In addition to transposition rates, we also focus on TE deletion rates and on the selective forces that affect the dynamics of TEs in genomes.
Collapse
Affiliation(s)
- Lain Guio
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona, Spain
| | - Josefa González
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona, Spain.
| |
Collapse
|
44
|
Lu J, Cao X, Zhong S. A likelihood approach to testing hypotheses on the co-evolution of epigenome and genome. PLoS Comput Biol 2018; 14:e1006673. [PMID: 30586383 PMCID: PMC6324829 DOI: 10.1371/journal.pcbi.1006673] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2018] [Revised: 01/08/2019] [Accepted: 11/26/2018] [Indexed: 01/03/2023] Open
Abstract
Central questions to epigenome evolution include whether interspecies changes of histone modifications are independent of evolutionary changes of DNA, and if there is dependence whether they depend on any specific types of DNA sequence changes. Here, we present a likelihood approach for testing hypotheses on the co-evolution of genome and histone modifications. The gist of this approach is to convert evolutionary biology hypotheses into probabilistic forms, by explicitly expressing the joint probability of multispecies DNA sequences and histone modifications, which we refer to as a class of Joint Evolutionary Model for the Genome and the Epigenome (JEMGE). JEMGE can be summarized as a mixture model of four components representing four evolutionary hypotheses, namely dependence and independence of interspecies epigenomic variations to underlying sequence substitutions and to underlying sequence insertions and deletions (indels). We implemented a maximum likelihood method to fit the models to the data. Based on comparison of likelihoods, we inferred whether interspecies epigenomic variations depended on substitution or indels in local genomic sequences based on DNase hypersensitivity and spermatid H3K4me3 ChIP-seq data from human and rhesus macaque. Approximately 5.5% of homologous regions in the genomes exhibited H3K4me3 modification in either species, among which approximately 67% homologous regions exhibited local-sequence-dependent interspecies H3K4me3 variations. Substitutions accounted for less local-sequence-dependent H3K4me3 variations than indels. Among transposon-mediated indels, ERV1 insertions and L1 insertions were most strongly associated with H3K4me3 gains and losses, respectively. By initiating probabilistic formulation on the co-evolution of genomes and epigenomes, JEMGE helps to bring evolutionary biology principles to comparative epigenomic studies. Epigenetic modifications play a significant role in gene regulations and thus heavily influence phenotypic outcomes. Whereas cross-species epigenomic comparisons have been fruitful in revealing the function of epigenetic modifications, it still remains unclear how the epigenome changes across species. A central question in epigenome evolution studies is whether interspecies epigenomic variations rely on genomic changes in cis and, if partially yes, whether different genomic changes have distinct impacts. To tackle this question, we initiated a likelihood-based approach, in which different hypotheses related to the co-evolution of the genome and the epigenome could be converted into probabilistic models. By fitting the models to actual data, each model yielded a likelihood, and the hypothesis corresponded to the largest likelihood was selected as most supported by observed data. In this work, we focused on the influence of two types of underlying sequence changes: substitutions, and insertions and deletions (indels). We quantitatively assessed the dependence of H3K4me3 variations on substitutions and indels between human and rhesus, and separated their relative impacts within each genomic region with H3K4me3. The methodology presented here provides a framework for modeling the epigenome together with the genome and a quantitative approach to test different evolutionary hypotheses.
Collapse
Affiliation(s)
- Jia Lu
- Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America
| | - Xiaoyi Cao
- Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America
| | - Sheng Zhong
- Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America
- * E-mail:
| |
Collapse
|
45
|
Raviram R, Rocha PP, Luo VM, Swanzey E, Miraldi ER, Chuong EB, Feschotte C, Bonneau R, Skok JA. Analysis of 3D genomic interactions identifies candidate host genes that transposable elements potentially regulate. Genome Biol 2018; 19:216. [PMID: 30541598 PMCID: PMC6292174 DOI: 10.1186/s13059-018-1598-7] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2018] [Accepted: 11/28/2018] [Indexed: 11/22/2022] Open
Abstract
BACKGROUND The organization of chromatin in the nucleus plays an essential role in gene regulation. About half of the mammalian genome comprises transposable elements. Given their repetitive nature, reads associated with these elements are generally discarded or randomly distributed among elements of the same type in genome-wide analyses. Thus, it is challenging to identify the activities and properties of individual transposons. As a result, we only have a partial understanding of how transposons contribute to chromatin folding and how they impact gene regulation. RESULTS Using PCR and Capture-based chromosome conformation capture (3C) approaches, collectively called 4Tran, we take advantage of the repetitive nature of transposons to capture interactions from multiple copies of endogenous retrovirus (ERVs) in the human and mouse genomes. With 4Tran-PCR, reads are selectively mapped to unique regions in the genome. This enables the identification of transposable element interaction profiles for individual ERV families and integration events specific to particular genomes. With this approach, we demonstrate that transposons engage in long-range intra-chromosomal interactions guided by the separation of chromosomes into A and B compartments as well as topologically associated domains (TADs). In contrast to 4Tran-PCR, Capture-4Tran can uniquely identify both ends of an interaction that involve retroviral repeat sequences, providing a powerful tool for uncovering the individual transposable element insertions that interact with and potentially regulate target genes. CONCLUSIONS 4Tran provides new insight into the manner in which transposons contribute to chromosome architecture and identifies target genes that transposable elements can potentially control.
Collapse
Affiliation(s)
- Ramya Raviram
- Department of Pathology, New York University School of Medicine, New York, NY 10016 USA
- Department of Biology, New York University, New York, NY 10003 USA
- Ludwig Institute for Cancer Research, La Jolla, CA USA
| | - Pedro P. Rocha
- Department of Pathology, New York University School of Medicine, New York, NY 10016 USA
- Division of Developmental Biology, Eunice Kennedy Shriver National Institute of Child Health and Human Development, NIH, Bethesda, MD 20892 USA
| | - Vincent M. Luo
- Department of Pathology, New York University School of Medicine, New York, NY 10016 USA
- Department of Biology, New York University, New York, NY 10003 USA
| | - Emily Swanzey
- Department of Developmental Genetics, New York University School of Medicine, New York, NY 10016 USA
| | - Emily R. Miraldi
- Department of Biology, New York University, New York, NY 10003 USA
- Department of Computer Science, Courant Institute of Mathematical Sciences, New York, NY 10003 USA
- Simons Center for Data Analysis, New York, NY 10010 USA
- Divisions of Immunobiology and Biomedical Informatics, Cincinnati Children’s Hospital, Cincinnati, OH 45229 USA
| | - Edward B. Chuong
- BioFrontiers Institute, Department of Molecular, Cellular, and Developmental Biology, University of Colorado Boulder, Boulder, CO 80309 USA
| | - Cédric Feschotte
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14850 USA
| | - Richard Bonneau
- Department of Biology, New York University, New York, NY 10003 USA
- Department of Computer Science, Courant Institute of Mathematical Sciences, New York, NY 10003 USA
- Simons Center for Data Analysis, New York, NY 10010 USA
| | - Jane A. Skok
- Department of Pathology, New York University School of Medicine, New York, NY 10016 USA
| |
Collapse
|
46
|
Ohtani H, Liu M, Zhou W, Liang G, Jones PA. Switching roles for DNA and histone methylation depend on evolutionary ages of human endogenous retroviruses. Genome Res 2018; 28:1147-1157. [PMID: 29970451 PMCID: PMC6071641 DOI: 10.1101/gr.234229.118] [Citation(s) in RCA: 73] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2018] [Accepted: 06/29/2018] [Indexed: 12/15/2022]
Abstract
We provide a comprehensive genomic and epigenomic map of the more than 500,000 endogenous retroviruses (ERVs) and fragments that populate the intergenic regions of the human genome. The repressive epigenetic marks associated with the ERVs, particularly long terminal repeats (LTRs), show a remarkable switch in silencing mechanisms, depending on the evolutionary age of the LTRs. Young LTRs tend to be CpG rich and are mainly suppressed by DNA methylation, whereas intermediate age LTRs are associated predominantly with histone modifications, particularly histone H3 lysine 9 (H3K9) methylation. Young LTRs can be reactivated by treatment with the DNA methylation inhibitor 5-aza-2'-deoxycytidine (5-aza-CdR) alone, but their level of expression is much increased by 5-aza-CdR treatment plus knockdown of one of several H3K9 methyltransferases or of the H3K27 methyltransferase EZH2. The removal of cytosine methylation led to rapid, widespread increases in H3K9me3 in the LTRs. Intermediate age LTRs had lower CpG densities and were not up-regulated by 5-aza-CdR treatment, but they were sensitive to knockdown of H3K9 methyltransferases. Unlike the situation in embryonic stem cells, the polycomb repressive complex (PRC2) has a minor role in LTR suppression by itself and is only a player after removal of cytosine methylation in the analyzed cancer cell line. Up-regulation of LTRs and induction of "viral mimicry" is rapidly becoming of interest for predicting cancer patient response to epigenetic therapies. Understanding the mechanism for LTR suppression is of major importance in order to improve patient treatment strategies.
Collapse
Affiliation(s)
- Hitoshi Ohtani
- Van Andel Research Institute, Grand Rapids, Michigan 49503, USA
| | - Minmin Liu
- Van Andel Research Institute, Grand Rapids, Michigan 49503, USA
| | - Wanding Zhou
- Van Andel Research Institute, Grand Rapids, Michigan 49503, USA
| | - Gangning Liang
- Department of Urology, Keck School of Medicine, University of Southern California, Los Angeles, California 90089, USA
| | - Peter A Jones
- Van Andel Research Institute, Grand Rapids, Michigan 49503, USA
| |
Collapse
|
47
|
Ward MC, Zhao S, Luo K, Pavlovic BJ, Karimi MM, Stephens M, Gilad Y. Silencing of transposable elements may not be a major driver of regulatory evolution in primate iPSCs. eLife 2018; 7:33084. [PMID: 29648536 PMCID: PMC5943035 DOI: 10.7554/elife.33084] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2017] [Accepted: 04/11/2018] [Indexed: 12/16/2022] Open
Abstract
Transposable elements (TEs) comprise almost half of primate genomes and their aberrant regulation can result in deleterious effects. In pluripotent stem cells, rapidly evolving KRAB-ZNF genes target TEs for silencing by H3K9me3. To investigate the evolution of TE silencing, we performed H3K9me3 ChIP-seq experiments in induced pluripotent stem cells from 10 human and 7 chimpanzee individuals. We identified four million orthologous TEs and found the SVA and ERV families to be marked most frequently by H3K9me3. We found little evidence of inter-species differences in TE silencing, with as many as 82% of putatively silenced TEs marked at similar levels in humans and chimpanzees. TEs that are preferentially silenced in one species are a similar age to those silenced in both species and are not more likely to be associated with expression divergence of nearby orthologous genes. Our data suggest limited species-specificity of TE silencing across 6 million years of primate evolution.
Collapse
Affiliation(s)
- Michelle C Ward
- Department of Human Genetics, University of Chicago, Chicago, United States.,Department of Medicine, University of Chicago, Chicago, United States
| | - Siming Zhao
- Department of Human Genetics, University of Chicago, Chicago, United States
| | - Kaixuan Luo
- Department of Human Genetics, University of Chicago, Chicago, United States
| | - Bryan J Pavlovic
- Department of Human Genetics, University of Chicago, Chicago, United States
| | - Mohammad M Karimi
- MRC London Institute of Medical Sciences, Imperial College, London, United Kingdom
| | - Matthew Stephens
- Department of Human Genetics, University of Chicago, Chicago, United States.,Department of Statistics, University of Chicago, Chicago, United States
| | - Yoav Gilad
- Department of Human Genetics, University of Chicago, Chicago, United States.,Department of Medicine, University of Chicago, Chicago, United States
| |
Collapse
|
48
|
Sundaram V, Wang T. Transposable Element Mediated Innovation in Gene Regulatory Landscapes of Cells: Re-Visiting the "Gene-Battery" Model. Bioessays 2018; 40:10.1002/bies.201700155. [PMID: 29206283 PMCID: PMC5912915 DOI: 10.1002/bies.201700155] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2017] [Revised: 10/25/2017] [Indexed: 01/31/2023]
Abstract
Transposable elements (TEs) are no longer considered to be "junk" DNA. Here, we review how TEs can impact gene regulation systematically. TEs encode various regulatory elements that enables them to regulate gene expression. RJ Britten and EH Davidson hypothesized that TEs can integrate the function of various transcriptional regulators into gene regulatory networks. Uniquely TEs can deposit regulatory sites across the genome when they transpose, and thereby bring multiple genes under control of the same regulatory logic. Several studies together have robustly established that TEs participate in embryonic development and oncogenesis. We discuss the regulatory characteristics of TEs in context of evolution to understand the extent of their impact on gene networks. Understanding these features of TEs is central to future investigations of TEs in cellular processes and phenotypic presentations, which are applicable to development and disease studies. We re-visit the Britten-Davidson "gene-battery" model and understand the genetic and transcriptional impact of TEs in innovating gene regulatory networks.
Collapse
Affiliation(s)
- Vasavi Sundaram
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Ting Wang
- Department of Genetics, Center for Genome Sciences & Systems Biology, Washington University School of Medicine, St Louis, Missouri 63110, United States of America
| |
Collapse
|
49
|
Shapiro JA. Living Organisms Author Their Read-Write Genomes in Evolution. BIOLOGY 2017; 6:E42. [PMID: 29211049 PMCID: PMC5745447 DOI: 10.3390/biology6040042] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 08/23/2017] [Revised: 11/17/2017] [Accepted: 11/28/2017] [Indexed: 12/18/2022]
Abstract
Evolutionary variations generating phenotypic adaptations and novel taxa resulted from complex cellular activities altering genome content and expression: (i) Symbiogenetic cell mergers producing the mitochondrion-bearing ancestor of eukaryotes and chloroplast-bearing ancestors of photosynthetic eukaryotes; (ii) interspecific hybridizations and genome doublings generating new species and adaptive radiations of higher plants and animals; and, (iii) interspecific horizontal DNA transfer encoding virtually all of the cellular functions between organisms and their viruses in all domains of life. Consequently, assuming that evolutionary processes occur in isolated genomes of individual species has become an unrealistic abstraction. Adaptive variations also involved natural genetic engineering of mobile DNA elements to rewire regulatory networks. In the most highly evolved organisms, biological complexity scales with "non-coding" DNA content more closely than with protein-coding capacity. Coincidentally, we have learned how so-called "non-coding" RNAs that are rich in repetitive mobile DNA sequences are key regulators of complex phenotypes. Both biotic and abiotic ecological challenges serve as triggers for episodes of elevated genome change. The intersections of cell activities, biosphere interactions, horizontal DNA transfers, and non-random Read-Write genome modifications by natural genetic engineering provide a rich molecular and biological foundation for understanding how ecological disruptions can stimulate productive, often abrupt, evolutionary transformations.
Collapse
Affiliation(s)
- James A Shapiro
- Department of Biochemistry and Molecular Biology, University of Chicago GCIS W123B, 979 E. 57th Street, Chicago, IL 60637, USA.
| |
Collapse
|