1
|
Correa M, Lerat E, Birmelé E, Samson F, Bouillon B, Normand K, Rizzon C. The Transposable Element Environment of Human Genes Differs According to Their Duplication Status and Essentiality. Genome Biol Evol 2021; 13:6273345. [PMID: 33973013 PMCID: PMC8155550 DOI: 10.1093/gbe/evab062] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/17/2021] [Indexed: 12/13/2022] Open
Abstract
Transposable elements (TEs) are major components of eukaryotic genomes and represent approximately 45% of the human genome. TEs can be important sources of novelty in genomes and there is increasing evidence that TEs contribute to the evolution of gene regulation in mammals. Gene duplication is an evolutionary mechanism that also provides new genetic material and opportunities to acquire new functions. To investigate how duplicated genes are maintained in genomes, here, we explored the TE environment of duplicated and singleton genes. We found that singleton genes have more short-interspersed nuclear elements and DNA transposons in their vicinity than duplicated genes, whereas long-interspersed nuclear elements and long-terminal repeat retrotransposons have accumulated more near duplicated genes. We also discovered that this result is highly associated with the degree of essentiality of the genes with an unexpected accumulation of short-interspersed nuclear elements and DNA transposons around the more-essential genes. Our results underline the importance of taking into account the TE environment of genes to better understand how duplicated genes are maintained in genomes.
Collapse
Affiliation(s)
- Margot Correa
- Laboratoire de Mathématiques et Modélisation d'Evry (LaMME), UMR CNRS 8071, ENSIIE, USC INRA, Université d'Evry Val d'Essonne, Evry, France
| | - Emmanuelle Lerat
- Laboratoire de Biométrie et Biologie Evolutive, UMR 5558, Université de Lyon, Université Lyon 1, CNRS, Villeurbanne, France
| | - Etienne Birmelé
- Laboratoire MAP5 UMR 8145, Université de Paris, Paris, France
| | - Franck Samson
- Laboratoire de Mathématiques et Modélisation d'Evry (LaMME), UMR CNRS 8071, ENSIIE, USC INRA, Université d'Evry Val d'Essonne, Evry, France
| | - Bérengère Bouillon
- Laboratoire de Mathématiques et Modélisation d'Evry (LaMME), UMR CNRS 8071, ENSIIE, USC INRA, Université d'Evry Val d'Essonne, Evry, France
| | - Kévin Normand
- Laboratoire de Mathématiques et Modélisation d'Evry (LaMME), UMR CNRS 8071, ENSIIE, USC INRA, Université d'Evry Val d'Essonne, Evry, France
| | - Carène Rizzon
- Laboratoire de Mathématiques et Modélisation d'Evry (LaMME), UMR CNRS 8071, ENSIIE, USC INRA, Université d'Evry Val d'Essonne, Evry, France
| |
Collapse
|
2
|
Mkannez G, Gagné-Ouellet V, Jalloul Nsaibia M, Boulanger MC, Rosa M, Argaud D, Hadji F, Gaudreault N, Rhéaume G, Bouchard L, Bossé Y, Mathieu P. DNA methylation of a PLPP3 MIR transposon-based enhancer promotes an osteogenic programme in calcific aortic valve disease. Cardiovasc Res 2019; 114:1525-1535. [PMID: 29726894 DOI: 10.1093/cvr/cvy111] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/13/2017] [Accepted: 05/01/2018] [Indexed: 12/12/2022] Open
Abstract
Aims Calcific aortic valve disease (CAVD) is characterized by the osteogenic transition of valve interstitial cells (VICs). In CAVD, lysophosphatidic acid (LysoPA), a lipid mediator with potent osteogenic activity, is produced in the aortic valve (AV) and is degraded by membrane-associated phospholipid phosphatases (PLPPs). We thus hypothesized that a dysregulation of PLPPs could participate to the osteogenic reprograming of VICs during CAVD. Methods and results The expression of PLPPs was examined in human control and mineralized AVs and comprehensive analyses were performed to document the gene regulation and impact of PLPPs on the osteogenic transition of VICs. We found that PLPP3 gene and enzymatic activity were downregulated in mineralized AVs. Multidimensional gene profiling in 21 human AVs showed that expression of PLPP3 was inversely correlated with the level of 5-methylcytosine (5meC) located in an intronic mammalian interspersed repeat (MIR) element. Bisulphite pyrosequencing in a larger series of 67 AVs confirmed that 5meC in intron 1 was increased by 2.2-fold in CAVD compared with control AVs. In isolated cells, epigenome editing with clustered regularly interspersed short palindromic repeats-Cas9 system containing a deficient Cas9 fused with DNA methyltransferase (dCas9-DNMT) was used to increase 5meC in the intronic enhancer and showed that it reduced significantly the expression of PLPP3. Knockdown experiments showed that lower expression of PLPP3 in VICs promotes an osteogenic programme. Conclusions DNA methylation of a MIR-based enhancer downregulates the expression of PLPP3 and promotes the mineralization of the AV.
Collapse
Affiliation(s)
- Ghada Mkannez
- Laboratory of Cardiovascular Pathobiology, Department of Surgery, Quebec Heart and Lung Institute/Research Center, Laval University, QC, Canada
| | - Valérie Gagné-Ouellet
- Department of Biochemistry, Université de Sherbrooke, Sherbrooke, QC, Canada.,ECOGENE-21 Biocluster, Chicoutimi Hospital, Saguenay, QC, Canada
| | - Mohamed Jalloul Nsaibia
- Laboratory of Cardiovascular Pathobiology, Department of Surgery, Quebec Heart and Lung Institute/Research Center, Laval University, QC, Canada
| | - Marie-Chloé Boulanger
- Laboratory of Cardiovascular Pathobiology, Department of Surgery, Quebec Heart and Lung Institute/Research Center, Laval University, QC, Canada
| | - Mickael Rosa
- Laboratory of Cardiovascular Pathobiology, Department of Surgery, Quebec Heart and Lung Institute/Research Center, Laval University, QC, Canada
| | - Deborah Argaud
- Laboratory of Cardiovascular Pathobiology, Department of Surgery, Quebec Heart and Lung Institute/Research Center, Laval University, QC, Canada
| | - Fayez Hadji
- Laboratory of Cardiovascular Pathobiology, Department of Surgery, Quebec Heart and Lung Institute/Research Center, Laval University, QC, Canada
| | | | - Gabrielle Rhéaume
- Laboratory of Cardiovascular Pathobiology, Department of Surgery, Quebec Heart and Lung Institute/Research Center, Laval University, QC, Canada
| | - Luigi Bouchard
- Department of Biochemistry, Université de Sherbrooke, Sherbrooke, QC, Canada.,ECOGENE-21 Biocluster, Chicoutimi Hospital, Saguenay, QC, Canada
| | - Yohan Bossé
- Department of Molecular Medicine, Laval University, QC, Canada
| | - Patrick Mathieu
- Laboratory of Cardiovascular Pathobiology, Department of Surgery, Quebec Heart and Lung Institute/Research Center, Laval University, QC, Canada
| |
Collapse
|
3
|
Carnevali D, Conti A, Pellegrini M, Dieci G. Whole-genome expression analysis of mammalian-wide interspersed repeat elements in human cell lines. DNA Res 2017; 24:59-69. [PMID: 28028040 PMCID: PMC5381342 DOI: 10.1093/dnares/dsw048] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2016] [Accepted: 10/09/2016] [Indexed: 01/06/2023] Open
Abstract
With more than 500,000 copies, mammalian-wide interspersed repeats (MIRs), a sub-group of SINEs, represent ∼2.5% of the human genome and one of the most numerous family of potential targets for the RNA polymerase (Pol) III transcription machinery. Since MIR elements ceased to amplify ∼130 myr ago, previous studies primarily focused on their genomic impact, while the issue of their expression has not been extensively addressed. We applied a dedicated bioinformatic pipeline to ENCODE RNA-Seq datasets of seven human cell lines and, for the first time, we were able to define the Pol III-driven MIR transcriptome at single-locus resolution. While the majority of Pol III-transcribed MIR elements are cell-specific, we discovered a small set of ubiquitously transcribed MIRs mapping within Pol II-transcribed genes in antisense orientation that could influence the expression of the overlapping gene. We also identified novel Pol III-transcribed ncRNAs, deriving from transcription of annotated MIR fragments flanked by unique MIR-unrelated sequences, and confirmed the role of Pol III-specific internal promoter elements in MIR transcription. Besides demonstrating widespread transcription at these retrotranspositionally inactive elements in human cells, the ability to profile MIR expression at single-locus resolution will facilitate their study in different cell types and states including pathological alterations.
Collapse
Affiliation(s)
| | - Anastasia Conti
- Department of Life Sciences, University of Parma, Parma, Italy
| | - Matteo Pellegrini
- Department of Molecular, Cell, and Developmental Biology, University of California, Los Angeles, CA 90095 723, USA
| | - Giorgio Dieci
- Department of Life Sciences, University of Parma, Parma, Italy
| |
Collapse
|
4
|
Chuong EB, Elde NC, Feschotte C. Regulatory activities of transposable elements: from conflicts to benefits. Nat Rev Genet 2016; 18:71-86. [PMID: 27867194 DOI: 10.1038/nrg.2016.139] [Citation(s) in RCA: 827] [Impact Index Per Article: 91.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Transposable elements (TEs) are a prolific source of tightly regulated, biochemically active non-coding elements, such as transcription factor-binding sites and non-coding RNAs. Many recent studies reinvigorate the idea that these elements are pervasively co-opted for the regulation of host genes. We argue that the inherent genetic properties of TEs and the conflicting relationships with their hosts facilitate their recruitment for regulatory functions in diverse genomes. We review recent findings supporting the long-standing hypothesis that the waves of TE invasions endured by organisms for eons have catalysed the evolution of gene-regulatory networks. We also discuss the challenges of dissecting and interpreting the phenotypic effect of regulatory activities encoded by TEs in health and disease.
Collapse
Affiliation(s)
- Edward B Chuong
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, Utah 84103, USA
| | - Nels C Elde
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, Utah 84103, USA
| | - Cédric Feschotte
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, Utah 84103, USA
| |
Collapse
|
5
|
Jjingo D, Conley AB, Wang J, Mariño-Ramírez L, Lunyak VV, Jordan IK. Mammalian-wide interspersed repeat (MIR)-derived enhancers and the regulation of human gene expression. Mob DNA 2014; 5:14. [PMID: 25018785 PMCID: PMC4090950 DOI: 10.1186/1759-8753-5-14] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2013] [Accepted: 04/10/2014] [Indexed: 11/26/2022] Open
Abstract
Background Mammalian-wide interspersed repeats (MIRs) are the most ancient family of transposable elements (TEs) in the human genome. The deep conservation of MIRs initially suggested the possibility that they had been exapted to play functional roles for their host genomes. MIRs also happen to be the only TEs whose presence in-and-around human genes is positively correlated to tissue-specific gene expression. Similar associations of enhancer prevalence within genes and tissue-specific expression, along with MIRs’ previous implication as providing regulatory sequences, suggested a possible link between MIRs and enhancers. Results To test the possibility that MIRs contribute functional enhancers to the human genome, we evaluated the relationship between MIRs and human tissue-specific enhancers in terms of genomic location, chromatin environment, regulatory function, and mechanistic attributes. This analysis revealed MIRs to be highly concentrated in enhancers of the K562 and HeLa human cell-types. Significantly more enhancers were found to be linked to MIRs than would be expected by chance, and putative MIR-derived enhancers are characterized by a chromatin environment highly similar to that of canonical enhancers. MIR-derived enhancers show strong associations with gene expression levels, tissue-specific gene expression and tissue-specific cellular functions, including a number of biological processes related to erythropoiesis. MIR-derived enhancers were found to be a rich source of transcription factor binding sites, underscoring one possible mechanistic route for the element sequences co-option as enhancers. There is also tentative evidence to suggest that MIR-enhancer function is related to the transcriptional activity of non-coding RNAs. Conclusions Taken together, these data reveal enhancers to be an important cis-regulatory platform from which MIRs can exercise a regulatory function in the human genome and help to resolve a long-standing conundrum as to the reason for MIRs’ deep evolutionary conservation.
Collapse
Affiliation(s)
- Daudi Jjingo
- School of Biology, Georgia Institute of Technology, Atlanta, GA, USA
| | - Andrew B Conley
- School of Biology, Georgia Institute of Technology, Atlanta, GA, USA
| | - Jianrong Wang
- School of Biology, Georgia Institute of Technology, Atlanta, GA, USA
| | - Leonardo Mariño-Ramírez
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA ; PanAmerican Bioinformatics Institute, Santa Marta, Magdalena, Colombia
| | - Victoria V Lunyak
- PanAmerican Bioinformatics Institute, Santa Marta, Magdalena, Colombia ; Buck Institute for Research on Aging, Novato, CA, USA
| | - I King Jordan
- School of Biology, Georgia Institute of Technology, Atlanta, GA, USA ; PanAmerican Bioinformatics Institute, Santa Marta, Magdalena, Colombia
| |
Collapse
|
6
|
Campos-Sánchez R, Kapusta A, Feschotte C, Chiaromonte F, Makova KD. Genomic landscape of human, bat, and ex vivo DNA transposon integrations. Mol Biol Evol 2014; 31:1816-32. [PMID: 24809961 DOI: 10.1093/molbev/msu138] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
The integration and fixation preferences of DNA transposons, one of the major classes of eukaryotic transposable elements, have never been evaluated comprehensively on a genome-wide scale. Here, we present a detailed study of the distribution of DNA transposons in the human and bat genomes. We studied three groups of DNA transposons that integrated at different evolutionary times: 1) ancient (>40 My) and currently inactive human elements, 2) younger (<40 My) bat elements, and 3) ex vivo integrations of piggyBat and Sleeping Beauty elements in HeLa cells. Although the distribution of ex vivo elements reflected integration preferences, the distribution of human and (to a lesser extent) bat elements was also affected by selection. We used regression techniques (linear, negative binomial, and logistic regression models with multiple predictors) applied to 20-kb and 1-Mb windows to investigate how the genomic landscape in the vicinity of DNA transposons contributes to their integration and fixation. Our models indicate that genomic landscape explains 16-79% of variability in DNA transposon genome-wide distribution. Importantly, we not only confirmed previously identified predictors (e.g., DNA conformation and recombination hotspots) but also identified several novel predictors (e.g., signatures of double-strand breaks and telomere hexamer). Ex vivo integrations showed a bias toward actively transcribed regions. Older DNA transposons were located in genomic regions scarce in most conserved elements-likely reflecting purifying selection. Our study highlights how DNA transposons are integral to the evolution of bat and human genomes, and has implications for the development of DNA transposon assays for gene therapy and mutagenesis applications.
Collapse
Affiliation(s)
- Rebeca Campos-Sánchez
- Genetics Program, The Huck Institutes of the Life Sciences, Penn State University, University Park, PA
| | - Aurélie Kapusta
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT
| | - Cédric Feschotte
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT
| | - Francesca Chiaromonte
- Center for Medical Genomics, The Huck Institutes of the Life Sciences, Penn State University, University Park, PADepartment of Statistics, Penn State University, University Park, PA
| | - Kateryna D Makova
- Center for Medical Genomics, The Huck Institutes of the Life Sciences, Penn State University, University Park, PADepartment of Biology, Penn State University, University Park, PA
| |
Collapse
|
7
|
Visualized computational predictions of transcriptional effects by intronic endogenous retroviruses. PLoS One 2013; 8:e71971. [PMID: 23936536 PMCID: PMC3735543 DOI: 10.1371/journal.pone.0071971] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2013] [Accepted: 07/05/2013] [Indexed: 11/19/2022] Open
Abstract
When endogenous retroviruses (ERVs) or other transposable elements (TEs) insert into an intron, the consequence on gene transcription can range from negligible to a complete ablation of normal transcripts. With the advance of sequencing technology, more and more insertionally polymorphic or private TE insertions are being identified in humans and mice, of which some could have a significant impact on host gene expression. Nevertheless, an efficient and low cost approach to prioritize their potential effect on gene transcription has been lacking. By building a computational model based on artificial neural networks (ANN), we demonstrate the feasibility of using machine-learning approaches to predict the likelihood that intronic ERV insertions will have major effects on gene transcription, focusing on the two ERV families, namely Intracisternal A-type Particle (IAP) and Early Transposon (ETn)/MusD elements, which are responsible for the majority of ERV-induced mutations in mice. We trained the ANN model using properties associated with these ERVs known to cause germ-line mutations (positive cases) and properties associated with likely neutral ERVs of the same families (negative cases), and derived a set of prediction plots that can visualize the likelihood of affecting gene transcription by ERV insertions. Our results show a highly reliable prediction power of our model, and offer a potential approach to computationally screen for other types of TE insertions that may affect gene transcription or even cause disease.
Collapse
|
8
|
Evolutionary rate of human tissue-specific genes are related with transposable element insertions. Genetica 2013; 140:513-23. [PMID: 23337972 DOI: 10.1007/s10709-013-9700-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2012] [Accepted: 01/12/2013] [Indexed: 01/05/2023]
Abstract
The influence of transposable elements (TEs) on genome evolution has been widely studied. However, it remains unclear whether TE insertions also impact on evolutionary rate of human genes. In this study, we have compared the differences in TEs and evolutionary rates between human tissue-specific genes. Our results showed that various functional categories of human tissue-specific genes contained different TE numbers and divergent values of Ka/Ks, with human nucleic acid binding transcription factor activity genes having the fewest TE density and Ka/Ks value. Interestingly, we also found that human tissue-specific genes with TEs have also undergone faster evolution than those without TEs. Therefore, TEs have significant impact on the evolutionary rates of human tissue-specific genes. Furthermore, local genomic properties such as gene length, GC content and recombination rate may reflect a true transpositional bias for the particular TEs. Our results may provide important insights for further elucidating the evolution of human tissue-specific genes.
Collapse
|
9
|
Lorite P, Maside X, Sanllorente O, Torres MI, Periquet G, Palomeque T. The ant genomes have been invaded by several types of mariner transposable elements. Naturwissenschaften 2012; 99:1007-20. [PMID: 23097152 DOI: 10.1007/s00114-012-0982-5] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2011] [Revised: 10/10/2012] [Accepted: 10/11/2012] [Indexed: 11/25/2022]
Abstract
To date, only three types of full-length mariner elements have been described in ants, each one in a different genus of the Myrmicinae subfamily: Sinvmar was isolated from various Solenopsis species, Myrmar from Myrmica ruginodis, and Mboumar from Messor bouvieri. In this study, we report the coexistence of three mariner elements (Tnigmar-Si, Tnigmar-Mr, and Tnigmar-Mb) in the genome of a single species, Tapinoma nigerrimum (subfamily Dolichoderinae). Molecular evolutionary analyses of the nucleotide sequence data revealed a general agreement between the evolutionary history of most the elements and the ant species that harbour them, and suggest that they are at the vertical inactivation stage of the so-called Mariner Life Cycle. In contrast, significantly reduced levels of synonymous divergence between Mboumar and Tnigmar-Mb and between Myrmar and Botmar (a mariner element isolated from Bombus terrestris), relative to those observed between their hosts, suggest that these elements arrived to the species that host them by horizontal transfer, long after the species' split. The horizontal transfer events for the two pairs of elements could be roughly dated within the last 2 million years and about 14 million years, respectively. As would be expected under this scenario, the coding sequences of the youngest elements, Tnigmar-Mb and Mboumar, are intact and, thus, potentially functional. Each mariner element has a different chromosomal distribution pattern according to their stage within the Mariner Life Cycle. Finally, a new defective transposable element (Azteca) has also been found inserted into the Tnigmar-Mr sequences showing that the ant genomes have been invaded by at least four different types of mariner elements.
Collapse
Affiliation(s)
- Pedro Lorite
- Departamento de Biología Experimental, Área de Genética, Universidad de Jaén, 23071, Jaén, Spain
| | | | | | | | | | | |
Collapse
|
10
|
Testori A, Caizzi L, Cutrupi S, Friard O, De Bortoli M, Cora' D, Caselle M. The role of Transposable Elements in shaping the combinatorial interaction of Transcription Factors. BMC Genomics 2012; 13:400. [PMID: 22897927 PMCID: PMC3478180 DOI: 10.1186/1471-2164-13-400] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2012] [Accepted: 06/28/2012] [Indexed: 12/22/2022] Open
Abstract
Background In the last few years several studies have shown that Transposable Elements (TEs) in the human genome are significantly associated with Transcription Factor Binding Sites (TFBSs) and that in several cases their expansion within the genome led to a substantial rewiring of the regulatory network. Another important feature of the regulatory network which has been thoroughly studied is the combinatorial organization of transcriptional regulation. In this paper we combine these two observations and suggest that TEs, besides rewiring the network, also played a central role in the evolution of particular patterns of combinatorial gene regulation. Results To address this issue we searched for TEs overlapping Estrogen Receptor α (ERα) binding peaks in two publicly available ChIP-seq datasets from the MCF7 cell line corresponding to different modalities of exposure to estrogen. We found a remarkable enrichment of a few specific classes of Transposons. Among these a prominent role was played by MIR (Mammalian Interspersed Repeats) transposons. These TEs underwent a dramatic expansion at the beginning of the mammalian radiation and then stabilized. We conjecture that the special affinity of ERα for the MIR class of TEs could be at the origin of the important role assumed by ERα in Mammalians. We then searched for TFBSs within the TEs overlapping ChIP-seq peaks. We found a strong enrichment of a few precise combinations of TFBS. In several cases the corresponding Transcription Factors (TFs) were known cofactors of ERα, thus supporting the idea of a co-regulatory role of TFBS within the same TE. Moreover, most of these correlations turned out to be strictly associated to specific classes of TEs thus suggesting the presence of a well-defined "transposon code" within the regulatory network. Conclusions In this work we tried to shed light into the role of Transposable Elements (TEs) in shaping the regulatory network of higher eukaryotes. To test this idea we focused on a particular transcription factor: the Estrogen Receptor α (ERα) and we found that ERα preferentially targets a well defined set of TEs and that these TEs host combinations of transcriptional regulators involving several of known co-regulators of ERα. Moreover, a significant number of these TEs turned out to be conserved between human and mouse and located in the vicinity (and thus candidate to be regulators) of important estrogen-related genes.
Collapse
Affiliation(s)
- Alessandro Testori
- Center for Molecular Systems Biology, University of Turin, Turin, Candiolo I-10060, Italy.
| | | | | | | | | | | | | |
Collapse
|
11
|
Wang D, Su Y, Wang X, Lei H, Yu J. Transposon-derived and satellite-derived repetitive sequences play distinct functional roles in Mammalian intron size expansion. Evol Bioinform Online 2012; 8:301-19. [PMID: 22807622 PMCID: PMC3396637 DOI: 10.4137/ebo.s9758] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
Background Repetitive sequences (RSs) are redundant, complex at times, and often lineage-specific, representing significant “building” materials for genes and genomes. According to their origins, sequence characteristics, and ways of propagation, repetitive sequences are divided into transposable elements (TEs) and satellite sequences (SSs) as well as related subfamilies and subgroups hierarchically. The combined changes attributable to the repetitive sequences alter gene and genome architectures, such as the expansion of exonic, intronic, and intergenic sequences, and most of them propagate in a seemingly random fashion and contribute very significantly to the entire mutation spectrum of mammalian genomes. Principal findings Our analysis is focused on evolutional features of TEs and SSs in the intronic sequence of twelve selected mammalian genomes. We divided them into four groups—primates, large mammals, rodents, and primary mammals—and used four non-mammalian vertebrate species as the out-group. After classifying intron size variation in an intron-centric way based on RS-dominance (TE-dominant or SS-dominant intron expansions), we observed several distinct profiles in intron length and positioning in different vertebrate lineages, such as retrotransposon-dominance in mammals and DNA transposon-dominance in the lower vertebrates, amphibians and fishes. The RS patterns of mouse and rat genes are most striking, which are not only distinct from those of other mammals but also different from that of the third rodent species analyzed in this study—guinea pig. Looking into the biological functions of relevant genes, we observed a two-dimensional divergence; in particular, genes that possess SS-dominant and/or RS-free introns are enriched in tissue-specific development and transcription regulation in all mammalian lineages. In addition, we found that the tendency of transposons in increasing intron size is much stronger than that of satellites, and the combined effect of both RSs is greater than either one of them alone in a simple arithmetic sum among the mammals and the opposite is found among the four non-mammalian vertebrates. Conclusions TE- and SS-derived RSs represent major mutational forces shaping the size and composition of vertebrate genes and genomes, and through natural selection they either fine-tune or facilitate changes in size expansion, position variation, and duplication, and thus in functions and evolutionary paths for better survival and fitness. When analyzed globally, not only are such changes significantly diversified but also comprehensible in lineages and biological implications.
Collapse
Affiliation(s)
- Dapeng Wang
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100029, P.R. China
| | | | | | | | | |
Collapse
|
12
|
Zhang Y, Mager DL. Gene properties and chromatin state influence the accumulation of transposable elements in genes. PLoS One 2012; 7:e30158. [PMID: 22272293 PMCID: PMC3260225 DOI: 10.1371/journal.pone.0030158] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2011] [Accepted: 12/14/2011] [Indexed: 12/03/2022] Open
Abstract
Transposable elements (TEs) are mobile DNA sequences found in the genomes of almost all species. By measuring the normalized coverage of TE sequences within genes, we identified sets of genes with conserved extremes of high/low TE density in the genomes of human, mouse and cow and denoted them as ‘shared upper/lower outliers (SUOs/SLOs)’. By comparing these outlier genes to the genomic background, we show that a large proportion of SUOs are involved in metabolic pathways and tend to be mammal-specific, whereas many SLOs are related to developmental processes and have more ancient origins. Furthermore, the proportions of different types of TEs within human and mouse orthologous SUOs showed high similarity, even though most detectable TEs in these two genomes inserted after their divergence. Interestingly, our computational analysis of polymerase-II (Pol-II) occupancy at gene promoters in different mouse tissues showed that 60% of tissue-specific SUOs show strong Pol-II binding only in embryonic stem cells (ESCs), a proportion significantly higher than the genomic background (37%). In addition, our analysis of histone marks such as H3K4me3 and H3K27me3 in mouse ESCs also suggest a strong association between TE-rich genes and open-chromatin at promoters. Finally, two independent whole-transcriptome datasets show a positive association between TE density and gene expression level in ESCs. While this study focuses on genes with extreme TE densities, the above results clearly show that the probability of TE accumulation/fixation in mammalian genes is not random and is likely associated with different factors/gene properties and, most importantly, an association between the TE insertion/fixation rate and gene activity status in ES cells.
Collapse
Affiliation(s)
- Ying Zhang
- Terry Fox Laboratory, British Columbia Cancer Agency, Vancouver, British Columbia, Canada
- Department of Medical Genetics, University of British Columbia, Vancouver, British Columbia, Canada
| | - Dixie L. Mager
- Terry Fox Laboratory, British Columbia Cancer Agency, Vancouver, British Columbia, Canada
- Department of Medical Genetics, University of British Columbia, Vancouver, British Columbia, Canada
- * E-mail:
| |
Collapse
|
13
|
Jjingo D, Huda A, Gundapuneni M, Mariño-Ramírez L, Jordan IK. Effect of the transposable element environment of human genes on gene length and expression. Genome Biol Evol 2011; 3:259-71. [PMID: 21362639 PMCID: PMC3070429 DOI: 10.1093/gbe/evr015] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Independent lines of investigation have documented effects of both transposable elements (TEs) and gene length (GL) on gene expression. However, TE gene fractions are highly correlated with GL, suggesting that they cannot be considered independently. We evaluated the TE environment of human genes and GL jointly in an attempt to tease apart their relative effects. TE gene fractions and GL were compared with the overall level of gene expression and the breadth of expression across tissues. GL is strongly correlated with overall expression level but weakly correlated with the breadth of expression, confirming the selection hypothesis that attributes the compactness of highly expressed genes to selection for economy of transcription. However, TE gene fractions overall, and for the L1 family in particular, show stronger anticorrelations with expression level than GL, indicating that GL may not be the most important target of selection for transcriptional economy. These results suggest a specific mechanism, removal of TEs, by which highly expressed genes are selectively tuned for efficiency. MIR elements are the only family of TEs with gene fractions that show a positive correlation with tissue-specific expression, suggesting that they may provide regulatory sequences that help to control human gene expression. Consistent with this notion, MIR fractions are relatively enriched close to transcription start sites and associated with coexpression in specific sets of related tissues. Our results confirm the overall relevance of the TE environment to gene expression and point to distinct mechanisms by which different TE families may contribute to gene regulation.
Collapse
Affiliation(s)
- Daudi Jjingo
- School of Biology, Georgia Institute of Technology, GA, USA
| | | | | | | | | |
Collapse
|
14
|
Mortada H, Vieira C, Lerat E. Genes devoid of full-length transposable element insertions are involved in development and in the regulation of transcription in human and closely related species. J Mol Evol 2010; 71:180-91. [PMID: 20798934 DOI: 10.1007/s00239-010-9376-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2010] [Accepted: 07/26/2010] [Indexed: 02/04/2023]
Abstract
Transposable elements (TEs) are major components of mammalian genomes, and their impact on genome evolution is now well established. In recent years several findings have shown that they are associated with the expression level and function of genes. In this study, we analyze the relationships between human genes and full-length TE copies in terms of three factors (gene function, expression level, and selective pressure). We classified human genes according to their TE density, and found that TE-free genes are involved in important functions such as development, transcription, and the regulation of transcription, whereas TE-rich genes are involved in functions such as transport and metabolism. This trend is conserved through evolution. We show that this could be explained by a stronger selection pressure acting on both the coding and non-coding regions of TE-free genes than on those of TE-rich genes. The higher level of expression found for TE-rich genes in tumor and immune system tissues suggests that TEs play an important role in gene regulation.
Collapse
|
15
|
Sela N, Mersch B, Hotz-Wagenblatt A, Ast G. Characteristics of transposable element exonization within human and mouse. PLoS One 2010; 5:e10907. [PMID: 20532223 PMCID: PMC2879366 DOI: 10.1371/journal.pone.0010907] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2009] [Accepted: 05/06/2010] [Indexed: 12/16/2022] Open
Abstract
Insertion of transposed elements within mammalian genes is thought to be an important contributor to mammalian evolution and speciation. Insertion of transposed elements into introns can lead to their activation as alternatively spliced cassette exons, an event called exonization. Elucidation of the evolutionary constraints that have shaped fixation of transposed elements within human and mouse protein coding genes and subsequent exonization is important for understanding of how the exonization process has affected transcriptome and proteome complexities. Here we show that exonization of transposed elements is biased towards the beginning of the coding sequence in both human and mouse genes. Analysis of single nucleotide polymorphisms (SNPs) revealed that exonization of transposed elements can be population-specific, implying that exonizations may enhance divergence and lead to speciation. SNP density analysis revealed differences between Alu and other transposed elements. Finally, we identified cases of primate-specific Alu elements that depend on RNA editing for their exonization. These results shed light on TE fixation and the exonization process within human and mouse genes.
Collapse
Affiliation(s)
- Noa Sela
- Department of Human Molecular Genetics, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Britta Mersch
- Department of Molecular Biophysics, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Agnes Hotz-Wagenblatt
- Department of Molecular Biophysics, German Cancer Research Center (DKFZ), Heidelberg, Germany
- * E-mail: (GA); (AHW)
| | - Gil Ast
- Department of Human Molecular Genetics, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- * E-mail: (GA); (AHW)
| |
Collapse
|
16
|
Shen-Orr SS, Pilpel Y, Hunter CP. Composition and regulation of maternal and zygotic transcriptomes reflects species-specific reproductive mode. Genome Biol 2010; 11:R58. [PMID: 20515465 PMCID: PMC2911106 DOI: 10.1186/gb-2010-11-6-r58] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2009] [Revised: 04/23/2010] [Accepted: 06/01/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Early embryos contain mRNA transcripts expressed from two distinct origins; those expressed from the mother's genome and deposited in the oocyte (maternal) and those expressed from the embryo's genome after fertilization (zygotic). The transition from maternal to zygotic control occurs at different times in different animals according to the extent and form of maternal contributions, which likely reflect evolutionary and ecological forces. Maternally deposited transcripts rely on post-transcriptional regulatory mechanisms for precise spatial and temporal expression in the embryo, whereas zygotic transcripts can use both transcriptional and post-transcriptional regulatory mechanisms. The differences in maternal contributions between animals may be associated with gene regulatory changes detectable by the size and complexity of the associated regulatory regions. RESULTS We have used genomic data to identify and compare maternal and/or zygotic expressed genes from six different animals and find evidence for selection acting to shape gene regulatory architecture in thousands of genes. We find that mammalian maternal genes are enriched for complex regulatory regions, suggesting an increase in expression specificity, while egg-laying animals are enriched for maternal genes that lack transcriptional specificity. CONCLUSIONS We propose that this lack of specificity for maternal expression in egg-laying animals indicates that a large fraction of maternal genes are expressed non-functionally, providing only supplemental nutritional content to the developing embryo. These results provide clear predictive criteria for analysis of additional genomes.
Collapse
Affiliation(s)
- Shai S Shen-Orr
- Department of Molecular and Cellular Biology, Harvard University, 16 Divinity Ave, Cambridge, MA 02138, USA
| | | | | |
Collapse
|
17
|
Babenko VN, Makunin IV, Brusentsova IV, Belyaeva ES, Maksimov DA, Belyakin SN, Maroy P, Vasil'eva LA, Zhimulev IF. Paucity and preferential suppression of transgenes in late replication domains of the D. melanogaster genome. BMC Genomics 2010; 11:318. [PMID: 20492674 PMCID: PMC2887417 DOI: 10.1186/1471-2164-11-318] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2010] [Accepted: 05/21/2010] [Indexed: 01/17/2023] Open
Abstract
Background Eukaryotic genomes are organized in extended domains with distinct features intimately linking genome structure, replication pattern and chromatin state. Recently we identified a set of long late replicating euchromatic regions that are underreplicated in salivary gland polytene chromosomes of D. melanogaster. Results Here we demonstrate that these underreplicated regions (URs) have a low density of P-element and piggyBac insertions compared to the genome average or neighboring regions. In contrast, Minos-based transposons show no paucity in URs but have a strong bias to testis-specific genes. We estimated the suppression level in 2,852 stocks carrying a single P-element by analysis of eye color determined by the mini-white marker gene and demonstrate that the proportion of suppressed transgenes in URs is more than three times higher than in the flanking regions or the genomic average. The suppressed transgenes reside in intergenic, genic or promoter regions of the annotated genes. We speculate that the low insertion frequency of P-elements and piggyBacs in URs partially results from suppression of transgenes that potentially could prevent identification of transgenes due to complete suppression of the marker gene. In a similar manner, the proportion of suppressed transgenes is higher in loci replicating late or very late in Kc cells and these loci have a lower density of P-elements and piggyBac insertions. In transgenes with two marker genes suppression of mini-white gene in eye coincides with suppression of yellow gene in bristles. Conclusions Our results suggest that the late replication domains have a high inactivation potential apparently linked to the silenced or closed chromatin state in these regions, and that such inactivation potential is largely maintained in different tissues.
Collapse
Affiliation(s)
- Vladimir N Babenko
- Department of Molecular and Cellular Biology, Institute of Chemical Biology and Fundamental Medicine SB RAS, Novosibirsk, 630090, Russia
| | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Kvikstad EM, Makova KD. The (r)evolution of SINE versus LINE distributions in primate genomes: sex chromosomes are important. Genome Res 2010; 20:600-13. [PMID: 20219940 DOI: 10.1101/gr.099044.109] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The densities of transposable elements (TEs) in the human genome display substantial variation both within individual chromosomes and among chromosome types (autosomes and the two sex chromosomes). Finding an explanation for this variability has been challenging, especially in light of genome landscapes unique to the sex chromosomes. Here, using a multiple regression framework, we investigate primate Alu and L1 densities shaped by regional genome features and location on a particular chromosome type. As a result of our analysis, first, we build statistical models explaining up to 79% and 44% of variation in Alu and L1 element density, respectively. Second, we analyze sex chromosome versus autosome TE densities corrected for regional genomic effects. We discover that sex-chromosome bias in Alu and L1 distributions not only persists after accounting for these effects, but even presents differences in patterns, confirming preferential Alu integration in the male germline, yet likely integration of L1s in both male and female germlines or in early embryogenesis. Additionally, our models reveal that local base composition (measured by GC content and density of L1 target sites) and natural selection (inferred via density of most conserved elements) are significant to predicting densities of L1s. Interestingly, measurements of local double-stranded breaks (a 13-mer associated with genome instability) strongly correlate with densities of Alu elements; little evidence was found for the role of recombination-driven deletion in driving TE distributions over evolutionary time. Thus, Alu and L1 densities have been influenced by the combination of distinct local genome landscapes and the unique evolutionary dynamics of sex chromosomes.
Collapse
Affiliation(s)
- Erika M Kvikstad
- Center for Comparative Genomics and Bioinformatics, Penn State University, University Park, Pennsylvania 16802, USA.
| | | |
Collapse
|
19
|
Levy A, Schwartz S, Ast G. Large-scale discovery of insertion hotspots and preferential integration sites of human transposed elements. Nucleic Acids Res 2009; 38:1515-30. [PMID: 20008508 PMCID: PMC2836564 DOI: 10.1093/nar/gkp1134] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
Throughout evolution, eukaryotic genomes have been invaded by transposable elements (TEs). Little is known about the factors leading to genomic proliferation of TEs, their preferred integration sites and the molecular mechanisms underlying their insertion. We analyzed hundreds of thousands nested TEs in the human genome, i.e. insertions of TEs into existing ones. We first discovered that most TEs insert within specific ‘hotspots’ along the targeted TE. In particular, retrotransposed Alu elements contain a non-canonical single nucleotide hotspot for insertion of other Alu sequences. We next devised a method for identification of integration sequence motifs of inserted TEs that are conserved within the targeted TEs. This method revealed novel sequences motifs characterizing insertions of various important TE families: Alu, hAT, ERV1 and MaLR. Finally, we performed a global assessment to determine the extent to which young TEs tend to nest within older transposed elements and identified a 4-fold higher tendency of TEs to insert into existing TEs than to insert within non-TE intergenic regions. Our analysis demonstrates that TEs are highly biased to insert within certain TEs, in specific orientations and within specific targeted TE positions. TE nesting events also reveal new characteristics of the molecular mechanisms underlying transposition.
Collapse
Affiliation(s)
- Asaf Levy
- Department of Human Molecular Genetics and Biochemistry, Tel-Aviv University, Ramat Aviv 69978, Israel
| | | | | |
Collapse
|
20
|
Vorechovsky I. Transposable elements in disease-associated cryptic exons. Hum Genet 2009; 127:135-54. [PMID: 19823873 DOI: 10.1007/s00439-009-0752-4] [Citation(s) in RCA: 82] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2009] [Accepted: 09/27/2009] [Indexed: 11/28/2022]
Abstract
Transposable elements (TEs) make up a half of the human genome, but the extent of their contribution to cryptic exon activation that results in genetic disease is unknown. Here, a comprehensive survey of 78 mutation-induced cryptic exons previously identified in 51 disease genes revealed the presence of TEs in 40 cases (51%). Most TE-containing exons were derived from short interspersed nuclear elements (SINEs), with Alus and mammalian interspersed repeats (MIRs) covering >18 and >16% of the exonized sequences, respectively. The majority of SINE-derived cryptic exons had splice sites at the same positions of the Alu/MIR consensus as existing SINE exons and their inclusion in the mRNA was facilitated by phylogenetically conserved changes that improved both traditional and auxiliary splicing signals, thus marking intronic TEs amenable for pathogenic exonization. The overrepresentation of MIRs among TE exons is likely to result from their high average exon inclusion levels, which reflect their strong splice sites, a lack of splicing silencers and a high density of enhancers, particularly (G)AA(G) motifs. These elements were markedly depleted in antisense Alu exons, had the most prominent position on the exon-intron gradient scale and are proposed to promote exon definition through enhanced tertiary RNA interactions involving unpaired (di)adenosines. The identification of common mechanisms by which the most dynamic parts of the genome contribute both to new exon creation and genetic disease will facilitate detection of intronic mutations and the development of computational tools that predict TE hot-spots of cryptic exon activation.
Collapse
Affiliation(s)
- Igor Vorechovsky
- Division of Human Genetics, University of Southampton School of Medicine, MP808, Tremona Road, Southampton SO16 6YD, UK.
| |
Collapse
|
21
|
Chen JJ, Wang Y. [Recent progress in plant genome size evolution]. YI CHUAN = HEREDITAS 2009; 31:464-470. [PMID: 19586839 DOI: 10.3724/sp.j.1005.2009.00464] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
It has been known that eukaryotic genomes span a wide range of sizes regardless of organism complexity. The observed differences in genome size are primarily due to polyploidy level and abundance of non-coding DNA, especially the contribution of transposable elements (TEs). Here we reviewed the current progress in genome size variation of plant species and the underlying evolutionary forces that contribute to genome expansion or contraction. Polyploidization and the accumulation of transposable element are the primary contributors to genome expansion. As to the mechanisms of DNA loss, unequal homologous recombination and illegitimate recombination are thought to be the counterbalances to the unlimited expansion of a genome. The evolutionary direction of plant genome size is also discussed, which tends to favor larger genomes with deletion mechanisms acting to only attenuate genome expansion but not reverse.
Collapse
Affiliation(s)
- Jian-Jun Chen
- Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China.
| | | |
Collapse
|
22
|
Does selection against transcriptional interference shape retroelement-free regions in mammalian genomes? PLoS One 2008; 3:e3760. [PMID: 19018283 PMCID: PMC2582637 DOI: 10.1371/journal.pone.0003760] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2008] [Accepted: 10/31/2008] [Indexed: 11/29/2022] Open
Abstract
Background Eukaryotic genomes are scattered with retroelements that proliferate through retrotransposition. Although retroelements make up around 40 percent of the human genome, large regions are found to be completely devoid of retroelements. This has been hypothesised to be a result of genomic regions being intolerant to insertions of retroelements. The inadvertent transcriptional activity of retroelements may affect neighbouring genes, which in turn could be detrimental to an organism. We speculate that such retroelement transcription, or transcriptional interference, is a contributing factor in generating and maintaining retroelement-free regions in the human genome. Methodology/Principal Findings Based on the known transcriptional properties of retroelements, we expect long interspersed elements (LINEs) to be able to display a high degree of transcriptional interference. In contrast, we expect short interspersed elements (SINEs) to display very low levels of transcriptional interference. We find that genomic regions devoid of long interspersed elements (LINEs) are enriched for protein-coding genes, but that this is not the case for regions devoid of short interspersed elements (SINEs). This is expected if genes are subject to selection against transcriptional interference. We do not find microRNAs to be associated with genomic regions devoid of either SINEs or LINEs. We further observe an increased relative activity of genes overlapping LINE-free regions during early embryogenesis, where activity of LINEs has been identified previously. Conclusions/Significance Our observations are consistent with the notion that selection against transcriptional interference has contributed to the maintenance and/or generation of retroelement-free regions in the human genome.
Collapse
|
23
|
Lee JY, Ji Z, Tian B. Phylogenetic analysis of mRNA polyadenylation sites reveals a role of transposable elements in evolution of the 3'-end of genes. Nucleic Acids Res 2008; 36:5581-90. [PMID: 18757892 PMCID: PMC2553571 DOI: 10.1093/nar/gkn540] [Citation(s) in RCA: 90] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
mRNA polyadenylation is an essential step for the maturation of almost all eukaryotic mRNAs, and is tightly coupled with termination of transcription in defining the 3′-end of genes. Large numbers of human and mouse genes harbor alternative polyadenylation sites [poly(A) sites] that lead to mRNA variants containing different 3′-untranslated regions (UTRs) and/or encoding distinct protein sequences. Here, we examined the conservation and divergence of different types of alternative poly(A) sites across human, mouse, rat and chicken. We found that the 3′-most poly(A) sites tend to be more conserved than upstream ones, whereas poly(A) sites located upstream of the 3′-most exon, also termed intronic poly(A) sites, tend to be much less conserved. Genes with longer evolutionary history are more likely to have alternative polyadenylation, suggesting gain of poly(A) sites through evolution. We also found that nonconserved poly(A) sites are associated with transposable elements (TEs) to a much greater extent than conserved ones, albeit less frequently utilized. Different classes of TEs have different characteristics in their association with poly(A) sites via exaptation of TE sequences into polyadenylation elements. Our results establish a conservation pattern for alternative poly(A) sites in several vertebrate species, and indicate that the 3′-end of genes can be dynamically modified by TEs through evolution.
Collapse
Affiliation(s)
- Ju Youn Lee
- Graduate School of Biomedical Sciences and Department of Biochemistry and Molecular Biology, New Jersey Medical School, University of Medicine and Dentistry of New Jersey, Newark, NJ 07103, USA
| | | | | |
Collapse
|
24
|
Gayral P, Noa-Carrazana JC, Lescot M, Lheureux F, Lockhart BEL, Matsumoto T, Piffanelli P, Iskra-Caruana ML. A single Banana streak virus integration event in the banana genome as the origin of infectious endogenous pararetrovirus. J Virol 2008; 82:6697-710. [PMID: 18417582 PMCID: PMC2447048 DOI: 10.1128/jvi.00212-08] [Citation(s) in RCA: 69] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2008] [Accepted: 04/07/2008] [Indexed: 12/15/2022] Open
Abstract
Sequencing of plant nuclear genomes reveals the widespread presence of integrated viral sequences known as endogenous pararetroviruses (EPRVs). Banana is one of the three plant species known to harbor infectious EPRVs. Musa balbisiana carries integrated copies of Banana streak virus (BSV), which are infectious by releasing virions in interspecific hybrids. Here, we analyze the organization of the EPRV of BSV Goldfinger (BSGfV) present in the wild diploid M. balbisiana cv. Pisang Klutuk Wulung (PKW) revealed by the study of Musa bacterial artificial chromosome resources and interspecific genetic cross. cv. PKW contains two similar EPRVs of BSGfV. Genotyping of these integrants and studies of their segregation pattern show an allelic insertion. Despite the fact that integrated BSGfV has undergone extensive rearrangement, both EPRVs contain the full-length viral genome. The high degree of sequence conservation between the integrated and episomal form of the virus indicates a recent integration event; however, only one allele is infectious. Analysis of BSGfV EPRV segregation among an F1 population from an interspecific genetic cross revealed that these EPRV sequences correspond to two alleles originating from a single integration event. We describe here for the first time the full genomic and genetic organization of the two EPRVs of BSGfV present in cv. PKW in response to the challenge facing both scientists and breeders to identify and generate genetic resources free from BSV. We discuss the consequences of this unique host-pathogen interaction in terms of genetic and genomic plant defenses versus strategies of infectious BSGfV EPRVs.
Collapse
Affiliation(s)
- Philippe Gayral
- CIRAD BIOS, UMR BGPI, Campus International de Baillarguet, TA A-54/K, 34398 Montpellier Cedex 5, France
| | | | | | | | | | | | | | | |
Collapse
|
25
|
Arnoldi A, Tonelli A, Crippa F, Villani G, Pacelli C, Sironi M, Pozzoli U, D'Angelo MG, Meola G, Martinuzzi A, Crimella C, Redaelli F, Panzeri C, Renieri A, Comi GP, Turconi AC, Bresolin N, Bassi MT. A clinical, genetic, and biochemical characterization of SPG7 mutations in a large cohort of patients with hereditary spastic paraplegia. Hum Mutat 2008; 29:522-31. [PMID: 18200586 DOI: 10.1002/humu.20682] [Citation(s) in RCA: 78] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Mutations in the SPG7 gene encoding a mitochondrial protein termed paraplegin, are responsible for a recessive form of hereditary spastic paraparesis. Only few studies have so far been performed in large groups of hereditary spastic paraplegia (HSP) patients to determine the frequency of SPG7 mutations. Here, we report the result of a mutation screening conducted in a large cohort of 135 Italian HSP patients with the identification of six novel point mutations and one large intragenic deletion. Sequence analysis of the deletion breakpoint, together with secondary structure predictions of the deleted region, indicate that a complex rearrangement, likely caused by extensive secondary structure formation mediated by the short interspersed nuclear element (SINE) retrotransposons, is responsible for the deletion event. Biochemical studies performed on fibroblasts from three mutant patients revealed mild and heterogeneous mitochondrial dysfunctions that would exclude a specific association of a complex I defect with the pathology at the fibroblast level. Overall, our data confirm that SPG7 point mutations are rare causes of HSP, in both sporadic and familial forms, while underlying the puzzling and intriguing aspects of histological and biochemical consequences of paraplegin loss.
Collapse
Affiliation(s)
- Alessia Arnoldi
- Laboratory of Molecular Biology, Scientific Institute E. Medea, Bosisio Parini, Italy
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
26
|
Sela N, Mersch B, Gal-Mark N, Lev-Maor G, Hotz-Wagenblatt A, Ast G. Comparative analysis of transposed element insertion within human and mouse genomes reveals Alu's unique role in shaping the human transcriptome. Genome Biol 2008; 8:R127. [PMID: 17594509 PMCID: PMC2394776 DOI: 10.1186/gb-2007-8-6-r127] [Citation(s) in RCA: 181] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2007] [Revised: 06/07/2007] [Accepted: 06/27/2007] [Indexed: 01/31/2023] Open
Abstract
Analysis of transposed elements in the human and mouse genomes reveals many effects on the transcriptomes, including a higher level of exonization of Alu elements than other elements. Background Transposed elements (TEs) have a substantial impact on mammalian evolution and are involved in numerous genetic diseases. We compared the impact of TEs on the human transcriptome and the mouse transcriptome. Results We compiled a dataset of all TEs in the human and mouse genomes, identifying 3,932,058 and 3,122,416 TEs, respectively. We than extracted TEs located within human and mouse genes and, surprisingly, we found that 60% of TEs in both human and mouse are located in intronic sequences, even though introns comprise only 24% of the human genome. All TE families in both human and mouse can exonize. TE families that are shared between human and mouse exhibit the same percentage of TE exonization in the two species, but the exonization level of Alu, a primate-specific retroelement, is significantly greater than that of other TEs within the human genome, leading to a higher level of TE exonization in human than in mouse (1,824 exons compared with 506 exons, respectively). We detected a primate-specific mechanism for intron gain, in which Alu insertion into an exon creates a new intron located in the 3' untranslated region (termed 'intronization'). Finally, the insertion of TEs into the first and last exons of a gene is more frequent in human than in mouse, leading to longer exons in human. Conclusion Our findings reveal many effects of TEs on these two transcriptomes. These effects are substantially greater in human than in mouse, which is due to the presence of Alu elements in human.
Collapse
Affiliation(s)
- Noa Sela
- Department of Human Molecular Genetics and Biochemistry, Sackler Faculty of Medicine, Tel Aviv University, Ramat Aviv 69978, Israel
| | - Britta Mersch
- HUSAR Bioinformatics Lab, Department of Molecular Biophysics, German Cancer Research Center (DKFZ), Im Neuenheimer Feld, D-69120 Heidelberg, Germany
| | - Nurit Gal-Mark
- Department of Human Molecular Genetics and Biochemistry, Sackler Faculty of Medicine, Tel Aviv University, Ramat Aviv 69978, Israel
| | - Galit Lev-Maor
- Department of Human Molecular Genetics and Biochemistry, Sackler Faculty of Medicine, Tel Aviv University, Ramat Aviv 69978, Israel
| | - Agnes Hotz-Wagenblatt
- HUSAR Bioinformatics Lab, Department of Molecular Biophysics, German Cancer Research Center (DKFZ), Im Neuenheimer Feld, D-69120 Heidelberg, Germany
| | - Gil Ast
- Department of Human Molecular Genetics and Biochemistry, Sackler Faculty of Medicine, Tel Aviv University, Ramat Aviv 69978, Israel
| |
Collapse
|
27
|
Simons C, Makunin IV, Pheasant M, Mattick JS. Maintenance of transposon-free regions throughout vertebrate evolution. BMC Genomics 2007; 8:470. [PMID: 18093339 PMCID: PMC2241635 DOI: 10.1186/1471-2164-8-470] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2007] [Accepted: 12/20/2007] [Indexed: 01/23/2023] Open
Abstract
Background We recently reported the existence of large numbers of regions up to 80 kb long that lack transposon insertions in the human, mouse and opossum genomes. These regions are significantly associated with loci involved in developmental and transcriptional regulation. Results Here we report that transposon-free regions (TFRs) are prominent genomic features of amphibian and fish lineages, and that many have been maintained throughout vertebrate evolution, although most transposon-derived sequences have entered these lineages after their divergence. The zebrafish genome contains 470 TFRs over 10 kb and a further 3,951 TFRs over 5 kb, which is comparable to the number identified in mammals. Two thirds of zebrafish TFRs over 10 kb are orthologous to TFRs in at least one mammal, and many have orthologous TFRs in all three mammalian genomes as well as in the genome of Xenopus tropicalis. This indicates that the mechanism responsible for the maintenance of TFRs has been active at these loci for over 450 million years. However, the majority of TFR bases cannot be aligned between distantly related species, demonstrating that TFRs are not the by-product of strong primary sequence conservation. Syntenically conserved TFRs are also more enriched for regulatory genes compared to lineage-specific TFRs. Conclusion We suggest that TFRs contain extended regulatory sequences that contribute to the precise expression of genes central to early vertebrate development, and can be used as predictors of important regulatory regions.
Collapse
Affiliation(s)
- Cas Simons
- Australian Research Council Special Research Center for Functional and Applied Genomics, Institute for Molecular Bioscience, University of Queensland, St Lucia QLD 4072, Australia.
| | | | | | | |
Collapse
|