1
|
Villarreal LP, Witzany G. Editorial: Genome Invading RNA Networks. Front Microbiol 2018; 9:581. [PMID: 29651278 PMCID: PMC5885774 DOI: 10.3389/fmicb.2018.00581] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2018] [Accepted: 03/14/2018] [Indexed: 12/17/2022] Open
Affiliation(s)
- Luis P Villarreal
- Center for Virus Research, University of California, Irvine, Irvine, CA, United States
| | | |
Collapse
|
2
|
Moolhuijzen P, Kulski JK, Dunn DS, Schibeci D, Barrero R, Gojobori T, Bellgard M. The transcript repeat element: the human Alu sequence as a component of gene networks influencing cancer. Funct Integr Genomics 2016; 10:307-19. [PMID: 20393868 DOI: 10.1007/s10142-010-0168-1] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
A small percentage (3%) of the 1.3 million copies of Alu sequences in the human genome is expressed individually or as part of various gene transcripts with potential regulatory and pathophysiological importance. In order to better understand the role of repetitive elements within transcripts, this review focuses on Alu-containing transcripts of normal and cancerous tissue in a transcriptome-wide survey of the H-Invitational human transcript database on 106,825 tissue-derived transcripts expressed at 29,979 loci. The Alu elements in transcripts of cancerous tissues are significantly underrepresented in comparison to those in normal tissues. In this review, we propose a model for Alu-mediated siRNA down-regulation of Alu-containing transcripts in cancer tissues. In cancer or other rapidly dividing tissues, hypomethylation of repeat element regions triggers the expression of transposon elements including Alu, which can potentially form double-stranded RNA molecules for use as templates to generate Alu-derived siRNAs (Alu-siRNAs). The generated Alu-siRNAs target endogenous messenger RNAs harbouring sequence similarity to Alu elements. This model correlates with the observation that there is substantial under-representation of Alu-containing mRNAs in cancer cells. This new perspective of gene regulation in disease conditions can provide a basis for starting to account for changes in complex gene network in cancer.
Collapse
Affiliation(s)
- Paula Moolhuijzen
- Centre for Comparative Genomics, School for Information Technology, Murdoch University, Murdoch, WA, Australia
| | | | | | | | | | | | | |
Collapse
|
3
|
Abstract
DNA transposases are enzymes that catalyze the movement of discrete pieces of DNA from one location in the genome to another. Transposition occurs through a series of controlled DNA strand cleavage and subsequent integration reactions that are carried out by nucleoprotein complexes known as transpososomes. Transpososomes are dynamic assemblies which must undergo conformational changes that control DNA breaks and ensure that, once started, the transposition reaction goes to completion. They provide a precise architecture within which the chemical reactions involved in transposon movement occur, but adopt different conformational states as transposition progresses. Their components also vary as they must, at some stage, include target DNA and sometimes even host-encoded proteins. A very limited number of transpososome states have been crystallographically captured, and here we provide an overview of the various structures determined to date. These structures include examples of DNA transposases that catalyze transposition by a cut-and-paste mechanism using an RNaseH-like nuclease catalytic domain, those that transpose using only single-stranded DNA substrates and targets, and the retroviral integrases that carry out an integration reaction very similar to DNA transposition. Given that there are a number of common functional requirements for transposition, it is remarkable how these are satisfied by complex assemblies that are so architecturally different.
Collapse
|
4
|
Villarreal LP, Witzany G. The DNA Habitat and its RNA Inhabitants: At the Dawn of RNA Sociology. GENOMICS INSIGHTS 2013; 6:1-12. [PMID: 26217106 PMCID: PMC4510605 DOI: 10.4137/gei.s11490] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Most molecular biological concepts derive from physical chemical assumptions about the genetic code that are basically more than 40 years old. Additionally, systems biology, another quantitative approach, investigates the sum of interrelations to obtain a more holistic picture of nucleotide sequence order. Recent empirical data on genetic code compositions and rearrangements by mobile genetic elements and noncoding RNAs, together with results of virus research and their role in evolution, does not really fit into these concepts and compel a reexamination. In this review, we try to find an alternate hypothesis. It seems plausible now that if we look at the abundance of regulatory RNAs and persistent viruses in host genomes, we will find more and more evidence that the key players that edit the genetic codes of host genomes are consortia of RNA agents and viruses that drive evolutionary novelty and regulation of cellular processes in all steps of development. This agent-based approach may lead to a qualitative RNA sociology that investigates and identifies relevant behavioral motifs of cooperative RNA consortia. In addition to molecular biological perspectives, this may lead to a better understanding of genetic code evolution and dynamics.
Collapse
Affiliation(s)
- Luis P Villarreal
- Department of Molecular Biology and Biochemistry, University of California, Irvine, CA, USA
| | | |
Collapse
|
5
|
Transposable elements domesticated and neofunctionalized by eukaryotic genomes. Plasmid 2013; 69:1-15. [DOI: 10.1016/j.plasmid.2012.08.001] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2012] [Revised: 07/30/2012] [Accepted: 08/08/2012] [Indexed: 12/21/2022]
|
6
|
Palmer SJ, Taylor KM, Santucci N, Widagdo J, Chan YKA, Yeo JL, Adams M, Gunning PW, Hardeman EC. GTF2IRD2 from the Williams-Beuren critical region encodes a mobile-element-derived fusion protein that antagonizes the action of its related family members. J Cell Sci 2012; 125:5040-50. [PMID: 22899722 DOI: 10.1242/jcs.102798] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
GTF2IRD2 belongs to a family of transcriptional regulators (including TFII-I and GTF2IRD1) that are responsible for many of the key features of Williams-Beuren syndrome (WBS). Sequence evidence suggests that GTF2IRD2 arose in eutherian mammals by duplication and divergence from the gene encoding TFII-I. However, in GTF2IRD2, most of the C-terminal domain has been lost and replaced by the domesticated remnant of an in-frame hAT-transposon mobile element. In this first experimental analysis of function, we show that transgenic expression of each of the three family members in skeletal muscle causes significant fiber type shifts, but the GTF2IRD2 protein causes an extreme shift in the opposite direction to the two other family members. Mating of GTF2IRD1 and GTF2IRD2 mice restores the fiber type balance, indicating an antagonistic relationship between these two paralogs. In cells, GTF2IRD2 localizes to cytoplasmic microtubules and discrete speckles in the nuclear periphery. We show that it can interact directly with TFII-Iβ and GTF2IRD1, and upon co-transfection changes the normal distribution of these two proteins into a punctate nuclear pattern typical of GTF2IRD2. These data suggest that GTF2IRD2 has evolved as a regulator of GTF2IRD1 and TFII-I; inhibiting their function by direct interaction and sequestration into inactive nuclear zones.
Collapse
Affiliation(s)
- Stephen J Palmer
- Neuromuscular and Regenerative Medicine Unit, School of Medical Sciences, The University of New South Wales, Sydney 2052, Australia.
| | | | | | | | | | | | | | | | | |
Collapse
|
7
|
Xu HE, Zhang HH, Han MJ, Shen YH, Huang XZ, Xiang ZH, Zhang Z. [Computational approaches for identification and classification of transposable elements in eukaryotic genomes]. YI CHUAN = HEREDITAS 2012; 34:1009-1019. [PMID: 22917906 DOI: 10.3724/sp.j.1005.2012.01009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
Repetitive sequences (repeats) represent a significant fraction of the eukaryotic genomes and can be divided into tandem repeats, segmental duplications, and interspersed repeats on the basis of their sequence characteristics and how they are formed. Most interspersed repeats are derived from transposable elements (TEs). Eukaryotic TEs have been subdivided into two major classes according to the intermediate they use to move. The transposition and amplification of TEs have a great impact on the evolution of genes and the stability of genomes. However, identification and classification of TEs are complex and difficult due to the fact that their structure and classification are complex and diverse compared with those of other types of repeats. Here, we briefly introduced the function and classification of TEs, and summarized three different steps for identification, classification and annotation of TEs in eukaryotic genomes: (1) assembly of a repeat library, (2) repeat correction and classification, and (3) genome annotation. The existing computational approaches for each step were summarized and the advantages and disadvantages of the approaches were also highlighted in this review. To accurately identify, classify, and annotate the TEs in eukaryotic genomes requires combined methods. This review provides useful information for biologists who are not familiar with these approaches to find their way through the forest of programs.
Collapse
Affiliation(s)
- Hong-En Xu
- The Institute of Sericulture and Systems Biology, Southwest University, Chongqing, China.
| | | | | | | | | | | | | |
Collapse
|
8
|
Diversification of hAT transposase paralogues in the sugarcane genome. Mol Genet Genomics 2012; 287:205-19. [PMID: 22228195 PMCID: PMC3285750 DOI: 10.1007/s00438-011-0670-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2011] [Accepted: 12/18/2011] [Indexed: 12/22/2022]
Abstract
Transposons are abundant components of eukaryotic genomes, and play important role in genome evolution. The knowledge about these elements should contribute to the understanding of their impact on the host genomes. The hAT transposon superfamily is one of the best characterized superfamilies in diverse organisms, nevertheless, a detailed study of these elements was never carried in sugarcane. To address this question we analyzed 32 cDNAs similar to that of hAT superfamily of transposons previously identified in the sugarcane transcriptome. Our results revealed that these hAT-like transposases cluster in one highly homogeneous and other more heterogeneous lineage. We present evidences that support the hypothesis that the highly homogeneous group is a domesticated transposase while the remainder of the lineages are composed of transposon units. The first is common to grasses, clusters significantly with domesticated transposases from Arabidopsis, rice and sorghum and is expressed in different tissues of two sugarcane cultivars analyzed. In contrast, the more heterogeneous group represents at least two transposon lineages. We recovered five genomic versions of one lineage, characterizing a novel transposon family with conserved DDE motif, named SChAT. These results indicate the presence of at least three distinct lineages of hAT-like transposase paralogues in sugarcane genome, including a novel transposon family described in Saccharum and a domesticated transposase. Taken together, these findings permit to follow the diversification of some hAT transposase paralogues in sugarcane, aggregating knowledge about the co-evolution of transposons and their host genomes.
Collapse
|
9
|
Coates BS, Hellmich RL, Grant DM, Abel CA. Mobilizing the genome of Lepidoptera through novel sequence gains and end creation by non-autonomous Lep1 Helitrons. DNA Res 2011; 19:11-21. [PMID: 22086996 PMCID: PMC3276263 DOI: 10.1093/dnares/dsr038] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Transposable elements (TEs) can affect the structure of genomes through their acquisition and transposition of novel DNA sequences. The 134-bp repetitive elements, Lep1, are conserved non-autonomous Helitrons in lepidopteran genomes that have characteristic 5′-CT and 3′-CTAY nucleotide termini, a 3′-terminal hairpin structure, a 5′- and 3′-subterminal inverted repeat (SIR), and integrations that occur between AT or TT nucleotides. Lep1 Helitrons have acquired and propagated sequences downstream of their 3′-CTAY termini that are 57–344-bp in length and have termini composed of a 3′-CTRR preceded by a 3′-hairpin structure and a region complementary to the 5′-SIR (3′-SIRb). Features of both the Lep1 Helitron and multiple acquired sequences indicate that secondary structures at the 3′-terminus may have a role in rolling circle replication or genome integration mechanisms, and are a prerequisite for novel end creation by Helitron-like TEs. The preferential integration of Lep1 Helitrons in proximity to gene-coding regions results in the creation of genetic novelty that is shown to impact gene structure and function through the introduction of novel exon sequence (exon shuffling). These findings are important in understanding the structural requirements of genomic DNA sequences that are acquired and transposed by Helitron-like TEs.
Collapse
Affiliation(s)
- Brad S Coates
- 1USDA-ARS, Corn Insect and Crop Genetics Research Unit, 113 Genetics Laboratory, Iowa State University, Ames, IA 50011, USA.
| | | | | | | |
Collapse
|
10
|
Bae JH, Ahn K, Nam GH, Lee CE, Park KD, Lee HK, Cho BW, Kim HS. Molecular characterization of alternative transcripts of the horse BMAL1 gene. Zoolog Sci 2011; 28:671-5. [PMID: 21882956 DOI: 10.2108/zsj.28.671] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
The horse BMAL1 gene encodes the brain and muscle Arnt-like protein 1, which is a key regulator of circadian rhythmic systems in most organs and cells. The first exon of the horse-specific BMAL1 gene is produced by an exonization event of LINE3 (CR1) and SINE (MIR) was detected by bioinformatic analysis. Alternative variants generated by cassette exon event in various horse tissues were also detected by RT-PCR amplification and sequencing. The cDNA sequences of the horse transcripts (BMAL1a, BMAL1b) contain additional 21 bp and 71 bp fragments relative to horse BMAL1. Quantitative real-time RT-PCR was performed to compare the expression patterns between transcript variants in various horse tissues. The results of these experiments showed splice variants that were widely expressed in most tissues. Furthermore, they were highly expressed in cerebellum, heart, and kidney.
Collapse
Affiliation(s)
- Jin-Han Bae
- Department of Biological Sciences, College of Natural Sciences, Pusan National University, Busan 609-735, Republic of Korea
| | | | | | | | | | | | | | | |
Collapse
|
11
|
When one is better than two: RNA with dual functions. Biochimie 2010; 93:633-44. [PMID: 21111023 DOI: 10.1016/j.biochi.2010.11.004] [Citation(s) in RCA: 77] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2010] [Accepted: 11/17/2010] [Indexed: 11/23/2022]
Abstract
The central dogma of biology, until not long ago, held that genetic information stored on DNA molecules was translated into the final protein products through RNA as intermediate molecules. Then, an additional level of complexity in the regulation of genome expression was added, implicating new classes of RNA molecules called non-coding RNA (ncRNA). These ncRNA are also often referred to as functional RNA in that, although they do not contain the capacity to encode proteins, do have a function as RNA molecules. They have been thus far considered as truly non-coding RNA since no ORF long enough to be considered, nor protein, have been associated with them. However, the recent identification and characterization of bifunctional RNA, i.e. RNA for which both coding capacity and activity as functional RNA have been reported, suggests that a definite categorization of some RNA molecules is far from being straightforward. Indeed, several RNA primarily classified as non-protein-coding RNA has been showed to hold coding capacities and associated peptides. Conversely, mRNA, usually regarded as strictly protein-coding, may act as functional RNA molecules. Here, we describe several examples of these bifunctional RNA that have been already characterized from bacteria to mammals. We also extend this concept to fortuitous acquisition of dual function in pathological conditions and to the recently highlighted duality between information carried by a gene and its pseudogenes counterparts.
Collapse
|
12
|
Morescalchi MA, Barucca M, Stingo V, Capriglione T. Polypteridae (Actinopterygii: Cladistia) and DANA-SINEs insertions. Mar Genomics 2010; 3:79-84. [PMID: 21798200 DOI: 10.1016/j.margen.2010.06.001] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2009] [Revised: 06/07/2010] [Accepted: 06/15/2010] [Indexed: 01/09/2023]
Abstract
SINE sequences are interspersed throughout virtually all eukaryotic genomes and greatly outnumber the other repetitive elements. These sequences are of increasing interest for phylogenetic studies because of their diagnostic power for establishing common ancestry among taxa, once properly characterized. We identified and characterized a peculiar family of composite tRNA-derived short interspersed SINEs, DANA-SINEs, associated with mutational activities in Danio rerio, in a group of species belonging to one of the most basal bony fish families, the Polypteridae, in order to investigate their own inner specific phylogenetic relationships. DANA sequences were identified, sequenced and then localized, by means of fluorescent in situ hybridization (FISH), in six Polypteridae species (Polypterus delhezi, P. ornatipinnis, P. palmas, P. buettikoferi P. senegalus and Erpetoichthys calabaricus) After cloning, the sequences obtained were aligned for phylogenetic analysis, comparing them with three Dipnoan lungfish species (Protopterus annectens, P. aethiopicus, Lepidosiren paradoxa), and Lethenteron reissneri (Petromyzontidae)was used as outgroup. The obtained overlapping MP, ML and NJ tree clustered together the species belonging to the two taxonomically different Osteichthyans groups: the Polypteridae, by one side, and the Protopteridae by the other, with the monotypic genus Erpetoichthys more distantly related to the Polypterus genus comprising three distinct groups: P. palmas and P. buettikoferi, P. delhezi and P. ornatipinnis and P. senegalus. In situ hybridization with DANA probes marked along the whole chromosome arms in the metaphases of all the Polypteridae species examined.
Collapse
Affiliation(s)
- Maria Alessandra Morescalchi
- Dipartimento di Scienze della Vita, Seconda Università degli Studi di Napoli, via Vivaldi 43, 81100, Caserta, Italy.
| | | | | | | |
Collapse
|
13
|
O'Brochta DA, Stosic CD, Pilitt K, Subramanian RA, Hice RH, Atkinson PW. Transpositionally active episomal hAT elements. BMC Mol Biol 2009; 10:108. [PMID: 20003420 PMCID: PMC2803484 DOI: 10.1186/1471-2199-10-108] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2009] [Accepted: 12/14/2009] [Indexed: 01/31/2023] Open
Abstract
Background hAT elements and V(D)J recombination may have evolved from a common ancestral transposable element system. Extrachromosomal, circular forms of transposable elements (referred to here as episomal forms) have been reported yet their biological significance remains unknown. V(D)J signal joints, which resemble episomal transposable elements, have been considered non-recombinogenic products of V(D)J recombination and a safe way to dispose of excised chromosomal sequences. V(D)J signal joints can, however, participate in recombination reactions and the purpose of this study was to determine if hobo and Hermes episomal elements are also recombinogenic. Results Up to 50% of hobo/Hermes episomes contained two intact, inverted-terminal repeats and 86% of these contained from 1-1000 bp of intercalary DNA. Episomal hobo/Hermes elements were recovered from Musca domestica (a natural host of Hermes), Drosophila melanogaster (a natural host of hobo) and transgenic Drosophila melanogaster and Aedes aegypti (with autonomous Hermes elements). Episomal Hermes elements were recovered from unfertilized eggs of M. domestica and D. melanogaster demonstrating their potential for extrachromosomal, maternal transmission. Reintegration of episomal Hermes elements was observed in vitro and in vivo and the presence of Hermes episomes resulted in lower rates of canonical Hermes transposition in vivo. Conclusion Episomal hobo/Hermes elements are common products of element excision and can be maternally transmitted. Episomal forms of Hermes are capable of integration and also of influencing the transposition of canonical elements suggesting biological roles for these extrachromosomal elements in element transmission and regulation.
Collapse
Affiliation(s)
- David A O'Brochta
- Center for Biosystems Research, University of Maryland Biotechnology Institute, Rockville, MD 20850, USA.
| | | | | | | | | | | |
Collapse
|
14
|
Gasent Blesa J, Candel V. Cell-cell fusion as a potential target in cancer therapy. Ecancermedicalscience 2009; 3:145. [PMID: 22276010 PMCID: PMC3224012 DOI: 10.3332/ecancer.2009.145] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2009] [Indexed: 11/11/2022] Open
Abstract
In the fight against cancer, new and more specific targets are needed. Here, we offer an example of a potential target that has not been widely studied, namely the syncytin protein. Syncytin is expressed mainly in the human placenta and is implicated in placental syncytiotrophoblast cell fusion. Not much is known about the role of syncytin in cancer, but the existing data call for more intense research. Its retroviral origin and particular tissue distribution make syncytin an interesting potential target in cancer therapy.
Collapse
Affiliation(s)
- Jm Gasent Blesa
- Hospital General Universitari Marina Alta, Denia, Plana de l'Est no 5, Alacant, Spain
| | | |
Collapse
|
15
|
Zeh DW, Zeh JA, Ishida Y. Transposable elements and an epigenetic basis for punctuated equilibria. Bioessays 2009; 31:715-26. [DOI: 10.1002/bies.200900026] [Citation(s) in RCA: 119] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
16
|
Imamura H, Karro JE, Chuang JH. Weak preservation of local neutral substitution rates across mammalian genomes. BMC Evol Biol 2009; 9:89. [PMID: 19416516 PMCID: PMC2689173 DOI: 10.1186/1471-2148-9-89] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2008] [Accepted: 05/05/2009] [Indexed: 01/06/2023] Open
Abstract
Background The rate at which neutral (non-functional) bases undergo substitution is highly dependent on their location within a genome. However, it is not clear how fast these location-dependent rates change, or to what extent the substitution rate patterns are conserved between lineages. To address this question, which is critical not only for understanding the substitution process but also for evaluating phylogenetic footprinting algorithms, we examine ancestral repeats: a predominantly neutral dataset with a significantly higher genomic density than other datasets commonly used to study substitution rate variation. Using this repeat data, we measure the extent to which orthologous ancestral repeat sequences exhibit similar substitution patterns in separate mammalian lineages, allowing us to ascertain how well local substitution rates have been preserved across species. Results We calculated substitution rates for each ancestral repeat in each of three independent mammalian lineages (primate – from human/macaque alignments, rodent – from mouse/rat alignments, and laurasiatheria – from dog/cow alignments). We then measured the correlation of local substitution rates among these lineages. Overall we found the correlations between lineages to be statistically significant, but too weak to have much predictive power (r2 <5%). These correlations were found to be primarily driven by regional effects at the scale of several hundred kb or larger. A few repeat classes (e.g. 7SK, Charlie8, and MER121) also exhibited stronger conservation of rate patterns, likely due to the effect of repeat-specific purifying selection. These classes should be excluded when estimating local neutral substitution rates. Conclusion Although local neutral substitution rates have some correlations among mammalian species, these correlations have little predictive power on the scale of individual repeats. This indicates that local substitution rates have changed significantly among the lineages we have studied, and are likely to have changed even more for more diverged lineages. The correlations that do persist are too weak to be responsible for many of the highly conserved elements found by phylogenetic footprinting algorithms, leading us to conclude that such elements must be conserved due to selective forces.
Collapse
Affiliation(s)
- Hideo Imamura
- Boston College, Department of Biology, Chestnut Hill, MA 02467, USA.
| | | | | |
Collapse
|
17
|
Pidpala OV, Yatsishina AP, Lukash LL. Human mobile genetic elements: Structure, distribution and functional role. CYTOL GENET+ 2008. [DOI: 10.3103/s009545270806011x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
|
18
|
Oki N, Yano K, Okumoto Y, Tsukiyama T, Teraishi M, Tanisaka T. A genome-wide view of miniature inverted-repeat transposable elements (MITEs) in rice, Oryza sativa ssp. japonica. Genes Genet Syst 2008; 83:321-9. [PMID: 18931457 DOI: 10.1266/ggs.83.321] [Citation(s) in RCA: 63] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Transposable elements (TEs) have played important roles in the evolution of genes and genomes of higher eukaryotes. Among the TEs in the rice genome, miniature inverted-repeat transposable elements (MITEs) exist at the highest copy number. Some of MITEs in the rice genome contain poly(A) signals and putative cis-acting regulatory domains. Insertion events of such MITEs may have caused many structural and functional changes of genomes. The genome-wide examination of MITE-derived sequences could elucidate the contribution of MITEs to gene evolution. Here we report on the MITEs in the rice genome that have contributed to the emergence of novel genes and the expansion of the sequence diversity of the genome and mRNAs. Of the MITE-derived sequences, approximately 6000 were found in gene regions (exons and introns) and 67,000 in intergenic regions. In gene regions, most MITEs are located in introns rather than exons. For over 300 protein-coding genes, coding sequences, poly(A) sites, transcription start sites, and splicing sites overlap with MITEs. These sequence alterations via MITE insertions potentially affect the biological functions of gene products. Many MITE insertions also exist in 5'-untranslated regions (UTRs), 3'-UTRs, and in the proximity of genes. Although mutations in these non-protein coding regions do not alter protein sequences, these regions have key roles for gene regulation. Moreover, MITE family sequences (Tourist, Stowaway, and others) are unevenly distributed in introns. Our findings suggest that MITEs may have contributed to expansion of genome diversity by causing alterations not only in gene functions but also in regulation of many genes.
Collapse
Affiliation(s)
- Nobuhiko Oki
- Graduate School of Agriculture, Kyoto University, Kyoto, Japan
| | | | | | | | | | | |
Collapse
|
19
|
Opgen-Rhein C, Lencz T, Burdick KE, Neuhaus AH, DeRosse P, Goldberg TE, Malhotra AK. Genetic variation in the DAOA gene complex: impact on susceptibility for schizophrenia and on cognitive performance. Schizophr Res 2008; 103:169-77. [PMID: 18541412 PMCID: PMC2605318 DOI: 10.1016/j.schres.2008.04.020] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/08/2007] [Revised: 02/28/2008] [Accepted: 04/04/2008] [Indexed: 01/19/2023]
Abstract
INTRODUCTION The genetic region coding for d-amino acid oxidase activator (DAOA) is considered an intriguing susceptibility locus for schizophrenia. However, association studies have often resulted in conflicting findings, and the risk-conferring variants and their biological impact remain elusive. Our aim in this study was to investigate the relationship between DAOA variation and schizophrenia, and the influence of DAOA on cognitive performance. METHODS We analyzed block structure and association patterns of an approximately 173 kb region on chromosome 13q33, applying genotype data of 55 SNPs derived from Caucasian North American sample (178 cases, 144 healthy controls). Haplotypes were assigned using the program PHASE and frequencies compared between cases and controls. We applied MANOVA to investigate the relationship between the identified risk haplotype on cognitive performance. RESULTS We identified multiple haplotypes within the region containing the DAOA gene. Of these, one was significantly associated with schizophrenia, being over-represented in schizophrenia versus healthy controls. This haplotype was also associated with one aspect of cognitive performance, semantic fluency. Carriers of the risk haplotype showed better semantic fluency than non-carriers. CONCLUSIONS We report a significant effect of DAOA variation on risk for schizophrenia. Moreover, we identified a relationship between DAOA genetic variation and specific aspects of neurocognitive function. As the identified DAOA risk haplotype was associated with better performance on a semantic fluency measure, further work is required to identify the mechanism of DAOA action on CNS function, including the possibility of a role for balanced selection at this locus.
Collapse
Affiliation(s)
- Carolin Opgen-Rhein
- Division of Psychiatry Research, The Zucker Hillside Hospital, North Shore-Long Island Jewish Health System, Glen Oaks, New York, United States.,Department of Psychiatry and Psychotherapy, Charité -University Medicine Berlin, Campus Benjamin Franklin, Germany
| | - Todd Lencz
- Division of Psychiatry Research, The Zucker Hillside Hospital, North Shore-Long Island Jewish Health System, Glen Oaks, New York, United States.,Department of Psychiatry and Behavioral Health, Albert Einstein College of Medicine, Bronx, NY, USA.,Center for Translational Psychiatry, Feinstein Institute for Medical Research, Manhasset, NY, USA
| | - Katherine E. Burdick
- Division of Psychiatry Research, The Zucker Hillside Hospital, North Shore-Long Island Jewish Health System, Glen Oaks, New York, United States.,Department of Psychiatry and Behavioral Health, Albert Einstein College of Medicine, Bronx, NY, USA.,Center for Translational Psychiatry, Feinstein Institute for Medical Research, Manhasset, NY, USA
| | - Andres H Neuhaus
- Division of Psychiatry Research, The Zucker Hillside Hospital, North Shore-Long Island Jewish Health System, Glen Oaks, New York, United States.,Department of Psychiatry and Psychotherapy, Charité -University Medicine Berlin, Campus Benjamin Franklin, Germany
| | - Pamela DeRosse
- Division of Psychiatry Research, The Zucker Hillside Hospital, North Shore-Long Island Jewish Health System, Glen Oaks, New York, United States
| | - Terry E. Goldberg
- Division of Psychiatry Research, The Zucker Hillside Hospital, North Shore-Long Island Jewish Health System, Glen Oaks, New York, United States
| | - Anil K. Malhotra
- Division of Psychiatry Research, The Zucker Hillside Hospital, North Shore-Long Island Jewish Health System, Glen Oaks, New York, United States.,Department of Psychiatry and Behavioral Health, Albert Einstein College of Medicine, Bronx, NY, USA.,Center for Translational Psychiatry, Feinstein Institute for Medical Research, Manhasset, NY, USA
| |
Collapse
|
20
|
Elisaphenko EA, Kolesnikov NN, Shevchenko AI, Rogozin IB, Nesterova TB, Brockdorff N, Zakian SM. A dual origin of the Xist gene from a protein-coding gene and a set of transposable elements. PLoS One 2008; 3:e2521. [PMID: 18575625 PMCID: PMC2430539 DOI: 10.1371/journal.pone.0002521] [Citation(s) in RCA: 132] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2008] [Accepted: 05/15/2008] [Indexed: 11/18/2022] Open
Abstract
X-chromosome inactivation, which occurs in female eutherian mammals is controlled by a complex X-linked locus termed the X-inactivation center (XIC). Previously it was proposed that genes of the XIC evolved, at least in part, as a result of pseudogenization of protein-coding genes. In this study we show that the key XIC gene Xist, which displays fragmentary homology to a protein-coding gene Lnx3, emerged de novo in early eutherians by integration of mobile elements which gave rise to simple tandem repeats. The Xist gene promoter region and four out of ten exons found in eutherians retain homology to exons of the Lnx3 gene. The remaining six Xist exons including those with simple tandem repeats detectable in their structure have similarity to different transposable elements. Integration of mobile elements into Xist accompanies the overall evolution of the gene and presumably continues in contemporary eutherian species. Additionally we showed that the combination of remnants of protein-coding sequences and mobile elements is not unique to the Xist gene and is found in other XIC genes producing non-coding nuclear RNA.
Collapse
Affiliation(s)
- Eugeny A. Elisaphenko
- Institute of Cytology and Genetics, Russian Academy of Sciences, Siberian Department, Novosibirsk, Russia
| | - Nikolay N. Kolesnikov
- Institute of Cytology and Genetics, Russian Academy of Sciences, Siberian Department, Novosibirsk, Russia
| | - Alexander I. Shevchenko
- Institute of Cytology and Genetics, Russian Academy of Sciences, Siberian Department, Novosibirsk, Russia
| | - Igor B. Rogozin
- Institute of Cytology and Genetics, Russian Academy of Sciences, Siberian Department, Novosibirsk, Russia
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Tatyana B. Nesterova
- Medical Research Council, Clinical Sciences Centre, Imperial College Faculty of Medicine, London, United Kingdom
| | - Neil Brockdorff
- Department of Biochemistry, University of Oxford, Oxford, United Kingdom
| | - Suren M. Zakian
- Institute of Cytology and Genetics, Russian Academy of Sciences, Siberian Department, Novosibirsk, Russia
- * E-mail:
| |
Collapse
|
21
|
Chen ST, Cheng HC, Barbash DA, Yang HP. Evolution of hydra, a recently evolved testis-expressed gene with nine alternative first exons in Drosophila melanogaster. PLoS Genet 2008; 3:e107. [PMID: 17616977 PMCID: PMC1904467 DOI: 10.1371/journal.pgen.0030107] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2007] [Accepted: 05/15/2007] [Indexed: 12/26/2022] Open
Abstract
We describe here the Drosophila gene hydra that appears to have originated de novo in the melanogaster subgroup and subsequently evolved in both structure and expression level in Drosophila melanogaster and its sibling species. D. melanogaster hydra encodes a predicted protein of ~300 amino acids with no apparent similarity to any previously known proteins. The syntenic region flanking hydra on both sides is found in both D. ananassae and D. pseudoobscura, but hydra is found only in melanogaster subgroup species, suggesting that it originated less than ~13 million y ago. Exon 1 of hydra has undergone recurrent duplications, leading to the formation of nine tandem alternative exon 1s in D. melanogaster. Seven of these alternative exons are flanked on their 3′ side by the transposon DINE-1 (Drosophila interspersed element-1). We demonstrate that at least four of the nine duplicated exon 1s can function as alternative transcription start sites. The entire hydra locus has also duplicated in D. simulans and D. sechellia. D. melanogaster hydra is expressed most intensely in the proximal testis, suggesting a role in late-stage spermatogenesis. The coding region of hydra has a relatively high Ka/Ks ratio between species, but the ratio is less than 1 in all comparisons, suggesting that hydra is subject to functional constraint. Analysis of sequence polymorphism and divergence of hydra shows that it has evolved under positive selection in the lineage leading to D. melanogaster. The dramatic structural changes surrounding the first exons do not affect the tissue specificity of gene expression: hydra is expressed predominantly in the testes in D. melanogaster, D. simulans, and D. yakuba. However, we have found that expression level changed dramatically (~ >20-fold) between D. melanogaster and D. simulans. While hydra initially evolved in the absence of nearby transposable element insertions, we suggest that the subsequent accumulation of repetitive sequences in the hydra region may have contributed to structural and expression-level evolution by inducing rearrangements and causing local heterochromatinization. Our analysis further shows that recurrent evolution of both gene structure and expression level may be characteristics of newly evolved genes. We also suggest that late-stage spermatogenesis is the functional target for newly evolved and rapidly evolving male-specific genes. Similar groups of animals have similar numbers of genes, but not all of these genes are the same. While some genes are highly conserved and can be easily and uniquely identified in species ranging from yeast to plants to humans, other genes are sometimes found in only a small number or even in a single species. Such newly evolved genes may help produce traits that make species unique. We describe here a newly evolved gene called hydra that occurs only in a small subgroup of Drosophila species. hydra is expressed in the testes, suggesting that it may have a function in male fertility. hydra has evolved significantly in its structure and protein-coding sequence among species. The authors named the gene hydra after the nine-headed monster slain by Hercules because in one species, Drosophila melanogaster, hydra has nine potential alternative first exons. Perhaps because of this or other structural changes, the level of RNA made by hydra differs significantly between one pair of species. This analysis reveals that newly created genes may evolve rapidly in sequence, structure, and expression level.
Collapse
Affiliation(s)
- Shou-Tao Chen
- Faculty of Life Sciences and Institute of Genome Sciences, National Yang-Ming University, Taipei, Taiwan, Republic of China
| | - Hsin-Chien Cheng
- Faculty of Life Sciences and Institute of Genome Sciences, National Yang-Ming University, Taipei, Taiwan, Republic of China
| | - Daniel A Barbash
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, United States of America
| | - Hsiao-Pei Yang
- Faculty of Life Sciences and Institute of Genome Sciences, National Yang-Ming University, Taipei, Taiwan, Republic of China
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
22
|
Abstract
Transposable elements are mobile genetic units that exhibit broad diversity in their structure and transposition mechanisms. Transposable elements occupy a large fraction of many eukaryotic genomes and their movement and accumulation represent a major force shaping the genes and genomes of almost all organisms. This review focuses on DNA-mediated or class 2 transposons and emphasizes how this class of elements is distinguished from other types of mobile elements in terms of their structure, amplification dynamics, and genomic effect. We provide an up-to-date outlook on the diversity and taxonomic distribution of all major types of DNA transposons in eukaryotes, including Helitrons and Mavericks. We discuss some of the evolutionary forces that influence their maintenance and diversification in various genomic environments. Finally, we highlight how the distinctive biological features of DNA transposons have contributed to shape genome architecture and led to the emergence of genetic innovations in different eukaryotic lineages.
Collapse
Affiliation(s)
- Cédric Feschotte
- Department of Biology, University of Texas, Arlington, TX 76019, USA.
| | | |
Collapse
|
23
|
Lopes FR, Carazzolle MF, Pereira GAG, Colombo CA, Carareto CMA. Transposable elements in Coffea (Gentianales: Rubiacea) transcripts and their role in the origin of protein diversity in flowering plants. Mol Genet Genomics 2008; 279:385-401. [PMID: 18231813 DOI: 10.1007/s00438-008-0319-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2007] [Accepted: 01/02/2008] [Indexed: 11/28/2022]
Abstract
Transposable elements are major components of plant genomes and they influence their evolution, acting as recombination hot spots, acquiring specific cell functions or becoming part of protein-coding regions. The latter is the subject of the present analysis. This study is a report on the annotation of transposable elements (TEs) in expressed sequences of Coffea arabica, Coffea canephora and Coffea racemosa, showing the occurrence of 383 ESTs and 142 unigenes with TE fragments in these three Coffea species. Based on selected unigenes, it was possible to suggest 26 putative proteins with TE-cassette insertions, demonstrating a likely contribution to protein variability. The genes for two of those proteins, the fertility restorer (FR) and the pyrophosphate-dependent phosphofructokinase (PPi-PFKs) genes, were selected for evaluating the impact of TE-cassettes on host gene evolution of other plant genomes (Arabidopsis thaliana, Oryza sativa and Populus trichocarpa). This survey allowed identifying a FR gene in O. sativa harboring multiple insertions of LTR retrotransposons that originated new exons, which however does not necessarily mean a case of molecular domestication. A possible transduction event of a fragment of the PPi-PFK beta-subunit gene mediated by Helitron ATREPX1 in Arabidopsis thaliana was also highlighted.
Collapse
Affiliation(s)
- Fabrício Ramon Lopes
- Laboratory of Molecular Evolution, Department of Biology, UNESP, São Paulo State University, 15054-000, São José do Rio Preto, São Paulo, Brazil
| | | | | | | | | |
Collapse
|
24
|
Casola C, Hucks D, Feschotte C. Convergent domestication of pogo-like transposases into centromere-binding proteins in fission yeast and mammals. Mol Biol Evol 2007; 25:29-41. [PMID: 17940212 DOI: 10.1093/molbev/msm221] [Citation(s) in RCA: 93] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
The mammalian centromere-associated protein B (CENP-B) shares significant sequence similarity with 3 proteins in fission yeast (Abp1, Cbh1, and Cbh2) that also bind centromeres and have essential function for chromosome segregation and centromeric heterochromatin formation. Each of these proteins displays extensive sequence similarity with pogo-like transposases, which have been previously identified in the genomes of various insects and vertebrates, in the protozoan Entamoeba and in plants. Based on this distribution, it has been proposed that the mammalian and fission yeast centromeric proteins are derived from "domesticated" pogo-like transposons. Here we took advantage of the vast amount of sequence information that has become recently available for a wide range of fungal and animal species to investigate the origin of the mammalian CENP-B and yeast CENP-B-like genes. A highly conserved ortholog of CENP-B was detected in 31 species of mammals, including opossum and platypus, but was absent from all nonmammalian species represented in the databases. Similarly, no ortholog of the fission yeast centromeric proteins was identified in any of the various fungal genomes currently available. In contrast, we discovered a plethora of novel pogo-like transposons in diverse invertebrates and vertebrates and in several filamentous fungi. Phylogenetic analysis revealed that the mammalian and fission yeast CENP-B proteins fall into 2 distinct monophyletic clades, each of which includes a different set of pogo-like transposons. These results are most parsimoniously explained by independent domestication events of pogo-like transposases into centromeric proteins in the mammalian and fission yeast lineages, a case of "convergent domestication." These findings highlight the propensity of transposases to give rise to new host proteins and the potential of transposons as sources of genetic innovation.
Collapse
Affiliation(s)
- Claudio Casola
- Department of Biology, University of Texas, Arlington, USA
| | | | | |
Collapse
|
25
|
Wu M, Li L, Sun Z. Transposable element fragments in protein-coding regions and their contributions to human functional proteins. Gene 2007; 401:165-71. [PMID: 17716834 DOI: 10.1016/j.gene.2007.07.012] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2007] [Revised: 07/11/2007] [Accepted: 07/11/2007] [Indexed: 11/26/2022]
Abstract
Transposable elements (TEs) and their contributions to protein-coding regions are of particular interest. Here we searched for TE fragments in Homo sapiens at both the transcript and protein levels. We found evidence in support of TE exonization and its association with alternative splicing. Despite recent findings that long evolutionary times are required to incorporate TE into proteins, we found many functional proteins with translated TE cassettes derived from young TEs. Analyses of two Bcl-family proteins and Alu-encoded segments suggest the coding and functional potential of TE sequences.
Collapse
Affiliation(s)
- Ming Wu
- Institute of Bioinformatics, MOE Key Laboratory of Bioinformatics, State Key Laboratory of Biomembrane and Membrane Biotechnology, Department of Biological Sciences and Biotechnology, Tsinghua University, Beijing 100084, China
| | | | | |
Collapse
|
26
|
Wang-Johanning F, Liu J, Rycaj K, Huang M, Tsai K, Rosen DG, Chen DT, Lu DW, Barnhart KF, Johanning GL. Expression of multiple human endogenous retrovirus surface envelope proteins in ovarian cancer. Int J Cancer 2007; 120:81-90. [PMID: 17013901 DOI: 10.1002/ijc.22256] [Citation(s) in RCA: 157] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Individual classes of human endogenous retrovirus (HERV) genes and proteins are expressed in cancer, but expression of more than one type of HERV is rare. We report here the expression of multiple HERV genes and proteins in ovarian cell lines and tissues. Expression of HERV-K env mRNA was greater in ovarian epithelial tumors than in normal ovarian tissues (N = 254). The expression of this protein on the surface and in the cytoplasm of ovarian cancer cells was confirmed using anti-HERV-K specific antibody by flow cytometric analysis. The frequency of expression of HERV-K env protein in multitissue microarrays (N = 641) was determined by immunohistochemistry and a significant correlation with tumor histotype was found. A significantly increased expression of HERV-K was observed in tumors with low malignant potential and low grade, relative to expression in normal ovarian tissues. The increase in expression of HERV-K env protein took place in a stepwise fashion in serous papillary adenocarcinoma. Interestingly, we found that other classes of HERV env mRNAs, including ERV3 and HERV-E, are expressed in the same ovarian cancer tissues that expressed HERV-K. Furthermore, anti-HERV antibodies including anti-ERV3 (30%), anti-HERV-E (40%) and anti-HERV-K (55%) were detected in patients with ovarian cancer, but not in normal female controls. HERV env proteins are frequently transcribed and translated in ovarian epithelial tumors, and multiple HERV families are detectable in ovarian cancer. HERV env proteins, and especially those expressed on the cell surface, may serve as novel tumor targets for detection, diagnosis and immunotherapy of ovarian cancer.
Collapse
MESH Headings
- Adenocarcinoma, Clear Cell/metabolism
- Adenocarcinoma, Clear Cell/virology
- Adenocarcinoma, Mucinous/metabolism
- Adenocarcinoma, Mucinous/virology
- Adult
- Aged
- Aged, 80 and over
- Amino Acid Sequence
- Base Sequence
- Carcinoma, Endometrioid/metabolism
- Carcinoma, Endometrioid/virology
- Case-Control Studies
- Cystadenocarcinoma, Serous/metabolism
- Cystadenocarcinoma, Serous/virology
- Endogenous Retroviruses/genetics
- Endogenous Retroviruses/immunology
- Endogenous Retroviruses/metabolism
- Enzyme-Linked Immunosorbent Assay
- Female
- Flow Cytometry
- Fluorescent Antibody Technique
- Gene Products, env/genetics
- Gene Products, env/metabolism
- Gene Products, env/physiology
- Humans
- Immunoenzyme Techniques
- Membrane Proteins/genetics
- Membrane Proteins/metabolism
- Middle Aged
- Molecular Sequence Data
- Ovarian Neoplasms/metabolism
- Ovarian Neoplasms/virology
- Ovary/metabolism
- RNA, Messenger/genetics
- RNA, Messenger/metabolism
- Reverse Transcriptase Polymerase Chain Reaction
- Tissue Array Analysis
- Tumor Cells, Cultured
Collapse
Affiliation(s)
- Feng Wang-Johanning
- Department of Veterinary Sciences and Michale E. Keeling Center for Comparative Medicine and Research, University of Texas MD Anderson Cancer Center, Houston, TX 78602, USA.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
27
|
Chung H, Bogwitz MR, McCart C, Andrianopoulos A, Ffrench-Constant RH, Batterham P, Daborn PJ. Cis-regulatory elements in the Accord retrotransposon result in tissue-specific expression of the Drosophila melanogaster insecticide resistance gene Cyp6g1. Genetics 2006; 175:1071-7. [PMID: 17179088 PMCID: PMC1840086 DOI: 10.1534/genetics.106.066597] [Citation(s) in RCA: 181] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
Transposable elements are a major mutation source and powerful agents of adaptive change. Some transposable element insertions in genomes increase to a high frequency because of the selective advantage the mutant phenotype provides. Cyp6g1-mediated insecticide resistance in Drosophila melanogaster is due to the upregulation of the cytochrome P450 gene Cyp6g1, leading to the resistance to a variety of insecticide classes. The upregulation of Cyp6g1 is correlated with the presence of the long terminal repeat (LTR) of an Accord retrotransposon inserted 291bp upstream of the Cyp6g1 transcription start site. This resistant allele (DDT-R) is currently at a high frequency in D. melanogaster populations around the world. Here, we characterize the spatial expression of Cyp6g1 in insecticide-resistant and -susceptible strains. We show that the Accord LTR insertion is indeed the resistance-associated mutation and demonstrate that the Accord LTR carries regulatory sequences that increase the expression of Cyp6g1 in tissues important for detoxification, the midgut, Malpighian tubules, and the fat body. This study provides a significant example of how changes in tissue-specific gene expression caused by transposable-element insertions can contribute to adaptation.
Collapse
Affiliation(s)
- Henry Chung
- Centre for Environmental and Stress Adaptation Research, Bio21 Molecular Science and Biotechnology Institute, University of Melbourne, Melbourne, Victoria, Australia
| | | | | | | | | | | | | |
Collapse
|
28
|
Detera-Wadleigh SD, McMahon FJ. G72/G30 in schizophrenia and bipolar disorder: review and meta-analysis. Biol Psychiatry 2006; 60:106-14. [PMID: 16581030 DOI: 10.1016/j.biopsych.2006.01.019] [Citation(s) in RCA: 214] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/13/2005] [Revised: 01/19/2006] [Accepted: 01/28/2006] [Indexed: 01/31/2023]
Abstract
Association of the G72/G30 locus with schizophrenia and bipolar disorder has now been reported in several studies. The G72/G30 locus may be one of several that account for the evidence of linkage that spans a broad region of chromosome 13q. However, the story of G72/G30 is complex. Our meta-analysis of published association studies shows highly significant evidence of association between nucleotide variations in the G72/G30 region and schizophrenia, along with compelling evidence of association with bipolar disorder. But the associated alleles and haplotypes are not identical across studies, and some strongly associated variants are located approximately 50 kb telomeric of G72. Interestingly, G72 and G30 are transcribed in opposite directions; hence, their transcripts could cross-regulate translation. A functional native protein and functional motifs for G72 or G30 remain to be demonstrated. The interaction of G72 with d-amino acid oxidase, itself of interest as a modulator of N-methyl-d-aspartate receptors through regulation of d-serine levels, has been reported in one study and could be a key functional link that deserves further investigation. The association findings in the G72/G30 region, among the most compelling in psychiatry, may expose an important molecular pathway involved in susceptibility to schizophrenia and bipolar disorder.
Collapse
Affiliation(s)
- Sevilla D Detera-Wadleigh
- National Institute of Mental Health Intramural Research Program, National Institutes of Health, U.S. Department of Health and Human Services, Bethesda, Maryland 20892-3719, USA.
| | | |
Collapse
|
29
|
Geurts AM, Hackett CS, Bell JB, Bergemann TL, Collier LS, Carlson CM, Largaespada DA, Hackett PB. Structure-based prediction of insertion-site preferences of transposons into chromosomes. Nucleic Acids Res 2006; 34:2803-11. [PMID: 16717285 PMCID: PMC1464413 DOI: 10.1093/nar/gkl301] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Mobile genetic elements with the ability to integrate genetic information into chromosomes can cause disease over short periods of time and shape genomes over eons. These elements can be used for functional genomics, gene transfer and human gene therapy. However, their integration-site preferences, which are critically important for these uses, are poorly understood. We analyzed the insertion sites of several transposons and retroviruses to detect patterns of integration that might be useful for prediction of preferred integration sites. Initially we found that a mathematical description of DNA-deformability, called Vstep, could be used to distinguish preferential integration sites for Sleeping Beauty (SB) transposons into a particular 100 bp region of a plasmid [G. Liu, A. M. Geurts, K. Yae, A. R. Srinivassan, S. C. Fahrenkrug, D. A. Largaespada,J. Takeda, K. Horie, W. K. Olson and P. B. Hackett (2005) J. Mol. Biol., 346, 161–173 ]. Based on these findings, we extended our examination of integration of SB transposons into whole plasmids and chromosomal DNA. To accommodate sequences up to 3 Mb for these analyses, we developed an automated method, ProTIS©, that can generate profiles of predicted integration events. However, a similar approach did not reveal any structural pattern of DNA that could be used to predict favored integration sites for other transposons as well as retroviruses and lentiviruses due to a limitation of available data sets. Nonetheless, ProTIS© has the utility for predicting likely SB transposon integration sites in investigator-selected regions of genomes and our general strategy may be useful for other mobile elements once a sufficiently high density of sites in a single region are obtained. ProTIS analysis can be useful for functional genomic, gene transfer and human gene therapy applications using the SB system.
Collapse
Affiliation(s)
- Aron M. Geurts
- Department of Genetics, Cell Biology and Development, The Arnold and Mabel Beckman Center for Transposon Research, University of MinnesotaMinneapolis, MN 55455, USA
| | - Christopher S. Hackett
- Biomedical Sciences Graduate Program, University of California San FranciscoSan Francisco, CA 94143-0452, USA
| | - Jason B. Bell
- Department of Genetics, Cell Biology and Development, The Arnold and Mabel Beckman Center for Transposon Research, University of MinnesotaMinneapolis, MN 55455, USA
| | - Tracy L. Bergemann
- Biostatistics Core, University of Minnesota Cancer CenterMinneapolis, MN 55455, USA
| | - Lara S. Collier
- University of Minnesota Cancer CenterMinneapolis, MN 55455, USA
| | | | - David A. Largaespada
- Department of Genetics, Cell Biology and Development, The Arnold and Mabel Beckman Center for Transposon Research, University of MinnesotaMinneapolis, MN 55455, USA
- University of Minnesota Cancer CenterMinneapolis, MN 55455, USA
| | - Perry B. Hackett
- Department of Genetics, Cell Biology and Development, The Arnold and Mabel Beckman Center for Transposon Research, University of MinnesotaMinneapolis, MN 55455, USA
- University of Minnesota Cancer CenterMinneapolis, MN 55455, USA
- To whom correspondence should be addressed at Department of Genetics, Cell Biology, and Development, University of Minnesota, 6-160 Jackson Hall, 321 Church Street SE, Minneapolis, MN 55455, USA. Tel: +1 612 624 6736; Fax: +1 612 625 6140;
| |
Collapse
|
30
|
Cordaux R, Udit S, Batzer MA, Feschotte C. Birth of a chimeric primate gene by capture of the transposase gene from a mobile element. Proc Natl Acad Sci U S A 2006; 103:8101-6. [PMID: 16672366 PMCID: PMC1472436 DOI: 10.1073/pnas.0601161103] [Citation(s) in RCA: 177] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The emergence of new genes and functions is of central importance to the evolution of species. The contribution of various types of duplications to genetic innovation has been extensively investigated. Less understood is the creation of new genes by recycling of coding material from selfish mobile genetic elements. To investigate this process, we reconstructed the evolutionary history of SETMAR, a new primate chimeric gene resulting from fusion of a SET histone methyltransferase gene to the transposase gene of a mobile element. We show that the transposase gene was recruited as part of SETMAR 40-58 million years ago, after the insertion of an Hsmar1 transposon downstream of a preexisting SET gene, followed by the de novo exonization of previously noncoding sequence and the creation of a new intron. The original structure of the fusion gene is conserved in all anthropoid lineages, but only the N-terminal half of the transposase is evolving under strong purifying selection. In vitro assays show that this region contains a DNA-binding domain that has preserved its ancestral binding specificity for a 19-bp motif located within the terminal-inverted repeats of Hsmar1 transposons and their derivatives. The presence of these transposons in the human genome constitutes a potential reservoir of approximately 1,500 perfect or nearly perfect SETMAR-binding sites. Our results not only provide insight into the conditions required for a successful gene fusion, but they also suggest a mechanism by which the circuitry underlying complex regulatory networks may be rapidly established.
Collapse
Affiliation(s)
- Richard Cordaux
- *Department of Biological Sciences, Biological Computation and Visualization Center, Center for BioModular Multi-Scale Systems, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803; and
| | - Swalpa Udit
- Department of Biology, University of Texas, Arlington, TX 76019
| | - Mark A. Batzer
- *Department of Biological Sciences, Biological Computation and Visualization Center, Center for BioModular Multi-Scale Systems, Louisiana State University, 202 Life Sciences Building, Baton Rouge, LA 70803; and
| | - Cédric Feschotte
- Department of Biology, University of Texas, Arlington, TX 76019
- To whom correspondence should be addressed. E-mail:
| |
Collapse
|
31
|
Britten R. Transposable elements have contributed to thousands of human proteins. Proc Natl Acad Sci U S A 2006; 103:1798-803. [PMID: 16443682 PMCID: PMC1413650 DOI: 10.1073/pnas.0510007103] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
This is a report of many distant but significant protein sequence relationships between human proteins and transposable elements (TEs). The libraries of human repeated sequences contain the DNA sequences of many TEs. These were translated in all reading frames, ignoring stop codons, and were used as amino acid sequence probes to search with BLASTP for similar sequences in a library of 25,193 human proteins. The probes show regions of significant amino acid sequence similarity to 1,950 different human genes, with an expectation of <10(-3). In comparison with previous REPEATMASKER (Institute for Systems Biology, Seattle) studies, these probes detect many more TE sequences in more human coding sequences with greater length than previous work using DNA sequences. If the criterion is opened, very many matches are found occurring on 4,653 different genes after correction for the number seen with random amino acid sequence probes. The processes that led to these extensive sets of sequence relationships between TEs and coding sequences of human genes have been a major source of variation and novel genes during evolution. This paper lists the number of sequence similarities seen by amino acid sequence comparison, which is surely an underestimate of the actual number of significant relationships. It appears that many of these are the result of past events of duplication of genes or gene regions, rather than a direct result of TE insertion. This report of observable relationships leaves to the future the functional implications as well as the detection of the events of TE insertion.
Collapse
Affiliation(s)
- Roy Britten
- California Institute of Technology, 101 Dahlia Avenue, Corona del Mar, CA 92625, USA.
| |
Collapse
|
32
|
Cáceres M, Thomas JW. The gene of retroviral origin Syncytin 1 is specific to hominoids and is inactive in Old World monkeys. ACTA ACUST UNITED AC 2006; 97:100-6. [PMID: 16424151 DOI: 10.1093/jhered/esj011] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Syncytin 1 is one of the best known examples of recent acquisition of a new gene from an endogenous retrovirus (HERV) in the human genome and has been implicated in placental physiology. Within primates, Syncytin 1 is conserved in all hominoids but has not been characterized in Old World monkeys (OWMs). In this study, we investigated the status of Syncytin 1 in 14 hominoid and OWM species. We show that although the HERV-W provirus responsible for the origin of this gene was present in the genome of the most recent common ancestor of hominoids and OWMs, Syncytin 1 is inactive in OWMs. In addition, we were able to determine that the evolution of Syncytin 1 in hominoids involved an accumulation of amino acid changes and showed signatures of both positive and purifying selection. Our results indicate that Syncytin 1 is indeed a hominoid-specific gene and illustrate the complex and dynamic process associated with the origin of new genes.
Collapse
Affiliation(s)
- Mario Cáceres
- Department of Human Genetics, Emory University School of Medicine, 615 Michael Street, Suite 301, Atlanta, GA 30322, USA.
| | | |
Collapse
|
33
|
Lipatov M, Lenkov K, Petrov DA, Bergman CM. Paucity of chimeric gene-transposable element transcripts in the Drosophila melanogaster genome. BMC Biol 2005; 3:24. [PMID: 16283942 PMCID: PMC1308810 DOI: 10.1186/1741-7007-3-24] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2005] [Accepted: 11/12/2005] [Indexed: 11/27/2022] Open
Abstract
Background Recent analysis of the human and mouse genomes has shown that a substantial proportion of protein coding genes and cis-regulatory elements contain transposable element (TE) sequences, implicating TE domestication as a mechanism for the origin of genetic novelty. To understand the general role of TE domestication in eukaryotic genome evolution, it is important to assess the acquisition of functional TE sequences by host genomes in a variety of different species, and to understand in greater depth the population dynamics of these mutational events. Results Using an in silico screen for host genes that contain TE sequences, we identified a set of 63 mature "chimeric" transcripts supported by expressed sequence tag (EST) evidence in the Drosophila melanogaster genome. We found a paucity of chimeric TEs relative to expectations derived from non-chimeric TEs, indicating that the majority (~80%) of TEs that generate chimeric transcripts are deleterious and are not observed in the genome sequence. Using a pooled-PCR strategy to assay the presence of gene-TE chimeras in wild strains, we found that over half of the observed chimeric TE insertions are restricted to the sequenced strain, and ~15% are found at high frequencies in North American D. melanogaster populations. Estimated population frequencies of chimeric TEs did not differ significantly from non-chimeric TEs, suggesting that the distribution of fitness effects for the observed subset of chimeric TEs is indistinguishable from the general set of TEs in the genome sequence. Conclusion In contrast to mammalian genomes, we found that fewer than 1% of Drosophila genes produce mRNAs that include bona fide TE sequences. This observation can be explained by the results of our population genomic analysis, which indicates that most potential chimeric TEs in D. melanogaster are deleterious but that a small proportion may contribute to the evolution of novel gene sequences such as nested or intercalated gene structures. Our results highlight the need to establish the fixity of putative cases of TE domestication identified using genome sequences in order to demonstrate their functional importance, and reveal that the contribution of TE domestication to genome evolution may vary drastically among animal taxa.
Collapse
Affiliation(s)
- Mikhail Lipatov
- Department of Biological Sciences, Stanford University, Stanford, CA 94305, USA
| | - Kapa Lenkov
- Department of Biological Sciences, Stanford University, Stanford, CA 94305, USA
| | - Dmitri A Petrov
- Department of Biological Sciences, Stanford University, Stanford, CA 94305, USA
| | - Casey M Bergman
- Faculty of Life Sciences, University of Manchester, Manchester M13 9PT, UK
| |
Collapse
|
34
|
Dunn CA, Romanish MT, Gutierrez LE, van de Lagemaat LN, Mager DL. Transcription of two human genes from a bidirectional endogenous retrovirus promoter. Gene 2005; 366:335-42. [PMID: 16288839 DOI: 10.1016/j.gene.2005.09.003] [Citation(s) in RCA: 69] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2005] [Revised: 08/29/2005] [Accepted: 09/01/2005] [Indexed: 11/17/2022]
Abstract
Eight percent of the human genome is derived from endogenous retrovirus (ERV) insertions. ERV long terminal repeats (LTRs) contain strong promoters that are known to contribute to the transcriptional regulation of certain human genes. While some LTRs are known to possess bidirectional promoter activity in vitro, only sense orientation LTR promoters have previously been shown to regulate human gene expression. Here we demonstrate that an ERV1 LTR acts as a bidirectional promoter for the human Down syndrome critical region 4 (DSCR4) and DSCR8 genes. We show that while DSCR4 and DSCR8 are essentially co-expressed, their shared LTR promoter is more active in the sense than the antisense orientation. Through deletion analysis of the LTR we have identified positive and negative regulatory elements, and defined a core region of the promoter that is required for transcriptional activity in both orientations. Finally, we show that the ERV LTR also exists in the genomes of several non-human primates, and present evidence that potential transcription factor binding sites in the core region have been maintained throughout primate evolution.
Collapse
Affiliation(s)
- Catherine A Dunn
- Terry Fox Laboratory, BC Cancer Agency, 675 West 10th Avenue, Vancouver, BC, Canada V5Z 1L3
| | | | | | | | | |
Collapse
|