51
|
Fattash I, Rooke R, Wong A, Hui C, Luu T, Bhardwaj P, Yang G. Miniature inverted-repeat transposable elements: discovery, distribution, and activity. Genome 2013; 56:475-86. [PMID: 24168668 DOI: 10.1139/gen-2012-0174] [Citation(s) in RCA: 64] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
Eukaryotic organisms have dynamic genomes, with transposable elements (TEs) as a major contributing factor. Although the large autonomous TEs can significantly shape genomic structures during evolution, genomes often harbor more miniature nonautonomous TEs that can infest genomic niches where large TEs are rare. In spite of their cut-and-paste transposition mechanisms that do not inherently favor copy number increase, miniature inverted-repeat transposable elements (MITEs) are abundant in eukaryotic genomes and exist in high copy numbers. Based on the large number of MITE families revealed in previous studies, accurate annotation of MITEs, particularly in newly sequenced genomes, will identify more genomes highly rich in these elements. Novel families identified from these analyses, together with the currently known families, will further deepen our understanding of the origins, transposase sources, and dramatic amplification of these elements.
Collapse
Affiliation(s)
- Isam Fattash
- a Department of Biology, University of Toronto at Mississauga, 3359 Mississauga Road, Mississauga, ON L5L 1C6, Canada
| | | | | | | | | | | | | |
Collapse
|
52
|
Abstract
For decades, transposable elements have been known to produce a wide variety of changes in plant gene expression and function. This has led to the idea that transposable element activity has played a key part in adaptive plant evolution. This Review describes the kinds of changes that transposable elements can cause, discusses evidence that those changes have contributed to plant evolution and suggests future strategies for determining the extent to which these changes have in fact contributed to plant adaptation and evolution. Recent advances in genomics and phenomics for a range of plant species, particularly crops, have begun to allow the systematic assessment of these questions.
Collapse
Affiliation(s)
- Damon Lisch
- Department of Plant and Microbial Biology, UC Berkeley, Berkeley, California 94720, USA.
| |
Collapse
|
53
|
Abrusán G, Szilágyi A, Zhang Y, Papp B. Turning gold into 'junk': transposable elements utilize central proteins of cellular networks. Nucleic Acids Res 2013; 41:3190-200. [PMID: 23341038 PMCID: PMC3597677 DOI: 10.1093/nar/gkt011] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
The numerous discovered cases of domesticated transposable element (TE) proteins led to the recognition that TEs are a significant source of evolutionary innovation. However, much less is known about the reverse process, whether and to what degree the evolution of TEs is influenced by the genome of their hosts. We addressed this issue by searching for cases of incorporation of host genes into the sequence of TEs and examined the systems-level properties of these genes using the Saccharomyces cerevisiae and Drosophila melanogaster genomes. We identified 51 cases where the evolutionary scenario was the incorporation of a host gene fragment into a TE consensus sequence, and we show that both the yeast and fly homologues of the incorporated protein sequences have central positions in the cellular networks. An analysis of selective pressure (Ka/Ks ratio) detected significant selection in 37% of the cases. Recent research on retrovirus-host interactions shows that virus proteins preferentially target hubs of the host interaction networks enabling them to take over the host cell using only a few proteins. We propose that TEs face a similar evolutionary pressure to evolve proteins with high interacting capacities and take some of the necessary protein domains directly from their hosts.
Collapse
Affiliation(s)
- György Abrusán
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Center of the Hungarian Academy of Sciences, Temesváry krt. 62. Szeged H-6701, Hungary.
| | | | | | | |
Collapse
|
54
|
Li L, Petsch K, Shimizu R, Liu S, Xu WW, Ying K, Yu J, Scanlon MJ, Schnable PS, Timmermans MCP, Springer NM, Muehlbauer GJ. Mendelian and non-Mendelian regulation of gene expression in maize. PLoS Genet 2013; 9:e1003202. [PMID: 23341782 PMCID: PMC3547793 DOI: 10.1371/journal.pgen.1003202] [Citation(s) in RCA: 74] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2012] [Accepted: 11/14/2012] [Indexed: 11/25/2022] Open
Abstract
Transcriptome variation plays an important role in affecting the phenotype of an organism. However, an understanding of the underlying mechanisms regulating transcriptome variation in segregating populations is still largely unknown. We sought to assess and map variation in transcript abundance in maize shoot apices in the intermated B73×Mo17 recombinant inbred line population. RNA–based sequencing (RNA–seq) allowed for the detection and quantification of the transcript abundance derived from 28,603 genes. For a majority of these genes, the population mean, coefficient of variation, and segregation patterns could be predicted by the parental expression levels. Expression quantitative trait loci (eQTL) mapping identified 30,774 eQTL including 96 trans-eQTL “hotspots,” each of which regulates the expression of a large number of genes. Interestingly, genes regulated by a trans-eQTL hotspot tend to be enriched for a specific function or act in the same genetic pathway. Also, genomic structural variation appeared to contribute to cis-regulation of gene expression. Besides genes showing Mendelian inheritance in the RIL population, we also found genes whose expression level and variation in the progeny could not be predicted based on parental difference, indicating that non-Mendelian factors also contribute to expression variation. Specifically, we found 145 genes that show patterns of expression reminiscent of paramutation such that all the progeny had expression levels similar to one of the two parents. Furthermore, we identified another 210 genes that exhibited unexpected patterns of transcript presence/absence. Many of these genes are likely to be gene fragments resulting from transposition, and the presence/absence of their transcripts could influence expression levels of their ancestral syntenic genes. Overall, our results contribute to the identification of novel expression patterns and broaden the understanding of transcriptional variation in plants. Phenotypes are determined by the expression of genes, the environment, and the interaction of gene expression and the environment. However, a complete understanding of the inheritance of and genome-wide regulation of gene expression is lacking. One approach, called expression quantitative trait locus (eQTL) mapping provides the opportunity to examine the genome-wide inheritance and regulation of gene expression. In this paper, we conducted high-throughput sequencing of gene transcripts to examine gene expression in the shoot apex of a maize biparental mapping population. We quantified expression levels from 28,603 genes in the population and showed that the vast majority of genes exhibited the expected pattern of Mendelian inheritance. We genetically mapped the expression patterns and identified genomic regions associated with gene expression. Notably, we detected gene expression patterns that exhibited non-Mendelian inheritance. These included 145 genes that exhibited expression patterns in the progeny that were similar to only one of the parents and 210 genes with unexpected presence/absence expression patterns. The findings of non-Mendelian inheritance underscore the complexity of gene expression and provide a framework for understanding these complexities.
Collapse
Affiliation(s)
- Lin Li
- Department of Agronomy and Plant Genetics, University of Minnesota, Saint Paul, Minnesota, United States of America
| | - Katherine Petsch
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, United States of America
| | - Rena Shimizu
- Department of Plant Biology, Cornell University, Ithaca, New York, United States of America
| | - Sanzhen Liu
- Department of Genetics, Development, and Cell Biology, and Department of Agronomy, Iowa State University, Ames, Iowa, United States of America
| | - Wayne Wenzhong Xu
- Supercomputing Institute for Advanced Computational Research, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Kai Ying
- Department of Genetics, Development, and Cell Biology, and Department of Agronomy, Iowa State University, Ames, Iowa, United States of America
| | - Jianming Yu
- Department of Agronomy, Kansas State University, Manhattan, Kansas, United States of America
| | - Michael J. Scanlon
- Department of Plant Biology, Cornell University, Ithaca, New York, United States of America
| | - Patrick S. Schnable
- Department of Genetics, Development, and Cell Biology, and Department of Agronomy, Iowa State University, Ames, Iowa, United States of America
| | | | - Nathan M. Springer
- Department of Plant Biology, University of Minnesota, Saint Paul, Minnesota, United States of America
| | - Gary J. Muehlbauer
- Department of Agronomy and Plant Genetics, University of Minnesota, Saint Paul, Minnesota, United States of America
- Department of Plant Biology, University of Minnesota, Saint Paul, Minnesota, United States of America
- * E-mail:
| |
Collapse
|
55
|
Abstract
The initial identification of transposable elements (TEs) was attributed to the activity of DNA transposable elements, which are prevalent in plants. Unlike RNA elements, which accumulate in the gene-poor heterochromatic regions, most DNA elements are located in the gene rich regions and many of them carry genes or gene fragments. As such, DNA elements have a more intimate relationship with genes and may have an immediate impact on gene expression and gene function. DNA elements are structurally distinct from RNA elements and most of them have terminal inverted repeats (TIRs). Such structural features have been used to identify the relevant elements from genomic sequences. Among the DNA elements in plants, the most abundant type is the miniature inverted repeat transposable elements (MITEs). This chapter discusses the methods to identify MITEs, Helitrons, and other DNA transposable elements.
Collapse
Affiliation(s)
- Ning Jiang
- Department of Horticulture, Michigan State University, East Lansing, MI, USA
| |
Collapse
|
56
|
Eichten SR, Ellis NA, Makarevitch I, Yeh CT, Gent JI, Guo L, McGinnis KM, Zhang X, Schnable PS, Vaughn MW, Dawe RK, Springer NM. Spreading of heterochromatin is limited to specific families of maize retrotransposons. PLoS Genet 2012; 8:e1003127. [PMID: 23271981 PMCID: PMC3521669 DOI: 10.1371/journal.pgen.1003127] [Citation(s) in RCA: 121] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2012] [Accepted: 10/15/2012] [Indexed: 12/25/2022] Open
Abstract
Transposable elements (TEs) have the potential to act as controlling elements to influence the expression of genes and are often subject to heterochromatic silencing. The current paradigm suggests that heterochromatic silencing can spread beyond the borders of TEs and influence the chromatin state of neighboring low-copy sequences. This would allow TEs to condition obligatory or facilitated epialleles and act as controlling elements. The maize genome contains numerous families of class I TEs (retrotransposons) that are present in moderate to high copy numbers, and many are found in regions near genes, which provides an opportunity to test whether the spreading of heterochromatin from retrotransposons is prevalent. We have investigated the extent of heterochromatin spreading into DNA flanking each family of retrotransposons by profiling DNA methylation and di-methylation of lysine 9 of histone 3 (H3K9me2) in low-copy regions of the maize genome. The effects of different retrotransposon families on local chromatin are highly variable. Some retrotransposon families exhibit enrichment of heterochromatic marks within 800–1,200 base pairs of insertion sites, while other families exhibit very little evidence for the spreading of heterochromatic marks. The analysis of chromatin state in genotypes that lack specific insertions suggests that the heterochromatin in low-copy DNA flanking retrotransposons often results from the spreading of silencing marks rather than insertion-site preferences. Genes located near TEs that exhibit spreading of heterochromatin tend to be expressed at lower levels than other genes. Our findings suggest that a subset of retrotransposon families may act as controlling elements influencing neighboring sequences, while the majority of retrotransposons have little effect on flanking sequences. Transposable elements comprise a substantial portion of many eukaryotic genomes. These mobile fragments of DNA can directly mutate genes through insertions into coding regions but may also affect the gene regulation through nearby insertions. There is evidence that the majority of transposable elements are epigenetically silenced, and in some cases this silencing may spread to neighboring sequences. This spreading of heterochromatin could create a significant fitness tradeoff between transposon silencing and gene expression. The maize genome has a complex organization with many genes flanked by retrotransposons, providing an opportunity to study the interaction of retrotransposons and genes. To survey the prevalence of heterochromatin spreading associated with different retrotransposon families, we profiled the spread of heterochromatin into nearby low copy sequences for 150 high copy retrotransposon families. While many retrotransposons exhibit little to no spreading of heterochromatin, there are some retrotransposon families that do exhibit spreading. Genes located near retrotransposons that spread heterochromatin have lower expression levels. The families of retrotransposons that spread heterochromatin marks to nearby low-copy sequences may have increased fitness costs for the host genome due to their suppression of genes located near insertions.
Collapse
Affiliation(s)
- Steven R. Eichten
- Microbial and Plant Genomics Institute, Department of Plant Biology, University of Minnesota, Saint Paul, Minnesota, United States of America
| | - Nathanael A. Ellis
- Department of Plant Biology, University of Georgia, Athens, Georgia, United States of America
| | - Irina Makarevitch
- Biology Department, Hamline University, Saint Paul, Minnesota, United States of America
| | - Cheng-Ting Yeh
- Center for Plant Genomics and Department of Agronomy, Iowa State University, Ames, Iowa, United States of America
| | - Jonathan I. Gent
- Department of Plant Biology, University of Georgia, Athens, Georgia, United States of America
| | - Lin Guo
- Department of Plant Biology, University of Georgia, Athens, Georgia, United States of America
| | - Karen M. McGinnis
- Department of Biological Science, Florida State University, Tallahassee, Florida, United States of America
| | - Xiaoyu Zhang
- Department of Plant Biology, University of Georgia, Athens, Georgia, United States of America
| | - Patrick S. Schnable
- Center for Plant Genomics and Department of Agronomy, Iowa State University, Ames, Iowa, United States of America
| | - Matthew W. Vaughn
- Texas Advanced Computing Center, University of Texas at Austin, Austin, Texas, United States of America
| | - R. Kelly Dawe
- Department of Plant Biology, University of Georgia, Athens, Georgia, United States of America
| | - Nathan M. Springer
- Microbial and Plant Genomics Institute, Department of Plant Biology, University of Minnesota, Saint Paul, Minnesota, United States of America
- * E-mail:
| |
Collapse
|
57
|
González LG, Deyholos MK. Identification, characterization and distribution of transposable elements in the flax (Linum usitatissimum L.) genome. BMC Genomics 2012; 13:644. [PMID: 23171245 PMCID: PMC3544724 DOI: 10.1186/1471-2164-13-644] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2012] [Accepted: 11/15/2012] [Indexed: 12/22/2022] Open
Abstract
BACKGROUND Flax (Linum usitatissimum L.) is an important crop for the production of bioproducts derived from its seed and stem fiber. Transposable elements (TEs) are widespread in plant genomes and are a key component of their evolution. The availability of a genome assembly of flax (Linum usitatissimum) affords new opportunities to explore the diversity of TEs and their relationship to genes and gene expression. RESULTS Four de novo repeat identification algorithms (PILER, RepeatScout, LTR_finder and LTR_STRUC) were applied to the flax genome assembly. The resulting library of flax repeats was combined with the RepBase Viridiplantae division and used with RepeatMasker to identify TEs coverage in the genome. LTR retrotransposons were the most abundant TEs (17.2% genome coverage), followed by Long Interspersed Nuclear Element (LINE) retrotransposons (2.10%) and Mutator DNA transposons (1.99%). Comparison of putative flax TEs to flax transcript databases indicated that TEs are not highly expressed in flax. However, the presence of recent insertions, defined by 100% intra-element LTR similarity, provided evidence for recent TE activity. Spatial analysis showed TE-rich regions, gene-rich regions as well as regions with similar genes and TE density. Monte Carlo simulations for the 71 largest scaffolds (≥ 1 Mb each) did not show any regional differences in the frequency of TE overlap with gene coding sequences. However, differences between TE superfamilies were found in their proximity to genes. Genes within TE-rich regions also appeared to have lower transcript expression, based on EST abundance. When LTR elements were compared, Copia showed more diversity, recent insertions and conserved domains than the Gypsy, demonstrating their importance in genome evolution. CONCLUSIONS The calculated 23.06% TE coverage of the flax WGS assembly is at the low end of the range of TE coverages reported in other eudicots, although this estimate does not include TEs likely found in unassembled repetitive regions of the genome. Since enrichment for TEs in genomic regions was associated with reduced expression of neighbouring genes, and many members of the Copia LTR superfamily are inserted close to coding regions, we suggest Copia elements have a greater influence on recent flax genome evolution while Gypsy elements have become residual and highly mutated.
Collapse
|
58
|
Affiliation(s)
- Nina V Fedoroff
- King Abdullah University of Science and Technology, Saudi Arabia.
| |
Collapse
|
59
|
|
60
|
Young GJ, Zhang S, Mirsky HP, Cressman RF, Cong B, Ladics GS, Zhong CX. Assessment of possible allergenicity of hypothetical ORFs in common food crops using current bioinformatic guidelines and its implications for the safety assessment of GM crops. Food Chem Toxicol 2012; 50:3741-51. [PMID: 22867756 DOI: 10.1016/j.fct.2012.07.044] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2012] [Revised: 07/18/2012] [Accepted: 07/20/2012] [Indexed: 01/14/2023]
Abstract
Before a genetically modified (GM) crop can be commercialized it must pass through a rigorous regulatory process to verify that it is safe for human and animal consumption, and to the environment. One particular area of focus is the potential introduction of a known or cross-reactive allergen not previously present within the crop. The assessment of possible allergenicity uses the guidelines outlined by the Food and Agriculture Organization (FAO) and World Health Organization's (WHO) Codex Alimentarius Commission (Codex) to evaluate all newly expressed proteins. Some regulatory authorities have broadened the scope of the assessment to include all DNA reading frames between stop codons across the insert and spanning the insert/genomic DNA junctions. To investigate the utility of this bioinformatic assessment, all naturally occurring stop-to-stop frames in the non-transgenic genomes of maize, rice, and soybean, as well as the human genome, were compared against the AllergenOnline (www.allergenonline.org) database using the Codex criteria. We discovered thousands of frames that exceeded the Codex defined threshold for potential cross-reactivity suggesting that evaluating hypothetical ORFs (stop-to-stop frames) has questionable value for making decisions on the safety of GM crops.
Collapse
Affiliation(s)
- Gregory J Young
- Pioneer Hi-Bred International, Inc., DuPont Agricultural Biotechnology, Wilmington, DE 19880, USA.
| | | | | | | | | | | | | |
Collapse
|
61
|
Yang G, Wong A, Rooke R. ATon, abundant novel nonautonomous mobile genetic elements in yellow fever mosquito (Aedes aegypti). BMC Genomics 2012; 13:283. [PMID: 22738224 PMCID: PMC3422177 DOI: 10.1186/1471-2164-13-283] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2012] [Accepted: 06/27/2012] [Indexed: 12/16/2022] Open
Abstract
Background Mosquitoes are important pathogen vectors affecting human and other animals. Studies on genetic control of mosquito mediated disease transmission gained traction recently due to mosquito transgenesis technology. Active transposons are considered valuable tools to propagate pathogen resistance transgenes among mosquitoes, rendering the whole population recalcitrant to diseases. A major hurdle in this approach is the inefficient remobilization activity after the integration of heterologous transposon vectors bearing transgenes into chromosomes. Therefore, endogenous active transposons in mosquito genomes are highly desirable. Results Starting with the transposable element database of the yellow fever mosquito Aedes aegypti genome, detailed analyses of the members of each TE family were performed to identify sequences with multiple identical copies, an indicator of their latest or current transposition activity. Among a dozen of potentially active TE families, two DNA elements (TF000728 and TF000742 in TEfam) are short and nonautonomous. Close inspection of the elements revealed that these two families were previously mis-categorized and, unlike other known TEs, insert specifically at dinucleotide “AT”. These two families were therefore designated as ATon-I and ATon-II. ATon-I has a total copy number of 294, among which three elements have more than 10 identical copies (146, 61 and 17). ATon-II has a total copy number of 317, among which three elements have more than 10 identical copies (84, 15 and 12). Genome wide searches revealed additional 24 ATon families in A. aegypti genome with nearly 6500 copies in total. Transposon display analysis of ATon-1 family using different A. aegypti strains suggests that the elements are similarly abundant in the tested mosquito strains. Conclusion ATons are novel mobile genetic elements bearing terminal inverted repeats and insert specifically at dinucleotide “AT”. Five ATon families contain elements existing at more than 10 identical copies, suggesting very recent or current transposition activity. A total of 24 new TE families with nearly 6000 copies were identified in this study.
Collapse
Affiliation(s)
- Guojun Yang
- Department of Biology, University of Toronto Mississauga, SB3058, Mississauga, ON, L5L 1 C6, Canada.
| | | | | |
Collapse
|
62
|
Freeling M, Woodhouse MR, Subramaniam S, Turco G, Lisch D, Schnable JC. Fractionation mutagenesis and similar consequences of mechanisms removing dispensable or less-expressed DNA in plants. CURRENT OPINION IN PLANT BIOLOGY 2012; 15:131-9. [PMID: 22341793 DOI: 10.1016/j.pbi.2012.01.015] [Citation(s) in RCA: 123] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/25/2011] [Revised: 12/07/2011] [Accepted: 01/21/2012] [Indexed: 05/06/2023]
Abstract
Unlike in mammals, plants rapidly delete functionless, nonrepetitive DNA from their genomes. Following paleopolyploidies, duplicate genes are deleted by intrachromosomal recombination. This may explain how flowering plants have survived multiple whole genome duplications. Genes are disproportionately lost from one parental subgenome, the subgenome that is less expressed in the polyploid. The origin of this unbalanced expression between genomes remains unknown. The consequences of the tradeoffs between transposon repression and gene expression represent one potential explanation of genome dominance. If so, the same mechanisms may act in heterosis: genome dominance is like inbreeding depression. Regulatory DNA deletion following polyploidy combined with abundant RNA-seq expression datasets are being used to generate testable hypothesizes regarding the function of specific cis-regulatory sequences.
Collapse
Affiliation(s)
- Michael Freeling
- Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, USA.
| | | | | | | | | | | |
Collapse
|
63
|
Janicki M, Rooke R, Yang G. Bioinformatics and genomic analysis of transposable elements in eukaryotic genomes. Chromosome Res 2012; 19:787-808. [PMID: 21850457 DOI: 10.1007/s10577-011-9230-7] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
A major portion of most eukaryotic genomes are transposable elements (TEs). During evolution, TEs have introduced profound changes to genome size, structure, and function. As integral parts of genomes, the dynamic presence of TEs will continue to be a major force in reshaping genomes. Early computational analyses of TEs in genome sequences focused on filtering out "junk" sequences to facilitate gene annotation. When the high abundance and diversity of TEs in eukaryotic genomes were recognized, these early efforts transformed into the systematic genome-wide categorization and classification of TEs. The availability of genomic sequence data reversed the classical genetic approaches to discovering new TE families and superfamilies. Curated TE databases and their accurate annotation of genome sequences in turn facilitated the studies on TEs in a number of frontiers including: (1) TE-mediated changes of genome size and structure, (2) the influence of TEs on genome and gene functions, (3) TE regulation by host, (4) the evolution of TEs and their population dynamics, and (5) genomic scale studies of TE activity. Bioinformatics and genomic approaches have become an integral part of large-scale studies on TEs to extract information with pure in silico analyses or to assist wet lab experimental studies. The current revolution in genome sequencing technology facilitates further progress in the existing frontiers of research and emergence of new initiatives. The rapid generation of large-sequence datasets at record low costs on a routine basis is challenging the computing industry on storage capacity and manipulation speed and the bioinformatics community for improvement in algorithms and their implementations.
Collapse
Affiliation(s)
- Mateusz Janicki
- Department of Biology, University of Toronto at Mississauga, 3359 Mississauga Road, Mississauga, ON L5L1C6, Canada
| | | | | |
Collapse
|
64
|
Jiang N, Visa S, Wu S, van der Knaap E. Rider Transposon Insertion and Phenotypic Change in Tomato. PLANT TRANSPOSABLE ELEMENTS 2012. [DOI: 10.1007/978-3-642-31842-9_15] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
|
65
|
Dong Y, Lu X, Song W, Shi L, Zhang M, Zhao H, Jiao Y, Lai J. Structural characterization of helitrons and their stepwise capturing of gene fragments in the maize genome. BMC Genomics 2011; 12:609. [PMID: 22177531 PMCID: PMC3288121 DOI: 10.1186/1471-2164-12-609] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2011] [Accepted: 12/17/2011] [Indexed: 12/30/2022] Open
Abstract
Background As a newly identified category of DNA transposon, helitrons have been found in a large number of eukaryotes genomes. Helitrons have contributed significantly to the intra-specific genome diversity in maize. Although many characteristics of helitrons in the maize genome have been well documented, the sequence of an intact autonomous helitrons has not been identified in maize. In addition, the process of gene fragment capturing during the transposition of helitrons has not been characterized. Results The whole genome sequences of maize inbred line B73 were analyzed, 1,649 helitron-like transposons including 1,515 helAs and 134 helBs were identified. ZmhelA1, ZmhelB1 and ZmhelB2 all encode an open reading frame (ORF) with intact replication initiator (Rep) motif and a DNA helicase (Hel) domain, which are similar to previously reported autonomous helitrons in other organisms. The putative autonomous ZmhelB1 and ZmhelB2 contain an extra replication factor-a protein1 (RPA1) transposase (RPA-TPase) including three single strand DNA-binding domains (DBD)-A/-B/-C in the ORF. Over ninety percent of maize helitrons identified have captured gene fragments. HelAs and helBs carry 4,645 and 249 gene fragments, which yield 2,507 and 187 different genes respectively. Many helitrons contain mutilple terminal sequences, but only one 3'-terminal sequence had an intact "CTAG" motif. There were no significant differences in the 5'-termini sequence between the veritas terminal sequence and the pseudo sequence. Helitrons not only can capture fragments, but were also shown to lose internal sequences during the course of transposing. Conclusions Three putative autonomous elements were identified, which encoded an intact Rep motif and a DNA helicase domain, suggesting that autonomous helitrons may exist in modern maize. The results indicate that gene fragments captured during the transposition of many helitrons happen in a stepwise way, with multiple gene fragments within one helitron resulting from several sequential transpositions. In addition, we have proposed a potential mechanism regarding how helitrons with multiple termini are generated.
Collapse
Affiliation(s)
- Yongbin Dong
- State Key Laboratory of Agrobiotechnology and National Maize Improvement Center, Department of Plant Genetics and Breeding, China Agricultural University, Beijing, 100193, China
| | | | | | | | | | | | | | | |
Collapse
|
66
|
Abstract
Helitrons are a family of mobile elements that were discovered in 2001 and are now known to exist in the entire eukaryotic kingdom. Helitrons, particularly those of maize, exhibit an intriguing property of capturing gene fragments and placing them into the mobile element. Helitron-captured genes are sometimes transcribed, giving birth to chimeric transcripts that intertwine coding regions of different captured genes. Here, we perused the B73 maize genome for high-quality, putative Helitrons that exhibit plus/minus polymorphisms and contain pieces of more than one captured gene. Selected Helitrons were monitored for expression via in silico EST analysis. Intriguingly, expression validation of selected elements by RT–PCR analysis revealed multiple transcripts not seen in the EST databases. The differing transcripts were generated by alternative selection of splice sites during pre-mRNA processing. Selection of splice sites was not random since different patterns of splicing were observed in the root and shoot tissues. In one case, an exon residing in close proximity but outside of the Helitron was found conjoined with Helitron-derived exons in the mature transcript. Hence, Helitrons have the ability to synthesize new genes not only by placing unrelated exons into common transcripts, but also by transcription readthrough and capture of nearby exons. Thus, Helitrons have a phenomenal ability to “display” new coding regions for possible selection in nature. A highly conservative, minimum estimate of the number of new transcripts expressed by Helitrons is ∼11,000 or ∼25% of the total number of genes in the maize genome.
Collapse
|
67
|
Du C, Hoffman A, He L, Caronna J, Dooner HK. The complete Ac/Ds transposon family of maize. BMC Genomics 2011; 12:588. [PMID: 22132901 PMCID: PMC3260210 DOI: 10.1186/1471-2164-12-588] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2011] [Accepted: 12/01/2011] [Indexed: 01/29/2023] Open
Abstract
Background The nonautonomous maize Ds transposons can only move in the presence of the autonomous element Ac. They comprise a heterogeneous group that share 11-bp terminal inverted repeats (TIRs) and some subterminal repeats, but vary greatly in size and composition. Three classes of Ds elements can cause mutations: Ds-del, internal deletions of the 4.6-kb Ac element; Ds1, ~400-bp in size and sharing little homology with Ac, and Ds2, variably-sized elements containing about 0.5 kb from the Ac termini and unrelated internal sequences. Here, we analyze the entire complement of Ds-related sequences in the genome of the inbred B73 and ask whether additional classes of Ds-like (Ds-l) elements, not uncovered genetically, are mobilized by Ac. We also compare the makeup of Ds-related sequences in two maize inbreds of different origin. Results We found 903 elements with 11-bp Ac/Ds TIRs flanked by 8-bp target site duplications. Three resemble Ac, but carry small rearrangements. The others are much shorter, once extraneous insertions are removed. There are 331 Ds1 and 39 Ds2 elements, many of which are likely mobilized by Ac, and two novel classes of Ds-l elements. Ds-l3 elements lack subterminal homology with Ac, but carry transposase gene fragments, and represent decaying Ac elements. There are 44 such elements in B73. Ds-l4 elements share little similarity with Ac outside of the 11-bp TIR, have a modal length of ~1 kb, and carry filler DNA which, in a few cases, could be matched to gene fragments. Most Ds-related elements in B73 (486/903) fall in this class. None of the Ds-l elements tested responded to Ac. Only half of Ds insertion sites examined are shared between the inbreds B73 and W22. Conclusions The majority of Ds-related sequences in maize correspond to Ds-l elements that do not transpose in the presence of Ac. Unlike actively transposing elements, many Ds-l elements are inserted in repetitive DNA, where they probably become methylated and begin to decay. The filler DNA present in most elements is occasionally captured from genes, a rare feature in transposons of the hAT superfamily to which Ds belongs. Maize inbreds of different origin are highly polymorphic in their DNA transposon makeup.
Collapse
Affiliation(s)
- Chunguang Du
- Dept. of Biology & Molecular Biology, Montclair State University, NJ 07043, USA.
| | | | | | | | | |
Collapse
|
68
|
Eichten SR, Foerster JM, de Leon N, Kai Y, Yeh CT, Liu S, Jeddeloh JA, Schnable PS, Kaeppler SM, Springer NM. B73-Mo17 near-isogenic lines demonstrate dispersed structural variation in maize. PLANT PHYSIOLOGY 2011; 156:1679-90. [PMID: 21705654 PMCID: PMC3149956 DOI: 10.1104/pp.111.174748] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Recombinant inbred lines developed from the maize (Zea mays ssp. mays) inbreds B73 and Mo17 have been widely used to discover quantitative trait loci controlling a wide variety of phenotypic traits and as a resource to produce high-resolution genetic maps. These two parents were used to produce a set of near-isogenic lines (NILs) with small regions of introgression into both backgrounds. A novel array-based genotyping platform was used to score genotypes of over 7,000 loci in 100 NILs with B73 as the recurrent parent and 50 NILs with Mo17 as the recurrent parent. This population contains introgressions that cover the majority of the maize genome. The set of NILs displayed an excess of residual heterozygosity relative to the amount expected based on their pedigrees, and this excess residual heterozygosity is enriched in the low-recombination regions near the centromeres. The genotyping platform provided the ability to survey copy number variants that exist in more copies in Mo17 than in B73. The majority of these Mo17-specific duplications are located in unlinked positions throughout the genome. The utility of this population for the discovery and validation of quantitative trait loci was assessed through analysis of plant height variation.
Collapse
|
69
|
Pack-Mutator-like transposable elements (Pack-MULEs) induce directional modification of genes through biased insertion and DNA acquisition. Proc Natl Acad Sci U S A 2011; 108:1537-42. [PMID: 21220310 DOI: 10.1073/pnas.1010814108] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
In monocots, many genes demonstrate a significant negative GC gradient, meaning that the GC content declines along the orientation of transcription. Such a gradient is not observed in the genes of the dicot plant Arabidopsis. In addition, a lack of homology is often observed when comparing the 5' end of the coding region of orthologous genes in rice and Arabidopsis. The reasons for these differences have been enigmatic. The presence of GC-rich sequences at the 5' end of genes may influence the conformation of chromatin, the expression level of genes, as well as the recombination rate. Here we show that Pack-Mutator-like transposable elements (Pack-MULEs) that carry gene fragments specifically acquire GC-rich fragments and preferentially insert into the 5' end of genes. The resulting Pack-MULEs form independent, GC-rich transcripts with a negative GC gradient. Alternatively, the Pack-MULEs evolve into additional exons at the 5' end of existing genes, thus altering the GC content in those regions. We demonstrate that Pack-MULEs modify the 5' end of genes and are at least partially responsible for the negative GC gradient of genes in grasses. Such a unique and global impact on gene composition and gene structure has not been observed for any other transposable elements.
Collapse
|
70
|
Lisch D, Slotkin RK. Strategies for silencing and escape: the ancient struggle between transposable elements and their hosts. INTERNATIONAL REVIEW OF CELL AND MOLECULAR BIOLOGY 2011; 292:119-52. [PMID: 22078960 DOI: 10.1016/b978-0-12-386033-0.00003-7] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Over the past several years, there has been an explosion in our understanding of the mechanisms by which plant transposable elements (TEs) are epigenetically silenced and maintained in an inactive state over long periods of time. This highly efficient process results in vast numbers of inactive TEs; indeed, the majority of many plant genomes are composed of these quiescent elements. This observation has led to the rather static view that TEs represent an essentially inert portion of plant genomes. However, recent work has demonstrated that TE silencing is a highly dynamic process that often involves transcription of TEs at particular times and places during plant development. Plants appear to use transcripts from silenced TEs as an ongoing source of information concerning the mobile portion of the genome. In contrast to our understanding of silencing pathways, we know relatively little about the ways in which TEs evade silencing. However, vast differences in TE content between even closely related plant species suggest that they are often wildly successful at doing so. Here, we discuss TE activity in plants as the result of a constantly shifting balance between host strategies for TE silencing and TE strategies for escape and amplification.
Collapse
Affiliation(s)
- Damon Lisch
- Department of Plant and Microbial Biology, University of California, Berkeley, California, USA
| | | |
Collapse
|
71
|
Thomas J, Schaack S, Pritham EJ. Pervasive horizontal transfer of rolling-circle transposons among animals. Genome Biol Evol 2010; 2:656-64. [PMID: 20693155 PMCID: PMC2997563 DOI: 10.1093/gbe/evq050] [Citation(s) in RCA: 83] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Horizontal transfer (HT) of genes is known to be an important mechanism of genetic innovation, especially in prokaryotes. The impact of HT of transposable elements (TEs), however, has only recently begun to receive widespread attention and may be significant due to their mutagenic potential, inherent mobility, and abundance. Helitrons, also known as rolling-circle transposons, are a distinctive subclass of TE with a unique transposition mechanism. Here, we describe the first evidence for the repeated HT of four different families of Helitrons in an unprecedented array of organisms, including mammals, reptiles, fish, invertebrates, and insect viruses. The Helitrons present in these species have a patchy distribution and are closely related (80–98% sequence identity), despite the deep divergence times among hosts. Multiple lines of evidence indicate the extreme conservation of sequence identity is not due to selection, including the highly fragmented nature of the Helitrons identified and the lack of any signatures of selection at the nucleotide level. The presence of horizontally transferred Helitrons in insect viruses, in particular, suggests that this may represent a potential mechanism of transfer in some taxa. Unlike genes, Helitrons that have horizontally transferred into new host genomes can amplify, in some cases reaching up to several hundred copies and representing a substantial fraction of the genome. Because Helitrons are known to frequently capture and amplify gene fragments, HT of this unique group of DNA transposons could lead to horizontal gene transfer and incur dramatic shifts in the trajectory of genome evolution.
Collapse
Affiliation(s)
- Jainy Thomas
- Department of Biology, University of Texas at Arlington, Arlington, TX, USA
| | | | | |
Collapse
|
72
|
Tenaillon MI, Hollister JD, Gaut BS. A triptych of the evolution of plant transposable elements. TRENDS IN PLANT SCIENCE 2010; 15:471-8. [PMID: 20541961 DOI: 10.1016/j.tplants.2010.05.003] [Citation(s) in RCA: 179] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/23/2010] [Revised: 05/06/2010] [Accepted: 05/12/2010] [Indexed: 05/06/2023]
Abstract
Transposable elements (TEs) constitute the majority of angiosperm DNA, but the processes that govern their accumulation remain mysterious. Here we discuss the three major forces that govern the accumulation of TEs, corresponding to the three panels of a triptych. The first force, transposition, creates new copies of TEs, but is regulated by both host- and TE-specific mechanisms. The second force, deletion of TE DNA, is capable of removing vast swaths of genomic regions via recombinational processes, but we still have very little insight into how deletion varies across species and even among TE types. Finally, we focus on the often-ignored third panel of our triptych - the population processes that determine the ultimate evolutionary fate of TE insertions.
Collapse
Affiliation(s)
- Maud I Tenaillon
- CNRS, UMR 0320/UMR8120 Génétique Végétale, F-91190 Gif-sur-Yvette, France
| | | | | |
Collapse
|
73
|
Coates BS, Sumerford DV, Hellmich RL, Lewis LC. A helitron-like transposon superfamily from lepidoptera disrupts (GAAA)(n) microsatellites and is responsible for flanking sequence similarity within a microsatellite family. J Mol Evol 2010; 70:275-88. [PMID: 20217059 DOI: 10.1007/s00239-010-9330-6] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2008] [Accepted: 02/17/2010] [Indexed: 12/20/2022]
Abstract
Transposable elements (TEs) are mobile DNA regions that alter host genome structure and gene expression. A novel 588 bp non-autonomous high copy number TE in the Ostrinia nubilalis genome has features in common with miniature inverted-repeat transposable elements (MITEs): high A + T content (62.3%), lack of internal protein coding sequence, and secondary structure consisting of subterminal inverted repeats (SIRs). The O. nubilalis TE has inserted at (GAAA)(n) microsatellite loci, and was named the microsatellite-associated interspersed nuclear element (MINE-1). Non-autonomous MINE-1 superfamily members also were identified downstream of (GAAA)(n) microsatellites within Bombyx mori and Pectinophora gossypiella genomes. Of 316 (GAAA)(n) microsatellites from the B. mori whole genome sequence, 201 (63.6%) have associated autonomous or non-autonomous MINE-1 elements. Autonomous B. mori MINE-1s a encode a helicase and endonuclease domain RepHel-like protein (BMHELp1) indicating their classification as Helitron-like transposons and were renamed Helitron1_BM. Transposition of MINE-1 members in Lepidoptera has resulted in the disruption of (GAAA)(n) microsatellite loci, has impacted the application of microsatellite-based genetic markers, and suggests genome sequence that flanks TT/AA dinucleotides may be required for target site recognition by RepHel endonuclease domains.
Collapse
|
74
|
Muers M. Genomics: Maize looks set to amaze. Nat Rev Genet 2010; 11:6. [PMID: 20050275 DOI: 10.1038/nrg2724] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
75
|
Yang L, Bennetzen JL. Distribution, diversity, evolution, and survival of Helitrons in the maize genome. Proc Natl Acad Sci U S A 2009; 106:19922-7. [PMID: 19926865 PMCID: PMC2785268 DOI: 10.1073/pnas.0908008106] [Citation(s) in RCA: 119] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2009] [Indexed: 01/11/2023] Open
Abstract
Homology and structure-based approaches were used to identify Helitrons in the genome of maize inbred B73. A total of 1,930 intact Helitrons from eight families (62 subfamilies) and >20,000 Helitron fragments were identified, accounting for approximately 2.2% of the B73 genome. Transposition of at least one of these families is ongoing, but the most prominent burst of amplification activity was approximately 250,000 years ago. Sixty percent of maize Helitrons were found to have captured fragments of nuclear genes ( approximately 840 different fragment acquisitions, with tens of thousands of predicted gene fragments inside Helitrons within the B73 assembly). Most acquired gene fragments are undergoing random drift, but 4% were calculated to be under purifying selection, whereas another 4% exhibit apparent adaptive selection, suggesting beneficial effects for the host or Helitron transposition/retention. Gene fragment capture is frequent in some Helitron subfamilies, with as many as 10 unlinked genes providing DNA inserts within a single element. Gene fragment acquisition appears to positively influence element survival and/or ability of the Helitron to acquire additional gene fragments. Helitrons with gene fragment captures in the antisense orientation have a lesser chance of survival. Helitron distribution in maize exhibits severe biases, including preferential accumulation in relatively gene-rich regions. Insertions, however, are not usually found inside genes. Rather, Helitrons preferentially insert near (but not into) other Helitrons. This biased accumulation is not caused by a preference for cis or nearby transposition, suggesting a specific association between Helitron integration functions and unknown chromatin characteristics that specifically mark Helitrons.
Collapse
Affiliation(s)
- Lixing Yang
- Department of Genetics, University of Georgia, Athens, GA 30602
| | | |
Collapse
|
76
|
|