1
|
Haas M, Kono T, Macchietto M, Millas R, McGilp L, Shao M, Duquette J, Qiu Y, Hirsch CN, Kimball J. Whole-genome assembly and annotation of northern wild rice, Zizania palustris L., supports a whole-genome duplication in the Zizania genus. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2021; 107:1802-1818. [PMID: 34310794 DOI: 10.1111/tpj.15419] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/14/2021] [Revised: 06/16/2021] [Accepted: 07/06/2021] [Indexed: 06/13/2023]
Abstract
Zizania palustris L. (northern wild rice, NWR) is an aquatic grass native to North America that is notable for its nutritious grain. This is an important species with ecological, cultural and agricultural significance, specifically in the Great Lakes region of the USA. Using flow cytometry, we first estimated the NWR genome size to be 1.8 Gb. Using long- and short-range sequencing, Hi-C scaffolding and RNA-seq data from eight tissues, we generated an annotated whole-genome de novo assembly of NWR. The assembly was 1.29 Gb in length, highly repetitive (approx. 76.0%) and contained 46 421 putative protein-coding genes. The expansion of retrotransposons within the genome and a whole-genome duplication (WGD) after the Zizania-Oryza speciation event have both led to an increase in the genome size of NWR in comparison with Oryza sativa L. and Zizania latifolia. Both events depict a genome rapidly undergoing change over a short evolutionary time. Comparative analyses revealed the conservation of large syntenic blocks between NWR and O. sativa, which were used to identify putative seed-shattering genes. Estimates of divergence times revealed that the Zizania genus diverged from Oryza approximately 26-30 million years ago (26-30 MYA), whereas NWR and Z. latifolia diverged from one another approximately 6-8 MYA. Comparative genomics confirmed evidence of a WGD in the Zizania genus and provided support that the event occurred prior to the NWR-Z. latifolia speciation event. This genome assembly and annotation provides a valuable resource for comparative genomics in the Oryzeae tribe and provides an important resource for future conservation and breeding efforts of NWR.
Collapse
Affiliation(s)
- Matthew Haas
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN, 55108, USA
| | - Thomas Kono
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, MN, 55455, USA
| | - Marissa Macchietto
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, MN, 55455, USA
| | - Reneth Millas
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN, 55108, USA
| | - Lillian McGilp
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN, 55108, USA
| | - Mingqin Shao
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN, 55108, USA
| | - Jacques Duquette
- North Central Research and Outreach Center, University of Minnesota, Grand Rapids, MN, 55744, USA
| | - Yinjie Qiu
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN, 55108, USA
| | - Candice N Hirsch
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN, 55108, USA
| | - Jennifer Kimball
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN, 55108, USA
| |
Collapse
|
2
|
Záveská Drábková L, Honys D, Motyka V. Evolutionary diversification of cytokinin-specific glucosyltransferases in angiosperms and enigma of missing cis-zeatin O-glucosyltransferase gene in Brassicaceae. Sci Rep 2021; 11:7885. [PMID: 33846460 PMCID: PMC8041765 DOI: 10.1038/s41598-021-87047-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2020] [Accepted: 03/23/2021] [Indexed: 11/09/2022] Open
Abstract
In the complex process of homeostasis of phytohormones cytokinins (CKs), O-glucosylation catalyzed by specific O-glucosyltransferases represents one of important mechanisms of their reversible inactivation. The CK O-glucosyltransferases belong to a highly divergent and polyphyletic multigene superfamily of glycosyltransferases, of which subfamily 1 containing UDP-glycosyltransferases (UGTs) is the largest in the plant kingdom. It contains recently discovered O and P subfamilies present in higher plant species but not in Arabidopsis thaliana. The cis-zeatin O-glucosyltransferase (cisZOG) genes belong to the O subfamily encoding a stereo-specific O-glucosylation of cis-zeatin-type CKs. We studied different homologous genes, their domains and motifs, and performed a phylogenetic reconstruction to elucidate the plant evolution of the cisZOG gene. We found that the cisZOG homologs do not form a clear separate clade, indicating that diversification of the cisZOG gene took place after the diversification of the main angiosperm families, probably within genera or closely related groups. We confirmed that the gene(s) from group O is(are) not present in A. thaliana and is(are) also missing in the family Brassicaceae. However, cisZOG or its metabolites are found among Brassicaceae clade, indicating that remaining genes from other groups (UGT73-group D and UGT85-group G) are able, at least in part, to substitute the function of group O lost during evolution. This study is the first detailed evolutionary evaluation of relationships among different plant ZOGs within angiosperms.
Collapse
Affiliation(s)
- Lenka Záveská Drábková
- Laboratory of Pollen Biology, Institute of Experimental Botany of the Czech Academy of Sciences, Rozvojová 263, 165 02, Prague 6, Czech Republic.
| | - David Honys
- Laboratory of Pollen Biology, Institute of Experimental Botany of the Czech Academy of Sciences, Rozvojová 263, 165 02, Prague 6, Czech Republic
| | - Václav Motyka
- Laboratory of Hormonal Regulations in Plants, Institute of Experimental Botany of the Czech Academy of Sciences, Rozvojová 263, 165 02, Prague 6, Czech Republic.
| |
Collapse
|
3
|
Genome wide annotation and characterization of young, intact long terminal repeat retrotransposons (In-LTR-RTs) of seven legume species. Genetica 2020; 148:253-268. [PMID: 32949338 DOI: 10.1007/s10709-020-00103-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2019] [Accepted: 09/02/2020] [Indexed: 10/23/2022]
Abstract
Availability of genome sequence of different legume species has provided an opportunity to characterize the abundance, distribution, and divergence of canonical intact long terminal retrotransposons (In-LTR-RT) superfamilies. Among seven legume species, Arachis ipaensis (Aip) showed the highest number of full-length canonical In-LTR-RTs (3325), followed by Glycine max (Gma, 2328), Vigna angularis (Van, 1625), Arachis durensis (Adu, 1348), Lotus japonicus (Lja, 1294), Medicago truncatula (Mtr, 788), and Circer arietinum (Car, 124). Divergence time analysis demonstrated that the amplification timeframe of LTR-RTs dramatically varied in different families. The average insertion time of Copia element varied from 0.51 (Van) to 1.37 million years ago (Mya) (Adu, and Aip), whereas that of Gypsy was between 0.22 (Mtr) and 1.82 Mya (Adu). Bayesian phylogenetic tree analysis suggested that the 1397 and 1917 reverse transcriptase (RT) domains of Copia and Gypsy families of the seven legume species were clustered into 7 and 14 major groups, respectively. The highest proportion (approximately 94.79-100%) of transposable element (TE)-associated genes assigned to pathways was mapped to metabolism-related pathways in all species. The results enabled the structural understanding of full-length In-LTR-RTs and will be valuable resource for the further study of the impact of TEs on gene structure and expression in legume species.
Collapse
|
4
|
Wicker T, Gundlach H, Spannagl M, Uauy C, Borrill P, Ramírez-González RH, De Oliveira R, Mayer KFX, Paux E, Choulet F. Impact of transposable elements on genome structure and evolution in bread wheat. Genome Biol 2018; 19:103. [PMID: 30115100 PMCID: PMC6097303 DOI: 10.1186/s13059-018-1479-0] [Citation(s) in RCA: 155] [Impact Index Per Article: 25.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2018] [Accepted: 07/11/2018] [Indexed: 01/20/2023] Open
Abstract
BACKGROUND Transposable elements (TEs) are major components of large plant genomes and main drivers of genome evolution. The most recent assembly of hexaploid bread wheat recovered the highly repetitive TE space in an almost complete chromosomal context and enabled a detailed view into the dynamics of TEs in the A, B, and D subgenomes. RESULTS The overall TE content is very similar between the A, B, and D subgenomes, although we find no evidence for bursts of TE amplification after the polyploidization events. Despite the near-complete turnover of TEs since the subgenome lineages diverged from a common ancestor, 76% of TE families are still present in similar proportions in each subgenome. Moreover, spacing between syntenic genes is also conserved, even though syntenic TEs have been replaced by new insertions over time, suggesting that distances between genes, but not sequences, are under evolutionary constraints. The TE composition of the immediate gene vicinity differs from the core intergenic regions. We find the same TE families to be enriched or depleted near genes in all three subgenomes. Evaluations at the subfamily level of timed long terminal repeat-retrotransposon insertions highlight the independent evolution of the diploid A, B, and D lineages before polyploidization and cases of concerted proliferation in the AB tetraploid. CONCLUSIONS Even though the intergenic space is changed by the TE turnover, an unexpected preservation is observed between the A, B, and D subgenomes for features like TE family proportions, gene spacing, and TE enrichment near genes.
Collapse
Affiliation(s)
- Thomas Wicker
- Department of Plant and Microbial Biology, University of Zurich, Zurich, Switzerland
| | - Heidrun Gundlach
- PGSB Plant Genome and Systems Biology, Helmholtz Center Munich, German Research Center for Environmental Health, Neuherberg, Germany
| | - Manuel Spannagl
- PGSB Plant Genome and Systems Biology, Helmholtz Center Munich, German Research Center for Environmental Health, Neuherberg, Germany
| | - Cristobal Uauy
- Department of Crop Genetics, John Innes Centre, Norwich Research Park, Colney, Norwich, NR4 7UH, UK
| | - Philippa Borrill
- Department of Crop Genetics, John Innes Centre, Norwich Research Park, Colney, Norwich, NR4 7UH, UK
| | | | - Romain De Oliveira
- GDEC, INRA, UCA (Université Clermont Auvergne), Clermont-Ferrand, France
| | - Klaus F X Mayer
- PGSB Plant Genome and Systems Biology, Helmholtz Center Munich, German Research Center for Environmental Health, Neuherberg, Germany
- School of Life Sciences, Technical University Munich, Munich, Germany
| | - Etienne Paux
- GDEC, INRA, UCA (Université Clermont Auvergne), Clermont-Ferrand, France
| | - Frédéric Choulet
- GDEC, INRA, UCA (Université Clermont Auvergne), Clermont-Ferrand, France.
| |
Collapse
|
5
|
Divergent Transactivation of Maize Storage Protein Zein Genes by the Transcription Factors Opaque2 and OHPs. Genetics 2016; 204:581-591. [PMID: 27474726 PMCID: PMC5068848 DOI: 10.1534/genetics.116.192385] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2016] [Accepted: 07/25/2016] [Indexed: 11/18/2022] Open
Abstract
Maize transcription factors (TFs) opaque2 (O2) and the O2 heterodimerizing proteins (OHP1 and OHP2) originated from an ancient segmental duplication. The 22-kDa (z1C) and 19-kDa (z1A, z1B, and z1D) α-zeins are the most abundant storage proteins in maize endosperm. O2 is known to regulate α-zein gene expression, but its target motifs in the 19-kDa α-zein gene promoters have not been identified. The mechanisms underlying the regulation of α-zein genes by these TFs are also not well understood. In this study, we found that the O2 binding motifs in the α-zein gene promoters are quite flexible, with ACGT being present in the z1C and z1A promoters and a variant, ACAT, being present in the z1B and z1D promoters. OHPs recognized and transactivated all of the α-zein promoters, although to much lower levels than did O2. In the presence of O2, the suppression of OHPs did not cause a significant reduction in the transcription of α-zein genes, but in the absence of O2, OHPs were critical for the expression of residual levels of α-zeins. These findings demonstrated that O2 is the primary TF and that OHPs function as minor TFs in this process. This relationship is the converse of that involved in 27-kDa γ-zein gene regulation, indicating that the specificities of O2 and the OHPs for regulating zein genes diverged after gene duplication. The prolamine-box binding factor by itself has limited transactivation activity, but it promotes the binding of O2 to O2 motifs, resulting in the synergistic transactivation of α-zein genes.
Collapse
|
6
|
Establishment of a loop-mediated isothermal amplification (LAMP) detection method for genetically modified maize MON88017. Eur Food Res Technol 2016. [DOI: 10.1007/s00217-016-2678-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
|
7
|
Zhang M, Zhen Z, Yu Y, Gao X, Liu Y. Development of a Rapid Event-Specific Loop-Mediated Isothermal Amplification Detection Method for Genetically Modified Maize NK603. FOOD ANAL METHOD 2015. [DOI: 10.1007/s12161-015-0244-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
8
|
Transposable elements contribute to activation of maize genes in response to abiotic stress. PLoS Genet 2015; 11:e1004915. [PMID: 25569788 PMCID: PMC4287451 DOI: 10.1371/journal.pgen.1004915] [Citation(s) in RCA: 238] [Impact Index Per Article: 26.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2014] [Accepted: 11/24/2014] [Indexed: 11/25/2022] Open
Abstract
Transposable elements (TEs) account for a large portion of the genome in many eukaryotic species. Despite their reputation as “junk” DNA or genomic parasites deleterious for the host, TEs have complex interactions with host genes and the potential to contribute to regulatory variation in gene expression. It has been hypothesized that TEs and genes they insert near may be transcriptionally activated in response to stress conditions. The maize genome, with many different types of TEs interspersed with genes, provides an ideal system to study the genome-wide influence of TEs on gene regulation. To analyze the magnitude of the TE effect on gene expression response to environmental changes, we profiled gene and TE transcript levels in maize seedlings exposed to a number of abiotic stresses. Many genes exhibit up- or down-regulation in response to these stress conditions. The analysis of TE families inserted within upstream regions of up-regulated genes revealed that between four and nine different TE families are associated with up-regulated gene expression in each of these stress conditions, affecting up to 20% of the genes up-regulated in response to abiotic stress, and as many as 33% of genes that are only expressed in response to stress. Expression of many of these same TE families also responds to the same stress conditions. The analysis of the stress-induced transcripts and proximity of the transposon to the gene suggests that these TEs may provide local enhancer activities that stimulate stress-responsive gene expression. Our data on allelic variation for insertions of several of these TEs show strong correlation between the presence of TE insertions and stress-responsive up-regulation of gene expression. Our findings suggest that TEs provide an important source of allelic regulatory variation in gene response to abiotic stress in maize. Transposable elements are mobile DNA elements that are a prevalent component of many eukaryotic genomes. While transposable elements can often have deleterious effects through insertions into protein-coding genes they may also contribute to regulatory variation of gene expression. There are a handful of examples in which specific transposon insertions contribute to regulatory variation of nearby genes, particularly in response to environmental stress. We sought to understand the genome-wide influence of transposable elements on gene expression responses to abiotic stress in maize, a plant with many families of transposable elements located in between genes. Our analysis suggests that a small number of maize transposable element families may contribute to the response of nearby genes to abiotic stress by providing stress-responsive enhancer-like functions. The specific insertions of transposable elements are often polymorphic within a species. Our data demonstrate that allelic variation for insertions of the transposable elements associated with stress-responsive expression can contribute to variation in the regulation of nearby genes. Thus novel insertions of transposable elements provide a potential mechanism for genes to acquire cis-regulatory influences that could contribute to heritable variation for stress response.
Collapse
|
9
|
Colasuonno P, Maria MA, Blanco A, Gadaleta A. Description of durum wheat linkage map and comparative sequence analysis of wheat mapped DArT markers with rice and Brachypodium genomes. BMC Genet 2013; 14:114. [PMID: 24304553 PMCID: PMC3866978 DOI: 10.1186/1471-2156-14-114] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2013] [Accepted: 11/25/2013] [Indexed: 01/29/2023] Open
Abstract
Background The importance of wheat to the world economy, together with progresses in high-throughput next-generation DNA sequencing, have accelerated initiatives of genetic research for wheat improvement. The availability of high density linkage maps is crucial to identify genotype-phenotype associations, but also for anchoring BAC contigs to genetic maps, a strategy followed for sequencing the wheat genome. Results Here we report a genetic linkage map in a durum wheat segregating population and the study of mapped DArT markers. The linkage map consists of 126 gSSR, 31 EST-SSR and 351 DArT markers distributed in 24 linkage groups for a total length of 1,272 cM. Through bioinformatic approaches we have analysed 327 DArT clones to reveal their redundancy, syntenic and functional aspects. The DNA sequences of 174 DArT markers were assembled into a non-redundant set of 60 marker clusters. This explained the generation of clusters in very small chromosome regions across genomes. Of these DArT markers, 61 showed highly significant (Expectation < E-10) BLAST similarity to gene sequences in public databases of model species such as Brachypodium and rice. Based on sequence alignments, the analysis revealed a mosaic gene conservation, with 54 and 72 genes present in rice and Brachypodium species, respectively. Conclusions In the present manuscript we provide a detailed DArT markers characterization and the basis for future efforts in durum wheat map comparing.
Collapse
Affiliation(s)
| | | | | | - Agata Gadaleta
- Department of Soil, Plant and Food Sciences, University of Bari "Aldo Moro", Via Amendola 165/A, Bari 70126, Italy.
| |
Collapse
|
10
|
Swanson-Wagner RA, Eichten SR, Kumari S, Tiffin P, Stein JC, Ware D, Springer NM. Pervasive gene content variation and copy number variation in maize and its undomesticated progenitor. Genome Res 2010; 20:1689-99. [PMID: 21036921 DOI: 10.1101/gr.109165.110] [Citation(s) in RCA: 206] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
Individuals of the same species are generally thought to have very similar genomes. However, there is growing evidence that structural variation in the form of copy number variation (CNV) and presence-absence variation (PAV) can lead to variation in the genome content of individuals within a species. Array comparative genomic hybridization (CGH) was used to compare gene content and copy number variation among 19 diverse maize inbreds and 14 genotypes of the wild ancestor of maize, teosinte. We identified 479 genes exhibiting higher copy number in some genotypes (UpCNV) and 3410 genes that have either fewer copies or are missing in the genome of at least one genotype relative to B73 (DownCNV/PAV). Many of these DownCNV/PAV are examples of genes present in B73, but missing from other genotypes. Over 70% of the CNV/PAV examples are identified in multiple genotypes, and the majority of events are observed in both maize and teosinte, suggesting that these variants predate domestication and that there is not strong selection acting against them. Many of the genes affected by CNV/PAV are either maize specific (thus possible annotation artifacts) or members of large gene families, suggesting that the gene loss can be tolerated through buffering by redundant functions encoded elsewhere in the genome. While this structural variation may not result in major qualitative variation due to genetic buffering, it may significantly contribute to quantitative variation.
Collapse
Affiliation(s)
- Ruth A Swanson-Wagner
- Department of Plant Biology, University of Minnesota, Saint Paul, Minnesota 55108, USA
| | | | | | | | | | | | | |
Collapse
|
11
|
Zhang L, Chia JM, Kumari S, Stein JC, Liu Z, Narechania A, Maher CA, Guill K, McMullen MD, Ware D. A genome-wide characterization of microRNA genes in maize. PLoS Genet 2009; 5:e1000716. [PMID: 19936050 PMCID: PMC2773440 DOI: 10.1371/journal.pgen.1000716] [Citation(s) in RCA: 279] [Impact Index Per Article: 18.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2009] [Accepted: 10/12/2009] [Indexed: 01/17/2023] Open
Abstract
MicroRNAs (miRNAs) are small, non-coding RNAs that play essential roles in plant growth, development, and stress response. We conducted a genome-wide survey of maize miRNA genes, characterizing their structure, expression, and evolution. Computational approaches based on homology and secondary structure modeling identified 150 high-confidence genes within 26 miRNA families. For 25 families, expression was verified by deep-sequencing of small RNA libraries that were prepared from an assortment of maize tissues. PCR-RACE amplification of 68 miRNA transcript precursors, representing 18 families conserved across several plant species, showed that splice variation and the use of alternative transcriptional start and stop sites is common within this class of genes. Comparison of sequence variation data from diverse maize inbred lines versus teosinte accessions suggest that the mature miRNAs are under strong purifying selection while the flanking sequences evolve equivalently to other genes. Since maize is derived from an ancient tetraploid, the effect of whole-genome duplication on miRNA evolution was examined. We found that, like protein-coding genes, duplicated miRNA genes underwent extensive gene-loss, with approximately 35% of ancestral sites retained as duplicate homoeologous miRNA genes. This number is higher than that observed with protein-coding genes. A search for putative miRNA targets indicated bias towards genes in regulatory and metabolic pathways. As maize is one of the principal models for plant growth and development, this study will serve as a foundation for future research into the functional roles of miRNA genes.
Collapse
Affiliation(s)
- Lifang Zhang
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, United States of America
| | - Jer-Ming Chia
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, United States of America
| | - Sunita Kumari
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, United States of America
| | - Joshua C. Stein
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, United States of America
| | - Zhijie Liu
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, United States of America
| | - Apurva Narechania
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, United States of America
| | - Christopher A. Maher
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, United States of America
| | - Katherine Guill
- Plant Genetics Research Unit, United States Department of Agriculture–Agriculture Research Service, Columbia, Missouri, United States of America
| | - Michael D. McMullen
- Plant Genetics Research Unit, United States Department of Agriculture–Agriculture Research Service, Columbia, Missouri, United States of America
- Division of Plant Sciences, University of Missouri Columbia, Columbia, Missouri, United States of America
| | - Doreen Ware
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, United States of America
- Plant, Soil, and Nutrition Research Unit, United States Department of Agriculture–Agriculture Research Service, Ithaca, New York, United States of America
- * E-mail:
| |
Collapse
|
12
|
Stonaker JL, Lim JP, Erhard KF, Hollick JB. Diversity of Pol IV function is defined by mutations at the maize rmr7 locus. PLoS Genet 2009; 5:e1000706. [PMID: 19936246 PMCID: PMC2775721 DOI: 10.1371/journal.pgen.1000706] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2009] [Accepted: 10/15/2009] [Indexed: 12/03/2022] Open
Abstract
Mutations affecting the heritable maintenance of epigenetic states in maize identify multiple small RNA biogenesis factors including NRPD1, the largest subunit of the presumed maize Pol IV holoenzyme. Here we show that mutations defining the required to maintain repression7 locus identify a second RNA polymerase subunit related to Arabidopsis NRPD2a, the sole second largest subunit shared between Arabidopsis Pol IV and Pol V. A phylogenetic analysis shows that, in contrast to representative eudicots, grasses have retained duplicate loci capable of producing functional NRPD2-like proteins, which is indicative of increased RNA polymerase diversity in grasses relative to eudicots. Together with comparisons of rmr7 mutant plant phenotypes and their effects on the maintenance of epigenetic states with parallel analyses of NRPD1 defects, our results imply that maize utilizes multiple functional NRPD2-like proteins. Despite the observation that RMR7/NRPD2, like NRPD1, is required for the accumulation of most siRNAs, our data indicate that different Pol IV isoforms play distinct roles in the maintenance of meiotically-heritable epigenetic information in the grasses. Multicellular plants possess a unique set of DNA–dependent RNA polymerase complexes (RNAPs) that prevent certain repetitious regions of the genome from being copied into stable RNAs. Two distinct RNAPs, termed Pol IV and Pol V, are required for this type of genome-silencing behavior in the eudicot Arabidopsis thaliana, but the mechanism by which these RNAPs accomplish this function is still relatively unknown. Using genetic and molecular methodologies, we identified a Pol IV–type subunit protein as being involved in a process of meiotically-heritable gene silencing in the maize plant known as paramutation. Our analyses of the available plant genome sequences indicate that monocots have a greater potential for RNAP diversity due to having duplicate variants of this particular subunit. Consistent with this inferred diversity, comparative analyses with plants defective in a different core Pol IV subunit indicate that the Pol IV–type RNAP in maize has distinct functional isoforms. The mechanistic and biological role(s) of these specific RNAPs in mediating genome regulation and heritable gene silencing in large genome cereals should now be tractable by biochemical approaches.
Collapse
Affiliation(s)
- Jennifer L. Stonaker
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, California, United States of America
| | - Jana P. Lim
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, California, United States of America
| | - Karl F. Erhard
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, California, United States of America
| | - Jay B. Hollick
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, California, United States of America
- * E-mail:
| |
Collapse
|
13
|
Wicker T, Taudien S, Houben A, Keller B, Graner A, Platzer M, Stein N. A whole-genome snapshot of 454 sequences exposes the composition of the barley genome and provides evidence for parallel evolution of genome size in wheat and barley. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2009; 59:712-22. [PMID: 19453446 DOI: 10.1111/j.1365-313x.2009.03911.x] [Citation(s) in RCA: 75] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]
Abstract
The genomes of barley and wheat, two of the world's most important crops, are very large and complex due to their high content of repetitive DNA. In order to obtain a whole-genome sequence sample, we performed two runs of 454 (GS20) sequencing on genomic DNA of barley cv. Morex, which yielded approximately 1% of a haploid genome equivalent. Almost 60% of the sequences comprised known transposable element (TE) families, and another 9% represented novel repetitive sequences. We also discovered high amounts of low-complexity DNA and non-genic low-copy DNA. We identified almost 2300 protein coding gene sequences and more than 660 putative conserved non-coding sequences. Comparison of the 454 reads with previously published genomic sequences suggested that TE families are distributed unequally along chromosomes. This was confirmed by in situ hybridizations of selected TEs. A comparison of these data for the barley genome with a large sample of publicly available wheat sequences showed that several TE families that are highly abundant in wheat are absent from the barley genome. This finding implies that the TE composition of their genomes differs dramatically, despite their very similar genome size and their close phylogenetic relationship.
Collapse
Affiliation(s)
- Thomas Wicker
- Institute of Plant Biology, University Zurich, Zurich, Switzerland
| | | | | | | | | | | | | |
Collapse
|
14
|
Goettel W, Messing J. Change of gene structure and function by non-homologous end-joining, homologous recombination, and transposition of DNA. PLoS Genet 2009; 5:e1000516. [PMID: 19521498 PMCID: PMC2686159 DOI: 10.1371/journal.pgen.1000516] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2009] [Accepted: 05/13/2009] [Indexed: 11/18/2022] Open
Abstract
An important objective in genome research is to relate genome structure to gene function. Sequence comparisons among orthologous and paralogous genes and their allelic variants can reveal sequences of functional significance. Here, we describe a 379-kb region on chromosome 1 of maize that enables us to reconstruct chromosome breakage, transposition, non-homologous end-joining, and homologous recombination events. Such a high-density composition of various mechanisms in a small chromosomal interval exemplifies the evolution of gene regulation and allelic diversity in general. It also illustrates the evolutionary pace of changes in plants, where many of the above mechanisms are of somatic origin. In contrast to animals, somatic alterations can easily be transmitted through meiosis because the germline in plants is contiguous to somatic tissue, permitting the recovery of such chromosomal rearrangements. The analyzed region contains the P1-wr allele, a variant of the genetically well-defined p1 gene, which encodes a Myb-like transcriptional activator in maize. The P1-wr allele consists of eleven nearly perfect P1-wr 12-kb repeats that are arranged in a tandem head-to-tail array. Although a technical challenge to sequence such a structure by shotgun sequencing, we overcame this problem by subcloning each repeat and ordering them based on nucleotide variations. These polymorphisms were also critical for recombination and expression analysis in presence and absence of the trans-acting epigenetic factor Ufo1. Interestingly, chimeras of the p1 and p2 genes, p2/p1 and p1/p2, are framing the P1-wr cluster. Reconstruction of sequence amplification steps at the p locus showed the evolution from a single Myb-homolog to the multi-gene P1-wr cluster. It also demonstrates how non-homologous end-joining can create novel gene fusions. Comparisons to orthologous regions in sorghum and rice also indicate a greater instability of the maize genome, probably due to diploidization following allotetraploidization.
Collapse
Affiliation(s)
- Wolfgang Goettel
- Waksman Institute of Microbiology, Rutgers University, Piscataway, NJ, USA
| | | |
Collapse
|
15
|
Bolot S, Abrouk M, Masood-Quraishi U, Stein N, Messing J, Feuillet C, Salse J. The 'inner circle' of the cereal genomes. CURRENT OPINION IN PLANT BIOLOGY 2009; 12:119-25. [PMID: 19095493 DOI: 10.1016/j.pbi.2008.10.011] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/01/2008] [Revised: 10/28/2008] [Accepted: 10/29/2008] [Indexed: 05/18/2023]
Abstract
Early marker-based macrocolinearity studies between the grass genomes led to arranging their chromosomes into concentric 'crop circles' of synteny blocks that initially consisted of 30 rice-independent linkage groups representing the ancestral cereal genome structure. Recently, increased marker density and genome sequencing of several cereal genomes allowed the characterization of intragenomic duplications and their integration with intergenomic colinearity data to identify paleo-duplications and propose a model for the evolution of the grass genomes from a common ancestor. On the basis of these data an 'inner circle' comprising five ancestral chromosomes was defined providing a new reference for the grass chromosomes and new insights into their ancestral relationships and origin, as well as an efficient tool to design cross-genome markers for genetic studies.
Collapse
Affiliation(s)
- Stéphanie Bolot
- INRA/UBP UMR 1095, Domaine de Crouelle, 234 avenue du Brézet 63100 Clermont Ferrand, France
| | | | | | | | | | | | | |
Collapse
|
16
|
Paterson AH, Bowers JE, Bruggmann R, Dubchak I, Grimwood J, Gundlach H, Haberer G, Hellsten U, Mitros T, Poliakov A, Schmutz J, Spannagl M, Tang H, Wang X, Wicker T, Bharti AK, Chapman J, Feltus FA, Gowik U, Grigoriev IV, Lyons E, Maher CA, Martis M, Narechania A, Otillar RP, Penning BW, Salamov AA, Wang Y, Zhang L, Carpita NC, Freeling M, Gingle AR, Hash CT, Keller B, Klein P, Kresovich S, McCann MC, Ming R, Peterson DG, Mehboob-ur-Rahman, Ware D, Westhoff P, Mayer KFX, Messing J, Rokhsar DS. The Sorghum bicolor genome and the diversification of grasses. Nature 2009; 457:551-6. [PMID: 19189423 DOI: 10.1038/nature07723] [Citation(s) in RCA: 1642] [Impact Index Per Article: 109.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Sorghum, an African grass related to sugar cane and maize, is grown for food, feed, fibre and fuel. We present an initial analysis of the approximately 730-megabase Sorghum bicolor (L.) Moench genome, placing approximately 98% of genes in their chromosomal context using whole-genome shotgun sequence validated by genetic, physical and syntenic information. Genetic recombination is largely confined to about one-third of the sorghum genome with gene order and density similar to those of rice. Retrotransposon accumulation in recombinationally recalcitrant heterochromatin explains the approximately 75% larger genome size of sorghum compared with rice. Although gene and repetitive DNA distributions have been preserved since palaeopolyploidization approximately 70 million years ago, most duplicated gene sets lost one member before the sorghum-rice divergence. Concerted evolution makes one duplicated chromosomal segment appear to be only a few million years old. About 24% of genes are grass-specific and 7% are sorghum-specific. Recent gene and microRNA duplications may contribute to sorghum's drought tolerance.
Collapse
Affiliation(s)
- Andrew H Paterson
- Plant Genome Mapping Laboratory, University of Georgia, Athens, Georgia 30602, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Holding DR, Larkins BA. Zein Storage Proteins. MOLECULAR GENETIC APPROACHES TO MAIZE IMPROVEMENT 2008. [DOI: 10.1007/978-3-540-68922-5_19] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
18
|
The 172-kb genomic DNA region of the O. rufipogon yld1.1 locus: comparative sequence analysis with O. sativa ssp. japonica and O. sativa ssp. indica. Funct Integr Genomics 2008; 9:97-108. [PMID: 18633654 DOI: 10.1007/s10142-008-0091-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2008] [Revised: 06/08/2008] [Accepted: 06/08/2008] [Indexed: 10/21/2022]
Abstract
Common wild rice (Oryza rufipogon) plays an important role by contributing to modern rice breeding. In this paper, we report the sequence and analysis of a 172-kb genomic DNA region of wild rice around the RM5 locus, which is associated with the yield QTL yld1.1. Comparative sequence analysis between orthologous RM5 regions from Oryza sativa ssp. japonica, O. sativa ssp. indica and O. rufipogon revealed a high level of conserved synteny in the content, homology, structure, orientation, and physical distance of all 14 predicted genes. Twelve of the putative genes were supported by matches to proteins with known function, whereas two were predicted by homology to rice and other plant expressed sequence tags or complementary DNAs. The remarkably high level of conservation found in coding, intronic and intergenic regions may indicate high evolutionary selection on the RM5 region. Although our analysis has not defined which gene(s) determine the yld1.1 phenotype, allelic variation and the insertion of transposable elements, among other nucleotide changes, represent potential variation responsible for the yield QTL. However, as suggested previously, two putative receptor-like protein kinase genes remain the key suspects for yld1.1.
Collapse
|
19
|
Danilova TV, Birchler JA. Integrated cytogenetic map of mitotic metaphase chromosome 9 of maize: resolution, sensitivity, and banding paint development. Chromosoma 2008; 117:345-56. [PMID: 18317793 DOI: 10.1007/s00412-008-0151-y] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2007] [Revised: 02/04/2008] [Accepted: 02/05/2008] [Indexed: 11/28/2022]
Abstract
To study the correlation of the sequence positions on the physical DNA finger print contig (FPC) map and cytogenetic maps of pachytene and somatic maize chromosomes, sequences located along the chromosome 9 FPC map approximately every 10 Mb were selected to place on maize chromosomes using fluorescent in situ hybridization (FISH). The probes were produced as pooled polymerase chain reaction products based on sequences of genetic markers or repeat-free portions of mapped bacterial artificial chromosome (BAC) clones. Fifteen probes were visualized on chromosome 9. The cytological positions of most sequences correspond on the pachytene, somatic, and FPC maps except some probes at the pericentromeric regions. Because of unequal condensation of mitotic metaphase chromosomes, being lower at pericentromeric regions and higher in the arms, probe positions are displaced to the distal ends of both arms. The axial resolution of FISH on somatic chromosome 9 varied from 3.3 to 8.2 Mb, which is 12-30 times lower than on pachytene chromosomes. The probe collection can be used as chromosomal landmarks or as a "banding paint" for the physical mapping of sequences including transgenes and BAC clones and for studying chromosomal rearrangements.
Collapse
Affiliation(s)
- Tatiana V Danilova
- Division of Biological Sciences, University of Missouri-Columbia, Columbia, MO 65211, USA
| | | |
Collapse
|
20
|
Bossolini E, Wicker T, Knobel PA, Keller B. Comparison of orthologous loci from small grass genomes Brachypodium and rice: implications for wheat genomics and grass genome annotation. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2007; 49:704-17. [PMID: 17270010 DOI: 10.1111/j.1365-313x.2006.02991.x] [Citation(s) in RCA: 87] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
Brachypodium sylvaticum and Brachypodium distachyon were recently proposed as new model plants because of their small genomes and their phylogenetic position between rice and Triticeae crops. We sequenced a 371-kb region in B. sylvaticum, the largest genomic sequence available so far from this species, providing quantitative data on gene conservation, collinearity and phylogeny. We compared it with orthologous regions from rice and wheat. Brachypodium and wheat show perfect macro-collinearity of genetic markers, whereas rice contains an approximately 220-kb inversion. Rice contains almost twice as many genes as Brachypodium in the region studied, whereas wheat has about 40% more. Through comparative annotation, we identified alternative transcripts and improved the annotation for several rice genes, indicating that approximately 15% of rice genes might require re-annotation. Surprisingly, our data suggest that 10-15% of functional sequences in small grass genomes may not encode any proteins. From available genomic and expressed sequence tag sequences, we estimated Brachypodium to have diverged from wheat about 35-40 Mya, significantly more recently than the divergence of rice and wheat. However, our data also indicate that orthologous regions from Brachypodium and wheat differ considerably in gene content, thus the Brachypodium genome sequence probably cannot replace genomic studies in the large Triticeae genomes.
Collapse
Affiliation(s)
- Eligio Bossolini
- Institute of Plant Biology, University of Zürich, Zollikerstrasse 107, 8008 Zürich, Switzerland
| | | | | | | |
Collapse
|
21
|
Emrich SJ, Li L, Wen TJ, Yandeau-Nelson MD, Fu Y, Guo L, Chou HH, Aluru S, Ashlock DA, Schnable PS. Nearly identical paralogs: implications for maize (Zea mays L.) genome evolution. Genetics 2006; 175:429-39. [PMID: 17110490 PMCID: PMC1774996 DOI: 10.1534/genetics.106.064006] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
As an ancient segmental tetraploid, the maize (Zea mays L.) genome contains large numbers of paralogs that are expected to have diverged by a minimum of 10% over time. Nearly identical paralogs (NIPs) are defined as paralogous genes that exhibit > or = 98% identity. Sequence analyses of the "gene space" of the maize inbred line B73 genome, coupled with wet lab validation, have revealed that, conservatively, at least approximately 1% of maize genes have a NIP, a rate substantially higher than that in Arabidopsis. In most instances, both members of maize NIP pairs are expressed and are therefore at least potentially functional. Of evolutionary significance, members of many NIP families also exhibit differential expression. The finding that some families of maize NIPs are closely linked genetically while others are genetically unlinked is consistent with multiple modes of origin. NIPs provide a mechanism for the maize genome to circumvent the inherent limitation that diploid genomes can carry at most two "alleles" per "locus." As such, NIPs may have played important roles during the evolution and domestication of maize and may contribute to the success of long-term selection experiments in this important crop species.
Collapse
Affiliation(s)
- Scott J Emrich
- Interdepartmental Bioinformatics and Computational Biology Graduate Program, Iowa State University, Ames, Iowa 50011, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
22
|
Bruggmann R, Bharti AK, Gundlach H, Lai J, Young S, Pontaroli AC, Wei F, Haberer G, Fuks G, Du C, Raymond C, Estep MC, Liu R, Bennetzen JL, Chan AP, Rabinowicz PD, Quackenbush J, Barbazuk WB, Wing RA, Birren B, Nusbaum C, Rounsley S, Mayer KF, Messing J. Uneven chromosome contraction and expansion in the maize genome. Genes Dev 2006; 16:1241-51. [PMID: 16902087 PMCID: PMC1581433 DOI: 10.1101/gr.5338906] [Citation(s) in RCA: 94] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Maize (Zea mays or corn), both a major food source and an important cytogenetic model, evolved from a tetraploid that arose about 4.8 million years ago (Mya). As a result, maize has extensive duplicated regions within its genome. We have sequenced the two copies of one such region, generating 7.8 Mb of sequence spanning 17.4 cM of the short arm of chromosome 1 and 6.6 Mb (25.6 cM) from the long arm of chromosome 9. Rice, which did not undergo a similar whole genome duplication event, has only one orthologous region (4.9 Mb) on the short arm of chromosome 3, and can be used as reference for the maize homoeologous regions. Alignment of the three regions allowed identification of syntenic blocks, and indicated that the maize regions have undergone differential contraction in genic and intergenic regions and expansion by the insertion of retrotransposable elements. Approximately 9% of the predicted genes in each duplicated region are completely missing in the rice genome, and almost 20% have moved to other genomic locations. Predicted genes within these regions tend to be larger in maize than in rice, primarily because of the presence of predicted genes in maize with larger introns. Interestingly, the general gene methylation patterns in the maize homoeologous regions do not appear to have changed with contraction or expansion of their chromosomes. In addition, no differences in methylation of single genes and tandemly repeated gene copies have been detected. These results, therefore, provide new insights into the diploidization of polyploid species.
Collapse
Affiliation(s)
- Rémy Bruggmann
- Munich Information Center for Protein Sequences (MIPS), Institute for Bioinformatics, GSF Research Center for Environment and Health, D-85764 Neuherberg, Germany
| | - Arvind K. Bharti
- The Plant Genome Initiative at Rutgers (PGIR), Waksman Institute, Rutgers, The State University of New Jersey, Piscataway, New Jersey 08854, USA
| | - Heidrun Gundlach
- Munich Information Center for Protein Sequences (MIPS), Institute for Bioinformatics, GSF Research Center for Environment and Health, D-85764 Neuherberg, Germany
| | - Jinsheng Lai
- The Plant Genome Initiative at Rutgers (PGIR), Waksman Institute, Rutgers, The State University of New Jersey, Piscataway, New Jersey 08854, USA
| | - Sarah Young
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02141, USA
| | - Ana C. Pontaroli
- Department of Genetics, University of Georgia, Athens, Georgia 30602, USA
| | - Fusheng Wei
- Arizona Genomics Institute (AGI), University of Arizona, Tucson, Arizona 85721, USA
| | - Georg Haberer
- Munich Information Center for Protein Sequences (MIPS), Institute for Bioinformatics, GSF Research Center for Environment and Health, D-85764 Neuherberg, Germany
| | - Galina Fuks
- The Plant Genome Initiative at Rutgers (PGIR), Waksman Institute, Rutgers, The State University of New Jersey, Piscataway, New Jersey 08854, USA
| | - Chunguang Du
- The Plant Genome Initiative at Rutgers (PGIR), Waksman Institute, Rutgers, The State University of New Jersey, Piscataway, New Jersey 08854, USA
| | - Christina Raymond
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02141, USA
| | - Matt C. Estep
- Department of Genetics, University of Georgia, Athens, Georgia 30602, USA
| | - Renyi Liu
- Department of Genetics, University of Georgia, Athens, Georgia 30602, USA
| | | | - Agnes P. Chan
- The Institute for Genomic Research (TIGR), Rockville, Maryland 20850, USA
| | | | - John Quackenbush
- The Institute for Genomic Research (TIGR), Rockville, Maryland 20850, USA
| | - W. Brad Barbazuk
- Donald Danforth Plant Science Center, St. Louis, Missouri 63132, USA
| | - Rod A. Wing
- Arizona Genomics Institute (AGI), University of Arizona, Tucson, Arizona 85721, USA
| | - Bruce Birren
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02141, USA
| | - Chad Nusbaum
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02141, USA
| | - Steve Rounsley
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02141, USA
| | - Klaus F.X. Mayer
- Munich Information Center for Protein Sequences (MIPS), Institute for Bioinformatics, GSF Research Center for Environment and Health, D-85764 Neuherberg, Germany
| | - Joachim Messing
- The Plant Genome Initiative at Rutgers (PGIR), Waksman Institute, Rutgers, The State University of New Jersey, Piscataway, New Jersey 08854, USA
| |
Collapse
|
23
|
Du C, Swigoňová Z, Messing J. Retrotranspositions in orthologous regions of closely related grass species. BMC Evol Biol 2006; 6:62. [PMID: 16914031 PMCID: PMC1560396 DOI: 10.1186/1471-2148-6-62] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2006] [Accepted: 08/16/2006] [Indexed: 11/10/2022] Open
Abstract
Background Retrotransposons are commonly occurring eukaryotic transposable elements (TEs). Among these, long terminal repeat (LTR) retrotransposons are the most abundant TEs and can comprise 50–90% of the genome in higher plants. By comparing the orthologous chromosomal regions of closely related species, the effects of TEs on the evolution of plant genomes can be studied in detail. Results Here, we compared the composition and organization of TEs within five orthologous chromosomal regions among three grass species: maize, sorghum, and rice. We identified a total of 132 full or fragmented LTR retrotransposons in these regions. As a percentage of the total cumulative sequence in each species, LTR retrotransposons occupy 45.1% of the maize, 21.1% of the rice, and 3.7% of the sorghum regions. The most common elements in the maize retrotransposon-rich regions are the copia-like retrotransposons with 39% and the gypsy-like retrotransposons with 37%. Using the contiguous sequence of the orthologous regions, we detected 108 retrotransposons with intact target duplication sites and both LTR termini. Here, we show that 74% of these elements inserted into their host genome less than 1 million years ago and that many retroelements expanded in size by the insertion of other sequences. These inserts were predominantly other retroelements, however, several of them were also fragmented genes. Unforeseen was the finding of intact genes embedded within LTR retrotransposons. Conclusion Although the abundance of retroelements between maize and rice is consistent with their different genome sizes of 2,364 and 389 Mb respectively, the content of retrotransposons in sorghum (790 Mb) is surprisingly low. In all three species, retrotransposition is a very recent activity relative to their speciation. While it was known that genes re-insert into non-orthologous positions of plant genomes, they appear to re-insert also within retrotransposons, potentially providing an important role for retrotransposons in the evolution of gene function.
Collapse
Affiliation(s)
- Chunguang Du
- Waksman Institute of Microbiology, Rutgers University, Piscataway, NJ 08854, USA
- Department of Biology & Molecular Biology, Montclair State University, Montclair, NJ 07043, USA
| | - Zuzana Swigoňová
- Waksman Institute of Microbiology, Rutgers University, Piscataway, NJ 08854, USA
- Department of Medical Genetics, University of Pittsburgh, Pittsburgh, PA 15213, USA
| | - Joachim Messing
- Waksman Institute of Microbiology, Rutgers University, Piscataway, NJ 08854, USA
| |
Collapse
|
24
|
Messing J, Dooner HK. Organization and variability of the maize genome. CURRENT OPINION IN PLANT BIOLOGY 2006; 9:157-63. [PMID: 16459130 DOI: 10.1016/j.pbi.2006.01.009] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/07/2005] [Accepted: 01/24/2006] [Indexed: 05/06/2023]
Abstract
With a size approximating that of the human genome, the maize genome is about to become the largest plant genome yet sequenced. Contributing to that size are a whole-genome duplication event and a retrotransposition explosion that produced a large amount of repetitive DNA. This DNA is greatly under-represented in cDNA collections, so analysis of the maize transcriptome has been an expedient way of assessing the gene content of maize. Over 2 million maize cDNA sequences are now available, making maize the third most widely studied organism, behind mouse and man. To date, the sequencing of large-sized DNA clones has been largely driven by the genetic interests of different investigators. The recent construction of a physical map that is anchored to the genetic map will aid immensely in the maize genome-sequencing effort. However, studies showing that the repetitive DNA component is highly polymorphic among maize inbred lines point to the need to sample vertically a few specific regions of the genome to evaluate the extent and importance of this variability.
Collapse
Affiliation(s)
- Joachim Messing
- Waksman Institute of Microbiology, Rutgers, The State University of New Jersey, 190 Frelinghuysen Road, Piscataway, New Jersey 08854, USA
| | | |
Collapse
|
25
|
Rabinowicz PD, Bennetzen JL. The maize genome as a model for efficient sequence analysis of large plant genomes. CURRENT OPINION IN PLANT BIOLOGY 2006; 9:149-56. [PMID: 16459129 DOI: 10.1016/j.pbi.2006.01.015] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/04/2006] [Accepted: 01/20/2006] [Indexed: 05/06/2023]
Abstract
The genomes of flowering plants vary in size from about 0.1 to over 100 gigabase pairs (Gbp), mostly because of polyploidy and variation in the abundance of repetitive elements in intergenic regions. High-quality sequences of the relatively small genomes of Arabidopsis (0.14 Gbp) and rice (0.4 Gbp) have now been largely completed. The sequencing of plant genomes that have a more representative size (the mean for flowering plant genomes is 5.6 Gbp) has been seen as a daunting task, partly because of their size and partly because of the numerous highly conserved repeats. Nevertheless, creative strategies and powerful new tools have been generated recently in the plant genetics community, so that sequencing large plant genomes is now a realistic possibility. Maize (2.4-2.7 Gbp) will be the first gigabase-size plant genome to be sequenced using these novel approaches. Pilot studies on maize indicate that the new gene-enrichment, gene-finishing and gene-orientation technologies are efficient, robust and comprehensive. These strategies will succeed in sequencing the gene-space of large genome plants, and in locating all of these genes and adjacent sequences on the genetic and physical maps.
Collapse
Affiliation(s)
- Pablo D Rabinowicz
- The Institute for Genomic Research, 9712 Medical Center Drive, Rockville, Maryland 20850, USA
| | | |
Collapse
|
26
|
The sequence of rice chromosomes 11 and 12, rich in disease resistance genes and recent gene duplications. BMC Biol 2005; 3:20. [PMID: 16188032 PMCID: PMC1261165 DOI: 10.1186/1741-7007-3-20] [Citation(s) in RCA: 126] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2005] [Accepted: 09/27/2005] [Indexed: 01/13/2023] Open
Abstract
Background Rice is an important staple food and, with the smallest cereal genome, serves as a reference species for studies on the evolution of cereals and other grasses. Therefore, decoding its entire genome will be a prerequisite for applied and basic research on this species and all other cereals. Results We have determined and analyzed the complete sequences of two of its chromosomes, 11 and 12, which total 55.9 Mb (14.3% of the entire genome length), based on a set of overlapping clones. A total of 5,993 non-transposable element related genes are present on these chromosomes. Among them are 289 disease resistance-like and 28 defense-response genes, a higher proportion of these categories than on any other rice chromosome. A three-Mb segment on both chromosomes resulted from a duplication 7.7 million years ago (mya), the most recent large-scale duplication in the rice genome. Paralogous gene copies within this segmental duplication can be aligned with genomic assemblies from sorghum and maize. Although these gene copies are preserved on both chromosomes, their expression patterns have diverged. When the gene order of rice chromosomes 11 and 12 was compared to wheat gene loci, significant synteny between these orthologous regions was detected, illustrating the presence of conserved genes alternating with recently evolved genes. Conclusion Because the resistance and defense response genes, enriched on these chromosomes relative to the whole genome, also occur in clusters, they provide a preferred target for breeding durable disease resistance in rice and the isolation of their allelic variants. The recent duplication of a large chromosomal segment coupled with the high density of disease resistance gene clusters makes this the most recently evolved part of the rice genome. Based on syntenic alignments of these chromosomes, rice chromosome 11 and 12 do not appear to have resulted from a single whole-genome duplication event as previously suggested.
Collapse
|
27
|
Nelson WM, Bharti AK, Butler E, Wei F, Fuks G, Kim H, Wing RA, Messing J, Soderlund C. Whole-genome validation of high-information-content fingerprinting. PLANT PHYSIOLOGY 2005; 139:27-38. [PMID: 16166258 PMCID: PMC1203355 DOI: 10.1104/pp.105.061978] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Fluorescent-based high-information-content fingerprinting (HICF) techniques have recently been developed for physical mapping. These techniques make use of automated capillary DNA sequencing instruments to enable both high-resolution and high-throughput fingerprinting. In this article, we report the construction of a whole-genome HICF FPC map for maize (Zea mays subsp. mays cv B73), using a variant of HICF in which a type IIS restriction enzyme is used to generate the fluorescently labeled fragments. The HICF maize map was constructed from the same three maize bacterial artificial chromosome libraries as previously used for the whole-genome agarose FPC map, providing a unique opportunity for direct comparison of the agarose and HICF methods; as a result, it was found that HICF has substantially greater sensitivity in forming contigs. An improved assembly procedure is also described that uses automatic end-merging of contigs to reduce the effects of contamination and repetitive bands. Several new features in FPC v7.2 are presented, including shared-memory multiprocessing, which allows dramatically faster assemblies, and automatic end-merging, which permits more accurate assemblies. It is further shown that sequenced clones may be digested in silico and located accurately on the HICF assembly, despite size deviations that prevent the precise prediction of experimental fingerprints. Finally, repetitive bands are isolated, and their effect on the assembly is studied.
Collapse
Affiliation(s)
- William M Nelson
- Arizona Genomics Computational Laboratory, BIO5 Institute, University of Arizona, Tucson, 85721, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Morgante M, Brunner S, Pea G, Fengler K, Zuccolo A, Rafalski A. Gene duplication and exon shuffling by helitron-like transposons generate intraspecies diversity in maize. Nat Genet 2005; 37:997-1002. [PMID: 16056225 DOI: 10.1038/ng1615] [Citation(s) in RCA: 333] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2005] [Accepted: 06/27/2005] [Indexed: 12/19/2022]
Abstract
We report a whole-genome comparison of gene content in allelic BAC contigs from two maize inbred lines. Genic content polymorphisms involve as many as 10,000 sequences and are mainly generated by DNA insertions. The termini of eight of the nine genic insertions that we analyzed shared the structural hallmarks of helitron rolling-circle transposons. DNA segments defined by helitron termini contained multiple gene-derived fragments and had a structure typical of nonautonomous helitron-like transposons. Closely related insertions were found in multiple genomic locations. Some of these produced transcripts containing segments of different genes, supporting the idea that these transposition events have a role in exon shuffling and the evolution of new proteins. We identified putative autonomous helitron elements and found evidence for their transcription. Helitrons in maize seem to continually produce new nonautonomous elements responsible for the duplicative insertion of gene segments into new locations and for the unprecedented genic diversity. The maize genome is in constant flux, as transposable elements continue to change both the genic and nongenic fractions of the genome, profoundly affecting genetic diversity.
Collapse
Affiliation(s)
- Michele Morgante
- Dipartimento di Scienze Agrarie ed Ambientali, Universita' di Udine, Via delle Scienze 208, 33100 Udine, Italy.
| | | | | | | | | | | |
Collapse
|
29
|
Ma J, SanMiguel P, Lai J, Messing J, Bennetzen JL. DNA rearrangement in orthologous orp regions of the maize, rice and sorghum genomes. Genetics 2005; 170:1209-20. [PMID: 15834137 PMCID: PMC1451190 DOI: 10.1534/genetics.105.040915] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The homeologous Orp1 and Orp2 regions of maize and the orthologous regions in sorghum and rice were compared by generating sequence data for >486 kb of genomic DNA. At least three genic rearrangements differentiate the maize Orp1 and Orp2 segments, including an insertion of a single gene and two deletions that removed one gene each, while no genic rearrangements were detected in the maize Orp2 region relative to sorghum. Extended comparison of the orthologous Orp regions of sorghum and japonica rice uncovered numerous genic rearrangements and the presence of a transposon-rich region in rice. Only 11 of 27 genes (40%) are arranged in the same order and orientation between sorghum and rice. Of the 8 genes that are uniquely present in the sorghum region, 4 were found to have single-copy homologs in both rice and Arabidopsis, but none of these genes are located near each other, indicating frequent gene movement. Further comparison of the Orp segments from two rice subspecies, japonica and indica, revealed that the transposon-rich region is both an ancient and current hotspot for retrotransposon accumulation and genic rearrangement. We also identify unequal gene conversion as a mechanism for maize retrotransposon rearrangement.
Collapse
Affiliation(s)
- Jianxin Ma
- Department of Genetics, University of Georgia, Athens, Georgia 30602
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana 47907
| | - Phillip SanMiguel
- Genomics Core Facility, Purdue University, West Lafayette, Indiana 47907
| | - Jinsheng Lai
- Waksman Institute, Rutgers, The State University of New Jersey, Piscataway, New Jersey 08854
| | - Joachim Messing
- Waksman Institute, Rutgers, The State University of New Jersey, Piscataway, New Jersey 08854
| | - Jeffrey L. Bennetzen
- Department of Genetics, University of Georgia, Athens, Georgia 30602
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana 47907
- Corresponding author: Department of Genetics, University of Georgia, Athens, GA 30602. E-mail:
| |
Collapse
|