301
|
Woodhouse MR, Schnable JC, Pedersen BS, Lyons E, Lisch D, Subramaniam S, Freeling M. Following tetraploidy in maize, a short deletion mechanism removed genes preferentially from one of the two homologs. PLoS Biol 2010; 8:e1000409. [PMID: 20613864 PMCID: PMC2893956 DOI: 10.1371/journal.pbio.1000409] [Citation(s) in RCA: 195] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2009] [Accepted: 05/20/2010] [Indexed: 12/02/2022] Open
Abstract
Following genome duplication and selfish DNA expansion, maize used a heretofore unknown mechanism to shed redundant genes and functionless DNA with bias toward one of the parental genomes. Previous work in Arabidopsis showed that after an ancient tetraploidy event, genes were preferentially removed from one of the two homeologs, a process known as fractionation. The mechanism of fractionation is unknown. We sought to determine whether such preferential, or biased, fractionation exists in maize and, if so, whether a specific mechanism could be implicated in this process. We studied the process of fractionation using two recently sequenced grass species: sorghum and maize. The maize lineage has experienced a tetraploidy since its divergence from sorghum approximately 12 million years ago, and fragments of many knocked-out genes retain enough sequence similarity to be easily identifiable. Using sorghum exons as the query sequence, we studied the fate of both orthologous genes in maize following the maize tetraploidy. We show that genes are predominantly lost, not relocated, and that single-gene loss by deletion is the rule. Based on comparisons with orthologous sorghum and rice genes, we also infer that the sequences present before the deletion events were flanked by short direct repeats, a signature of intra-chromosomal recombination. Evidence of this deletion mechanism is found 2.3 times more frequently on one of the maize homeologs, consistent with earlier observations of biased fractionation. The over-fractionated homeolog is also a greater than 3-fold better target for transposon removal, but does not have an observably higher synonymous base substitution rate, nor could we find differentially placed methylation domains. We conclude that fractionation is indeed biased in maize and that intra-chromosomal or possibly a similar illegitimate recombination is the primary mechanism by which fractionation occurs. The mechanism of intra-chromosomal recombination explains the observed bias in both gene and transposon loss in the maize lineage. The existence of fractionation bias demonstrates that the frequency of deletion is modulated. Among the evolutionary benefits of this deletion/fractionation mechanism is bulk DNA removal and the generation of novel combinations of regulatory sequences and coding regions. All genomes can accumulate dispensable DNA in the form of duplications of individual genes or even partial or whole genome duplications. Genomes also can accumulate selfish DNA elements. Duplication events specifically are often followed by extensive gene loss. The maize genome is particularly extreme, having become tetraploid 10 million years ago and played host to massive transposon amplifications. We compared the genome of sorghum (which is homologous to the pre-tetraploid maize genome) with the two identifiable parental genomes retained in maize. The two maize genomes differ greatly: one of the parental genomes has lost 2.3 times more genes than the other, and the selfish DNA regions between genes were even more frequently lost, suggesting maize can distinguish between the parental genomes present in the original tetraploid. We show that genes are actually lost, not simply relocated. Deletions were rarely longer than a single gene, and occurred between repeated DNA sequences, suggesting mis-recombination as a mechanism of gene removal. We hypothesize an epigenetic mechanism of genome distinction to account for the selective loss. To the extent that the rate of base substitutions tracks time, we neither support nor refute claims of maize allotetraploidy. Finally, we explain why it makes sense that purifying selection in mammals does not operate at all like the gene and genome deletion program we describe here.
Collapse
Affiliation(s)
- Margaret R. Woodhouse
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, California, United States of America
| | - James C. Schnable
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, California, United States of America
| | - Brent S. Pedersen
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, California, United States of America
| | - Eric Lyons
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, California, United States of America
| | - Damon Lisch
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, California, United States of America
| | - Shabarinath Subramaniam
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, California, United States of America
| | - Michael Freeling
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, California, United States of America
- * E-mail:
| |
Collapse
|
302
|
Ron M, Alandete Saez M, Eshed Williams L, Fletcher JC, McCormick S. Proper regulation of a sperm-specific cis-nat-siRNA is essential for double fertilization in Arabidopsis. Genes Dev 2010; 24:1010-21. [PMID: 20478994 DOI: 10.1101/gad.1882810] [Citation(s) in RCA: 132] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Natural cis-antisense siRNAs (cis-nat-siRNAs) are a recently characterized class of small regulatory RNAs that are widespread in eukaryotes. Despite their abundance, the importance of their regulatory activity is largely unknown. The only functional role for eukaryotic cis-nat-siRNAs that has been described to date is in environmental stress responses in plants. Here we demonstrate that cis-nat-siRNA-based regulation plays key roles in Arabidopsis reproductive function, as it facilitates gametophyte formation and double fertilization, a developmental process of enormous agricultural value. We show that male gametophytic kokopelli (kpl) mutants display frequent single-fertilization events, and that KPL and a inversely transcribed gene, ARIADNE14 (ARI14), which encodes a putative ubiquitin E3 ligase, generate a sperm-specific nat-siRNA pair. In the absence of KPL, ARI14 RNA levels in sperm are increased and fertilization is impaired. Furthermore, ARI14 transcripts accumulate in several siRNA biogenesis pathway mutants, and overexpression of ARI14 in sperm phenocopies the reduced seed set of the kokopelli mutants. These results extend the regulatory capacity of cis-nat-siRNAs to development by identifying a role for cis-nat-siRNAs in controlling sperm function during double fertilization.
Collapse
Affiliation(s)
- Mily Ron
- Department of Plant and Microbial Biology, Plant Gene Expression Center, US Department of Agriculture/Agricultural Research Service, University of California at Berkeley, Albany, California 94710, USA
| | | | | | | | | |
Collapse
|
303
|
Hand ML, Cogan NOI, Sawbridge TI, Spangenberg GC, Forster JW. Comparison of homoeolocus organisation in paired BAC clones from white clover (Trifolium repens L.) and microcolinearity with model legume species. BMC PLANT BIOLOGY 2010; 10:94. [PMID: 20492736 PMCID: PMC3095360 DOI: 10.1186/1471-2229-10-94] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/20/2010] [Accepted: 05/24/2010] [Indexed: 05/24/2023]
Abstract
BACKGROUND White clover (Trifolium repens L.) is an outbreeding allotetraploid species and an important forage legume in temperate grassland agriculture. Comparison of sub-genome architecture and study of nucleotide sequence diversity within allopolyploids provides insight into evolutionary divergence mechanisms, and is also necessary for the development of whole-genome sequencing strategies. This study aimed to evaluate the degree of divergence between the O and P' sub-genomes of white clover through sequencing of BAC clones containing paired homoeoloci. The microsyntenic relationships between the genomes of white clover and the model legumes Lotus japonicus and Medicago truncatula as well as Arabidopsis thaliana were also characterised. RESULTS A total of four paired homoeologous BACs were selected and sequenced to generate 173 kb of overlapping sequence between the O and P' sub-genomes. Equivalent gene content was generally observed, apart from small-scale deletions, in contrast to conservation of intergenic sequences, which varied between the four selected regions. Measurement of the number of synonymous substitutions between homoeologous genes led to estimation of a 4.2 million year divergence time between the two sub-genomes. Microsynteny was observed between the genomes of white clover and L. japonicus for all four targeted regions, but corresponding M. truncatula genomic regions were only identified for two BAC pairs. CONCLUSIONS This study describes the first analysis of sub-genome structural conservation across selected genomic regions in white clover. Although the high levels of sequence conservation between the O and P' sub-genomes would complicate efforts for whole genome sequence assembly, the conserved microsynteny with model legume genomes, especially that of L. japonicus, will be highly valuable for the future of white clover genomics and molecular breeding.
Collapse
Affiliation(s)
- Melanie L Hand
- Department of Primary Industries, Biosciences Research Division, Victorian AgriBiosciences Centre, 1 Park Drive, La Trobe University Research and Development Park, Bundoora, Victoria 3083, Australia
- Molecular Plant Breeding Cooperative Research Centre, Australia
- La Trobe University, Bundoora, Victoria 3086, Australia
| | - Noel OI Cogan
- Department of Primary Industries, Biosciences Research Division, Victorian AgriBiosciences Centre, 1 Park Drive, La Trobe University Research and Development Park, Bundoora, Victoria 3083, Australia
- Molecular Plant Breeding Cooperative Research Centre, Australia
| | - Timothy I Sawbridge
- Department of Primary Industries, Biosciences Research Division, Victorian AgriBiosciences Centre, 1 Park Drive, La Trobe University Research and Development Park, Bundoora, Victoria 3083, Australia
- Molecular Plant Breeding Cooperative Research Centre, Australia
- La Trobe University, Bundoora, Victoria 3086, Australia
| | - German C Spangenberg
- Department of Primary Industries, Biosciences Research Division, Victorian AgriBiosciences Centre, 1 Park Drive, La Trobe University Research and Development Park, Bundoora, Victoria 3083, Australia
- Molecular Plant Breeding Cooperative Research Centre, Australia
- La Trobe University, Bundoora, Victoria 3086, Australia
| | - John W Forster
- Department of Primary Industries, Biosciences Research Division, Victorian AgriBiosciences Centre, 1 Park Drive, La Trobe University Research and Development Park, Bundoora, Victoria 3083, Australia
- Molecular Plant Breeding Cooperative Research Centre, Australia
- La Trobe University, Bundoora, Victoria 3086, Australia
| |
Collapse
|
304
|
Kane J, Freeling M, Lyons E. The evolution of a high copy gene array in Arabidopsis. J Mol Evol 2010; 70:531-44. [PMID: 20495794 PMCID: PMC2886086 DOI: 10.1007/s00239-010-9350-2] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2009] [Accepted: 05/03/2010] [Indexed: 11/29/2022]
Abstract
Local gene duplication is a prominent mechanism of gene copy number expansion. Elucidating the mechanisms by which local duplicates arise is necessary in understanding the evolution of genomes and their host organisms. Chromosome one of Arabidopsis thaliana contains an 81-gene array subdivided into 27 triplet units (t-units), with each t-unit containing three pre-transfer RNA genes. We utilized phylogenetic tree reconstructions and comparative genomics to order the events leading to the array's formation, and propose a model using unequal crossing-over as the primary mechanism of array formation. The model is supported by additional phylogenetic information from intergenic spacer sequences separating each t-unit, comparative analysis to an orthologous array of 12 t-units in the sister taxa Arabidopsis lyrata, and additional modeling using a stochastic simulation of orthologous array divergence. Lastly, comparative phylogenetic analysis demonstrates that the two orthologous t-unit arrays undergo concerted evolution within each taxa and are likely fluctuating in copy number under neutral evolutionary drift. These findings hold larger implications for future research concerning gene and genome evolution.
Collapse
Affiliation(s)
- Joshua Kane
- Department of Plant and Microbial Biology, University of California at Berkeley, 311 Koshland Hall, Berkeley, CA 94720, USA
| | | | | |
Collapse
|
305
|
Phylogenetic footprint of the plant clock system in angiosperms: evolutionary processes of pseudo-response regulators. BMC Evol Biol 2010; 10:126. [PMID: 20433765 PMCID: PMC2887406 DOI: 10.1186/1471-2148-10-126] [Citation(s) in RCA: 60] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2009] [Accepted: 05/01/2010] [Indexed: 11/23/2022] Open
Abstract
Background Plant circadian clocks regulate many photoperiodic and diurnal responses that are conserved among plant species. The plant circadian clock system has been uncovered in the model plant, Arabidopsis thaliana, using genetics and systems biology approaches. However, it is still not clear how the clock system had been organized in the evolutionary history of plants. We recently revealed the molecular phylogeny of LHY/CCA1 genes, one of the essential components of the clock system. The aims of this study are to reconstruct the phylogenetic relationships of angiosperm clock-associated PRR genes, the partner of the LHY/CCA1 genes, and to clarify the evolutionary history of the plant clock system in angiosperm lineages. Results In the present study, to investigate the molecular phylogeny of PRR genes, we performed two approaches: reconstruction of phylogenetic trees and examination of syntenic relationships. Phylogenetic analyses revealed that PRR genes had diverged into three clades prior to the speciation of monocots and eudicots. Furthermore, copy numbers of PRR genes have been independently increased in monocots and eudicots as a result of ancient chromosomal duplication events. Conclusions Based on the molecular phylogenies of both PRR genes and LHY/CCA1 genes, we inferred the evolutionary process of the plant clock system in angiosperms. This scenario provides evolutionary information that a common ancestor of monocots and eudicots had retained the basic components required for reconstructing a clock system and that the plant circadian clock may have become a more elaborate mechanism after the speciation of monocots and eudicots because of the gene expansion that resulted from polyploidy events.
Collapse
|
306
|
Paterson AH, Freeling M, Tang H, Wang X. Insights from the comparison of plant genome sequences. ANNUAL REVIEW OF PLANT BIOLOGY 2010; 61:349-72. [PMID: 20441528 DOI: 10.1146/annurev-arplant-042809-112235] [Citation(s) in RCA: 117] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
The next decade will see essentially completed sequences for multiple branches of virtually all angiosperm clades that include major crops and/or botanical models. These sequences will provide a powerful framework for relating genome-level events to aspects of morphological and physiological variation that have contributed to the colonization of much of the planet by angiosperms. Clarification of the fundamental angiosperm gene set, its arrangement, lineage-specific variations in gene repertoire and arrangement, and the fates of duplicated gene pairs will advance knowledge of functional and regulatory diversity and perhaps shed light on adaptation by lineages to whole-genome duplication, which is a distinguishing feature of angiosperm evolution. Better understanding of the relationships among angiosperm genomes promises to provide a firm foundation upon which to base translational genomics: the leveraging of hard-won structural and functional genomic information from crown botanical models to dissect novel and, in some cases, economically important features in many additional organisms.
Collapse
Affiliation(s)
- Andrew H Paterson
- Department of Plant Biology, University of Georgia, Athens, Georgia.
| | | | | | | |
Collapse
|
307
|
Proost S, Van Bel M, Sterck L, Billiau K, Van Parys T, Van de Peer Y, Vandepoele K. PLAZA: a comparative genomics resource to study gene and genome evolution in plants. THE PLANT CELL 2009; 21:3718-31. [PMID: 20040540 PMCID: PMC2814516 DOI: 10.1105/tpc.109.071506] [Citation(s) in RCA: 193] [Impact Index Per Article: 12.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2009] [Revised: 12/04/2009] [Accepted: 12/10/2009] [Indexed: 05/17/2023]
Abstract
The number of sequenced genomes of representatives within the green lineage is rapidly increasing. Consequently, comparative sequence analysis has significantly altered our view on the complexity of genome organization, gene function, and regulatory pathways. To explore all this genome information, a centralized infrastructure is required where all data generated by different sequencing initiatives is integrated and combined with advanced methods for data mining. Here, we describe PLAZA, an online platform for plant comparative genomics (http://bioinformatics.psb.ugent.be/plaza/). This resource integrates structural and functional annotation of published plant genomes together with a large set of interactive tools to study gene function and gene and genome evolution. Precomputed data sets cover homologous gene families, multiple sequence alignments, phylogenetic trees, intraspecies whole-genome dot plots, and genomic colinearity between species. Through the integration of high confidence Gene Ontology annotations and tree-based orthology between related species, thousands of genes lacking any functional description are functionally annotated. Advanced query systems, as well as multiple interactive visualization tools, are available through a user-friendly and intuitive Web interface. In addition, detailed documentation and tutorials introduce the different tools, while the workbench provides an efficient means to analyze user-defined gene sets through PLAZA's interface. In conclusion, PLAZA provides a comprehensible and up-to-date research environment to aid researchers in the exploration of genome information within the green plant lineage.
Collapse
Affiliation(s)
- Sebastian Proost
- Department of Plant Systems Biology, Flanders Institute for Biotechnology, B-9052 Ghent, Belgium.
| | | | | | | | | | | | | |
Collapse
|
308
|
|
309
|
Jurkuta RJ, Kaplinsky NJ, Spindel JE, Barton MK. Partitioning the apical domain of the Arabidopsis embryo requires the BOBBER1 NudC domain protein. THE PLANT CELL 2009; 21:1957-71. [PMID: 19648297 PMCID: PMC2729608 DOI: 10.1105/tpc.108.065284] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/05/2009] [Revised: 06/24/2009] [Accepted: 07/09/2009] [Indexed: 05/20/2023]
Abstract
The apical domain of the embryo is partitioned into distinct regions that will give rise to the cotyledons and the shoot apical meristem. In this article, we describe a novel screen to identify Arabidopsis thaliana embryo arrest mutants that are defective in this partitioning, and we describe the phenotype of one such mutant, bobber1. bobber1 mutants arrest at the globular stage of development, they express the meristem-specific SHOOTMERISTEMLESS gene throughout the top half of the embryo, and they fail to express the AINTEGUMENTA transcript normally found in cotyledons. Thus, BOBBER1 is required to limit the extent of the meristem domain and/or to promote the development of the cotyledon domains. Based on expression of early markers for apical development, bobber1 mutants differentiate protodermis and undergo normal early apical development. Consistent with a role for auxin in cotyledon development, BOBBER1 mutants fail to express localized maxima of the DR5:green fluorescent protein reporter. BOBBER1 encodes a protein with homology to the Aspergillus nidulans protein NUDC that has similarity to protein chaperones, indicating a possible role for BOBBER1 in synthesis or transport of proteins involved in patterning the Arabidopsis embryo.
Collapse
Affiliation(s)
- Rebecca Joy Jurkuta
- Department of Genetics, University of Wisconsin, Madison, Wisconsin 53706, USA
| | | | | | | |
Collapse
|
310
|
Chu TC, Liu T, Lee DT, Lee GC, Shih ACC. GR-Aligner: an algorithm for aligning pairwise genomic sequences containing rearrangement events. Bioinformatics 2009; 25:2188-93. [PMID: 19542149 DOI: 10.1093/bioinformatics/btp372] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
MOTIVATION Homologous genomic sequences between species usually contain different rearrangement events. Whether some specific patterns existed in the breakpoint regions that caused such events to occur is still unclear. To resolve this question, it is necessary to determine the location of breakpoints at the nucleotide level. The availability of sequences near breakpoints would further facilitate the related studies. We thus need a tool that can identify breakpoints and align the neighboring sequences. Although local alignment tools can detect rearrangement events, they only report a set of discontinuous alignments, where the detailed alignments in the breakpoint regions are usually missing. Global alignment tools are even less appropriate for these tasks since most of them are designed to align the conserved regions between sequences in a consistent order, i.e. they do not consider rearrangement events. RESULTS We propose an effective and efficient pairwise sequence alignment algorithm, called GR-Aligner (Genomic Rearrangement Aligner), which can find breakpoints of rearrangement events by integrating the forward and reverse alignments of the breakpoint regions flanked by homologously rearranged sequences. In addition, GR-Aligner also provides an option to view the alignments of sequences extended to the breakpoints. These outputs provide materials for studying possible evolutionary mechanisms and biological functionalities of the rearrangement.
Collapse
Affiliation(s)
- Te-Chin Chu
- Institute of Information Science, Academia Sinica, National Taiwan University, Taipei, Taiwan
| | | | | | | | | |
Collapse
|
311
|
Bolduc N, Hake S. The maize transcription factor KNOTTED1 directly regulates the gibberellin catabolism gene ga2ox1. THE PLANT CELL 2009; 21:1647-58. [PMID: 19567707 PMCID: PMC2714931 DOI: 10.1105/tpc.109.068221] [Citation(s) in RCA: 196] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/27/2009] [Revised: 05/31/2009] [Accepted: 06/08/2009] [Indexed: 05/18/2023]
Abstract
KNOTTED1 (KN1)-like homeobox (KNOX) transcription factors are involved in the establishment and maintenance of plant meristems; however, few direct targets of KNOX proteins have been recognized. Using a combination of double mutant analysis and biochemistry, we found that in maize (Zea mays), KN1 negatively modulates the accumulation of gibberellin (GA) through the control of ga2ox1, which codes for an enzyme that inactivates GA. The ga2ox1 mRNA level is elevated in immature leaves of dominant KNOX mutants and downregulated in reproductive meristems of the null allele kn1-e1. KN1 binds in vivo to an intron of ga2ox1 through a cis-regulatory element containing two TGAC motifs. VP16-KN1 activates transcription in planta from a chimeric promoter containing this binding site. The domains of expression of kn1 and ga2ox1 mRNAs overlap at the base of the shoot apical meristem and the base of newly initiated leaves, suggesting that KN1-mediated activation of ga2ox1 maintains a boundary between meristem cell identity and rapidly elongating cells of the shoot. The KN1 binding site is conserved in ga2ox1 genes of different grasses, suggesting that the local regulation of bioactive GA levels through KNOX proteins is a common theme in grasses.
Collapse
Affiliation(s)
- Nathalie Bolduc
- Plant Gene Expression Center, U.S. Department of Agriculture-Agricultural Research Service, University of California, Albany, California 94710, USA
| | | |
Collapse
|
312
|
Freeling M, Subramaniam S. Conserved noncoding sequences (CNSs) in higher plants. CURRENT OPINION IN PLANT BIOLOGY 2009; 12:126-32. [PMID: 19249238 DOI: 10.1016/j.pbi.2009.01.005] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/17/2008] [Revised: 01/22/2009] [Accepted: 01/22/2009] [Indexed: 05/09/2023]
Abstract
Plant conserved noncoding sequences (CNSs)--a specific category of phylogenetic footprint--have been shown experimentally to function. No plant CNS is conserved to the extent that ultraconserved noncoding sequences are conserved in vertebrates. Plant CNSs are enriched in known transcription factor or other cis-acting binding sites, and are usually clustered around genes. Genes that encode transcription factors and/or those that respond to stimuli are particularly CNS-rich. Only rarely could this function involve small RNA binding. Some transcribed CNSs encode short translation products as a form of negative control. Approximately 4% of Arabidopsis gene content is estimated to be both CNS-rich and occupies a relatively long stretch of chromosome: Bigfoot genes (long phylogenetic footprints). We discuss a 'DNA-templated protein assembly' idea that might help explain Bigfoot gene CNSs.
Collapse
Affiliation(s)
- Michael Freeling
- Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, USA.
| | | |
Collapse
|
313
|
Freeling M. Bias in plant gene content following different sorts of duplication: tandem, whole-genome, segmental, or by transposition. ANNUAL REVIEW OF PLANT BIOLOGY 2009; 60:433-53. [PMID: 19575588 DOI: 10.1146/annurev.arplant.043008.092122] [Citation(s) in RCA: 605] [Impact Index Per Article: 40.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]
Abstract
Each mode of gene duplication (tandem, tetraploid, segmental, transpositional) retains genes in a biased manner. A reciprocal relationship exists between plant genes retained postpaleotetraploidy versus genes retained after an ancient tandem duplication. Among the models (C, neofunctionalization, balanced gene drive) and ideas that might explain this relationship, only balanced gene drive predicts reciprocity. The gene balance hypothesis explains that more "connected" genes--by protein-protein interactions in a heteromer, for example--are less likely to be retained as a tandem or transposed duplicate and are more likely to be retained postpaleotetraploidy; otherwise, selectively negative dosage effects are created. Biased duplicate retention is an instant and neutral by-product, a spandrel, of purifying selection. Balanced gene drive expanded plant gene families, including those encoding proteasomal proteins, protein kinases, motors, and transcription factors, with each paleotetraploidy, which could explain trends involving complexity. Balanced gene drive is a saltation mechanism in the mutationist tradition.
Collapse
Affiliation(s)
- Michael Freeling
- Department of Plant and Microbial Biology, University of California, Berkeley, California 94720, USA.
| |
Collapse
|
314
|
Lyons E, Pedersen B, Kane J, Alam M, Ming R, Tang H, Wang X, Bowers J, Paterson A, Lisch D, Freeling M. Finding and comparing syntenic regions among Arabidopsis and the outgroups papaya, poplar, and grape: CoGe with rosids. PLANT PHYSIOLOGY 2008; 148:1772-81. [PMID: 18952863 PMCID: PMC2593677 DOI: 10.1104/pp.108.124867] [Citation(s) in RCA: 277] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/16/2008] [Accepted: 10/19/2008] [Indexed: 05/18/2023]
Abstract
In addition to the genomes of Arabidopsis (Arabidopsis thaliana) and poplar (Populus trichocarpa), two near-complete rosid genome sequences, grape (Vitis vinifera) and papaya (Carica papaya), have been recently released. The phylogenetic relationship among these four genomes and the placement of their three independent, fractionated tetraploidies sum to a powerful comparative genomic system. CoGe, a platform of multiple whole or near-complete genome sequences, provides an integrative Web-based system to find and align syntenic chromosomal regions and visualize the output in an intuitive and interactive manner. CoGe has been customized to specifically support comparisons among the rosids. Crucial facts and definitions are presented to clearly describe the sorts of biological questions that might be answered in part using CoGe, including patterns of DNA conservation, accuracy of annotation, transposability of individual genes, subfunctionalization and/or fractionation of syntenic gene sets, and conserved noncoding sequence content. This précis of an online tutorial, CoGe with Rosids (http://tinyurl.com/4a23pk), presents sample results graphically.
Collapse
Affiliation(s)
- Eric Lyons
- Department of Plant and Microbial Biology, University of California, Berkeley, California 94720, USA.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
315
|
Freeling M, Lyons E, Pedersen B, Alam M, Ming R, Lisch D. Many or most genes in Arabidopsis transposed after the origin of the order Brassicales. Genome Res 2008; 18:1924-37. [PMID: 18836034 DOI: 10.1101/gr.081026.108] [Citation(s) in RCA: 140] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Previous to this work, typical genes were thought to move from one position to another infrequently. On the contrary, we now estimate that between one-fourth and three-fourths of the genes in Arabidopsis transposed in the Brassicales. We used the CoGe comparative genomics system to perform and visualize multiple orthologous chromosomal alignments. Using this tool, we found large differences between different categories of genes. Ten of the gene families examined, including genes in most transcription factor families, exhibited a median frequency of 5% transposed genes. In contrast, other gene families were composed largely of transposed genes: NB-LRR disease-resistance genes, genes encoding MADS-box and B3 transcription factors, and genes encoding F-box proteins. A unique method involving transposition-rich regions of genome allowed us to obtain an indirect estimate of the positional stability of the average gene. The observed differences between gene families raise important questions concerning the causes and consequences of gene transposition.
Collapse
Affiliation(s)
- Michael Freeling
- Department of Plant and Microbial Biology, University of California at Berkeley, California 94720, USA.
| | | | | | | | | | | |
Collapse
|
316
|
Ming R, Hou S, Feng Y, Yu Q, Dionne-Laporte A, Saw JH, Senin P, Wang W, Ly BV, Lewis KLT, Salzberg SL, Feng L, Jones MR, Skelton RL, Murray JE, Chen C, Qian W, Shen J, Du P, Eustice M, Tong E, Tang H, Lyons E, Paull RE, Michael TP, Wall K, Rice DW, Albert H, Wang ML, Zhu YJ, Schatz M, Nagarajan N, Acob RA, Guan P, Blas A, Wai CM, Ackerman CM, Ren Y, Liu C, Wang J, Wang J, Na JK, Shakirov EV, Haas B, Thimmapuram J, Nelson D, Wang X, Bowers JE, Gschwend AR, Delcher AL, Singh R, Suzuki JY, Tripathi S, Neupane K, Wei H, Irikura B, Paidi M, Jiang N, Zhang W, Presting G, Windsor A, Navajas-Pérez R, Torres MJ, Feltus FA, Porter B, Li Y, Burroughs AM, Luo MC, Liu L, Christopher DA, Mount SM, Moore PH, Sugimura T, Jiang J, Schuler MA, Friedman V, Mitchell-Olds T, Shippen DE, dePamphilis CW, Palmer JD, Freeling M, Paterson AH, Gonsalves D, Wang L, Alam M. The draft genome of the transgenic tropical fruit tree papaya (Carica papaya Linnaeus). Nature 2008; 452:991-6. [PMID: 18432245 PMCID: PMC2836516 DOI: 10.1038/nature06856] [Citation(s) in RCA: 616] [Impact Index Per Article: 38.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2007] [Accepted: 02/22/2008] [Indexed: 11/09/2022]
Abstract
Papaya, a fruit crop cultivated in tropical and subtropical regions, is known for its nutritional benefits and medicinal applications. Here we report a 3x draft genome sequence of 'SunUp' papaya, the first commercial virus-resistant transgenic fruit tree to be sequenced. The papaya genome is three times the size of the Arabidopsis genome, but contains fewer genes, including significantly fewer disease-resistance gene analogues. Comparison of the five sequenced genomes suggests a minimal angiosperm gene set of 13,311. A lack of recent genome duplication, atypical of other angiosperm genomes sequenced so far, may account for the smaller papaya gene number in most functional groups. Nonetheless, striking amplifications in gene number within particular functional groups suggest roles in the evolution of tree-like habit, deposition and remobilization of starch reserves, attraction of seed dispersal agents, and adaptation to tropical daylengths. Transgenesis at three locations is closely associated with chloroplast insertions into the nuclear genome, and with topoisomerase I recognition sites. Papaya offers numerous advantages as a system for fruit-tree functional genomics, and this draft genome sequence provides the foundation for revealing the basis of Carica's distinguishing morpho-physiological, medicinal and nutritional properties.
Collapse
Affiliation(s)
- Ray Ming
- Hawaii Agriculture Research Center, Aiea, Hawaii 96701, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|