101
|
Abstract
Plants respond and adapt to drought, cold, and high-salinity stresses. Stress-inducible gene products function in the stress response and tolerance in plants. Using cDNA microarrays and oligonucleotide microarrays, stress-inducible genes have been identified in various plant species so far. Recently, tiling array technology has become a powerful tool for the whole-genome transcriptome analysis. We applied the Arabidopsis Affymetrix tiling arrays to study the whole-genome transcriptome under drought, cold, and high-salinity stresses and identified a large number of drought, cold, and high-salinity stress-inducible genes and transcriptional units (TUs).
Collapse
|
102
|
|
103
|
Hanada K, Akiyama K, Sakurai T, Toyoda T, Shinozaki K, Shiu SH. sORF finder: a program package to identify small open reading frames with high coding potential. ACTA ACUST UNITED AC 2009; 26:399-400. [PMID: 20008477 DOI: 10.1093/bioinformatics/btp688] [Citation(s) in RCA: 102] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
SUMMARY sORF finder is a program package for identifying small open reading frames (sORFs) with high-coding potential. This application allows the identification of coding sORFs according to the nucleotide composition bias among coding sequences and the potential functional constraint at the amino acid level through evaluation of synonymous and non-synonymous substitution rates. AVAILABILITY Online tools and source codes are freely available at http://evolver.psc.riken.jp/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Kousuke Hanada
- Plant Science Center, RIKEN, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan.
| | | | | | | | | | | |
Collapse
|
104
|
Proost S, Van Bel M, Sterck L, Billiau K, Van Parys T, Van de Peer Y, Vandepoele K. PLAZA: a comparative genomics resource to study gene and genome evolution in plants. THE PLANT CELL 2009; 21:3718-31. [PMID: 20040540 PMCID: PMC2814516 DOI: 10.1105/tpc.109.071506] [Citation(s) in RCA: 193] [Impact Index Per Article: 12.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2009] [Revised: 12/04/2009] [Accepted: 12/10/2009] [Indexed: 05/17/2023]
Abstract
The number of sequenced genomes of representatives within the green lineage is rapidly increasing. Consequently, comparative sequence analysis has significantly altered our view on the complexity of genome organization, gene function, and regulatory pathways. To explore all this genome information, a centralized infrastructure is required where all data generated by different sequencing initiatives is integrated and combined with advanced methods for data mining. Here, we describe PLAZA, an online platform for plant comparative genomics (http://bioinformatics.psb.ugent.be/plaza/). This resource integrates structural and functional annotation of published plant genomes together with a large set of interactive tools to study gene function and gene and genome evolution. Precomputed data sets cover homologous gene families, multiple sequence alignments, phylogenetic trees, intraspecies whole-genome dot plots, and genomic colinearity between species. Through the integration of high confidence Gene Ontology annotations and tree-based orthology between related species, thousands of genes lacking any functional description are functionally annotated. Advanced query systems, as well as multiple interactive visualization tools, are available through a user-friendly and intuitive Web interface. In addition, detailed documentation and tutorials introduce the different tools, while the workbench provides an efficient means to analyze user-defined gene sets through PLAZA's interface. In conclusion, PLAZA provides a comprehensible and up-to-date research environment to aid researchers in the exploration of genome information within the green plant lineage.
Collapse
Affiliation(s)
- Sebastian Proost
- Department of Plant Systems Biology, Flanders Institute for Biotechnology, B-9052 Ghent, Belgium.
| | | | | | | | | | | | | |
Collapse
|
105
|
Robinson SJ, Tang LH, Mooney BAG, McKay SJ, Clarke WE, Links MG, Karcz S, Regan S, Wu YY, Gruber MY, Cui D, Yu M, Parkin IAP. An archived activation tagged population of Arabidopsis thaliana to facilitate forward genetics approaches. BMC PLANT BIOLOGY 2009; 9:101. [PMID: 19646253 PMCID: PMC3091532 DOI: 10.1186/1471-2229-9-101] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/07/2009] [Accepted: 07/31/2009] [Indexed: 05/18/2023]
Abstract
BACKGROUND Functional genomics tools provide researchers with the ability to apply high-throughput techniques to determine the function and interaction of a diverse range of genes. Mutagenized plant populations are one such resource that facilitate gene characterisation. They allow complex physiological responses to be correlated with the expression of single genes in planta, through either reverse genetics where target genes are mutagenized to assay the affect, or through forward genetics where populations of mutant lines are screened to identify those whose phenotype diverges from wild type for a particular trait. One limitation of these types of populations is the prevalence of gene redundancy within plant genomes, which can mask the affect of individual genes. Activation or enhancer populations, which not only provide knock-out but also dominant activation mutations, can facilitate the study of such genes. RESULTS We have developed a population of almost 50,000 activation tagged A. thaliana lines that have been archived as individual lines to the T3 generation. The population is an excellent tool for both reverse and forward genetic screens and has been used successfully to identify a number of novel mutants. Insertion site sequences have been generated and mapped for 15,507 lines to enable further application of the population, while providing a clear distribution of T-DNA insertions across the genome. The population is being screened for a number of biochemical and developmental phenotypes, provisional data identifying novel alleles and genes controlling steps in proanthocyanidin biosynthesis and trichome development is presented. CONCLUSION This publicly available population provides an additional tool for plant researcher's to assist with determining gene function for the many as yet uncharacterised genes annotated within the Arabidopsis genome sequence http://aafc-aac.usask.ca/FST. The presence of enhancer elements on the inserted T-DNA molecule allows both knock-out and dominant activation phenotypes to be identified for traits of interest.
Collapse
Affiliation(s)
- Stephen J Robinson
- Agriculture and Agri-Food Canada, Saskatoon Research Centre, 107 Science Place, Saskatoon, S7N 0X2, Canada
| | - Lily H Tang
- Agriculture and Agri-Food Canada, Saskatoon Research Centre, 107 Science Place, Saskatoon, S7N 0X2, Canada
| | - Brent AG Mooney
- Agriculture and Agri-Food Canada, Saskatoon Research Centre, 107 Science Place, Saskatoon, S7N 0X2, Canada
| | - Sheldon J McKay
- Agriculture and Agri-Food Canada, Saskatoon Research Centre, 107 Science Place, Saskatoon, S7N 0X2, Canada
- Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA
| | - Wayne E Clarke
- Agriculture and Agri-Food Canada, Saskatoon Research Centre, 107 Science Place, Saskatoon, S7N 0X2, Canada
| | - Matthew G Links
- Agriculture and Agri-Food Canada, Saskatoon Research Centre, 107 Science Place, Saskatoon, S7N 0X2, Canada
| | - Steven Karcz
- Agriculture and Agri-Food Canada, Saskatoon Research Centre, 107 Science Place, Saskatoon, S7N 0X2, Canada
| | - Sharon Regan
- Department of Biology, Biosciences Complex, Queens University, Kingston, Ontario, K7L 3N6, Canada
| | - Yun-Yun Wu
- Department of Biology, Biosciences Complex, Queens University, Kingston, Ontario, K7L 3N6, Canada
| | - Margaret Y Gruber
- Agriculture and Agri-Food Canada, Saskatoon Research Centre, 107 Science Place, Saskatoon, S7N 0X2, Canada
| | - Dejun Cui
- Agriculture and Agri-Food Canada, Saskatoon Research Centre, 107 Science Place, Saskatoon, S7N 0X2, Canada
| | - Min Yu
- Agriculture and Agri-Food Canada, Saskatoon Research Centre, 107 Science Place, Saskatoon, S7N 0X2, Canada
| | - Isobel AP Parkin
- Agriculture and Agri-Food Canada, Saskatoon Research Centre, 107 Science Place, Saskatoon, S7N 0X2, Canada
| |
Collapse
|
106
|
Butenko MA, Vie AK, Brembu T, Aalen RB, Bones AM. Plant peptides in signalling: looking for new partners. TRENDS IN PLANT SCIENCE 2009; 14:255-63. [PMID: 19362511 DOI: 10.1016/j.tplants.2009.02.002] [Citation(s) in RCA: 112] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/06/2008] [Revised: 02/05/2009] [Accepted: 02/09/2009] [Indexed: 01/11/2023]
Abstract
A novel candidate ligand-receptor system, INFLORESCENCE DEFICIENT IN ABSCISSION (IDA) and the related receptor-like kinases (RLKs) HAESA (HAE) and HAESA-LIKE (HSL)2, has been shown to control floral abscission in Arabidopsis thaliana. Furthermore, several IDA-LIKE (IDL) proteins, which contain a conserved C-terminal domain resembling that of the CLAVATA (CLV)3-ENDOSPERM SURROUNDING REGION (ESR)-RELATED (CLE) protein family, have been shown to be partially redundant with IDA. Here, we use the genetic similarities between the IDA and CLV3 signalling systems to hypothesize that closely related peptide ligands are likely to interact with families of closely related RLKs. Guided by this hypothesis and with the aid of genetics and novel methods, ligand-receptor systems can be identified to improve our understanding of developmental processes in plants.
Collapse
Affiliation(s)
- Melinka A Butenko
- Department of Molecular Biosciences, University of Oslo, N-0316 Oslo, Norway
| | | | | | | | | |
Collapse
|
107
|
Abstract
DNA microarrays have become a mainstream tool in experimental plant biology. The constant improvements in the technological platforms have enabled the development of the tiling DNA microarrays that cover the whole genome, which in turn catalyzed the wide variety of creative applications of such microarrays in the areas as diverse as global studies of genetic variation, DNA-binding proteins, DNA methylation, and chromatin and transcriptome dynamics. This chapter attempts to summarize such applications as well as discusses some technical and strategic issues that are particular to the use of tiling microarrays.
Collapse
|
108
|
Abstract
Gene annotation underpins genome science. Most often protein coding sequence is inferred from the genome based on transcript evidence and computational predictions. While generally correct, gene models suffer from errors in reading frame, exon border definition, and exon identification. To ascertain the error rate of Arabidopsis thaliana gene models, we isolated proteins from a sample of Arabidopsis tissues and determined the amino acid sequences of 144,079 distinct peptides by tandem mass spectrometry. The peptides corresponded to 1 or more of 3 different translations of the genome: a 6-frame translation, an exon splice-graph, and the currently annotated proteome. The majority of the peptides (126,055) resided in existing gene models (12,769 confirmed proteins), comprising 40% of annotated genes. Surprisingly, 18,024 novel peptides were found that do not correspond to annotated genes. Using the gene finding program AUGUSTUS and 5,426 novel peptides that occurred in clusters, we discovered 778 new protein-coding genes and refined the annotation of an additional 695 gene models. The remaining 13,449 novel peptides provide high quality annotation (>99% correct) for thousands of additional genes. Our observation that 18,024 of 144,079 peptides did not match current gene models suggests that 13% of the Arabidopsis proteome was incomplete due to approximately equal numbers of missing and incorrect gene models.
Collapse
|
109
|
Armisén D, Lecharny A, Aubourg S. Unique genes in plants: specificities and conserved features throughout evolution. BMC Evol Biol 2008; 8:280. [PMID: 18847470 PMCID: PMC2576244 DOI: 10.1186/1471-2148-8-280] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2008] [Accepted: 10/10/2008] [Indexed: 11/10/2022] Open
Abstract
Background Plant genomes contain a high proportion of duplicated genes as a result of numerous whole, segmental and local duplications. These duplications lead up to the formation of gene families, which are the usual material for many evolutionary studies. However, all characterized genomes include single-copy (unique) genes that have not received much attention. Unlike gene duplication, gene loss is not an unspecific mechanism but is rather influenced by a functional selection. In this context, we have established and used stringent criteria in order to identify suitable sets of unique genes present in plant proteomes. Comparisons of unique genes in the green phylum were used to characterize the gene and protein features exhibited by both conserved and species-specific unique genes. Results We identified the unique genes within both A. thaliana and O. sativa genomes and classified them according to the number of homologs in the alternative species: none (U{1:0}), one (U{1:1}) or several (U{1:m}). Regardless of the species, all the genes in these groups present some conserved characteristics, such as small average protein size and abnormal intron number. In order to understand the origin and function of unique genes, we further characterized the U{1:1} gene pairs. The possible involvement of sequence convergence in the creation of U{1:1} pairs was discarded due to the frequent conservation of intron positions. Furthermore, an orthology relationship between the two members of each U{1:1} pair was strongly supported by a high conservation in the protein sizes and transcription levels. Within the promoter of the unique conserved genes, we found a number of TATA and TELO boxes that specifically differed from their mean number in the whole genome. Many unique genes have been conserved as unique through evolution from the green alga Ostreococcus lucimarinus to higher plants. Plant unique genes may also have homologs in bacteria and we showed a link between the targeting towards plastids of proteins encoded by plant nuclear unique genes and their homology with a bacterial protein. Conclusion Many of the A. thaliana and O. sativa unique genes are conserved in plants for which the ancestor diverged at least 725 million years ago (MYA). Half of these genes are also present in other eukaryotic and/or prokaryotic species. Thus, our results indicate that (i) a strong negative selection pressure has conserved a number of genes as unique in genomes throughout evolution, (ii) most unique genes are subjected to a low divergence rate, (iii) they have some features observed in housekeeping genes but for most of them there is no functional annotation and (iv) they may have an ancient origin involving a possible gene transfer from ancestral chloroplasts or bacteria to the plant nucleus.
Collapse
Affiliation(s)
- David Armisén
- Unité de Recherche en Génomique Végetale , UMR INRA 1165 - CNRS 8114 - Université d'Evry Val d'Essonne, 2 rue Gaston Crémieux, CP 5708, F-91057 Evry Cedex, France.
| | | | | |
Collapse
|
110
|
Matsui A, Ishida J, Morosawa T, Mochizuki Y, Kaminuma E, Endo TA, Okamoto M, Nambara E, Nakajima M, Kawashima M, Satou M, Kim JM, Kobayashi N, Toyoda T, Shinozaki K, Seki M. Arabidopsis transcriptome analysis under drought, cold, high-salinity and ABA treatment conditions using a tiling array. PLANT & CELL PHYSIOLOGY 2008; 49:1135-49. [PMID: 18625610 DOI: 10.1093/pcp/pcn101] [Citation(s) in RCA: 330] [Impact Index Per Article: 20.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
Plants respond and adapt to drought, cold and high-salinity stresses in order to survive. In this study, we applied Arabidopsis Affymetrix tiling arrays to study the whole genome transcriptome under drought, cold, high-salinity and ABA treatment conditions. The bioinformatic analysis using the tiling array data showed that 7,719 non-AGI transcriptional units (TUs) exist in the unannotated "intergenic" regions of Arabidopsis genome. These include 1,275 and 181 TUs that are induced and downregulated, respectively, by the stress or ABA treatments. Most of the non-AGI TUs are hypothetical non-protein-coding RNAs. About 80% of the non-AGI TUs belong to pairs of the fully overlapping sense-antisense transcripts (fSATs). Significant linear correlation between the expression ratios (treated/untreated) of the sense TUs and the ratios of the antisense TUs was observed in the SATs of AGI code/non-AGI TU. We studied the biogenesis mechanisms of the stress- or ABA-inducible antisense RNAs and found that the expression of sense TUs is necessary for the stress- or ABA-inducible expression of the antisense TUs in the fSATs (AGI code/non-AGI TU).
Collapse
Affiliation(s)
- Akihiro Matsui
- Plant Genomic Network Research Team, Plant Functional Genomics Research Group, RIKEN Plant Science Center, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa, 230-0045 Japan
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
111
|
The EPIP peptide of INFLORESCENCE DEFICIENT IN ABSCISSION is sufficient to induce abscission in arabidopsis through the receptor-like kinases HAESA and HAESA-LIKE2. THE PLANT CELL 2008; 20:1805-17. [PMID: 18660431 DOI: 10.1105/tpc.108.059139] [Citation(s) in RCA: 220] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
In Arabidopsis thaliana, the final step of floral organ abscission is regulated by INFLORESCENCE DEFICIENT IN ABSCISSION (IDA): ida mutants fail to abscise floral organs, and plants overexpressing IDA display earlier abscission. We show that five IDA-LIKE (IDL) genes are expressed in different tissues, but plants overexpressing these genes have phenotypes similar to IDA-overexpressing plants, suggesting functional redundancy. IDA/IDL proteins have N-terminal signal peptides and a C-terminal conserved motif (extended PIP [EPIP]) at the C terminus (EPIP-C). IDA can, similar to CLAVATA3, be processed by an activity from cauliflower meristems. The EPIP-C of IDA and IDL1 replaced IDA function in vivo, when the signal peptide was present. In addition, synthetic IDA and IDL1 EPIP peptides rescued ida and induced early floral abscission in wild-type flowers. The EPIP-C of the other IDL proteins could partially substitute for IDA function. Similarly to ida, a double mutant between the receptor-like kinases (RLKs) HAESA (HAE) and HAESA-LIKE2 (HSL2) displays nonabscising flowers. Neither overexpression of IDA nor synthetic EPIP or EPIP-C peptides could rescue the hae hsl2 abscission deficiency. We propose that IDA and the IDL proteins constitute a family of putative ligands that act through RLKs to regulate different events during plant development.
Collapse
|
112
|
Laubinger S, Zeller G, Henz SR, Sachsenberg T, Widmer CK, Naouar N, Vuylsteke M, Schölkopf B, Rätsch G, Weigel D. At-TAX: a whole genome tiling array resource for developmental expression analysis and transcript identification in Arabidopsis thaliana. Genome Biol 2008; 9:R112. [PMID: 18613972 PMCID: PMC2530869 DOI: 10.1186/gb-2008-9-7-r112] [Citation(s) in RCA: 89] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2008] [Revised: 06/12/2008] [Accepted: 07/09/2008] [Indexed: 11/10/2022] Open
Abstract
Gene expression maps for model organisms, including Arabidopsis thaliana, have typically been created using gene-centric expression arrays. Here, we describe a comprehensive expression atlas, Arabidopsis thaliana Tiling Array Express (At-TAX), which is based on whole-genome tiling arrays. We demonstrate that tiling arrays are accurate tools for gene expression analysis and identified more than 1,000 unannotated transcribed regions. Visualizations of gene expression estimates, transcribed regions, and tiling probe measurements are accessible online at the At-TAX homepage.
Collapse
Affiliation(s)
- Sascha Laubinger
- Department of Molecular Biology, Max Planck Institute for Developmental Biology, Spemannstr, 37-39, 72076 Tübingen, Germany.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
113
|
Hashimoto Y, Kondo T, Kageyama Y. Lilliputians get into the limelight: novel class of small peptide genes in morphogenesis. Dev Growth Differ 2008; 50 Suppl 1:S269-76. [PMID: 18459982 DOI: 10.1111/j.1440-169x.2008.00994.x] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
Generally, bioactive small peptides are derived from precursors with signal sequences at their N-terminal ends, which undergo modification and proteolysis through a secretory pathway. By contrast, small peptides encoded in short open reading frames (sORF) lack signaling sequences and therefore are released into the cytoplasm, which may result in their having functions distinct from those of secreted peptides. Several small peptides encoded by sORF are involved in the morphogenesis of multicellular organisms. POLARIS, ROTUNDIFOLIA4, and Enod40 are plant peptides that are involved, respectively, in root formation, leaf shape control, and cortical cell division during nodule formation. Brick1/HSPC300 is an evolutionarily conserved component of the actin reorganization complex. polished rice/tarsal-less and mille-pattes encode related small peptides that are required for epithelial morphogenesis in Drosophila and segmentation in Tribolium. There are only a few known examples of small peptides encoded by sORF, and their molecular functions are still largely obscure. Nevertheless, an increasing number of sORF genes is being identified, and further research should reveal their roles in novel molecular mechanisms underlying developmental events.
Collapse
Affiliation(s)
- Yoshiko Hashimoto
- Laboratory of Developmental Genetics, Okazaki Institute for Integrative Bioscience, National Institutes of Natural Sciences, 5-1 Myodaiji-Higashiyama, Okazaki 444-8787, Japan
| | | | | |
Collapse
|
114
|
Gleason CA, Liu QL, Williamson VM. Silencing a candidate nematode effector gene corresponding to the tomato resistance gene Mi-1 leads to acquisition of virulence. MOLECULAR PLANT-MICROBE INTERACTIONS : MPMI 2008; 21:576-85. [PMID: 18393617 DOI: 10.1094/mpmi-21-5-0576] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]
Abstract
The Mi-1 gene in tomato confers effective resistance against several species of root-knot nematode, including Meloidogyne javanica. A strain of M. javanica that can reproduce on tomato with Mi-1 was obtained from a culture of an avirulent strain after greenhouse selection. DNA blots and amplified fragment length polymorphism (AFLP) analysis indicated that the two nematode strains are closely related. Expression patterns visualized as cDNA AFLPs were nearly identical except for a cDNA fragment, Cg-1, that was present in the avirulent strain but not in the virulent strain. DNA blots showed that Cg-1 corresponds to a member of a small gene family with one or more copies missing in the virulent strain compared with the avirulent strain. Except for the presence of a histone stem loop near the 3' end of the transcript, Cg-1 shows no similarity to other sequences in GenBank. The longest open reading frame is 32 amino acids and initiates at the fourth AUG in the predicted transcript. When nematode juveniles of the Mi-1-avirulent strain were soaked in dsRNA corresponding to part of the predicted Cg-1 transcript, they produced progeny that were virulent on tomato carrying the Mi-1 gene, strongly suggesting that Cg-1 is required in the nematode for Mi-1-mediated resistance.
Collapse
|
115
|
Ming R, Hou S, Feng Y, Yu Q, Dionne-Laporte A, Saw JH, Senin P, Wang W, Ly BV, Lewis KLT, Salzberg SL, Feng L, Jones MR, Skelton RL, Murray JE, Chen C, Qian W, Shen J, Du P, Eustice M, Tong E, Tang H, Lyons E, Paull RE, Michael TP, Wall K, Rice DW, Albert H, Wang ML, Zhu YJ, Schatz M, Nagarajan N, Acob RA, Guan P, Blas A, Wai CM, Ackerman CM, Ren Y, Liu C, Wang J, Wang J, Na JK, Shakirov EV, Haas B, Thimmapuram J, Nelson D, Wang X, Bowers JE, Gschwend AR, Delcher AL, Singh R, Suzuki JY, Tripathi S, Neupane K, Wei H, Irikura B, Paidi M, Jiang N, Zhang W, Presting G, Windsor A, Navajas-Pérez R, Torres MJ, Feltus FA, Porter B, Li Y, Burroughs AM, Luo MC, Liu L, Christopher DA, Mount SM, Moore PH, Sugimura T, Jiang J, Schuler MA, Friedman V, Mitchell-Olds T, Shippen DE, dePamphilis CW, Palmer JD, Freeling M, Paterson AH, Gonsalves D, Wang L, Alam M. The draft genome of the transgenic tropical fruit tree papaya (Carica papaya Linnaeus). Nature 2008; 452:991-6. [PMID: 18432245 PMCID: PMC2836516 DOI: 10.1038/nature06856] [Citation(s) in RCA: 616] [Impact Index Per Article: 38.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2007] [Accepted: 02/22/2008] [Indexed: 11/09/2022]
Abstract
Papaya, a fruit crop cultivated in tropical and subtropical regions, is known for its nutritional benefits and medicinal applications. Here we report a 3x draft genome sequence of 'SunUp' papaya, the first commercial virus-resistant transgenic fruit tree to be sequenced. The papaya genome is three times the size of the Arabidopsis genome, but contains fewer genes, including significantly fewer disease-resistance gene analogues. Comparison of the five sequenced genomes suggests a minimal angiosperm gene set of 13,311. A lack of recent genome duplication, atypical of other angiosperm genomes sequenced so far, may account for the smaller papaya gene number in most functional groups. Nonetheless, striking amplifications in gene number within particular functional groups suggest roles in the evolution of tree-like habit, deposition and remobilization of starch reserves, attraction of seed dispersal agents, and adaptation to tropical daylengths. Transgenesis at three locations is closely associated with chloroplast insertions into the nuclear genome, and with topoisomerase I recognition sites. Papaya offers numerous advantages as a system for fruit-tree functional genomics, and this draft genome sequence provides the foundation for revealing the basis of Carica's distinguishing morpho-physiological, medicinal and nutritional properties.
Collapse
Affiliation(s)
- Ray Ming
- Hawaii Agriculture Research Center, Aiea, Hawaii 96701, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
116
|
Gregory BD, Yazaki J, Ecker JR. Utilizing tiling microarrays for whole-genome analysis in plants. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2008; 53:636-44. [PMID: 18269573 DOI: 10.1111/j.1365-313x.2007.03320.x] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
The recent explosion in available genome sequence data has ushered in an era in which analysis of a whole genome can be performed in a single experiment. While DNA microarrays have long been the established technology for measuring gene expression levels, standard expression arrays use relatively few probes for each gene and are typically biased toward known and predicted gene structures. Recently, with the availability of complete genome sequences for many organisms, very-high-density oligonucleotide-based microarrays that span the entire genome have emerged as the preferred platform for genomic analysis. Whole-genome tiling microarrays can be employed for a myriad of purposes, including empirical annotation of the transcriptome, chromatin immunoprecipitation-chip studies, analysis of alternative RNA splicing, characterization of the methylation state of cytosine bases throughout a genome (methylome), and DNA polymorphism discovery. Here, we review several applications of whole-genome technology to obtain a variety of genomic-scale information in plants.
Collapse
Affiliation(s)
- Brian D Gregory
- Plant Biology Laboratory, and Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | | | | |
Collapse
|
117
|
Swarbreck D, Wilks C, Lamesch P, Berardini TZ, Garcia-Hernandez M, Foerster H, Li D, Meyer T, Muller R, Ploetz L, Radenbaugh A, Singh S, Swing V, Tissier C, Zhang P, Huala E. The Arabidopsis Information Resource (TAIR): gene structure and function annotation. Nucleic Acids Res 2008. [PMID: 17986450 DOI: 10.1093/nur/skm965] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/30/2023] Open
Abstract
The Arabidopsis Information Resource (TAIR, http://arabidopsis.org) is the model organism database for the fully sequenced and intensively studied model plant Arabidopsis thaliana. Data in TAIR is derived in large part from manual curation of the Arabidopsis research literature and direct submissions from the research community. New developments at TAIR include the addition of the GBrowse genome viewer to the TAIR site, a redesigned home page, navigation structure and portal pages to make the site more intuitive and easier to use, the launch of several TAIR web services and a new genome annotation release (TAIR7) in April 2007. A combination of manual and computational methods were used to generate this release, which contains 27,029 protein-coding genes, 3889 pseudogenes or transposable elements and 1123 ncRNAs (32,041 genes in all, 37,019 gene models). A total of 681 new genes and 1002 new splice variants were added. Overall, 10,098 loci (one-third of all loci from the previous TAIR6 release) were updated for the TAIR7 release.
Collapse
Affiliation(s)
- David Swarbreck
- Carnegie Institution, Department of Plant Biology, 260 Panama St, Stanford, CA 94305, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
118
|
Swarbreck D, Wilks C, Lamesch P, Berardini TZ, Garcia-Hernandez M, Foerster H, Li D, Meyer T, Muller R, Ploetz L, Radenbaugh A, Singh S, Swing V, Tissier C, Zhang P, Huala E. The Arabidopsis Information Resource (TAIR): gene structure and function annotation. Nucleic Acids Res 2007; 36:D1009-14. [PMID: 17986450 PMCID: PMC2238962 DOI: 10.1093/nar/gkm965] [Citation(s) in RCA: 670] [Impact Index Per Article: 39.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
The Arabidopsis Information Resource (TAIR, http://arabidopsis.org) is the model organism database for the fully sequenced and intensively studied model plant Arabidopsis thaliana. Data in TAIR is derived in large part from manual curation of the Arabidopsis research literature and direct submissions from the research community. New developments at TAIR include the addition of the GBrowse genome viewer to the TAIR site, a redesigned home page, navigation structure and portal pages to make the site more intuitive and easier to use, the launch of several TAIR web services and a new genome annotation release (TAIR7) in April 2007. A combination of manual and computational methods were used to generate this release, which contains 27 029 protein-coding genes, 3889 pseudogenes or transposable elements and 1123 ncRNAs (32 041 genes in all, 37 019 gene models). A total of 681 new genes and 1002 new splice variants were added. Overall, 10 098 loci (one-third of all loci from the previous TAIR6 release) were updated for the TAIR7 release.
Collapse
Affiliation(s)
- David Swarbreck
- Carnegie Institution, Department of Plant Biology, 260 Panama St, Stanford, CA 94305, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
119
|
Jones-Rhoades MW, Borevitz JO, Preuss D. Genome-wide expression profiling of the Arabidopsis female gametophyte identifies families of small, secreted proteins. PLoS Genet 2007; 3:1848-61. [PMID: 17937500 PMCID: PMC2014789 DOI: 10.1371/journal.pgen.0030171] [Citation(s) in RCA: 140] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2007] [Accepted: 08/22/2007] [Indexed: 01/10/2023] Open
Abstract
The female gametophyte of flowering plants, the embryo sac, develops within the diploid (sporophytic) tissue of the ovule. While embryo sac–expressed genes are known to be required at multiple stages of the fertilization process, the set of embryo sac–expressed genes has remained poorly defined. In particular, the set of genes responsible for mediating intracellular communication between the embryo sac and the male gametophyte, the pollen grain, is unknown. We used high-throughput cDNA sequencing and whole-genome tiling arrays to compare gene expression in wild-type ovules to that in dif1 ovules, which entirely lack embryo sacs, and myb98 ovules, which are impaired in pollen tube attraction. We identified nearly 400 genes that are downregulated in dif1 ovules. Seventy-eight percent of these embryo sac–dependent genes were predicted to encode for secreted proteins, and 60% belonged to multigenic families. Our results define a large number of candidate extracellular signaling molecules that may act during embryo sac development or fertilization; less than half of these are represented on the widely used ATH1 expression array. In particular, we found that 37 out of 40 genes encoding Domain of Unknown Function 784 (DUF784) domains require the synergid-specific transcription factor MYB98 for expression. Several DUF784 genes were transcribed in synergid cells of the embryo sac, implicating the DUF784 gene family in mediating late stages of embryo sac development or interactions with pollen tubes. The coexpression of highly similar proteins suggests a high degree of functional redundancy among embryo sac genes. During the sexual reproduction of flowering plants, a pollen tube delivers sperm cells to a specialized group of cells known as the embryo sac, which contains the egg cell. It is known that embryo sacs are active participants in guiding the growth of pollen tubes, in facilitating fertilization, and in initiating seed development. However, the genes responsible for the complex biology of embryo sacs are poorly understood. The authors use two recently developed technologies, whole-genome tiling microarrays and high-throughput cDNA sequencing, to identify hundreds of genes expressed in embryo sacs of Arabidopsis thaliana. Most embryo sac–dependent genes have no known function, and include entire families of related genes that are only expressed in embryo sacs. Furthermore, most embryo sac–dependent genes encode small proteins that are potentially secreted from their cells of origin, suggesting that they may act as intracellular signals or to modify the extracellular matrix during fertilization or embryo sac development. These results illustrate the extent to which our understanding of plant sexual reproduction is limited and identifies hundreds of candidate genes for future studies investigating the molecular biology of the embryo sac.
Collapse
Affiliation(s)
- Matthew W Jones-Rhoades
- Department of Molecular Genetics and Cell Biology, The University of Chicago, Chicago, Illinois, United States of America
| | - Justin O Borevitz
- Department of Ecology and Evolution, The University of Chicago, Chicago, Illinois, United States of America
| | - Daphne Preuss
- Department of Molecular Genetics and Cell Biology, The University of Chicago, Chicago, Illinois, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|