1
|
Ding G, Mugume Y, Dueñas ME, Lee YJ, Liu M, Nettleton DS, Zhao X, Li L, Bassham DC, Nikolau BJ. Biological insights from multi-omics analysis strategies: Complex pleotropic effects associated with autophagy. FRONTIERS IN PLANT SCIENCE 2023; 14:1093358. [PMID: 36875559 PMCID: PMC9978356 DOI: 10.3389/fpls.2023.1093358] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Accepted: 01/30/2023] [Indexed: 06/18/2023]
Abstract
Research strategies that combine molecular data from multiple levels of genome expression (i.e., multi-omics data), often referred to as a systems biology strategy, has been advocated as a route to discovering gene functions. In this study we conducted an evaluation of this strategy by combining lipidomics, metabolite mass-spectral imaging and transcriptomics data from leaves and roots in response to mutations in two AuTophaGy-related (ATG) genes of Arabidopsis. Autophagy is an essential cellular process that degrades and recycles macromolecules and organelles, and this process is blocked in the atg7 and atg9 mutants that were the focus of this study. Specifically, we quantified abundances of ~100 lipids and imaged the cellular locations of ~15 lipid molecular species and the relative abundance of ~26,000 transcripts from leaf and root tissues of WT, atg7 and atg9 mutant plants, grown either in normal (nitrogen-replete) and autophagy-inducing conditions (nitrogen-deficient). The multi-omics data enabled detailed molecular depiction of the effect of each mutation, and a comprehensive physiological model to explain the consequence of these genetic and environmental changes in autophagy is greatly facilitated by the a priori knowledge of the exact biochemical function of the ATG7 and ATG9 proteins.
Collapse
Affiliation(s)
- Geng Ding
- Roy J. Carver Department of Biochemistry, Biophysics and Molecular Biology, Iowa State University, Ames, IA, United States
| | - Yosia Mugume
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA, United States
| | | | - Young Jin Lee
- Department of Chemistry, Iowa State University, Ames, IA, United States
| | - Meiling Liu
- Department of Statistics, Iowa State University, Ames, IA, United States
| | | | - Xuefeng Zhao
- Research Information Technology, College of Liberal Arts & Sciences, Iowa State University, Ames, IA, United States
| | - Ling Li
- Department of Biological Sciences, Mississippi State University, Mississippi State, MS, United States
| | - Diane C. Bassham
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA, United States
| | - Basil J. Nikolau
- Roy J. Carver Department of Biochemistry, Biophysics and Molecular Biology, Iowa State University, Ames, IA, United States
| |
Collapse
|
2
|
Zhao P, Ye M, Wang R, Wang D, Chen Q. Systematic identification and functional analysis of potato (Solanum tuberosum L.) bZIP transcription factors and overexpression of potato bZIP transcription factor StbZIP-65 enhances salt tolerance. Int J Biol Macromol 2020; 161:155-167. [PMID: 32512099 DOI: 10.1016/j.ijbiomac.2020.06.032] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2020] [Accepted: 06/03/2020] [Indexed: 01/23/2023]
Abstract
Basic leucine zipper (bZIP) transcription factors play important roles in numerous growth and developmental processes. Potato (Solanum tuberosum L.) is a worldwide important vegetable crop; nevertheless, no systematic identification or functional analysis of the potato bZIP gene family has been reported. In this research, 65 potato bZIPs distributed on 12 potato chromosomes were identified. According to the topology of Arabidopsis and potato bZIP phylogenetic tree, the bZIPs were classified into thirteen groups, designated as A-K, M, and S, with no potato bZIPs included in groups J and M. The bZIPs from the same group shared a conserved exon-intron structure, intron phase, and motif composition. Eighteen potato bZIPs were involved in segmental duplications, and the duplicated gene pairs were under purifying selection. No tandemly duplicated potato bZIP was found. Each potato bZIP promoter contained at least one kind of stress-responsive or stress-related hormone-responsive element. RNA-seq and qRT-PCR analyses revealed different expression patterns of potato bZIPs under abiotic stresses. The overexpression of StbZIP-65 in Arabidopsis enhanced salt tolerance. The StbZIP-65 protein localized in the nucleus. β-Glucuronidase staining showed that promoter activity of StbZIP-65 was induced by exogenous methyl jasmonate. These results may aid in further functional studies of potato bZIP transcription factors.
Collapse
Affiliation(s)
- Peng Zhao
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Minghui Ye
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Ruoqiu Wang
- Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, School of Life Sciences, Fudan University, Shanghai 200438, China
| | - Dongdong Wang
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A&F University, Yangling, Shaanxi 712100, China.
| | - Qin Chen
- College of Food Science and Engineering, Northwest A&F University, Yangling, Shaanxi 712100, China.
| |
Collapse
|
3
|
Frank MH, Chitwood DH. Plant chimeras: The good, the bad, and the 'Bizzaria'. Dev Biol 2016; 419:41-53. [PMID: 27381079 DOI: 10.1016/j.ydbio.2016.07.003] [Citation(s) in RCA: 43] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2016] [Revised: 06/25/2016] [Accepted: 07/01/2016] [Indexed: 01/10/2023]
Abstract
Chimeras - organisms that are composed of cells of more than one genotype - captured the human imagination long before they were formally described and used in the laboratory. These organisms owe their namesake to a fire-breathing monster from Greek mythology that has the head of a lion, the body of a goat, and the tail of a serpent. The first description of a non-fictional chimera dates back to the middle of the seventeenth century when the Florentine gardener Pietro Nati discovered an adventitious shoot growing from the graft junction between sour orange (Citrus aurantium) and citron (Citrus medica). This perplexing chimera that grows with sectors phenotypically resembling each of the citrus progenitors inspired discussion and wonder from the scientific community and was fittingly named the 'Bizzaria'. Initially, the 'Bizzaria' was believed to be an asexual hybrid that formed from a cellular fusion between the grafted parents; however, in-depth cellular analyses carried out centuries later demonstrated that the 'Bizzaria', along with other chimeras, owe their unique sectored appearance to a conglomeration of cells from the two donors. Since this pivotal discovery at the turn of the twentieth century, chimeras have served both as tools and as unique biological phenomena that have contributed to our understanding of plant development at the cellular, tissue, and organismal level. Rapid advancements in genome sequencing technologies have enabled the establishment of new model species with novel morphological and developmental features that enable the generation of chimeric organisms. In this review, we show that genetic mosaic and chimera studies provide a technologically simple way to delve into the organismal, genetic, and genomic inner workings underlying the development of diverse model organisms. Moreover, we discuss the unique opportunity that chimeras present to explore universal principles governing intercellular communication and the coordination of organismal biology in a heterogenomic landscape.
Collapse
Affiliation(s)
- Margaret H Frank
- Donald Danforth Plant Science Center, 975 North Warson Rd, Saint Louis, MO 63132, United States.
| | - Daniel H Chitwood
- Donald Danforth Plant Science Center, 975 North Warson Rd, Saint Louis, MO 63132, United States
| |
Collapse
|
4
|
Song J, Wang B. Using euhalophytes to understand salt tolerance and to develop saline agriculture: Suaeda salsa as a promising model. ANNALS OF BOTANY 2015; 115:541-53. [PMID: 25288631 PMCID: PMC4332605 DOI: 10.1093/aob/mcu194] [Citation(s) in RCA: 156] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/12/2014] [Accepted: 08/14/2014] [Indexed: 05/18/2023]
Abstract
BACKGROUND As important components in saline agriculture, halophytes can help to provide food for a growing world population. In addition to being potential crops in their own right, halophytes are also potential sources of salt-resistance genes that might help plant breeders and molecular biologists increase the salt tolerance of conventional crop plants. One especially promising halophyte is Suaeda salsa, a euhalophytic herb that occurs both on inland saline soils and in the intertidal zone. The species produces dimorphic seeds: black seeds are sensitive to salinity and remain dormant in light under high salt concentrations, while brown seeds can germinate under high salinity (e.g. 600 mm NaCl) regardless of light. Consequently, the species is useful for studying the mechanisms by which dimorphic seeds are adapted to saline environments. S. salsa has succulent leaves and is highly salt tolerant (e.g. its optimal NaCl concentration for growth is 200 mm). A series of S. salsa genes related to salt tolerance have been cloned and their functions tested: these include SsNHX1, SsHKT1, SsAPX, SsCAT1, SsP5CS and SsBADH. The species is economically important because its fresh branches have high value as a vegetable, and its seed oil is edible and rich in unsaturated fatty acids. Because it can remove salts and heavy metals from saline soils, S. salsa can also be used in the restoration of salinized or contaminated saline land. SCOPE Because of its economic and ecological value in saline agriculture, S. salsa is one of the most important halophytes in China. In this review, the value of S. salsa as a source of food, medicine and forage is discussed. Its uses in the restoration of salinized or contaminated land and as a source of salt-resistance genes are also considered.
Collapse
Affiliation(s)
- Jie Song
- Key Laboratory of Plant Stress, College of Life Science, Shandong Normal University, Jinan 250014, PR China
| | - Baoshan Wang
- Key Laboratory of Plant Stress, College of Life Science, Shandong Normal University, Jinan 250014, PR China
| |
Collapse
|
5
|
Plantagora: modeling whole genome sequencing and assembly of plant genomes. PLoS One 2011; 6:e28436. [PMID: 22174807 PMCID: PMC3236183 DOI: 10.1371/journal.pone.0028436] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2011] [Accepted: 11/08/2011] [Indexed: 01/17/2023] Open
Abstract
Background Genomics studies are being revolutionized by the next generation sequencing technologies, which have made whole genome sequencing much more accessible to the average researcher. Whole genome sequencing with the new technologies is a developing art that, despite the large volumes of data that can be produced, may still fail to provide a clear and thorough map of a genome. The Plantagora project was conceived to address specifically the gap between having the technical tools for genome sequencing and knowing precisely the best way to use them. Methodology/Principal Findings For Plantagora, a platform was created for generating simulated reads from several different plant genomes of different sizes. The resulting read files mimicked either 454 or Illumina reads, with varying paired end spacing. Thousands of datasets of reads were created, most derived from our primary model genome, rice chromosome one. All reads were assembled with different software assemblers, including Newbler, Abyss, and SOAPdenovo, and the resulting assemblies were evaluated by an extensive battery of metrics chosen for these studies. The metrics included both statistics of the assembly sequences and fidelity-related measures derived by alignment of the assemblies to the original genome source for the reads. The results were presented in a website, which includes a data graphing tool, all created to help the user compare rapidly the feasibility and effectiveness of different sequencing and assembly strategies prior to testing an approach in the lab. Some of our own conclusions regarding the different strategies were also recorded on the website. Conclusions/Significance Plantagora provides a substantial body of information for comparing different approaches to sequencing a plant genome, and some conclusions regarding some of the specific approaches. Plantagora also provides a platform of metrics and tools for studying the process of sequencing and assembly further.
Collapse
|
6
|
Henikoff S. Epigenetic Profiling of Histone Variants. Epigenomics 2008. [DOI: 10.1007/978-1-4020-9187-2_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022] Open
|
7
|
Turakulov R, Nontachaiyapoom S, Mitchelson KR, Gresshoff PM, Men AE. Ultrasensitive determination of absolute mRNA amounts at attomole levels of nearly identical plant genes with high-throughput mass spectrometry (MassARRAY). PLANT & CELL PHYSIOLOGY 2007; 48:1379-84. [PMID: 17686807 DOI: 10.1093/pcp/pcm103] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
Detection of very small amounts of RNA based on microdissection of plant tissue is essential for modern plant biology. Mass spectroscopy technology (MassARRAY) based on Sequenomtrade mark instrumentation was adapted to determine quickly and in a high-throughput fashion (by multiplexing) the absolute amounts of mRNA of closely related soybean genes. A sensitivity of 0.1 amol (10(-19)) was achieved, representing as few as 1,000 mRNA molecules. This methodology eliminates the use of housekeeping genes as reference standards and has multiple applications for plant functional genomics, such as the monitoring of individual expression of paralogous genes at ultra-low expression levels and/or in extremely small tissue samples.
Collapse
Affiliation(s)
- Rust Turakulov
- Australian Genome Research Facility, Level 5, Gehrmann Laboratories, University of Queensland, St Lucia, Brisbane, Australia
| | | | | | | | | |
Collapse
|
8
|
Ruebelt MC, Leimgruber NK, Lipp M, Reynolds TL, Nemeth MA, Astwood JD, Engel KH, Jany KD. Application of two-dimensional gel electrophoresis to interrogate alterations in the proteome of genetically modified crops. 1. Assessing analytical validation. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2006; 54:2154-61. [PMID: 16536590 DOI: 10.1021/jf0523566] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]
Abstract
Current tools used to assess the safety of food and feed derived from modern biotechnology emphasize the investigation of possible unintended effects caused directly by the expression of transgenes or indirectly by pleiotropy. These tools include extensive multisite and multiyear agronomic evaluations, compositional analyses, animal nutrition, and classical toxicology evaluations. Because analytical technologies are rapidly developing, proteome analysis based on two-dimensional gel electrophoresis (2DE) was investigated as a complementary tool to the existing technologies. A 2DE method was established for the qualitative and quantitative analysis of the seed proteome of Arabidopsis thaliana with the following validation parameters examined: (1) source and scope of variation; (2) repeatability; (3) sensitivity; and (4) linearity of the method. The 2DE method resolves proteins with isoelectric points between 4 and 9 and molecular masses (MM) of 6-120 kDa and is sensitive enough to detect protein levels in the low nanogram range. The separation of the proteins was demonstrated to be very reliable with relative position variations of 1.7 and 1.1% for the pI and MM directions, respectively. The mean coefficient of variation of 254 matched spot qualities was found to be 24.8% for the gel-to-gel and 26% for the overall variability. A linear relationship (R2 > 0.9) between protein amount and spot volume was demonstrated over a 100-fold range for the majority of selected proteins. Therefore, this method could be used to interrogate proteome alterations such as a novel protein, fusion protein, or any other change that affects molecular mass, isoelectric point, and/or quantity of a protein.
Collapse
Affiliation(s)
- Martin C Ruebelt
- Product Safety Center, Monsanto Company, 800 North Lindbergh Boulevard, St. Louis, Missouri 63167, USA.
| | | | | | | | | | | | | | | |
Collapse
|
9
|
Abstract
L’adoption à grande échelle des cultures transgéniques depuis dix ans a soulevé de nombreuses questions quant aux impacts possibles de ces nouvelles lignées végétales sur les écosystèmes agricoles et naturels. Des questions ont été soulevées, en particulier, sur le devenir des transgènes dans le milieu et sur une possible « pollution » du patrimoine génétique des organismes vivants à l’échelle des écosystèmes. Après une énumération des impacts environnementaux associés aux végétaux transgéniques, cet article de synthèse dresse un aperçu des connaissances actuelles sur le devenir – ou la migration – des transgènes dans le milieu. Les phénomènes d’hybridation et d’introgression génique en direction d’espèces ou de lignées apparentées sont d’abord abordés, après quoi sont considérés les phénomènes de transfert horizontal des transgènes en direction d’organismes non apparentés. Un article complémentaire publié dans ce même numéro traite de l’impact environnemental des protéines recombinantes encodées par les transgènes (Michaud 2005).
Collapse
|
10
|
Abstract
Pericentromeres are exceptional genomic regions: in animals they contain extensive segmental duplications implicated in gene creation, and in plants they sustain rearrangements and insertions uncommon in euchromatin. To examine the mechanisms and patterns of plant pericentromere evolution, we compared pericentromere sequence from four Brassicaceae species separated by <15 million years (Myr). This flowering plant family is ideal for studying relationships between genome reorganization and pericentromere evolution-its members have undergone recent polyploidization and hybridization, with close relatives changing in genome size and chromosome number. Through sequence and hybridization analyses, we examined regions from Arabidopsis arenosa, Capsella rubella, and Olimarabidopsis pumila that are homologous to Arabidopsis thaliana pericentromeres (peri-CENs) III and V, and used FISH to demonstrate they have been maintained near centromere satellite arrays in each species. Sequence analysis revealed a set of highly conserved genes, yet we discovered substantial differences in intergenic length and species-specific changes in sequence content and gene density. We discovered that A. thaliana has undergone recent, significant expansions within its pericentromeres, in some cases measuring hundreds of kilobases; these findings are in marked contrast to euchromatic segments in these species that exhibit only minor length changes. While plant pericentromeres do contain some duplications, we did not find evidence of extensive segmental duplications, as has been documented in primates. Our data support a model in which plant pericentromeres may experience selective pressures distinct from euchromatin, tolerating rapid, dynamic changes in structure and sequence content, including large insertions of mobile elements, 5S rDNA arrays and pseudogenes.
Collapse
Affiliation(s)
- Anne E Hall
- Howard Hughes Medical Institute, The University of Chicago, Chicago, Illinois 60637, USA
| | | | | |
Collapse
|
11
|
Abstract
MITEs (Miniature inverted-repeat transposable elements) are reminiscence of non-autonomous DNA (class II) elements, which are distinguished from other transposable elements by their small size, short terminal inverted repeats (TIRs), high copy numbers, genic preference, and DNA sequence identity among family members. Although MITEs were first discovered in plants and still actively reshaping genomes, they have been isolated from a wide range of eukaryotic organisms. MITEs can be divided into Tourist-like, Stowaway-like, and pogo-like groups, according to similarities of their TIRs and TSDs (target site duplications). In despite of several models to explain the origin and amplification of MITEs, their mechanisms of transposition and accumulation in eukaryotic genomes remain poorly understood owing to insufficient experimental data. The unique properties of MITEs have been exploited as useful genetic tools for plant genome analysis. Utilization of MITEs as effective and informative genomic markers and potential application of MITEs in plants systematic, phylogenetic, and genetic studies are discussed.
Collapse
Affiliation(s)
- Ying Feng
- Agriculture and Biotechnology College, Zhejiang University, Hangzhou 310029, China.
| |
Collapse
|
12
|
|
13
|
Graham MA, Silverstein KAT, Cannon SB, VandenBosch KA. Computational identification and characterization of novel genes from legumes. PLANT PHYSIOLOGY 2004; 135:1179-97. [PMID: 15266052 PMCID: PMC519039 DOI: 10.1104/pp.104.037531] [Citation(s) in RCA: 92] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/10/2003] [Revised: 04/01/2004] [Accepted: 04/03/2004] [Indexed: 05/18/2023]
Abstract
The Fabaceae, the third largest family of plants and the source of many crops, has been the target of many genomic studies. Currently, only the grasses surpass the legumes for the number of publicly available expressed sequence tags (ESTs). The quantity of sequences from diverse plants enables the use of computational approaches to identify novel genes in specific taxa. We used BLAST algorithms to compare unigene sets from Medicago truncatula, Lotus japonicus, and soybean (Glycine max and Glycine soja) to nonlegume unigene sets, to GenBank's nonredundant and EST databases, and to the genomic sequences of rice (Oryza sativa) and Arabidopsis. As a working definition, putatively legume-specific genes had no sequence homology, below a specified threshold, to publicly available sequences of nonlegumes. Using this approach, 2,525 legume-specific EST contigs were identified, of which less than three percent had clear homology to previously characterized legume genes. As a first step toward predicting function, related sequences were clustered to build motifs that could be searched against protein databases. Three families of interest were more deeply characterized: F-box related proteins, Pro-rich proteins, and Cys cluster proteins (CCPs). Of particular interest were the >300 CCPs, primarily from nodules or seeds, with predicted similarity to defensins. Motif searching also identified several previously unknown CCP-like open reading frames in Arabidopsis. Evolutionary analyses of the genomic sequences of several CCPs in M. truncatula suggest that this family has evolved by local duplications and divergent selection.
Collapse
Affiliation(s)
- Michelle A Graham
- Department of Plant Biology, University of Minnesota, St. Paul, Minnesota 55108, USA
| | | | | | | |
Collapse
|
14
|
Aert R, Sági L, Volckaert G. Gene content and density in banana ( Musa acuminata) as revealed by genomic sequencing of BAC clones. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2004; 109:129-39. [PMID: 14985976 DOI: 10.1007/s00122-004-1603-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2003] [Accepted: 01/20/2004] [Indexed: 05/20/2023]
Abstract
The complete sequence of Musa acuminata bacterial artificial chromosome (BAC) clones is presented and, consequently, the first analysis of the banana genome organization. One clone (MuH9) is 82,723 bp long with an overall G+C content of 38.2%. Twelve putative protein-coding sequences were identified, representing a gene density of one per 6.9 kb, which is slightly less than that previously reported for Arabidopsis but similar to rice. One coding sequence was identified as a partial M. acuminata malate synthase, while the remaining sequences showed a similarity to predicted or hypothetical proteins identified in genome sequence data. A second BAC clone (MuG9) is 73,268 bp long with an overall G+C content of 38.5%. Only seven putative coding regions were discovered, representing a gene density of only one gene per 10.5 kb, which is strikingly lower than that of the first BAC. One coding sequence showed significant homology to the soybean ribonucleotide reductase (large subunit). A transition point between coding regions and repeated sequences was found at approximately 45 kb, separating the coding upstream BAC end from its downstream end that mainly contained transposon-like sequences and regions similar to known repetitive sequences of M. acuminata. This gene organization resembles Gramineae genome sequences, where genes are clustered in gene-rich regions separated by gene-poor DNA containing abundant transposons.
Collapse
Affiliation(s)
- R Aert
- Laboratory of Gene Technology, Katholieke Universiteit Leuven, Kasteelpark Arenberg 21, 3001 Leuven, Belgium.
| | | | | |
Collapse
|
15
|
Bernier SP, Silo-Suh L, Woods DE, Ohman DE, Sokol PA. Comparative analysis of plant and animal models for characterization of Burkholderia cepacia virulence. Infect Immun 2003; 71:5306-13. [PMID: 12933878 PMCID: PMC187319 DOI: 10.1128/iai.71.9.5306-5313.2003] [Citation(s) in RCA: 74] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
A simple alfalfa model was developed as an alternative infection model for virulence studies of the Burkholderia cepacia complex. Symptoms of disease were observed in wounded alfalfa seedlings within 7 days following inoculation of 10(1) to 10(5) CFU of most strains of the B. cepacia complex. Strains from seven genomovars of the B. cepacia complex were tested for virulence in the alfalfa model, and the degree of virulence was generally similar in strains belonging to the same genomovar. Strains of Burkholderia multivorans and some strains of Burkholderia stabilis did not cause symptoms of disease in alfalfa seedlings. Representative strains were also tested for virulence using the rat agar bead model. Most of the strains tested were able to establish chronic lung infections; B. stabilis strains were the exception. Most of the strains that were virulent in the alfalfa infection model were also virulent in the lung infection model. The B. cepacia genomovar III mutants K56pvdA::tp and K56-H15 were significantly less virulent in the alfalfa infection model than their parent strain. Therefore, this alfalfa infection model may be a useful tool for assessing virulence of strains of the B. cepacia complex and identifying new virulence-associated genes.
Collapse
Affiliation(s)
- Steve P Bernier
- Department of Microbiology and Infectious Diseases, University of Calgary Health Sciences Center, Calgary, Alberta T2N 4N1, Canada
| | | | | | | | | |
Collapse
|
16
|
Abstract
Co-option occurs when natural selection finds new uses for existing traits, including genes, organs, and other body structures. Genes can be co-opted to generate developmental and physiological novelties by changing their patterns of regulation, by changing the functions of the proteins they encode, or both. This often involves gene duplication followed by specialization of the resulting paralogous genes into particular functions. A major role for gene co-option in the evolution of development has long been assumed, and many recent comparative developmental and genomic studies have lent support to this idea. Although there is relatively less known about the molecular basis of co-option events involving developmental pathways, much can be drawn from well-studied examples of the co-option of structural proteins. Here, we summarize several case studies of both structural gene and developmental genetic circuit co-option and discuss how co-option may underlie major episodes of adaptive change in multicellular organisms. We also examine the phenomenon of intraspecific variability in gene expression patterns, which we propose to be one form of material for the co-option process. We integrate this information with recent models of gene family evolution to provide a framework for understanding the origin of co-optive evolution and the mechanisms by which natural selection promotes evolutionary novelty by inventing new uses for the genetic toolkit.
Collapse
Affiliation(s)
- John R True
- Department of Ecology and Evolution, State University of New York at Stony Brook, 11794-5245, USA.
| | | |
Collapse
|
17
|
Georgi LL, Wang Y, Reighard GL, Mao L, Wing RA, Abbott AG. Comparison of peach and Arabidopsis genomic sequences: fragmentary conservation of gene neighborhoods. Genome 2003; 46:268-76. [PMID: 12723043 DOI: 10.1139/g03-004] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
We examined the degree of conservation of gene order in two plant species, Prunus persica (peach) and Arabidopsis thaliana (thale cress), whose lineages diverged more than 90 million years ago. In the three peach genomic regions studied, segments with a gene order congruent with A. thaliana were short (two to three genes in length); and for any peach region, corresponding segments were found in diverse locations in the A. thaliana genome. At the gene level and lower, the A. thaliana sequence was enormously useful for identifying likely coding regions in peach sequences and in determining their intron-exon structure. The peach BAC sequence data reported here contained a BLAST-detectable putative coding sequence an average of every 7 kb, and the peach introns identified in this study were, on average, almost twice the length of the corresponding introns in A. thaliana.
Collapse
Affiliation(s)
- Laura L Georgi
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC 29634, U.S.A.
| | | | | | | | | | | |
Collapse
|
18
|
Abstract
The multitude of metabolites found in living organisms and the calculated, unexpected small number of genes identified during genome sequencing projects discomfit biologists. Several processes on the transcription and translation level lead to the formation of isoenzymes and can therefore explain at least parts of this surprising result. However, poor enzyme specificity may also contribute to metabolome diversity. In former studies, when enzymes were isolated from natural sources, impure protein preparations were hold responsible for broad enzyme specificity. Nowadays, highly purified enzymes are available by molecular biological methods such as heterologous expression in host organisms and they can be thoroughly analyzed. During biochemical analysis of heterologously expressed enzymes poor specificity was observed for enzymes involved in fruit ripening, e.g. in flavour and color formation. Surprisingly broad specificity was shown for the reactants in the case of alcohol acyl-CoA transferase, O-methyltransferase, glucosyltransferase, P450 monooxygenases as well as polyketide synthases and for the product in the case of monoterpene synthases. Literature data confirm the assumption of limited specificity for enzymes involved in metabolism and bioformation of secondary metabolites. It is concluded that metabolome diversity is caused by low enzyme specificity but availability of suitable substrates due to compartmentation has also taken into account.
Collapse
Affiliation(s)
- Wilfried Schwab
- Lehrstuhl für Lebensmittelchemie, Universität Würzburg, Am Hubland, D-97074, Würzburg, Germany.
| |
Collapse
|
19
|
Blanc G, Hokamp K, Wolfe KH. A recent polyploidy superimposed on older large-scale duplications in the Arabidopsis genome. Genome Res 2003; 13:137-44. [PMID: 12566392 PMCID: PMC420368 DOI: 10.1101/gr.751803] [Citation(s) in RCA: 513] [Impact Index Per Article: 24.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2002] [Accepted: 11/12/2002] [Indexed: 12/30/2022]
Abstract
The Arabidopsis genome contains numerous large duplicated chromosomal segments, but the different approaches used in previous analyses led to different interpretations regarding the number and timing of ancestral large-scale duplication events. Here, using more appropriate methodology and a more recent version of the genome sequence annotation, we investigate the scale and timing of segmental duplications in Arabidopsis. We used protein sequence similarity searches to detect duplicated blocks in the genome, used the level of synonymous substitution between duplicated genes to estimate the relative ages of the blocks containing them, and analyzed the degree of overlap between adjacent duplicated blocks. We conclude that the Arabidopsis lineage underwent at least two distinct episodes of duplication. One was a polyploidy that occurred much more recently than estimated previously, before the Arabidopsis/Brassica rapa split and probably during the early emergence of the crucifer family (24-40 Mya). An older set of duplicated blocks was formed after the monocot/dicot divergence, and the relatively low level of overlap among these blocks indicates that at least some of them are remnants of a larger duplication such as a polyploidy or aneuploidy.
Collapse
Affiliation(s)
- Guillaume Blanc
- Department of Genetics, Smurfit Institute, University of Dublin, Trinity College, Dublin 2, Ireland
| | | | | |
Collapse
|
20
|
Reiss B. Homologous recombination and gene targeting in plant cells. INTERNATIONAL REVIEW OF CYTOLOGY 2003; 228:85-139. [PMID: 14667043 DOI: 10.1016/s0074-7696(03)28003-7] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Gene targeting has become an indispensable tool for functional genomics in yeast and mouse; however, this tool is still missing in plants. This review discusses the gene targeting problem in plants in the context of general knowledge on recombination and gene targeting. An overview on the history of gene targeting is followed by a general introduction to genetic recombination of bacteria, yeast, and vertebrates. This abridged discussion serves as a guide to the following sections, which cover plant-specific aspects of recombination assay systems, the mechanism of recombination, plant recombination genes, the relationship of recombination to the environment, approaches to stimulate homologous recombination and gene targeting, and a description of two plant systems, the moss Physcomitrella patens and the chloroplast, that naturally have high efficiencies of gene targeting. The review concludes with a discussion of alternatives to gene targeting.
Collapse
Affiliation(s)
- Bernd Reiss
- Max-Planck-Institut für Zuechtungsforschung, Carl-von-Linne-Weg 10, D-50829 Köln, Germany
| |
Collapse
|
21
|
Henikoff S, Comai L. Single-nucleotide mutations for plant functional genomics. ANNUAL REVIEW OF PLANT BIOLOGY 2003; 54:375-401. [PMID: 14502996 DOI: 10.1146/annurev.arplant.54.031902.135009] [Citation(s) in RCA: 114] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
In the present genomics era, powerful reverse-genetic strategies are needed to elucidate gene and protein function in the context of a whole organism. However, most current techniques lack the generality and high-throughput potential of descriptive genomic approaches, such as those that rely on microarray hybridization. For example, in plant research, effective insertional mutagenesis and transgenic methods are limited to relatively few species or are inefficient. Fortunately, single-nucleotide changes can be induced in any plant by using traditional chemical mutagens, and progress has been made in efficiently detecting changes. Because base substitutions in proteins provide allelic series, and not just knockouts, this strategy can yield refined insights into protein function. Here, we review recent progress that has been made in genome-wide screening for point mutations and natural variation in plants. Its general applicability leads to the expectation that traditional mutagenesis followed by high-throughput detection will become increasingly important for plant functional genomics.
Collapse
Affiliation(s)
- Steven Henikoff
- Howard Hughes Medical Institute, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA.
| | | |
Collapse
|
22
|
Lynch M, Conery JS. The evolutionary demography of duplicate genes. JOURNAL OF STRUCTURAL AND FUNCTIONAL GENOMICS 2003; 3:35-44. [PMID: 12836683 DOI: 10.1007/978-94-010-0263-9_4] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
Although gene duplication has generally been viewed as a necessary source of material for the origin of evolutionary novelties, the rates of origin, loss, and preservation of gene duplicates are not well understood. Applying steady-state demographic techniques to the age distributions of duplicate genes censused in seven completely sequenced genomes, we estimate the average rate of duplication of a eukaryotic gene to be on the order of 0.01/ gene/million years, which is of the same order of magnitude as the mutation rate per nucleotide site. However, the average half-life of duplicate genes is relatively small, on the order of 4.0 million years. Significant interspecific variation in these rates appears to be responsible for differences in species-specific genome sizes that arise as a consequence of a quasi-equilibrium birth-death process. Most duplicated genes experience a brief period of relaxed selection early in their history and a minority exhibit the signature of directional selection, but those that survive more than a few million years eventually experience strong purifying selection. Thus, although most theoretical work on the gene-duplication process has focused on issues related to adaptive evolution, the origin of a new function appears to be a very rare fate for a duplicate gene. A more significant role of the duplication process may be the generation of microchromosomal rearrangements through reciprocal silencing of alternative copies, which can lead to the passive origin of post-zygotic reproductive barriers in descendant lineages of incipient species.
Collapse
Affiliation(s)
- Michael Lynch
- Dept. of Biology, Indiana University, Bloomington, Indiana 47405, USA
| | | |
Collapse
|
23
|
Conner AJ, Glare TR, Nap JP. The release of genetically modified crops into the environment. Part II. Overview of ecological risk assessment. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2003; 33:19-46. [PMID: 12943539 DOI: 10.1046/j.0960-7412.2002.001607.x] [Citation(s) in RCA: 146] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Despite numerous future promises, there is a multitude of concerns about the impact of GM crops on the environment. Key issues in the environmental assessment of GM crops are putative invasiveness, vertical or horizontal gene flow, other ecological impacts, effects on biodiversity and the impact of presence of GM material in other products. These are all highly interdisciplinary and complex issues. A crucial component for a proper assessment is defining the appropriate baseline for comparison and decision. For GM crops, the best and most appropriately defined reference point is the impact of plants developed by traditional breeding. The latter is an integral and accepted part of agriculture. In many instances, the putative impacts identified for GM crops are very similar to the impacts of new cultivars derived from traditional breeding. When assessing GM crops relative to existing cultivars, the increased knowledge base underpinning the development of GM crops will provide greater confidence in the assurances plant science can give on the risks of releasing such crops.
Collapse
Affiliation(s)
- Anthony J Conner
- New Zealand Institute for Crop & Food Research Ltd, Private Bag 4704, Christchurch, New Zealand.
| | | | | |
Collapse
|
24
|
Coruzzi GM. Primary N-assimilation into Amino Acids in Arabidopsis. THE ARABIDOPSIS BOOK 2003; 2:e0010. [PMID: 22303223 PMCID: PMC3243381 DOI: 10.1199/tab.0010] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Affiliation(s)
- Gloria M Coruzzi
- Department of Biology, New York University, 100 Washington Square East, Room 1009 Main Building, New York, NY 10003
| |
Collapse
|
25
|
Zhao Q, Chae YK, Markley JL. NMR solution structure of ATTp, an Arabidopsis thaliana trypsin inhibitor. Biochemistry 2002; 41:12284-96. [PMID: 12369816 DOI: 10.1021/bi025702a] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The three-dimensional structure of the precursor form of the Arabidopsis thaliana trypsin inhibitor (ATT(p), GenBank entry Z46816), a 68-residue (approximately 7.5 kDa) rapeseed class proteinase inhibitor, has been determined in solution at pH 5.0 and 25 degrees C by multinuclear magnetic resonance spectroscopy. The protein contains one alpha-helix and two strands of antiparallel beta-sheet, with a type IV beta-turn connecting the two strands. The alpha-helix and the inhibitory loop are connected to the beta-sheet through three disulfide bridges; a fourth disulfide bridge connects the N- and C-termini. The overall structural topology of ATT(p) is similar to those of the sweet tasting protein brazzein (rmsd of 3.0 A) and the antifungal protein Rs-Afp1 [a knottin protein from radish (Raphanus sativus), rmsd of 2.7 A]. The precursor segment in ATT(p) is disordered, as visualized by the final 20-conformer ensemble and as confirmed by (15)N heteronuclear NOE analysis. The overall fold of ATT(p) is distinct from those of other classes of serine proteinase inhibitors except in the inhibitor loop; therefore, it represents a new inhibitor fold.
Collapse
Affiliation(s)
- Qin Zhao
- Department of Biochemistry, University of Wisconsin-Madison, 53706, USA
| | | | | |
Collapse
|
26
|
Affiliation(s)
- Stephen K Herbert
- Department of Botany, University of Wyoming, Aven Nelson 215, Laramie, WY 82071-3165, USA.
| |
Collapse
|
27
|
Orsel M, Krapp A, Daniel-Vedele F. Analysis of the NRT2 nitrate transporter family in Arabidopsis. Structure and gene expression. PLANT PHYSIOLOGY 2002; 129:886-96. [PMID: 12068127 PMCID: PMC161709 DOI: 10.1104/pp.005280] [Citation(s) in RCA: 158] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/07/2002] [Accepted: 03/11/2002] [Indexed: 05/18/2023]
Abstract
Nitrate is an essential element for plant growth, both as a primary nutrient in the nitrogen assimilation pathway and as an important signal for plant development. The uptake of nitrate from the soil and its translocation throughout the plant has been the subject of intensive physiological and molecular studies. Using a reverse genetic approach, the AtNRT2.1 gene has been shown to be involved in the inducible component of the high-affinity nitrate transport system in Arabidopsis. The Arabidopsis Genome Initiative has released nearly the whole genome sequence of Arabidopsis, allowing the identification of a small NRT2 multigene family in this species. Thus, we investigated the phylogenetic relationship between NRT2 proteins belonging to several kingdoms and compared the structure of the different members of the Arabidopsis family. We analyzed, by semiquantitative reverse transcriptase-polymerase chain reaction, the expression pattern of each gene depending on plant organ and development or nutritional status, and compared the relative level of each gene by real-time polymerase chain reaction. We also evaluated the significance of each paralog on the basis of the relative levels of gene expression. The results are discussed in relation with distinct roles for the individual members of the AtNRT2 family.
Collapse
Affiliation(s)
- Mathilde Orsel
- Unité de la Nutrition Azotée des Plantes, Institut National de la Recherche Agronomique, Route de St. Cyr, F-78026 Versailles cedex, France
| | | | | |
Collapse
|
28
|
Yu J, Hu S, Wang J, Wong GKS, Li S, Liu B, Deng Y, Dai L, Zhou Y, Zhang X, Cao M, Liu J, Sun J, Tang J, Chen Y, Huang X, Lin W, Ye C, Tong W, Cong L, Geng J, Han Y, Li L, Li W, Hu G, Huang X, Li W, Li J, Liu Z, Li L, Liu J, Qi Q, Liu J, Li L, Li T, Wang X, Lu H, Wu T, Zhu M, Ni P, Han H, Dong W, Ren X, Feng X, Cui P, Li X, Wang H, Xu X, Zhai W, Xu Z, Zhang J, He S, Zhang J, Xu J, Zhang K, Zheng X, Dong J, Zeng W, Tao L, Ye J, Tan J, Ren X, Chen X, He J, Liu D, Tian W, Tian C, Xia H, Bao Q, Li G, Gao H, Cao T, Wang J, Zhao W, Li P, Chen W, Wang X, Zhang Y, Hu J, Wang J, Liu S, Yang J, Zhang G, Xiong Y, Li Z, Mao L, Zhou C, Zhu Z, Chen R, Hao B, Zheng W, Chen S, Guo W, Li G, Liu S, Tao M, Wang J, Zhu L, Yuan L, Yang H. A draft sequence of the rice genome (Oryza sativa L. ssp. indica). Science 2002; 296:79-92. [PMID: 11935017 DOI: 10.1126/science.1068037] [Citation(s) in RCA: 1775] [Impact Index Per Article: 80.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
We have produced a draft sequence of the rice genome for the most widely cultivated subspecies in China, Oryza sativa L. ssp. indica, by whole-genome shotgun sequencing. The genome was 466 megabases in size, with an estimated 46,022 to 55,615 genes. Functional coverage in the assembled sequences was 92.0%. About 42.2% of the genome was in exact 20-nucleotide oligomer repeats, and most of the transposons were in the intergenic regions between genes. Although 80.6% of predicted Arabidopsis thaliana genes had a homolog in rice, only 49.4% of predicted rice genes had a homolog in A. thaliana. The large proportion of rice genes with no recognizable homologs is due to a gradient in the GC content of rice coding sequences.
Collapse
MESH Headings
- Arabidopsis/genetics
- Base Composition
- Computational Biology
- Contig Mapping
- DNA Transposable Elements
- DNA, Intergenic
- DNA, Plant/chemistry
- DNA, Plant/genetics
- Databases, Nucleic Acid
- Exons
- Gene Duplication
- Genes, Plant
- Genome, Plant
- Genomics
- Introns
- Molecular Sequence Data
- Oryza/genetics
- Plant Proteins/chemistry
- Plant Proteins/genetics
- Polymorphism, Genetic
- Repetitive Sequences, Nucleic Acid
- Sequence Analysis, DNA
- Sequence Homology, Nucleic Acid
- Software
- Species Specificity
- Synteny
Collapse
Affiliation(s)
- Jun Yu
- Beijing Genomics Institute/Center of Genomics and Bioinformatics, Chinese Academy of Sciences, Beijing 101300, China
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Abstract
euGenes is a genome information system and database that provides a common summary of eukaryote genes and genomes, at http://iubio.bio.indiana.edu/eugenes/. Seven popular genomes are included: human, mouse, fruitfly, Caenorhabditis elegans worm, Saccharomyces yeast, Arabidopsis mustard weed and zebrafish, with more planned. This information, automatically extracted and updated from several source databases, offers features not readily available through other genome databases to bioscientists looking for gene relationships across organisms. The database describes 150 000 known, predicted and orphan genes, using consistent gene names along with their homologies and associations with a standard vocabulary of molecular functions, cell locations and biological processes. Usable whole-genome maps including features, chromosome locations and molecular data integration are available, as are options to retrieve sequences from these genomes. Search and retrieval methods for these data are easy to use and efficient, allowing one to ask combined questions of sequence features, protein functions and other gene attributes, and fetch results in reports, computable tabular outputs or bulk database forms. These summarized data are useful for integration in other projects, such as gene expression databases. euGenes provides an extensible, flexible genome information system for many organisms.
Collapse
Affiliation(s)
- Donald G Gilbert
- Center for Genomics and Bioinformatics, Indiana University, Bloomington, IN 47405, USA.
| |
Collapse
|
30
|
Masson PH, Tasaka M, Morita MT, Guan C, Chen R, Boonsirichai K. Arabidopsis thaliana: A Model for the Study of Root and Shoot Gravitropism. THE ARABIDOPSIS BOOK 2002; 1:e0043. [PMID: 22303208 PMCID: PMC3243349 DOI: 10.1199/tab.0043] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
UNLABELLED For most plants, shoots grow upward and roots grow downward. These growth patterns illustrate the ability for plant organs to guide their growth at a specified angle from the gravity vector (gravitropism). They allow shoots to grow upward toward light, where they can photosynthesize, and roots to grow downward into the soil, where they can anchor the plant as well as take up water and mineral ions.Gravitropism involves several steps organized in a specific response pathway. These include the perception of a gravistimulus (reorientation within the gravity field), the transduction of this mechanical stimulus into a physiological signal, the transmission of this signal from the site of sensing to the site of response, and a curvature-response which allows the organ tip to resume growth at a predefined set angle from the gravity vector.The primary sites for gravity sensing are located in the cap for roots, and in the endodermis for shoots. The curvature response occurs in the elongation zones for each organ. Upon gravistimulation, a gradient of auxin appears to be generated across the stimulated organ, and be transmitted to the site of response where it promotes a differential growth response. Therefore, while the gravity-induced auxin gradient has to be transmitted from the cap to the elongation zones in roots, there is no need for a longitudinal transport in shoots, as sites for gravity sensing and response overlap in this organ.A combination of molecular genetics, physiology, biochemistry and cell biology, coupled with the utilization of Arabidopsis thaliana as a model system, have recently allowed the identification of a number of molecules involved in the regulation of each phase of gravitropism in shoots and roots of higher plants. In this review, we attempt to summarize the results of these experiments, and we conclude by comparing the molecular and physiological mechanisms that underlie gravitropism in these organs. ABBREVIATIONS GSPA: gravitational set point angle; IAA: indole-3-acetic acid; NAA: 1-naphthalene acetic acid; NPA: 1-N-naphthylphthalamic acid; 2,4-D: 2,4-dichlorphenoxy acetic acid; TIBA: 2,3,5-triiodobenzoic acid.
Collapse
|
31
|
Haas BJ, Volfovsky N, Town CD, Troukhan M, Alexandrov N, Feldmann KA, Flavell RB, White O, Salzberg SL. Full-length messenger RNA sequences greatly improve genome annotation. Genome Biol 2002; 3:RESEARCH0029. [PMID: 12093376 PMCID: PMC116726 DOI: 10.1186/gb-2002-3-6-research0029] [Citation(s) in RCA: 126] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2001] [Revised: 03/14/2002] [Accepted: 04/19/2002] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Annotation of eukaryotic genomes is a complex endeavor that requires the integration of evidence from multiple, often contradictory, sources. With the ever-increasing amount of genome sequence data now available, methods for accurate identification of large numbers of genes have become urgently needed. In an effort to create a set of very high-quality gene models, we used the sequence of 5,000 full-length gene transcripts from Arabidopsis to re-annotate its genome. We have mapped these transcripts to their exact chromosomal locations and, using alignment programs, have created gene models that provide a reference set for this organism. RESULTS Approximately 35% of the transcripts indicated that previously annotated genes needed modification, and 5% of the transcripts represented newly discovered genes. We also discovered that multiple transcription initiation sites appear to be much more common than previously known, and we report numerous cases of alternative mRNA splicing. We include a comparison of different alignment software and an analysis of how the transcript data improved the previously published annotation. CONCLUSIONS Our results demonstrate that sequencing of large numbers of full-length transcripts followed by computational mapping greatly improves identification of the complete exon structures of eukaryotic genes. In addition, we are able to find numerous introns in the untranslated regions of the genes.
Collapse
Affiliation(s)
- Brian J Haas
- The Institute for Genomic Research, 9712 Medical Center Drive, Rockville, MD 20850, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
32
|
Current Awareness on Comparative and Functional Genomics. Comp Funct Genomics 2001. [PMCID: PMC2448396 DOI: 10.1002/cfg.59] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
|