1
|
Carlson CH, Choi Y, Chan AP, Town CD, Smart LB. Nonadditive gene expression is correlated with nonadditive phenotypic expression in interspecific triploid hybrids of willow (Salix spp.). G3 (Bethesda) 2022; 12:6472355. [PMID: 35100357 PMCID: PMC9210313 DOI: 10.1093/g3journal/jkab436] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Accepted: 12/10/2021] [Indexed: 12/13/2022]
Abstract
Many studies have highlighted the complex and diverse basis for heterosis in inbred crops. Despite the lack of a consensus model, it is vital that we turn our attention to understanding heterosis in undomesticated, heterozygous, and polyploid species, such as willow (Salix spp.). Shrub willow is a dedicated energy crop bred to be fast-growing and high yielding on marginal land without competing with food crops. A trend in willow breeding is the consistent pattern of heterosis in triploids produced from crosses between diploid and tetraploid species. Here, we test whether differentially expressed genes are associated with heterosis in triploid families derived from diploid Salix purpurea, diploid Salix viminalis, and tetraploid Salix miyabeana parents. Three biological replicates of shoot tips from all family progeny and parents were collected after 12 weeks in the greenhouse and RNA extracted for RNA-Seq analysis. This study provides evidence that nonadditive patterns of gene expression are correlated with nonadditive phenotypic expression in interspecific triploid hybrids of willow. Expression-level dominance was most correlated with heterosis for biomass yield traits and was highly enriched for processes involved in starch and sucrose metabolism. In addition, there was a global dosage effect of parent alleles in triploid hybrids, with expression proportional to copy number variation. Importantly, differentially expressed genes between family parents were most predictive of heterosis for both field and greenhouse collected traits. Altogether, these data will be used to progress models of heterosis to complement the growing genomic resources available for the improvement of heterozygous perennial bioenergy crops.
Collapse
Affiliation(s)
- Craig H Carlson
- Horticulture Section, School of Integrative Plant Science, Cornell University, Geneva, NY 14456, USA
| | - Yongwook Choi
- Plant Genomics, J. Craig Venter Institute, Rockville, MD 20850, USA
| | - Agnes P Chan
- Plant Genomics, J. Craig Venter Institute, Rockville, MD 20850, USA
| | | | - Lawrence B Smart
- Horticulture Section, School of Integrative Plant Science, Cornell University, Geneva, NY 14456, USA
| |
Collapse
|
2
|
Richards VP, Velsko IM, Alam MT, Zadoks RN, Manning SD, Pavinski Bitar PD, Hassler HB, Crestani C, Springer GH, Probert BM, Town CD, Stanhope MJ. Population Gene Introgression and High Genome Plasticity for the Zoonotic Pathogen Streptococcus agalactiae. Mol Biol Evol 2019; 36:2572-2590. [PMID: 31350563 PMCID: PMC6805230 DOI: 10.1093/molbev/msz169] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2019] [Revised: 04/04/2019] [Accepted: 07/18/2019] [Indexed: 01/06/2023] Open
Abstract
The influence that bacterial adaptation (or niche partitioning) within species has on gene spillover and transmission among bacterial populations occupying different niches is not well understood. Streptococcus agalactiae is an important bacterial pathogen that has a taxonomically diverse host range making it an excellent model system to study these processes. Here, we analyze a global set of 901 genome sequences from nine diverse host species to advance our understanding of these processes. Bayesian clustering analysis delineated 12 major populations that closely aligned with niches. Comparative genomics revealed extensive gene gain/loss among populations and a large pan genome of 9,527 genes, which remained open and was strongly partitioned among niches. As a result, the biochemical characteristics of 11 populations were highly distinctive (significantly enriched). Positive selection was detected and biochemical characteristics of the dispensable genes under selection were enriched in ten populations. Despite the strong gene partitioning, phylogenomics detected gene spillover. In particular, tetracycline resistance (which likely evolved in the human-associated population) from humans to bovine, canines, seals, and fish, demonstrating how a gene selected in one host can ultimately be transmitted into another, and biased transmission from humans to bovines was confirmed with a Bayesian migration analysis. Our findings show high bacterial genome plasticity acting in balance with selection pressure from distinct functional requirements of niches that is associated with an extensive and highly partitioned dispensable genome, likely facilitating continued and expansive adaptation.
Collapse
Affiliation(s)
- Vincent P Richards
- Department of Population Medicine and Diagnostic Sciences, College of Veterinary Medicine, Cornell University, Ithaca, NY
- Department of Biological Sciences, College of Science, Clemson University, Clemson, SC
| | - Irina M Velsko
- Department of Biological Sciences, College of Science, Clemson University, Clemson, SC
- Department of Archaeogenetics, Max Planck Institute for the Science of Human History, Jena, Germany
| | - Md Tauqeer Alam
- Department of Biological Sciences, College of Science, Clemson University, Clemson, SC
- Department of Pathobiology, College of Veterinary Medicine, University of Illinois Urbana-Champaign, Urbana, IL
| | - Ruth N Zadoks
- Pentlands Science Park, Moredun Research Institute, Penicuik, United Kingdom
- Institute for Biodiversity, Animal Health and Comparative Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom
| | - Shannon D Manning
- Department of Microbiology and Molecular Genetics, Michigan State University, E. Lansing, MI
| | - Paulina D Pavinski Bitar
- Department of Population Medicine and Diagnostic Sciences, College of Veterinary Medicine, Cornell University, Ithaca, NY
| | - Hayley B Hassler
- Department of Biological Sciences, College of Science, Clemson University, Clemson, SC
| | - Chiara Crestani
- Pentlands Science Park, Moredun Research Institute, Penicuik, United Kingdom
| | - Garrett H Springer
- Department of Biological Sciences, College of Science, Clemson University, Clemson, SC
| | - Brett M Probert
- Department of Biological Sciences, College of Science, Clemson University, Clemson, SC
| | | | - Michael J Stanhope
- Department of Population Medicine and Diagnostic Sciences, College of Veterinary Medicine, Cornell University, Ithaca, NY
| |
Collapse
|
3
|
Bose ME, Shrivastava S, He J, Nelson MI, Bera J, Fedorova N, Halpin R, Town CD, Lorenzi HA, Amedeo P, Gupta N, Noyola DE, Videla C, Kok T, Buys A, Venter M, Vabret A, Cordey S, Henrickson KJ. Sequencing and analysis of globally obtained human parainfluenza viruses 1 and 3 genomes. PLoS One 2019; 14:e0220057. [PMID: 31318956 PMCID: PMC6638977 DOI: 10.1371/journal.pone.0220057] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2019] [Accepted: 07/08/2019] [Indexed: 12/16/2022] Open
Abstract
Human Parainfluenza viruses (HPIV) type 1 and 3 are important causes of respiratory tract infections in young children globally. HPIV infections do not confer complete protective immunity so reinfections occur throughout life. Since no effective vaccine is available for the two virus subtypes, comprehensive understanding of HPIV-1 and HPIV-3 genetic and epidemic features is important for diagnosis, prevention, and treatment of HPIV-1 and HPIV-3 infections. Relatively few whole genome sequences are available for both HPIV-1 and HPIV-3 viruses, so our study sought to provide whole genome sequences from multiple countries to further the understanding of the global diversity of HPIV at a whole-genome level. We collected HPIV-1 and HPIV-3 samples and isolates from Argentina, Australia, France, Mexico, South Africa, Switzerland, and USA from the years 2003-2011 and sequenced the genomes of 40 HPIV-1 and 75 HPIV-3 viruses with Sanger and next-generation sequencing with the Ion Torrent, Illumina, and 454 platforms. Phylogenetic analysis showed that the HPIV-1 genome is evolving at an estimated rate of 4.97 × 10-4 mutations/site/year (95% highest posterior density 4.55 × 10-4 to 5.38 × 10-4) and the HPIV-3 genome is evolving at a similar rate (3.59 × 10-4 mutations/site/year, 95% highest posterior density 3.26 × 10-4 to 3.94 × 10-4). There were multiple genetically distinct lineages of both HPIV-1 and 3 circulating on a global scale. Further surveillance and whole-genome sequencing are greatly needed to better understand the spatial dynamics of these important respiratory viruses in humans.
Collapse
Affiliation(s)
- Michael E. Bose
- Midwest Respiratory Virus Program, Medical College of Wisconsin, Milwaukee, WI, United States of America
| | | | - Jie He
- Midwest Respiratory Virus Program, Medical College of Wisconsin, Milwaukee, WI, United States of America
| | - Martha I. Nelson
- Fogarty International Center, National Institutes of Health, Bethesda, MD, United States of America
| | - Jayati Bera
- J. Craig Venter Institute, Rockville, MD, United States of America
| | - Nadia Fedorova
- J. Craig Venter Institute, Rockville, MD, United States of America
| | - Rebecca Halpin
- J. Craig Venter Institute, Rockville, MD, United States of America
| | | | | | - Paolo Amedeo
- J. Craig Venter Institute, Rockville, MD, United States of America
| | - Neha Gupta
- J. Craig Venter Institute, Rockville, MD, United States of America
| | - Daniel E. Noyola
- Departamento de Microbiología, Facultad de Medicina, Universidad Autónoma de San Luis Potosí, San Luis Potosí, Mexico
| | - Cristina Videla
- Clinical Virology Laboratory, Centro de Educación Médica e Investigaciones Clínicas (CEMIC) University Hospital, Buenos Aires, Argentina
| | - Tuckweng Kok
- School of Molecular and Biomedical Science, University of Adelaide, Adelaide, Australia
| | - Amelia Buys
- Centre for Respiratory Diseases and Meningitis, National Institute for Communicable Diseases, Sandringham, South Africa
| | - Marietjie Venter
- Centre for Respiratory Diseases and Meningitis, National Institute for Communicable Diseases, Sandringham, South Africa
- Zoonotic, arbo and respiratory virus program, Department Medical Virology, University of Pretoria, Pretoria, South Africa
| | - Astrid Vabret
- Normandie Université, Caen, France
- Groupe de Recherche sur l'Adaptation Microbienne (GRAM), Université de Caen, Caen, France
- Laboratoire de Virologie, Centre Hospitalier Universitaire de Caen, Caen, France
| | - Samuel Cordey
- Division of Infectious Diseases and Laboratory of Virology, University of Geneva Hospitals, Geneva, Switzerland
| | - Kelly J. Henrickson
- Midwest Respiratory Virus Program, Medical College of Wisconsin, Milwaukee, WI, United States of America
| |
Collapse
|
4
|
Carlson CH, Choi Y, Chan AP, Serapiglia MJ, Town CD, Smart LB. Dominance and Sexual Dimorphism Pervade the Salix purpurea L. Transcriptome. Genome Biol Evol 2017; 9:2377-2394. [PMID: 28957462 PMCID: PMC5622329 DOI: 10.1093/gbe/evx174] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/31/2017] [Indexed: 12/24/2022] Open
Abstract
The heritability of gene expression is critical in understanding heterosis and is dependent on allele-specific regulation by local and remote factors in the genome. We used RNA-Seq to test whether variation in gene expression among F1 and F2 intraspecific Salix purpurea progeny is attributable to cis- and trans-regulatory divergence. We assessed the mode of inheritance based on gene expression levels and allele-specific expression for F1 and F2 intraspecific progeny in two distinct tissue types: shoot tip and stem internode. In addition, we explored sexually dimorphic patterns of inheritance and regulatory divergence among F1 progeny individuals. We show that in S. purpurea intraspecific crosses, gene expression inheritance largely exhibits a maternal dominant pattern, regardless of tissue type or pedigree. A significantly greater number of cis- and trans-regulated genes coincided with upregulation of the maternal parent allele in the progeny, irrespective of the magnitude, whereas the paternal allele was higher expressed for genes showing cis × trans or compensatory regulation. Importantly, consistent with previous genetic mapping results for sex in shrub willow, we have delimited sex-biased gene expression to a 2 Mb pericentromeric region on S. purpurea chr15 and further refined the sex determination region. Altogether, our results offer insight into the inheritance of gene expression in S. purpurea as well as evidence of sexually dimorphic expression which may have contributed to the evolution of dioecy in Salix.
Collapse
Affiliation(s)
- Craig H. Carlson
- Horticulture Section, School of Integrative Plant Science, Cornell University, New York State Agricultural Experiment Station, Geneva, New York 14456 USA
| | - Yongwook Choi
- J. Craig Venter Institute, Rockville, Maryland 20850 USA
| | - Agnes P. Chan
- J. Craig Venter Institute, Rockville, Maryland 20850 USA
| | - Michelle J. Serapiglia
- Horticulture Section, School of Integrative Plant Science, Cornell University, New York State Agricultural Experiment Station, Geneva, New York 14456 USA
| | | | - Lawrence B. Smart
- Horticulture Section, School of Integrative Plant Science, Cornell University, New York State Agricultural Experiment Station, Geneva, New York 14456 USA
| |
Collapse
|
5
|
Cheng CY, Krishnakumar V, Chan AP, Thibaud-Nissen F, Schobel S, Town CD. Araport11: a complete reannotation of the Arabidopsis thaliana reference genome. Plant J 2017; 89:789-804. [PMID: 27862469 DOI: 10.1111/tpj.13415] [Citation(s) in RCA: 564] [Impact Index Per Article: 80.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/27/2016] [Revised: 10/27/2016] [Accepted: 11/03/2016] [Indexed: 05/20/2023]
Abstract
The flowering plant Arabidopsis thaliana is a dicot model organism for research in many aspects of plant biology. A comprehensive annotation of its genome paves the way for understanding the functions and activities of all types of transcripts, including mRNA, the various classes of non-coding RNA, and small RNA. The TAIR10 annotation update had a profound impact on Arabidopsis research but was released more than 5 years ago. Maintaining the accuracy of the annotation continues to be a prerequisite for future progress. Using an integrative annotation pipeline, we assembled tissue-specific RNA-Seq libraries from 113 datasets and constructed 48 359 transcript models of protein-coding genes in eleven tissues. In addition, we annotated various classes of non-coding RNA including microRNA, long intergenic RNA, small nucleolar RNA, natural antisense transcript, small nuclear RNA, and small RNA using published datasets and in-house analytic results. Altogether, we identified 635 novel protein-coding genes, 508 novel transcribed regions, 5178 non-coding RNAs, and 35 846 small RNA loci that were formerly unannotated. Analysis of the splicing events and RNA-Seq based expression profiles revealed the landscapes of gene structures, untranslated regions, and splicing activities to be more intricate than previously appreciated. Furthermore, we present 692 uniformly expressed housekeeping genes, 43% of whose human orthologs are also housekeeping genes. This updated Arabidopsis genome annotation with a substantially increased resolution of gene models will not only further our understanding of the biological processes of this plant model but also of other species.
Collapse
Affiliation(s)
- Chia-Yi Cheng
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD, 20850, USA
| | - Vivek Krishnakumar
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD, 20850, USA
| | - Agnes P Chan
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD, 20850, USA
| | - Françoise Thibaud-Nissen
- National Center for Biotechnology Information, US National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Seth Schobel
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD, 20850, USA
| | - Christopher D Town
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD, 20850, USA
| |
Collapse
|
6
|
Krishnakumar V, Contrino S, Cheng CY, Belyaeva I, Ferlanti ES, Miller JR, Vaughn MW, Micklem G, Town CD, Chan AP. ThaleMine: A Warehouse for Arabidopsis Data Integration and Discovery. Plant Cell Physiol 2017; 58:e4. [PMID: 28013278 DOI: 10.1093/pcp/pcw200] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/01/2016] [Accepted: 11/11/2016] [Indexed: 05/08/2023]
Abstract
ThaleMine (https://apps.araport.org/thalemine/) is a comprehensive data warehouse that integrates a wide array of genomic information of the model plant Arabidopsis thaliana. The data collection currently includes the latest structural and functional annotation from the Araport11 update, the Col-0 genome sequence, RNA-seq and array expression, co-expression, protein interactions, homologs, pathways, publications, alleles, germplasm and phenotypes. The data are collected from a wide variety of public resources. Users can browse gene-specific data through Gene Report pages, identify and create gene lists based on experiments or indexed keywords, and run GO enrichment analysis to investigate the biological significance of selected gene sets. Developed by the Arabidopsis Information Portal project (Araport, https://www.araport.org/), ThaleMine uses the InterMine software framework, which builds well-structured data, and provides powerful data query and analysis functionality. The warehoused data can be accessed by users via graphical interfaces, as well as programmatically via web-services. Here we describe recent developments in ThaleMine including new features and extensions, and discuss future improvements. InterMine has been broadly adopted by the model organism research community including nematode, rat, mouse, zebrafish, budding yeast, the modENCODE project, as well as being used for human data. ThaleMine is the first InterMine developed for a plant model. As additional new plant InterMines are developed by the legume and other plant research communities, the potential of cross-organism integrative data analysis will be further enabled.
Collapse
Affiliation(s)
- Vivek Krishnakumar
- Plant Genomics, J. Craig Venter Institute, Medical Center Dr, Rockville, MD, USA
| | - Sergio Contrino
- Department of Genetics, Cambridge Systems Biology Centre, Tennis Court Road, Cambridge, UK
| | - Chia-Yi Cheng
- Plant Genomics, J. Craig Venter Institute, Medical Center Dr, Rockville, MD, USA
| | - Irina Belyaeva
- Plant Genomics, J. Craig Venter Institute, Medical Center Dr, Rockville, MD, USA
| | - Erik S Ferlanti
- Life Sciences Computing, Texas Advanced Computing Center, 10100 Burnet Rd, Austin, TX, USA
| | - Jason R Miller
- Plant Genomics, J. Craig Venter Institute, Medical Center Dr, Rockville, MD, USA
| | - Matthew W Vaughn
- Life Sciences Computing, Texas Advanced Computing Center, 10100 Burnet Rd, Austin, TX, USA
| | - Gos Micklem
- Department of Genetics, Cambridge Systems Biology Centre, Tennis Court Road, Cambridge, UK
| | - Christopher D Town
- Plant Genomics, J. Craig Venter Institute, Medical Center Dr, Rockville, MD, USA
| | - Agnes P Chan
- Plant Genomics, J. Craig Venter Institute, Medical Center Dr, Rockville, MD, USA
| |
Collapse
|
7
|
Abstract
Optical mapping has been widely used to improve de novo plant genome assemblies, including rice, maize, Medicago, Amborella, tomato and wheat, with more genomes in the pipeline. Optical mapping provides long-range information of the genome and can more easily identify large structural variations. The ability of optical mapping to assay long single DNA molecules nicely complements short-read sequencing which is more suitable for the identification of small and short-range variants. Direct use of optical mapping to study population-level genetic diversity is currently limited to microbial strain typing and human diversity studies. Nonetheless, optical mapping shows great promise in the study of plant trait development, domestication and polyploid evolution. Here we review the current applications and future prospects of optical mapping in the field of plant comparative genomics.
Collapse
Affiliation(s)
- Haibao Tang
- Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, Fuzhou, 350002, Fujian People's Republic of China ; School of Plant Sciences, iPlant Collaborative, University of Arizona, Tucson, AZ 85721 USA
| | - Eric Lyons
- School of Plant Sciences, iPlant Collaborative, University of Arizona, Tucson, AZ 85721 USA
| | | |
Collapse
|
8
|
Zhang X, Rosen BD, Tang H, Krishnakumar V, Town CD. Polyribosomal RNA-Seq reveals the decreased complexity and diversity of the Arabidopsis translatome. PLoS One 2015; 10:e0117699. [PMID: 25706651 PMCID: PMC4338112 DOI: 10.1371/journal.pone.0117699] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2014] [Accepted: 12/30/2014] [Indexed: 01/01/2023] Open
Abstract
Recent RNA-seq studies reveal that the transcriptomes in animals and plants are more complex than previously thought, leading to the inclusion of many more splice isoforms in annotated genomes. However, it is possible that a significant proportion of the transcripts are spurious isoforms that do not contribute to functional proteins. One of the current hypotheses is that commonly used mRNA extraction methods isolate both pre-mature (nuclear) mRNA and mature (cytoplasmic) mRNA, and these incompletely spliced pre-mature mRNAs may contribute to a large proportion of these spurious transcripts. To investigate this, we compared a traditional RNA-seq dataset (total RNA-seq) and a ribosome-bound RNA-seq dataset (polyribosomal RNA-seq) from Arabidopsis thaliana. An integrative framework that combined de novo assembly and genome-guided assembly was applied to reconstruct transcriptomes for the two datasets. Up to 44.8% of the de novo assembled transcripts in total RNA-seq sample were of low abundance, whereas only 0.09% in polyribosomal RNA-seq de novo assembly were of low abundance. The final round of assembly using PASA (Program to Assemble Spliced Alignments) resulted in more transcript assemblies in the total RNA-seq than those in polyribosomal sample. Comparison of alternative splicing (AS) patterns between total and polyribosomal RNA-seq showed a significant difference (G-test, p-value<0.01) in intron retention events: 46.4% of AS events in the total sample were intron retention, whereas only 23.5% showed evidence of intron retention in the polyribosomal sample. It is likely that a large proportion of retained introns in total RNA-seq result from incompletely spliced pre-mature mRNA. Overall, this study demonstrated that polyribosomal RNA-seq technology decreased the complexity and diversity of the coding transcriptome by eliminating pre-mature mRNAs, especially those of low abundance.
Collapse
Affiliation(s)
- Xingtan Zhang
- J. Craig Venter Institute, Rockville, Maryland, United States of America
- School of Life Sciences, Chongqing University, Chongqing, China
| | - Benjamin D. Rosen
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Haibao Tang
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Vivek Krishnakumar
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Christopher D. Town
- J. Craig Venter Institute, Rockville, Maryland, United States of America
- * E-mail:
| |
Collapse
|
9
|
Richards VP, Palmer SR, Pavinski Bitar PD, Qin X, Weinstock GM, Highlander SK, Town CD, Burne RA, Stanhope MJ. Phylogenomics and the dynamic genome evolution of the genus Streptococcus. Genome Biol Evol 2015; 6:741-53. [PMID: 24625962 PMCID: PMC4007547 DOI: 10.1093/gbe/evu048] [Citation(s) in RCA: 106] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
The genus Streptococcus comprises important pathogens that have a severe impact on human health and are responsible for substantial economic losses to agriculture. Here, we utilize 46 Streptococcus genome sequences (44 species), including eight species sequenced here, to provide the first genomic level insight into the evolutionary history and genetic basis underlying the functional diversity of all major groups of this genus. Gene gain/loss analysis revealed a dynamic pattern of genome evolution characterized by an initial period of gene gain followed by a period of loss, as the major groups within the genus diversified. This was followed by a period of genome expansion associated with the origins of the present extant species. The pattern is concordant with an emerging view that genomes evolve through a dynamic process of expansion and streamlining. A large proportion of the pan-genome has experienced lateral gene transfer (LGT) with causative factors, such as relatedness and shared environment, operating over different evolutionary scales. Multiple gene ontology terms were significantly enriched for each group, and mapping terms onto the phylogeny showed that those corresponding to genes born on branches leading to the major groups represented approximately one-fifth of those enriched. Furthermore, despite the extensive LGT, several biochemical characteristics have been retained since group formation, suggesting genomic cohesiveness through time, and that these characteristics may be fundamental to each group. For example, proteolysis: mitis group; urea metabolism: salivarius group; carbohydrate metabolism: pyogenic group; and transcription regulation: bovis group.
Collapse
Affiliation(s)
- Vincent P Richards
- Department of Population Medicine and Diagnostic Sciences, College of Veterinary Medicine, Cornell University
| | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Abstract
Medicago truncatula, a close relative of alfalfa (Medicago sativa), is a model legume used for studying symbiotic nitrogen fixation, mycorrhizal interactions and legume genomics. J. Craig Venter Institute (JCVI; formerly TIGR) has been involved in M. truncatula genome sequencing and annotation since 2002 and has maintained a web-based resource providing data to the community for this entire period. The website (http://www.MedicagoGenome.org) has seen major updates in the past year, where it currently hosts the latest version of the genome (Mt4.0), associated data and legacy project information, presented to users via a rich set of open-source tools. A JBrowse-based genome browser interface exposes tracks for visualization. Mutant gene symbols originally assembled and curated by the Frugoli lab are now hosted at JCVI and tie into our community annotation interface, Medicago EuCAP (to be integrated soon with our implementation of WebApollo). Literature pertinent to M. truncatula is indexed and made searchable via the Textpresso search engine. The site also implements MedicMine, an instance of InterMine that offers interconnectivity with other plant 'mines' such as ThaleMine and PhytoMine, and other model organism databases (MODs). In addition to these new features, we continue to provide keyword- and locus identifier-based searches served via a Chado-backed Tripal Instance, a BLAST search interface and bulk downloads of data sets from the iPlant Data Store (iDS). Finally, we maintain an E-mail helpdesk, facilitated by a JIRA issue tracking system, where we receive and respond to questions about the website and requests for specific data sets from the community.
Collapse
Affiliation(s)
- Vivek Krishnakumar
- Plant Genomics Group, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA
| | - Maria Kim
- Plant Genomics Group, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA
| | - Benjamin D Rosen
- Plant Genomics Group, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA
| | - Svetlana Karamycheva
- Plant Genomics Group, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA
| | - Shelby L Bidwell
- Plant Genomics Group, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA
| | - Haibao Tang
- Plant Genomics Group, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA
| | - Christopher D Town
- Plant Genomics Group, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA
| |
Collapse
|
11
|
Krishnakumar V, Hanlon MR, Contrino S, Ferlanti ES, Karamycheva S, Kim M, Rosen BD, Cheng CY, Moreira W, Mock SA, Stubbs J, Sullivan JM, Krampis K, Miller JR, Micklem G, Vaughn M, Town CD. Araport: the Arabidopsis information portal. Nucleic Acids Res 2014; 43:D1003-9. [PMID: 25414324 PMCID: PMC4383980 DOI: 10.1093/nar/gku1200] [Citation(s) in RCA: 138] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
The Arabidopsis Information Portal (https://www.araport.org) is a new online resource for plant biology research. It houses the Arabidopsis thaliana genome sequence and associated annotation. It was conceived as a framework that allows the research community to develop and release ‘modules’ that integrate, analyze and visualize Arabidopsis data that may reside at remote sites. The current implementation provides an indexed database of core genomic information. These data are made available through feature-rich web applications that provide search, data mining, and genome browser functionality, and also by bulk download and web services. Araport uses software from the InterMine and JBrowse projects to expose curated data from TAIR, GO, BAR, EBI, UniProt, PubMed and EPIC CoGe. The site also hosts ‘science apps,’ developed as prototypes for community modules that use dynamic web pages to present data obtained on-demand from third-party servers via RESTful web services. Designed for sustainability, the Arabidopsis Information Portal strategy exploits existing scientific computing infrastructure, adopts a practical mixture of data integration technologies and encourages collaborative enhancement of the resource by its user community.
Collapse
Affiliation(s)
| | - Matthew R Hanlon
- Texas Advanced Computing Center, The University of Texas, Austin, TX 78758, USA
| | - Sergio Contrino
- Cambridge Systems Biology Centre, University of Cambridge, Cambridge CB2 1QR, UK
| | - Erik S Ferlanti
- Plant Genomics, J. Craig Venter Institute, Rockville, MD 20850, USA
| | | | - Maria Kim
- Plant Genomics, J. Craig Venter Institute, Rockville, MD 20850, USA
| | - Benjamin D Rosen
- Plant Genomics, J. Craig Venter Institute, Rockville, MD 20850, USA
| | - Chia-Yi Cheng
- Plant Genomics, J. Craig Venter Institute, Rockville, MD 20850, USA
| | - Walter Moreira
- Texas Advanced Computing Center, The University of Texas, Austin, TX 78758, USA
| | - Stephen A Mock
- Texas Advanced Computing Center, The University of Texas, Austin, TX 78758, USA
| | - Joseph Stubbs
- Texas Advanced Computing Center, The University of Texas, Austin, TX 78758, USA
| | - Julie M Sullivan
- Cambridge Systems Biology Centre, University of Cambridge, Cambridge CB2 1QR, UK
| | | | - Jason R Miller
- Plant Genomics, J. Craig Venter Institute, Rockville, MD 20850, USA
| | - Gos Micklem
- Cambridge Systems Biology Centre, University of Cambridge, Cambridge CB2 1QR, UK
| | - Matthew Vaughn
- Texas Advanced Computing Center, The University of Texas, Austin, TX 78758, USA
| | | |
Collapse
|
12
|
Chalhoub B, Denoeud F, Liu S, Parkin IAP, Tang H, Wang X, Chiquet J, Belcram H, Tong C, Samans B, Corréa M, Da Silva C, Just J, Falentin C, Koh CS, Le Clainche I, Bernard M, Bento P, Noel B, Labadie K, Alberti A, Charles M, Arnaud D, Guo H, Daviaud C, Alamery S, Jabbari K, Zhao M, Edger PP, Chelaifa H, Tack D, Lassalle G, Mestiri I, Schnel N, Le Paslier MC, Fan G, Renault V, Bayer PE, Golicz AA, Manoli S, Lee TH, Thi VHD, Chalabi S, Hu Q, Fan C, Tollenaere R, Lu Y, Battail C, Shen J, Sidebottom CHD, Wang X, Canaguier A, Chauveau A, Bérard A, Deniot G, Guan M, Liu Z, Sun F, Lim YP, Lyons E, Town CD, Bancroft I, Wang X, Meng J, Ma J, Pires JC, King GJ, Brunel D, Delourme R, Renard M, Aury JM, Adams KL, Batley J, Snowdon RJ, Tost J, Edwards D, Zhou Y, Hua W, Sharpe AG, Paterson AH, Guan C, Wincker P. Plant genetics. Early allopolyploid evolution in the post-Neolithic Brassica napus oilseed genome. Science 2014. [PMID: 25146293 DOI: 10.1126/science.125343] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/21/2023]
Abstract
Oilseed rape (Brassica napus L.) was formed ~7500 years ago by hybridization between B. rapa and B. oleracea, followed by chromosome doubling, a process known as allopolyploidy. Together with more ancient polyploidizations, this conferred an aggregate 72× genome multiplication since the origin of angiosperms and high gene content. We examined the B. napus genome and the consequences of its recent duplication. The constituent An and Cn subgenomes are engaged in subtle structural, functional, and epigenetic cross-talk, with abundant homeologous exchanges. Incipient gene loss and expression divergence have begun. Selection in B. napus oilseed types has accelerated the loss of glucosinolate genes, while preserving expansion of oil biosynthesis genes. These processes provide insights into allopolyploid evolution and its relationship with crop domestication and improvement.
Collapse
Affiliation(s)
- Boulos Chalhoub
- Institut National de Recherche Agronomique (INRA)/Université d'Evry Val d'Essone, Unité de Recherche en Génomique Végétale, UMR1165, Organization and Evolution of Plant Genomes, 2 rue Gaston Crémieux, 91057 Evry, France.
| | - France Denoeud
- Commissariat à l'Energie Atomique (CEA), Institut de Génomique (IG), Genoscope, BP5706, 91057 Evry, France. Université d'Evry Val d'Essone, UMR 8030, CP5706, Evry, France. Centre National de Recherche Scientifique (CNRS), UMR 8030, CP5706, Evry, France
| | - Shengyi Liu
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture of People's Republic of China, Oil Crops Research Institute, Chinese Academy of Agricultural Sciences, Wuhan 430062, China
| | - Isobel A P Parkin
- Agriculture and Agri-Food Canada, 107 Science Place, Saskatoon, SK S7N 0X2, Canada.
| | - Haibao Tang
- J. Craig Venter Institute, Rockville, MD 20850, USA. Center for Genomics and Biotechnology, Fujian Agriculture and Forestry, University, Fuzhou 350002, Fujian Province, China
| | - Xiyin Wang
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA 30602, USA. Center of Genomics and Computational Biology, School of Life Sciences, Hebei United University, Tangshan, Hebei 063000, China
| | - Julien Chiquet
- Laboratoire de Mathématiques et Modélisation d'Evry-UMR 8071 CNRS/Université d'Evry val d'Essonne-USC INRA, Evry, France
| | - Harry Belcram
- Institut National de Recherche Agronomique (INRA)/Université d'Evry Val d'Essone, Unité de Recherche en Génomique Végétale, UMR1165, Organization and Evolution of Plant Genomes, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Chaobo Tong
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture of People's Republic of China, Oil Crops Research Institute, Chinese Academy of Agricultural Sciences, Wuhan 430062, China
| | - Birgit Samans
- Department of Plant Breeding, Research Center for Biosystems, Land Use and Nutrition, Justus Liebig University, Heinrich-Buff-Ring 26-32, 35392 Giessen, Germany
| | - Margot Corréa
- Commissariat à l'Energie Atomique (CEA), Institut de Génomique (IG), Genoscope, BP5706, 91057 Evry, France
| | - Corinne Da Silva
- Commissariat à l'Energie Atomique (CEA), Institut de Génomique (IG), Genoscope, BP5706, 91057 Evry, France
| | - Jérémy Just
- Institut National de Recherche Agronomique (INRA)/Université d'Evry Val d'Essone, Unité de Recherche en Génomique Végétale, UMR1165, Organization and Evolution of Plant Genomes, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Cyril Falentin
- INRA, Institut de Génétique, Environnement et Protection des Plantes (IGEPP) UMR1349, BP35327, 35653 Le Rheu Cedex, France
| | - Chu Shin Koh
- National Research Council Canada, 110 Gymnasium Place, Saskatoon, SK S7N 0W9, Canada
| | - Isabelle Le Clainche
- Institut National de Recherche Agronomique (INRA)/Université d'Evry Val d'Essone, Unité de Recherche en Génomique Végétale, UMR1165, Organization and Evolution of Plant Genomes, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Maria Bernard
- Commissariat à l'Energie Atomique (CEA), Institut de Génomique (IG), Genoscope, BP5706, 91057 Evry, France
| | - Pascal Bento
- Commissariat à l'Energie Atomique (CEA), Institut de Génomique (IG), Genoscope, BP5706, 91057 Evry, France
| | - Benjamin Noel
- Commissariat à l'Energie Atomique (CEA), Institut de Génomique (IG), Genoscope, BP5706, 91057 Evry, France
| | - Karine Labadie
- Commissariat à l'Energie Atomique (CEA), Institut de Génomique (IG), Genoscope, BP5706, 91057 Evry, France
| | - Adriana Alberti
- Commissariat à l'Energie Atomique (CEA), Institut de Génomique (IG), Genoscope, BP5706, 91057 Evry, France
| | - Mathieu Charles
- INRA, Etude du Polymorphisme des Génomes Végétaux, US1279, Centre National de Génotypage, CEA-IG, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Dominique Arnaud
- Institut National de Recherche Agronomique (INRA)/Université d'Evry Val d'Essone, Unité de Recherche en Génomique Végétale, UMR1165, Organization and Evolution of Plant Genomes, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Hui Guo
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA 30602, USA
| | - Christian Daviaud
- Laboratory for Epigenetics and Environment, Centre National de Génotypage, CEA-IG, 2 rue Gaston Crémieux, 91000 Evry, France
| | - Salman Alamery
- Australian Centre for Plant Functional Genomics, School of Agriculture and Food Sciences, University of Queensland, St. Lucia, QLD 4072, Australia
| | - Kamel Jabbari
- Institut National de Recherche Agronomique (INRA)/Université d'Evry Val d'Essone, Unité de Recherche en Génomique Végétale, UMR1165, Organization and Evolution of Plant Genomes, 2 rue Gaston Crémieux, 91057 Evry, France. Cologne Center for Genomics, University of Cologne, Weyertal 115b, 50931 Köln, Germany
| | - Meixia Zhao
- Department of Agronomy, Purdue University, WSLR Building B018, West Lafayette, IN 47907, USA
| | - Patrick P Edger
- Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, USA
| | - Houda Chelaifa
- Institut National de Recherche Agronomique (INRA)/Université d'Evry Val d'Essone, Unité de Recherche en Génomique Végétale, UMR1165, Organization and Evolution of Plant Genomes, 2 rue Gaston Crémieux, 91057 Evry, France
| | - David Tack
- Department of Botany, University of British Columbia, Vancouver, BC, Canada
| | - Gilles Lassalle
- INRA, Institut de Génétique, Environnement et Protection des Plantes (IGEPP) UMR1349, BP35327, 35653 Le Rheu Cedex, France
| | - Imen Mestiri
- Institut National de Recherche Agronomique (INRA)/Université d'Evry Val d'Essone, Unité de Recherche en Génomique Végétale, UMR1165, Organization and Evolution of Plant Genomes, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Nicolas Schnel
- INRA, Institut de Génétique, Environnement et Protection des Plantes (IGEPP) UMR1349, BP35327, 35653 Le Rheu Cedex, France
| | - Marie-Christine Le Paslier
- INRA, Etude du Polymorphisme des Génomes Végétaux, US1279, Centre National de Génotypage, CEA-IG, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Guangyi Fan
- Beijing Genome Institute-Shenzhen, Shenzhen 518083, China
| | - Victor Renault
- Fondation Jean Dausset-Centre d'Étude du Polymorphisme Humain, 27 rue Juliette Dodu, 75010 Paris, France
| | - Philippe E Bayer
- Australian Centre for Plant Functional Genomics, School of Agriculture and Food Sciences, University of Queensland, St. Lucia, QLD 4072, Australia
| | - Agnieszka A Golicz
- Australian Centre for Plant Functional Genomics, School of Agriculture and Food Sciences, University of Queensland, St. Lucia, QLD 4072, Australia
| | - Sahana Manoli
- Australian Centre for Plant Functional Genomics, School of Agriculture and Food Sciences, University of Queensland, St. Lucia, QLD 4072, Australia
| | - Tae-Ho Lee
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA 30602, USA
| | - Vinh Ha Dinh Thi
- Institut National de Recherche Agronomique (INRA)/Université d'Evry Val d'Essone, Unité de Recherche en Génomique Végétale, UMR1165, Organization and Evolution of Plant Genomes, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Smahane Chalabi
- Institut National de Recherche Agronomique (INRA)/Université d'Evry Val d'Essone, Unité de Recherche en Génomique Végétale, UMR1165, Organization and Evolution of Plant Genomes, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Qiong Hu
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture of People's Republic of China, Oil Crops Research Institute, Chinese Academy of Agricultural Sciences, Wuhan 430062, China
| | - Chuchuan Fan
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China
| | - Reece Tollenaere
- Australian Centre for Plant Functional Genomics, School of Agriculture and Food Sciences, University of Queensland, St. Lucia, QLD 4072, Australia
| | - Yunhai Lu
- Institut National de Recherche Agronomique (INRA)/Université d'Evry Val d'Essone, Unité de Recherche en Génomique Végétale, UMR1165, Organization and Evolution of Plant Genomes, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Christophe Battail
- Commissariat à l'Energie Atomique (CEA), Institut de Génomique (IG), Genoscope, BP5706, 91057 Evry, France
| | - Jinxiong Shen
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China
| | | | - Xinfa Wang
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture of People's Republic of China, Oil Crops Research Institute, Chinese Academy of Agricultural Sciences, Wuhan 430062, China
| | - Aurélie Canaguier
- Institut National de Recherche Agronomique (INRA)/Université d'Evry Val d'Essone, Unité de Recherche en Génomique Végétale, UMR1165, Organization and Evolution of Plant Genomes, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Aurélie Chauveau
- INRA, Etude du Polymorphisme des Génomes Végétaux, US1279, Centre National de Génotypage, CEA-IG, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Aurélie Bérard
- INRA, Etude du Polymorphisme des Génomes Végétaux, US1279, Centre National de Génotypage, CEA-IG, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Gwenaëlle Deniot
- INRA, Institut de Génétique, Environnement et Protection des Plantes (IGEPP) UMR1349, BP35327, 35653 Le Rheu Cedex, France
| | - Mei Guan
- College of Agronomy, Hunan Agricultural University, Changsha 410128, China
| | - Zhongsong Liu
- College of Agronomy, Hunan Agricultural University, Changsha 410128, China
| | - Fengming Sun
- Beijing Genome Institute-Shenzhen, Shenzhen 518083, China
| | - Yong Pyo Lim
- Molecular Genetics and Genomics Laboratory, Department of Horticulture, Chungnam National University, Daejeon-305764, South Korea
| | - Eric Lyons
- School of Plant Sciences, iPlant Collaborative, University of Arizona, Tucson, AZ, USA
| | | | - Ian Bancroft
- Department of Biology, University of York, Wentworth Way, Heslington, York YO10 5DD, UK
| | - Xiaowu Wang
- Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Jinling Meng
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China
| | - Jianxin Ma
- Department of Agronomy, Purdue University, WSLR Building B018, West Lafayette, IN 47907, USA
| | - J Chris Pires
- Division of Biological Sciences, University of Missouri, Columbia, MO 65211, USA
| | - Graham J King
- Southern Cross Plant Science, Southern Cross University, Lismore, NSW 2480, Australia
| | - Dominique Brunel
- INRA, Etude du Polymorphisme des Génomes Végétaux, US1279, Centre National de Génotypage, CEA-IG, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Régine Delourme
- INRA, Institut de Génétique, Environnement et Protection des Plantes (IGEPP) UMR1349, BP35327, 35653 Le Rheu Cedex, France
| | - Michel Renard
- INRA, Institut de Génétique, Environnement et Protection des Plantes (IGEPP) UMR1349, BP35327, 35653 Le Rheu Cedex, France
| | - Jean-Marc Aury
- Commissariat à l'Energie Atomique (CEA), Institut de Génomique (IG), Genoscope, BP5706, 91057 Evry, France
| | - Keith L Adams
- Department of Botany, University of British Columbia, Vancouver, BC, Canada
| | - Jacqueline Batley
- Australian Centre for Plant Functional Genomics, School of Agriculture and Food Sciences, University of Queensland, St. Lucia, QLD 4072, Australia. School of Plant Biology, University of Western Australia, WA 6009, Australia
| | - Rod J Snowdon
- Department of Plant Breeding, Research Center for Biosystems, Land Use and Nutrition, Justus Liebig University, Heinrich-Buff-Ring 26-32, 35392 Giessen, Germany
| | - Jorg Tost
- Laboratory for Epigenetics and Environment, Centre National de Génotypage, CEA-IG, 2 rue Gaston Crémieux, 91000 Evry, France
| | - David Edwards
- Australian Centre for Plant Functional Genomics, School of Agriculture and Food Sciences, University of Queensland, St. Lucia, QLD 4072, Australia. School of Plant Biology, University of Western Australia, WA 6009, Australia.
| | - Yongming Zhou
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China.
| | - Wei Hua
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture of People's Republic of China, Oil Crops Research Institute, Chinese Academy of Agricultural Sciences, Wuhan 430062, China.
| | - Andrew G Sharpe
- National Research Council Canada, 110 Gymnasium Place, Saskatoon, SK S7N 0W9, Canada.
| | - Andrew H Paterson
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA 30602, USA.
| | - Chunyun Guan
- College of Agronomy, Hunan Agricultural University, Changsha 410128, China.
| | - Patrick Wincker
- Commissariat à l'Energie Atomique (CEA), Institut de Génomique (IG), Genoscope, BP5706, 91057 Evry, France. Université d'Evry Val d'Essone, UMR 8030, CP5706, Evry, France. Centre National de Recherche Scientifique (CNRS), UMR 8030, CP5706, Evry, France.
| |
Collapse
|
13
|
Chalhoub B, Denoeud F, Liu S, Parkin IAP, Tang H, Wang X, Chiquet J, Belcram H, Tong C, Samans B, Corréa M, Da Silva C, Just J, Falentin C, Koh CS, Le Clainche I, Bernard M, Bento P, Noel B, Labadie K, Alberti A, Charles M, Arnaud D, Guo H, Daviaud C, Alamery S, Jabbari K, Zhao M, Edger PP, Chelaifa H, Tack D, Lassalle G, Mestiri I, Schnel N, Le Paslier MC, Fan G, Renault V, Bayer PE, Golicz AA, Manoli S, Lee TH, Thi VHD, Chalabi S, Hu Q, Fan C, Tollenaere R, Lu Y, Battail C, Shen J, Sidebottom CHD, Wang X, Canaguier A, Chauveau A, Bérard A, Deniot G, Guan M, Liu Z, Sun F, Lim YP, Lyons E, Town CD, Bancroft I, Wang X, Meng J, Ma J, Pires JC, King GJ, Brunel D, Delourme R, Renard M, Aury JM, Adams KL, Batley J, Snowdon RJ, Tost J, Edwards D, Zhou Y, Hua W, Sharpe AG, Paterson AH, Guan C, Wincker P. Plant genetics. Early allopolyploid evolution in the post-Neolithic Brassica napus oilseed genome. Science 2014; 345:950-3. [PMID: 25146293 DOI: 10.1126/science.1253435] [Citation(s) in RCA: 1354] [Impact Index Per Article: 135.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Oilseed rape (Brassica napus L.) was formed ~7500 years ago by hybridization between B. rapa and B. oleracea, followed by chromosome doubling, a process known as allopolyploidy. Together with more ancient polyploidizations, this conferred an aggregate 72× genome multiplication since the origin of angiosperms and high gene content. We examined the B. napus genome and the consequences of its recent duplication. The constituent An and Cn subgenomes are engaged in subtle structural, functional, and epigenetic cross-talk, with abundant homeologous exchanges. Incipient gene loss and expression divergence have begun. Selection in B. napus oilseed types has accelerated the loss of glucosinolate genes, while preserving expansion of oil biosynthesis genes. These processes provide insights into allopolyploid evolution and its relationship with crop domestication and improvement.
Collapse
Affiliation(s)
- Boulos Chalhoub
- Institut National de Recherche Agronomique (INRA)/Université d'Evry Val d'Essone, Unité de Recherche en Génomique Végétale, UMR1165, Organization and Evolution of Plant Genomes, 2 rue Gaston Crémieux, 91057 Evry, France.
| | - France Denoeud
- Commissariat à l'Energie Atomique (CEA), Institut de Génomique (IG), Genoscope, BP5706, 91057 Evry, France. Université d'Evry Val d'Essone, UMR 8030, CP5706, Evry, France. Centre National de Recherche Scientifique (CNRS), UMR 8030, CP5706, Evry, France
| | - Shengyi Liu
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture of People's Republic of China, Oil Crops Research Institute, Chinese Academy of Agricultural Sciences, Wuhan 430062, China
| | - Isobel A P Parkin
- Agriculture and Agri-Food Canada, 107 Science Place, Saskatoon, SK S7N 0X2, Canada.
| | - Haibao Tang
- J. Craig Venter Institute, Rockville, MD 20850, USA. Center for Genomics and Biotechnology, Fujian Agriculture and Forestry, University, Fuzhou 350002, Fujian Province, China
| | - Xiyin Wang
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA 30602, USA. Center of Genomics and Computational Biology, School of Life Sciences, Hebei United University, Tangshan, Hebei 063000, China
| | - Julien Chiquet
- Laboratoire de Mathématiques et Modélisation d'Evry-UMR 8071 CNRS/Université d'Evry val d'Essonne-USC INRA, Evry, France
| | - Harry Belcram
- Institut National de Recherche Agronomique (INRA)/Université d'Evry Val d'Essone, Unité de Recherche en Génomique Végétale, UMR1165, Organization and Evolution of Plant Genomes, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Chaobo Tong
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture of People's Republic of China, Oil Crops Research Institute, Chinese Academy of Agricultural Sciences, Wuhan 430062, China
| | - Birgit Samans
- Department of Plant Breeding, Research Center for Biosystems, Land Use and Nutrition, Justus Liebig University, Heinrich-Buff-Ring 26-32, 35392 Giessen, Germany
| | - Margot Corréa
- Commissariat à l'Energie Atomique (CEA), Institut de Génomique (IG), Genoscope, BP5706, 91057 Evry, France
| | - Corinne Da Silva
- Commissariat à l'Energie Atomique (CEA), Institut de Génomique (IG), Genoscope, BP5706, 91057 Evry, France
| | - Jérémy Just
- Institut National de Recherche Agronomique (INRA)/Université d'Evry Val d'Essone, Unité de Recherche en Génomique Végétale, UMR1165, Organization and Evolution of Plant Genomes, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Cyril Falentin
- INRA, Institut de Génétique, Environnement et Protection des Plantes (IGEPP) UMR1349, BP35327, 35653 Le Rheu Cedex, France
| | - Chu Shin Koh
- National Research Council Canada, 110 Gymnasium Place, Saskatoon, SK S7N 0W9, Canada
| | - Isabelle Le Clainche
- Institut National de Recherche Agronomique (INRA)/Université d'Evry Val d'Essone, Unité de Recherche en Génomique Végétale, UMR1165, Organization and Evolution of Plant Genomes, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Maria Bernard
- Commissariat à l'Energie Atomique (CEA), Institut de Génomique (IG), Genoscope, BP5706, 91057 Evry, France
| | - Pascal Bento
- Commissariat à l'Energie Atomique (CEA), Institut de Génomique (IG), Genoscope, BP5706, 91057 Evry, France
| | - Benjamin Noel
- Commissariat à l'Energie Atomique (CEA), Institut de Génomique (IG), Genoscope, BP5706, 91057 Evry, France
| | - Karine Labadie
- Commissariat à l'Energie Atomique (CEA), Institut de Génomique (IG), Genoscope, BP5706, 91057 Evry, France
| | - Adriana Alberti
- Commissariat à l'Energie Atomique (CEA), Institut de Génomique (IG), Genoscope, BP5706, 91057 Evry, France
| | - Mathieu Charles
- INRA, Etude du Polymorphisme des Génomes Végétaux, US1279, Centre National de Génotypage, CEA-IG, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Dominique Arnaud
- Institut National de Recherche Agronomique (INRA)/Université d'Evry Val d'Essone, Unité de Recherche en Génomique Végétale, UMR1165, Organization and Evolution of Plant Genomes, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Hui Guo
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA 30602, USA
| | - Christian Daviaud
- Laboratory for Epigenetics and Environment, Centre National de Génotypage, CEA-IG, 2 rue Gaston Crémieux, 91000 Evry, France
| | - Salman Alamery
- Australian Centre for Plant Functional Genomics, School of Agriculture and Food Sciences, University of Queensland, St. Lucia, QLD 4072, Australia
| | - Kamel Jabbari
- Institut National de Recherche Agronomique (INRA)/Université d'Evry Val d'Essone, Unité de Recherche en Génomique Végétale, UMR1165, Organization and Evolution of Plant Genomes, 2 rue Gaston Crémieux, 91057 Evry, France. Cologne Center for Genomics, University of Cologne, Weyertal 115b, 50931 Köln, Germany
| | - Meixia Zhao
- Department of Agronomy, Purdue University, WSLR Building B018, West Lafayette, IN 47907, USA
| | - Patrick P Edger
- Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, USA
| | - Houda Chelaifa
- Institut National de Recherche Agronomique (INRA)/Université d'Evry Val d'Essone, Unité de Recherche en Génomique Végétale, UMR1165, Organization and Evolution of Plant Genomes, 2 rue Gaston Crémieux, 91057 Evry, France
| | - David Tack
- Department of Botany, University of British Columbia, Vancouver, BC, Canada
| | - Gilles Lassalle
- INRA, Institut de Génétique, Environnement et Protection des Plantes (IGEPP) UMR1349, BP35327, 35653 Le Rheu Cedex, France
| | - Imen Mestiri
- Institut National de Recherche Agronomique (INRA)/Université d'Evry Val d'Essone, Unité de Recherche en Génomique Végétale, UMR1165, Organization and Evolution of Plant Genomes, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Nicolas Schnel
- INRA, Institut de Génétique, Environnement et Protection des Plantes (IGEPP) UMR1349, BP35327, 35653 Le Rheu Cedex, France
| | - Marie-Christine Le Paslier
- INRA, Etude du Polymorphisme des Génomes Végétaux, US1279, Centre National de Génotypage, CEA-IG, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Guangyi Fan
- Beijing Genome Institute-Shenzhen, Shenzhen 518083, China
| | - Victor Renault
- Fondation Jean Dausset-Centre d'Étude du Polymorphisme Humain, 27 rue Juliette Dodu, 75010 Paris, France
| | - Philippe E Bayer
- Australian Centre for Plant Functional Genomics, School of Agriculture and Food Sciences, University of Queensland, St. Lucia, QLD 4072, Australia
| | - Agnieszka A Golicz
- Australian Centre for Plant Functional Genomics, School of Agriculture and Food Sciences, University of Queensland, St. Lucia, QLD 4072, Australia
| | - Sahana Manoli
- Australian Centre for Plant Functional Genomics, School of Agriculture and Food Sciences, University of Queensland, St. Lucia, QLD 4072, Australia
| | - Tae-Ho Lee
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA 30602, USA
| | - Vinh Ha Dinh Thi
- Institut National de Recherche Agronomique (INRA)/Université d'Evry Val d'Essone, Unité de Recherche en Génomique Végétale, UMR1165, Organization and Evolution of Plant Genomes, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Smahane Chalabi
- Institut National de Recherche Agronomique (INRA)/Université d'Evry Val d'Essone, Unité de Recherche en Génomique Végétale, UMR1165, Organization and Evolution of Plant Genomes, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Qiong Hu
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture of People's Republic of China, Oil Crops Research Institute, Chinese Academy of Agricultural Sciences, Wuhan 430062, China
| | - Chuchuan Fan
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China
| | - Reece Tollenaere
- Australian Centre for Plant Functional Genomics, School of Agriculture and Food Sciences, University of Queensland, St. Lucia, QLD 4072, Australia
| | - Yunhai Lu
- Institut National de Recherche Agronomique (INRA)/Université d'Evry Val d'Essone, Unité de Recherche en Génomique Végétale, UMR1165, Organization and Evolution of Plant Genomes, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Christophe Battail
- Commissariat à l'Energie Atomique (CEA), Institut de Génomique (IG), Genoscope, BP5706, 91057 Evry, France
| | - Jinxiong Shen
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China
| | | | - Xinfa Wang
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture of People's Republic of China, Oil Crops Research Institute, Chinese Academy of Agricultural Sciences, Wuhan 430062, China
| | - Aurélie Canaguier
- Institut National de Recherche Agronomique (INRA)/Université d'Evry Val d'Essone, Unité de Recherche en Génomique Végétale, UMR1165, Organization and Evolution of Plant Genomes, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Aurélie Chauveau
- INRA, Etude du Polymorphisme des Génomes Végétaux, US1279, Centre National de Génotypage, CEA-IG, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Aurélie Bérard
- INRA, Etude du Polymorphisme des Génomes Végétaux, US1279, Centre National de Génotypage, CEA-IG, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Gwenaëlle Deniot
- INRA, Institut de Génétique, Environnement et Protection des Plantes (IGEPP) UMR1349, BP35327, 35653 Le Rheu Cedex, France
| | - Mei Guan
- College of Agronomy, Hunan Agricultural University, Changsha 410128, China
| | - Zhongsong Liu
- College of Agronomy, Hunan Agricultural University, Changsha 410128, China
| | - Fengming Sun
- Beijing Genome Institute-Shenzhen, Shenzhen 518083, China
| | - Yong Pyo Lim
- Molecular Genetics and Genomics Laboratory, Department of Horticulture, Chungnam National University, Daejeon-305764, South Korea
| | - Eric Lyons
- School of Plant Sciences, iPlant Collaborative, University of Arizona, Tucson, AZ, USA
| | | | - Ian Bancroft
- Department of Biology, University of York, Wentworth Way, Heslington, York YO10 5DD, UK
| | - Xiaowu Wang
- Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Jinling Meng
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China
| | - Jianxin Ma
- Department of Agronomy, Purdue University, WSLR Building B018, West Lafayette, IN 47907, USA
| | - J Chris Pires
- Division of Biological Sciences, University of Missouri, Columbia, MO 65211, USA
| | - Graham J King
- Southern Cross Plant Science, Southern Cross University, Lismore, NSW 2480, Australia
| | - Dominique Brunel
- INRA, Etude du Polymorphisme des Génomes Végétaux, US1279, Centre National de Génotypage, CEA-IG, 2 rue Gaston Crémieux, 91057 Evry, France
| | - Régine Delourme
- INRA, Institut de Génétique, Environnement et Protection des Plantes (IGEPP) UMR1349, BP35327, 35653 Le Rheu Cedex, France
| | - Michel Renard
- INRA, Institut de Génétique, Environnement et Protection des Plantes (IGEPP) UMR1349, BP35327, 35653 Le Rheu Cedex, France
| | - Jean-Marc Aury
- Commissariat à l'Energie Atomique (CEA), Institut de Génomique (IG), Genoscope, BP5706, 91057 Evry, France
| | - Keith L Adams
- Department of Botany, University of British Columbia, Vancouver, BC, Canada
| | - Jacqueline Batley
- Australian Centre for Plant Functional Genomics, School of Agriculture and Food Sciences, University of Queensland, St. Lucia, QLD 4072, Australia. School of Plant Biology, University of Western Australia, WA 6009, Australia
| | - Rod J Snowdon
- Department of Plant Breeding, Research Center for Biosystems, Land Use and Nutrition, Justus Liebig University, Heinrich-Buff-Ring 26-32, 35392 Giessen, Germany
| | - Jorg Tost
- Laboratory for Epigenetics and Environment, Centre National de Génotypage, CEA-IG, 2 rue Gaston Crémieux, 91000 Evry, France
| | - David Edwards
- Australian Centre for Plant Functional Genomics, School of Agriculture and Food Sciences, University of Queensland, St. Lucia, QLD 4072, Australia. School of Plant Biology, University of Western Australia, WA 6009, Australia.
| | - Yongming Zhou
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China.
| | - Wei Hua
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture of People's Republic of China, Oil Crops Research Institute, Chinese Academy of Agricultural Sciences, Wuhan 430062, China.
| | - Andrew G Sharpe
- National Research Council Canada, 110 Gymnasium Place, Saskatoon, SK S7N 0W9, Canada.
| | - Andrew H Paterson
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA 30602, USA.
| | - Chunyun Guan
- College of Agronomy, Hunan Agricultural University, Changsha 410128, China.
| | - Patrick Wincker
- Commissariat à l'Energie Atomique (CEA), Institut de Génomique (IG), Genoscope, BP5706, 91057 Evry, France. Université d'Evry Val d'Essone, UMR 8030, CP5706, Evry, France. Centre National de Recherche Scientifique (CNRS), UMR 8030, CP5706, Evry, France.
| |
Collapse
|
14
|
Moghe GD, Hufnagel DE, Tang H, Xiao Y, Dworkin I, Town CD, Conner JK, Shiu SH. Consequences of Whole-Genome Triplication as Revealed by Comparative Genomic Analyses of the Wild Radish Raphanus raphanistrum and Three Other Brassicaceae Species. Plant Cell 2014; 26:1925-1937. [PMID: 24876251 PMCID: PMC4079359 DOI: 10.1105/tpc.114.124297] [Citation(s) in RCA: 86] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/12/2014] [Revised: 03/30/2014] [Accepted: 04/30/2014] [Indexed: 05/18/2023]
Abstract
Polyploidization events are frequent among flowering plants, and the duplicate genes produced via such events contribute significantly to plant evolution. We sequenced the genome of wild radish (Raphanus raphanistrum), a Brassicaceae species that experienced a whole-genome triplication event prior to diverging from Brassica rapa. Despite substantial gene gains in these two species compared with Arabidopsis thaliana and Arabidopsis lyrata, ∼70% of the orthologous groups experienced gene losses in R. raphanistrum and B. rapa, with most of the losses occurring prior to their divergence. The retained duplicates show substantial divergence in sequence and expression. Based on comparison of A. thaliana and R. raphanistrum ortholog floral expression levels, retained radish duplicates diverged primarily via maintenance of ancestral expression level in one copy and reduction of expression level in others. In addition, retained duplicates differed significantly from genes that reverted to singleton state in function, sequence composition, expression patterns, network connectivity, and rates of evolution. Using these properties, we established a statistical learning model for predicting whether a duplicate would be retained postpolyploidization. Overall, our study provides new insights into the processes of plant duplicate loss, retention, and functional divergence and highlights the need for further understanding factors controlling duplicate gene fate.
Collapse
Affiliation(s)
- Gaurav D Moghe
- Programs in Genetics and Quantitative Biology, Michigan State University, East Lansing, Michigan 48824
| | - David E Hufnagel
- Department of Plant Biology, Michigan State University, East Lansing, Michigan 48824
| | - Haibao Tang
- J. Craig Venter Institute, Rockville, Maryland 20850
| | - Yongli Xiao
- National Institute of Allergy and Infectious Disease, National Institute of Health, Bethesda, Maryland 20892
| | - Ian Dworkin
- Department of Zoology, Michigan State University, East Lansing, Michigan 48824
| | | | - Jeffrey K Conner
- Department of Plant Biology, Michigan State University, East Lansing, Michigan 48824 Kellogg Biological Station, Michigan State University, East Lansing, Michigan 48824
| | - Shin-Han Shiu
- Programs in Genetics and Quantitative Biology, Michigan State University, East Lansing, Michigan 48824 Department of Plant Biology, Michigan State University, East Lansing, Michigan 48824
| |
Collapse
|
15
|
Tang H, Krishnakumar V, Bidwell S, Rosen B, Chan A, Zhou S, Gentzbittel L, Childs KL, Yandell M, Gundlach H, Mayer KFX, Schwartz DC, Town CD. An improved genome release (version Mt4.0) for the model legume Medicago truncatula. BMC Genomics 2014. [PMID: 24767513 DOI: 10.1186/1471-216415-312] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/24/2023] Open
Abstract
BACKGROUND Medicago truncatula, a close relative of alfalfa, is a preeminent model for studying nitrogen fixation, symbiosis, and legume genomics. The Medicago sequencing project began in 2003 with the goal to decipher sequences originated from the euchromatic portion of the genome. The initial sequencing approach was based on a BAC tiling path, culminating in a BAC-based assembly (Mt3.5) as well as an in-depth analysis of the genome published in 2011. RESULTS Here we describe a further improved and refined version of the M. truncatula genome (Mt4.0) based on de novo whole genome shotgun assembly of a majority of Illumina and 454 reads using ALLPATHS-LG. The ALLPATHS-LG scaffolds were anchored onto the pseudomolecules on the basis of alignments to both the optical map and the genotyping-by-sequencing (GBS) map. The Mt4.0 pseudomolecules encompass ~360 Mb of actual sequences spanning 390 Mb of which ~330 Mb align perfectly with the optical map, presenting a drastic improvement over the BAC-based Mt3.5 which only contained 70% sequences (~250 Mb) of the current version. Most of the sequences and genes that previously resided on the unanchored portion of Mt3.5 have now been incorporated into the Mt4.0 pseudomolecules, with the exception of ~28 Mb of unplaced sequences. With regard to gene annotation, the genome has been re-annotated through our gene prediction pipeline, which integrates EST, RNA-seq, protein and gene prediction evidences. A total of 50,894 genes (31,661 high confidence and 19,233 low confidence) are included in Mt4.0 which overlapped with ~82% of the gene loci annotated in Mt3.5. Of the remaining genes, 14% of the Mt3.5 genes have been deprecated to an "unsupported" status and 4% are absent from the Mt4.0 predictions. CONCLUSIONS Mt4.0 and its associated resources, such as genome browsers, BLAST-able datasets and gene information pages, can be found on the JCVI Medicago web site (http://www.jcvi.org/medicago). The assembly and annotation has been deposited in GenBank (BioProject: PRJNA10791). The heavily curated chromosomal sequences and associated gene models of Medicago will serve as a better reference for legume biology and comparative genomics.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | | | - Christopher D Town
- J, Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD, USA.
| |
Collapse
|
16
|
Tang H, Krishnakumar V, Bidwell S, Rosen B, Chan A, Zhou S, Gentzbittel L, Childs KL, Yandell M, Gundlach H, Mayer KFX, Schwartz DC, Town CD. An improved genome release (version Mt4.0) for the model legume Medicago truncatula. BMC Genomics 2014; 15:312. [PMID: 24767513 PMCID: PMC4234490 DOI: 10.1186/1471-2164-15-312] [Citation(s) in RCA: 258] [Impact Index Per Article: 25.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2014] [Accepted: 04/22/2014] [Indexed: 11/10/2022] Open
Abstract
Background Medicago truncatula, a close relative of alfalfa, is a preeminent model for studying nitrogen fixation, symbiosis, and legume genomics. The Medicago sequencing project began in 2003 with the goal to decipher sequences originated from the euchromatic portion of the genome. The initial sequencing approach was based on a BAC tiling path, culminating in a BAC-based assembly (Mt3.5) as well as an in-depth analysis of the genome published in 2011. Results Here we describe a further improved and refined version of the M. truncatula genome (Mt4.0) based on de novo whole genome shotgun assembly of a majority of Illumina and 454 reads using ALLPATHS-LG. The ALLPATHS-LG scaffolds were anchored onto the pseudomolecules on the basis of alignments to both the optical map and the genotyping-by-sequencing (GBS) map. The Mt4.0 pseudomolecules encompass ~360 Mb of actual sequences spanning 390 Mb of which ~330 Mb align perfectly with the optical map, presenting a drastic improvement over the BAC-based Mt3.5 which only contained 70% sequences (~250 Mb) of the current version. Most of the sequences and genes that previously resided on the unanchored portion of Mt3.5 have now been incorporated into the Mt4.0 pseudomolecules, with the exception of ~28 Mb of unplaced sequences. With regard to gene annotation, the genome has been re-annotated through our gene prediction pipeline, which integrates EST, RNA-seq, protein and gene prediction evidences. A total of 50,894 genes (31,661 high confidence and 19,233 low confidence) are included in Mt4.0 which overlapped with ~82% of the gene loci annotated in Mt3.5. Of the remaining genes, 14% of the Mt3.5 genes have been deprecated to an “unsupported” status and 4% are absent from the Mt4.0 predictions. Conclusions Mt4.0 and its associated resources, such as genome browsers, BLAST-able datasets and gene information pages, can be found on the JCVI Medicago web site (http://www.jcvi.org/medicago). The assembly and annotation has been deposited in GenBank (BioProject: PRJNA10791). The heavily curated chromosomal sequences and associated gene models of Medicago will serve as a better reference for legume biology and comparative genomics.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | | | - Christopher D Town
- J, Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD, USA.
| |
Collapse
|
17
|
Tang H, Krishnakumar V, Bidwell S, Rosen B, Chan A, Zhou S, Gentzbittel L, Childs KL, Yandell M, Gundlach H, Mayer KFX, Schwartz DC, Town CD. An improved genome release (version Mt4.0) for the model legume Medicago truncatula. BMC Genomics 2014. [PMID: 24767513 DOI: 10.1186/1471‐2164‐15‐312] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Medicago truncatula, a close relative of alfalfa, is a preeminent model for studying nitrogen fixation, symbiosis, and legume genomics. The Medicago sequencing project began in 2003 with the goal to decipher sequences originated from the euchromatic portion of the genome. The initial sequencing approach was based on a BAC tiling path, culminating in a BAC-based assembly (Mt3.5) as well as an in-depth analysis of the genome published in 2011. RESULTS Here we describe a further improved and refined version of the M. truncatula genome (Mt4.0) based on de novo whole genome shotgun assembly of a majority of Illumina and 454 reads using ALLPATHS-LG. The ALLPATHS-LG scaffolds were anchored onto the pseudomolecules on the basis of alignments to both the optical map and the genotyping-by-sequencing (GBS) map. The Mt4.0 pseudomolecules encompass ~360 Mb of actual sequences spanning 390 Mb of which ~330 Mb align perfectly with the optical map, presenting a drastic improvement over the BAC-based Mt3.5 which only contained 70% sequences (~250 Mb) of the current version. Most of the sequences and genes that previously resided on the unanchored portion of Mt3.5 have now been incorporated into the Mt4.0 pseudomolecules, with the exception of ~28 Mb of unplaced sequences. With regard to gene annotation, the genome has been re-annotated through our gene prediction pipeline, which integrates EST, RNA-seq, protein and gene prediction evidences. A total of 50,894 genes (31,661 high confidence and 19,233 low confidence) are included in Mt4.0 which overlapped with ~82% of the gene loci annotated in Mt3.5. Of the remaining genes, 14% of the Mt3.5 genes have been deprecated to an "unsupported" status and 4% are absent from the Mt4.0 predictions. CONCLUSIONS Mt4.0 and its associated resources, such as genome browsers, BLAST-able datasets and gene information pages, can be found on the JCVI Medicago web site (http://www.jcvi.org/medicago). The assembly and annotation has been deposited in GenBank (BioProject: PRJNA10791). The heavily curated chromosomal sequences and associated gene models of Medicago will serve as a better reference for legume biology and comparative genomics.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | | | - Christopher D Town
- J, Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD, USA.
| |
Collapse
|
18
|
Dominguez SR, Shrivastava S, Berglund A, Qian Z, Góes LGB, Halpin RA, Fedorova N, Ransier A, Weston PA, Durigon EL, Jerez JA, Robinson CC, Town CD, Holmes KV. Isolation, propagation, genome analysis and epidemiology of HKU1 betacoronaviruses. J Gen Virol 2014; 95:836-848. [PMID: 24394697 DOI: 10.1099/vir.0.059832-0] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
From 1 January 2009 to 31 May 2013, 15 287 respiratory specimens submitted to the Clinical Virology Laboratory at the Children's Hospital Colorado were tested for human coronavirus RNA by reverse transcription-PCR. Human coronaviruses HKU1, OC43, 229E and NL63 co-circulated during each of the respiratory seasons but with significant year-to-year variability, and cumulatively accounted for 7.4-15.6 % of all samples tested during the months of peak activity. A total of 79 (0.5 % prevalence) specimens were positive for human betacoronavirus HKU1 RNA. Genotypes HKU1 A and B were both isolated from clinical specimens and propagated on primary human tracheal-bronchial epithelial cells cultured at the air-liquid interface and were neutralized in vitro by human intravenous immunoglobulin and by polyclonal rabbit antibodies to the spike glycoprotein of HKU1. Phylogenetic analysis of the deduced amino acid sequences of seven full-length genomes of Colorado HKU1 viruses and the spike glycoproteins from four additional HKU1 viruses from Colorado and three from Brazil demonstrated remarkable conservation of these sequences with genotypes circulating in Hong Kong and France. Within genotype A, all but one of the Colorado HKU1 sequences formed a unique subclade defined by three amino acid substitutions (W197F, F613Y and S752F) in the spike glycoprotein and exhibited a unique signature in the acidic tandem repeat in the N-terminal region of the nsp3 subdomain. Elucidating the function of and mechanisms responsible for the formation of these varying tandem repeats will increase our understanding of the replication process and pathogenicity of HKU1 and potentially of other coronaviruses.
Collapse
Affiliation(s)
- Samuel R Dominguez
- Departments of Microbiology, University of Colorado School of Medicine, Anschutz Medical Campus, 12800 E 19th Ave, Room P18-9403B, Aurora, CO 80045, USA.,Departments of Pediatrics, University of Colorado School of Medicine, Anschutz Medical Campus, 12800 E 19th Ave, Room P18-9403B, Aurora, CO 80045, USA
| | - Susmita Shrivastava
- Department of Pathology and Clinical Medicine, Children's Hospital Colorado, 13123 E 16th Ave, Aurora, CO 80045, USA
| | - Andrew Berglund
- Departments of Pediatrics, University of Colorado School of Medicine, Anschutz Medical Campus, 12800 E 19th Ave, Room P18-9403B, Aurora, CO 80045, USA
| | - Zhaohui Qian
- Departments of Pediatrics, University of Colorado School of Medicine, Anschutz Medical Campus, 12800 E 19th Ave, Room P18-9403B, Aurora, CO 80045, USA
| | - Luiz Gustavo Bentim Góes
- Interdisciplinary Graduate Program in Biotechnology, University of São Paulo, Av Prof. Lineu Prestes, 2415, ICB-III, Cidade Universitária, CEP: 05508-900, São Paulo, SP - Brazil.,J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850, USA
| | - Rebecca A Halpin
- Department of Pathology and Clinical Medicine, Children's Hospital Colorado, 13123 E 16th Ave, Aurora, CO 80045, USA
| | - Nadia Fedorova
- Department of Pathology and Clinical Medicine, Children's Hospital Colorado, 13123 E 16th Ave, Aurora, CO 80045, USA
| | - Amy Ransier
- Department of Pathology and Clinical Medicine, Children's Hospital Colorado, 13123 E 16th Ave, Aurora, CO 80045, USA
| | - Philip A Weston
- Departments of Pediatrics, University of Colorado School of Medicine, Anschutz Medical Campus, 12800 E 19th Ave, Room P18-9403B, Aurora, CO 80045, USA
| | - Edison Luiz Durigon
- Interdisciplinary Graduate Program in Biotechnology, University of São Paulo, Av Prof. Lineu Prestes, 2415, ICB-III, Cidade Universitária, CEP: 05508-900, São Paulo, SP - Brazil.,J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850, USA
| | - José Antonio Jerez
- Department of Microbiology, Institute of Biomedical Sciences, University of São Paulo, Av Prof. Lineu Prestes 1374, ICB-II, Cidade Universitária, CEP: 05580-900, São Paulo, SP - Brazil.,J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850, USA
| | - Christine C Robinson
- Department of Preventive Veterinary Medicine and Animal Health, Faculty of Veterinary Medicine and Animal Science, University of São Paulo, Av. Prof. Dr. Orlando Marques de Paiva, 87, Cidade Universitária, CEP: 05508-270, Sao Paulo, SP - Brazil
| | - Christopher D Town
- Department of Pathology and Clinical Medicine, Children's Hospital Colorado, 13123 E 16th Ave, Aurora, CO 80045, USA
| | - Kathryn V Holmes
- Departments of Microbiology, University of Colorado School of Medicine, Anschutz Medical Campus, 12800 E 19th Ave, Room P18-9403B, Aurora, CO 80045, USA
| |
Collapse
|
19
|
Duangjit J, Bohanec B, Chan AP, Town CD, Havey MJ. Transcriptome sequencing to produce SNP-based genetic maps of onion. Theor Appl Genet 2013; 126:2093-101. [PMID: 23689743 DOI: 10.1007/s00122-013-2121-x] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/16/2012] [Accepted: 05/08/2013] [Indexed: 05/10/2023]
Abstract
We used the Roche-454 platform to sequence from normalized cDNA libraries from each of two inbred lines of onion (OH1 and 5225). From approximately 1.6 million reads from each inbred, 27,065 and 33,254 cDNA contigs were assembled from OH1 and 5225, respectively. In total, 3,364 well supported single nucleotide polymorphisms (SNPs) on 1,716 cDNA contigs were identified between these two inbreds. One SNP on each of 1,256 contigs was randomly selected for genotyping. OH1 and 5225 were crossed and 182 gynogenic haploids extracted from hybrid plants were used for SNP mapping. A total of 597 SNPs segregated in the OH1 × 5225 haploid family and a genetic map of ten linkage groups (LOD ≥8) was constructed. Three hundred and thirty-nine of the newly identified SNPs were also mapped using a previously developed segregating family from BYG15-23 × AC43, and 223 common SNPs were used to join the two maps. Because these new SNPs are in expressed regions of the genome and commonly occur among onion germplasms, they will be useful for genetic mapping, gene tagging, marker-aided selection, quality control of seed lots, and fingerprinting of cultivars.
Collapse
Affiliation(s)
- J Duangjit
- Department of Horticulture, University of Wisconsin, Madison, WI 53706, USA
| | | | | | | | | |
Collapse
|
20
|
Haudry A, Platts AE, Vello E, Hoen DR, Leclercq M, Williamson RJ, Forczek E, Joly-Lopez Z, Steffen JG, Hazzouri KM, Dewar K, Stinchcombe JR, Schoen DJ, Wang X, Schmutz J, Town CD, Edger PP, Pires JC, Schumaker KS, Jarvis DE, Mandáková T, Lysak MA, van den Bergh E, Schranz ME, Harrison PM, Moses AM, Bureau TE, Wright SI, Blanchette M. An atlas of over 90,000 conserved noncoding sequences provides insight into crucifer regulatory regions. Nat Genet 2013; 45:891-8. [PMID: 23817568 DOI: 10.1038/ng.2684] [Citation(s) in RCA: 211] [Impact Index Per Article: 19.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2012] [Accepted: 06/04/2013] [Indexed: 12/17/2022]
Abstract
Despite the central importance of noncoding DNA to gene regulation and evolution, understanding of the extent of selection on plant noncoding DNA remains limited compared to that of other organisms. Here we report sequencing of genomes from three Brassicaceae species (Leavenworthia alabamica, Sisymbrium irio and Aethionema arabicum) and their joint analysis with six previously sequenced crucifer genomes. Conservation across orthologous bases suggests that at least 17% of the Arabidopsis thaliana genome is under selection, with nearly one-quarter of the sequence under selection lying outside of coding regions. Much of this sequence can be localized to approximately 90,000 conserved noncoding sequences (CNSs) that show evidence of transcriptional and post-transcriptional regulation. Population genomics analyses of two crucifer species, A. thaliana and Capsella grandiflora, confirm that most of the identified CNSs are evolving under medium to strong purifying selection. Overall, these CNSs highlight both similarities and several key differences between the regulatory DNA of plants and other species.
Collapse
Affiliation(s)
- Annabelle Haudry
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Ontario, Canada
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
21
|
Srour A, Afzal AJ, Blahut-Beatty L, Hemmati N, Simmonds DH, Li W, Liu M, Town CD, Sharma H, Arelli P, Lightfoot DA. The receptor like kinase at Rhg1-a/Rfs2 caused pleiotropic resistance to sudden death syndrome and soybean cyst nematode as a transgene by altering signaling responses. BMC Genomics 2012; 13:368. [PMID: 22857610 PMCID: PMC3439264 DOI: 10.1186/1471-2164-13-368] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2011] [Accepted: 06/12/2012] [Indexed: 12/27/2022] Open
Abstract
BACKGROUND Soybean (Glycine max (L. Merr.)) resistance to any population of Heterodera glycines (I.), or Fusarium virguliforme (Akoi, O'Donnell, Homma & Lattanzi) required a functional allele at Rhg1/Rfs2. H. glycines, the soybean cyst nematode (SCN) was an ancient, endemic, pest of soybean whereas F. virguliforme causal agent of sudden death syndrome (SDS), was a recent, regional, pest. This study examined the role of a receptor like kinase (RLK) GmRLK18-1 (gene model Glyma_18_02680 at 1,071 kbp on chromosome 18 of the genome sequence) within the Rhg1/Rfs2 locus in causing resistance to SCN and SDS. RESULTS A BAC (B73p06) encompassing the Rhg1/Rfs2 locus was sequenced from a resistant cultivar and compared to the sequences of two susceptible cultivars from which 800 SNPs were found. Sequence alignments inferred that the resistance allele was an introgressed region of about 59 kbp at the center of which the GmRLK18-1 was the most polymorphic gene and encoded protein. Analyses were made of plants that were either heterozygous at, or transgenic (and so hemizygous at a new location) with, the resistance allele of GmRLK18-1. Those plants infested with either H. glycines or F. virguliforme showed that the allele for resistance was dominant. In the absence of Rhg4 the GmRLK18-1 was sufficient to confer nearly complete resistance to both root and leaf symptoms of SDS caused by F. virguliforme and provided partial resistance to three different populations of nematodes (mature female cysts were reduced by 30-50%). In the presence of Rhg4 the plants with the transgene were nearly classed as fully resistant to SCN (females reduced to 11% of the susceptible control) as well as SDS. A reduction in the rate of early seedling root development was also shown to be caused by the resistance allele of the GmRLK18-1. Field trials of transgenic plants showed an increase in foliar susceptibility to insect herbivory. CONCLUSIONS The inference that soybean has adapted part of an existing pathogen recognition and defense cascade (H.glycines; SCN and insect herbivory) to a new pathogen (F. virguliforme; SDS) has broad implications for crop improvement. Stable resistance to many pathogens might be achieved by manipulation the genes encoding a small number of pathogen recognition proteins.
Collapse
Affiliation(s)
- Ali Srour
- Department of Molecular Biology, Microbiology and Biochemistry, Southern Illinois University at Carbondale, Carbondale, IL 62901, USA
- Department of Plant Soil and Agricultural Systems, Southern Illinois University at Carbondale, Carbondale, IL 62901-4415, USA
| | - Ahmed J Afzal
- Department of Molecular Biology, Microbiology and Biochemistry, Southern Illinois University at Carbondale, Carbondale, IL 62901, USA
- Department of Plant Soil and Agricultural Systems, Southern Illinois University at Carbondale, Carbondale, IL 62901-4415, USA
- Department of Horticulture and Crop Science, Ohio State University, 2021 Coffey Rd, Columbus, OH 43210, USA
| | - Laureen Blahut-Beatty
- Agriculture and Agri-Food Canada, Building 21, 960 Carling Ave, Ottawa, ON K1A 0C6, USA
| | - Naghmeh Hemmati
- Department of Molecular Biology, Microbiology and Biochemistry, Southern Illinois University at Carbondale, Carbondale, IL 62901, USA
| | - Daina H Simmonds
- Agriculture and Agri-Food Canada, Building 21, 960 Carling Ave, Ottawa, ON K1A 0C6, USA
| | - Wenbin Li
- Key Laboratory of Soybean Biology in the Chinese Ministry of Education, Harbin University, Harbin, China
| | - Miao Liu
- Key Laboratory of Soybean Biology in the Chinese Ministry of Education, Harbin University, Harbin, China
| | | | - Hemlata Sharma
- Department of Molecular Biology, Microbiology and Biochemistry, Southern Illinois University at Carbondale, Carbondale, IL 62901, USA
- Department of Plant Breeding & Genetics, Rajasthan College of Agriculture, MPUAT, Udaipur, India
| | | | - David A Lightfoot
- Department of Molecular Biology, Microbiology and Biochemistry, Southern Illinois University at Carbondale, Carbondale, IL 62901, USA
- Department of Plant Soil and Agricultural Systems, Southern Illinois University at Carbondale, Carbondale, IL 62901-4415, USA
- Key Laboratory of Soybean Biology in the Chinese Ministry of Education, Harbin University, Harbin, China
- Genomics Core Facility; Center for Excellence the Illinois Soybean Center, Southern Illinois University at Carbondale, Carbondale, IL 62901-4415, USA
| |
Collapse
|
22
|
Giacomelli L, Nanni V, Lenzi L, Zhuang J, Dalla Serra M, Banfield MJ, Town CD, Silverstein KAT, Baraldi E, Moser C. Identification and characterization of the defensin-like gene family of grapevine. Mol Plant Microbe Interact 2012; 25:1118-31. [PMID: 22550957 DOI: 10.1094/mpmi-12-11-0323] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]
Abstract
Defensins are a class of small and diverse cysteine-rich proteins found in plants, insects, and vertebrates, which share a common tertiary structure and usually exert broad-spectrum antimicrobial activities. We used a bioinformatic approach to scan the Vitis vinifera genome and identified 79 defensin-like sequences (DEFL) corresponding to 46 genes and allelic variants, plus 33 pseudogenes and gene fragments. Expansion and diversification of grapevine DEFL has occurred after the split from the last common ancestor with the genera Medicago and Arabidopsis. Grapevine DEFL localization on the 'Pinot Noir' genome revealed the presence of several clusters likely evolved through local duplications. By sequencing reverse-transcription polymerase chain reaction products, we could demonstrate the expression of grapevine DEFL with no previously reported record of expression. Many of these genes are predominantly or exclusively expressed in tissues linked to plant reproduction, consistent with findings in other plant species, and some of them accumulated at fruit ripening. The transcripts of five DEFL were also significantly upregulated in tissues infected with Botrytis cinerea, a necrotrophic mold, suggesting a role of these genes in defense against this pathogen. Finally, three novel defensins were discovered among the identified DEFL. They inhibit B. cinerea conidia germination when expressed as recombinant proteins.
Collapse
Affiliation(s)
- Lisa Giacomelli
- IASMA Research and Innovation Centre, San Michele all'Adige, Italy
| | | | | | | | | | | | | | | | | | | |
Collapse
|
23
|
Dominguez SR, Sims GE, Wentworth DE, Halpin RA, Robinson CC, Town CD, Holmes KV. Genomic analysis of 16 Colorado human NL63 coronaviruses identifies a new genotype, high sequence diversity in the N-terminal domain of the spike gene and evidence of recombination. J Gen Virol 2012; 93:2387-2398. [PMID: 22837419 DOI: 10.1099/vir.0.044628-0] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
This study compared the complete genome sequences of 16 NL63 strain human coronaviruses (hCoVs) from respiratory specimens of paediatric patients with respiratory disease in Colorado, USA, and characterized the epidemiology and clinical characteristics associated with circulating NL63 viruses over a 3-year period. From 1 January 2009 to 31 December 2011, 92 of 9380 respiratory specimens were found to be positive for NL63 RNA by PCR, an overall prevalence of 1 %. NL63 viruses were circulating during all 3 years, but there was considerable yearly variation in prevalence and the month of peak incidence. Phylogenetic analysis comparing the genome sequences of the 16 Colorado NL63 viruses with those of the prototypical hCoV-NL63 and three other NL63 viruses from the Netherlands demonstrated that there were three genotypes (A, B and C) circulating in Colorado from 2005 to 2010, and evidence of recombination between virus strains was found. Genotypes B and C co-circulated in Colorado in 2005, 2009 and 2010, but genotype A circulated only in 2005 when it was the predominant NL63 strain. Genotype C represents a new lineage that has not been described previously. The greatest variability in the NL63 virus genomes was found in the N-terminal domain (NTD) of the spike gene (nt 1-600, aa 1-200). Ten different amino acid sequences were found in the NTD of the spike protein among these NL63 strains and the 75 partial published sequences of NTDs from strains found at different times throughout the world.
Collapse
Affiliation(s)
- Samuel R Dominguez
- Department of Microbiology, University of Colorado School of Medicine, Anschutz Medical Campus, 12800 E 19th Ave, Room P18-9403B, Aurora, CO 80045, USA.,Department of Pediatrics, University of Colorado School of Medicine, Anschutz Medical Campus, 12800 E 19th Ave, Room P18-9403B, Aurora, CO 80045, USA
| | - Gregory E Sims
- J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850, USA
| | - David E Wentworth
- J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850, USA
| | - Rebecca A Halpin
- J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850, USA
| | - Christine C Robinson
- Department of Pathology and Clinical Medicine, Children's Hospital Colorado, 13123 E 16th Ave, Aurora, CO 80045, USA
| | - Christopher D Town
- J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850, USA
| | - Kathryn V Holmes
- Department of Microbiology, University of Colorado School of Medicine, Anschutz Medical Campus, 12800 E 19th Ave, Room P18-9403B, Aurora, CO 80045, USA
| |
Collapse
|
24
|
Regan AD, Millet JK, Tse LPV, Chillag Z, Rinaldi VD, Licitra BN, Dubovi EJ, Town CD, Whittaker GR. Characterization of a recombinant canine coronavirus with a distinct receptor-binding (S1) domain. Virology 2012; 430:90-9. [PMID: 22609354 PMCID: PMC3377836 DOI: 10.1016/j.virol.2012.04.013] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2012] [Revised: 03/30/2012] [Accepted: 04/17/2012] [Indexed: 01/12/2023]
Abstract
Canine alphacoronaviruses (CCoV) exist in two serotypes, type I and II, both of which can cause severe gastroenteritis. Here, we characterize a canine alphacoronavirus, designated CCoV-A76, first isolated in 1976. Serological studies show that CCoV-A76 is distinct from other CCoVs, such as the prototype CCoV-1-71. Efficient replication of CCoV-A76 is restricted to canine cell lines, in contrast to the prototypical type II strain CCoV-1-71 that more efficiently replicates in feline cells. CCoV-A76 can use canine aminopeptidase N (cAPN) receptor for infection of cells, but was unable to use feline APN (fAPN). In contrast, CCoV-1-71 can utilize both. Genomic analysis shows that CCoV-A76 possesses a distinct spike, which is the result of a recombination between type I and type II CCoV, that occurred between the N- and C-terminal domains (NTD and C-domain) of the S1 subunit. These data suggest that CCoV-A76 represents a recombinant coronavirus form, with distinct host cell tropism.
Collapse
Affiliation(s)
- Andrew D Regan
- Department of Microbiology and Immunology, Veterinary Medical Center, Cornell University, Ithaca, NY 14853, United States
| | | | | | | | | | | | | | | | | |
Collapse
|
25
|
Ferguson ME, Hearne SJ, Close TJ, Wanamaker S, Moskal WA, Town CD, de Young J, Marri PR, Rabbi IY, de Villiers EP. Identification, validation and high-throughput genotyping of transcribed gene SNPs in cassava. Theor Appl Genet 2012; 124:685-95. [PMID: 22069119 DOI: 10.1007/s00122-011-1739-9] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/05/2011] [Accepted: 10/18/2011] [Indexed: 05/05/2023]
Abstract
The availability of genomic resources can facilitate progress in plant breeding through the application of advanced molecular technologies for crop improvement. This is particularly important in the case of less researched crops such as cassava, a staple and food security crop for more than 800 million people. Here, expressed sequence tags (ESTs) were generated from five drought stressed and well-watered cassava varieties. Two cDNA libraries were developed: one from root tissue (CASR), the other from leaf, stem and stem meristem tissue (CASL). Sequencing generated 706 contigs and 3,430 singletons. These sequences were combined with those from two other EST sequencing initiatives and filtered based on the sequence quality. Quality sequences were aligned using CAP3 and embedded in a Windows browser called HarvEST:Cassava which is made available. HarvEST:Cassava consists of a Unigene set of 22,903 quality sequences. A total of 2,954 putative SNPs were identified. Of these 1,536 SNPs from 1,170 contigs and 53 cassava genotypes were selected for SNP validation using Illumina's GoldenGate assay. As a result 1,190 SNPs were validated technically and biologically. The location of validated SNPs on scaffolds of the cassava genome sequence (v.4.1) is provided. A diversity assessment of 53 cassava varieties reveals some sub-structure based on the geographical origin, greater diversity in the Americas as opposed to Africa, and similar levels of diversity in West Africa and southern, eastern and central Africa. The resources presented allow for improved genetic dissection of economically important traits and the application of modern genomics-based approaches to cassava breeding and conservation.
Collapse
Affiliation(s)
- Morag E Ferguson
- International Institute of Tropical Agriculture (IITA), c/o ILRI, P.O. Box 30709, Nairobi, Kenya.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
26
|
Young ND, Debellé F, Oldroyd GED, Geurts R, Cannon SB, Udvardi MK, Benedito VA, Mayer KFX, Gouzy J, Schoof H, Van de Peer Y, Proost S, Cook DR, Meyers BC, Spannagl M, Cheung F, De Mita S, Krishnakumar V, Gundlach H, Zhou S, Mudge J, Bharti AK, Murray JD, Naoumkina MA, Rosen B, Silverstein KAT, Tang H, Rombauts S, Zhao PX, Zhou P, Barbe V, Bardou P, Bechner M, Bellec A, Berger A, Bergès H, Bidwell S, Bisseling T, Choisne N, Couloux A, Denny R, Deshpande S, Dai X, Doyle JJ, Dudez AM, Farmer AD, Fouteau S, Franken C, Gibelin C, Gish J, Goldstein S, González AJ, Green PJ, Hallab A, Hartog M, Hua A, Humphray SJ, Jeong DH, Jing Y, Jöcker A, Kenton SM, Kim DJ, Klee K, Lai H, Lang C, Lin S, Macmil SL, Magdelenat G, Matthews L, McCorrison J, Monaghan EL, Mun JH, Najar FZ, Nicholson C, Noirot C, O'Bleness M, Paule CR, Poulain J, Prion F, Qin B, Qu C, Retzel EF, Riddle C, Sallet E, Samain S, Samson N, Sanders I, Saurat O, Scarpelli C, Schiex T, Segurens B, Severin AJ, Sherrier DJ, Shi R, Sims S, Singer SR, Sinharoy S, Sterck L, Viollet A, Wang BB, Wang K, Wang M, Wang X, Warfsmann J, Weissenbach J, White DD, White JD, Wiley GB, Wincker P, Xing Y, Yang L, Yao Z, Ying F, Zhai J, Zhou L, Zuber A, Dénarié J, Dixon RA, May GD, Schwartz DC, Rogers J, Quétier F, Town CD, Roe BA. The Medicago genome provides insight into the evolution of rhizobial symbioses. Nature 2011; 480:520-4. [PMID: 22089132 PMCID: PMC3272368 DOI: 10.1038/nature10625] [Citation(s) in RCA: 758] [Impact Index Per Article: 58.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2011] [Accepted: 10/13/2011] [Indexed: 11/09/2022]
Abstract
Legumes (Fabaceae or Leguminosae) are unique among cultivated plants for their ability to carry out endosymbiotic nitrogen fixation with rhizobial bacteria, a process that takes place in a specialized structure known as the nodule. Legumes belong to one of the two main groups of eurosids, the Fabidae, which includes most species capable of endosymbiotic nitrogen fixation. Legumes comprise several evolutionary lineages derived from a common ancestor 60 million years ago (Myr ago). Papilionoids are the largest clade, dating nearly to the origin of legumes and containing most cultivated species. Medicago truncatula is a long-established model for the study of legume biology. Here we describe the draft sequence of the M. truncatula euchromatin based on a recently completed BAC assembly supplemented with Illumina shotgun sequence, together capturing ∼94% of all M. truncatula genes. A whole-genome duplication (WGD) approximately 58 Myr ago had a major role in shaping the M. truncatula genome and thereby contributed to the evolution of endosymbiotic nitrogen fixation. Subsequent to the WGD, the M. truncatula genome experienced higher levels of rearrangement than two other sequenced legumes, Glycine max and Lotus japonicus. M. truncatula is a close relative of alfalfa (Medicago sativa), a widely cultivated crop with limited genomics tools and complex autotetraploid genetics. As such, the M. truncatula genome sequence provides significant opportunities to expand alfalfa's genomic toolbox.
Collapse
Affiliation(s)
- Nevin D Young
- Department of Plant Pathology, University of Minnesota, St Paul, Minnesota 55108, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
27
|
Thudi M, Bohra A, Nayak SN, Varghese N, Shah TM, Penmetsa RV, Thirunavukkarasu N, Gudipati S, Gaur PM, Kulwal PL, Upadhyaya HD, KaviKishor PB, Winter P, Kahl G, Town CD, Kilian A, Cook DR, Varshney RK. Novel SSR markers from BAC-end sequences, DArT arrays and a comprehensive genetic map with 1,291 marker loci for chickpea (Cicer arietinum L.). PLoS One 2011; 6:e27275. [PMID: 22102885 PMCID: PMC3216927 DOI: 10.1371/journal.pone.0027275] [Citation(s) in RCA: 141] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2011] [Accepted: 10/12/2011] [Indexed: 12/17/2022] Open
Abstract
Chickpea (Cicer arietinum L.) is the third most important cool season food legume, cultivated in arid and semi-arid regions of the world. The goal of this study was to develop novel molecular markers such as microsatellite or simple sequence repeat (SSR) markers from bacterial artificial chromosome (BAC)-end sequences (BESs) and diversity arrays technology (DArT) markers, and to construct a high-density genetic map based on recombinant inbred line (RIL) population ICC 4958 (C. arietinum)×PI 489777 (C. reticulatum). A BAC-library comprising 55,680 clones was constructed and 46,270 BESs were generated. Mining of these BESs provided 6,845 SSRs, and primer pairs were designed for 1,344 SSRs. In parallel, DArT arrays with ca. 15,000 clones were developed, and 5,397 clones were found polymorphic among 94 genotypes tested. Screening of newly developed BES-SSR markers and DArT arrays on the parental genotypes of the RIL mapping population showed polymorphism with 253 BES-SSR markers and 675 DArT markers. Segregation data obtained for these polymorphic markers and 494 markers data compiled from published reports or collaborators were used for constructing the genetic map. As a result, a comprehensive genetic map comprising 1,291 markers on eight linkage groups (LGs) spanning a total of 845.56 cM distance was developed (http://cmap.icrisat.ac.in/cmap/sm/cp/thudi/). The number of markers per linkage group ranged from 68 (LG 8) to 218 (LG 3) with an average inter-marker distance of 0.65 cM. While the developed resource of molecular markers will be useful for genetic diversity, genetic mapping and molecular breeding applications, the comprehensive genetic map with integrated BES-SSR markers will facilitate its anchoring to the physical map (under construction) to accelerate map-based cloning of genes in chickpea and comparative genome evolution studies in legumes.
Collapse
Affiliation(s)
- Mahendar Thudi
- Grain Legumes Research Program, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | - Abhishek Bohra
- Grain Legumes Research Program, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
- Department of Genetics, Osmania University, Hyderabad, India
| | - Spurthi N. Nayak
- Grain Legumes Research Program, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
- Department of Genetics, Osmania University, Hyderabad, India
| | - Nicy Varghese
- Grain Legumes Research Program, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | - Trushar M. Shah
- Grain Legumes Research Program, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | - R. Varma Penmetsa
- Department of Plant Pathology, University of California Davis, Davis, California, United States of America
| | | | - Srivani Gudipati
- Grain Legumes Research Program, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | - Pooran M. Gaur
- Grain Legumes Research Program, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | - Pawan L. Kulwal
- State Level Biotechnology Centre, Mahatma Phule Agricultural University, Ahmednagar, India
| | - Hari D. Upadhyaya
- Grain Legumes Research Program, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | | | | | - Günter Kahl
- Molecular BioSciences, University of Frankfurt, Frankfurt am Main, Germany
| | - Christopher D. Town
- J. Craig Venter Institute (JCVI), Rockville, Maryland, United States of America
| | | | - Douglas R. Cook
- Department of Plant Pathology, University of California Davis, Davis, California, United States of America
| | - Rajeev K. Varshney
- Grain Legumes Research Program, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
- CGIAR Generation Challenge Programme (GCP), CIMMYT, Mexico DF, Mexico
- * E-mail:
| |
Collapse
|
28
|
Hiremath PJ, Farmer A, Cannon SB, Woodward J, Kudapa H, Tuteja R, Kumar A, BhanuPrakash A, Mulaosmanovic B, Gujaria N, Krishnamurthy L, Gaur PM, KaviKishor PB, Shah T, Srinivasan R, Lohse M, Xiao Y, Town CD, Cook DR, May GD, Varshney RK. Large-scale transcriptome analysis in chickpea (Cicer arietinum L.), an orphan legume crop of the semi-arid tropics of Asia and Africa. Plant Biotechnol J 2011; 9:922-31. [PMID: 21615673 PMCID: PMC3437486 DOI: 10.1111/j.1467-7652.2011.00625.x] [Citation(s) in RCA: 83] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Chickpea (Cicer arietinum L.) is an important legume crop in the semi-arid regions of Asia and Africa. Gains in crop productivity have been low however, particularly because of biotic and abiotic stresses. To help enhance crop productivity using molecular breeding techniques, next generation sequencing technologies such as Roche/454 and Illumina/Solexa were used to determine the sequence of most gene transcripts and to identify drought-responsive genes and gene-based molecular markers. A total of 103,215 tentative unique sequences (TUSs) have been produced from 435,018 Roche/454 reads and 21,491 Sanger expressed sequence tags (ESTs). Putative functions were determined for 49,437 (47.8%) of the TUSs, and gene ontology assignments were determined for 20,634 (41.7%) of the TUSs. Comparison of the chickpea TUSs with the Medicago truncatula genome assembly (Mt 3.5.1 build) resulted in 42,141 aligned TUSs with putative gene structures (including 39,281 predicted intron/splice junctions). Alignment of ∼37 million Illumina/Solexa tags generated from drought-challenged root tissues of two chickpea genotypes against the TUSs identified 44,639 differentially expressed TUSs. The TUSs were also used to identify a diverse set of markers, including 728 simple sequence repeats (SSRs), 495 single nucleotide polymorphisms (SNPs), 387 conserved orthologous sequence (COS) markers, and 2088 intron-spanning region (ISR) markers. This resource will be useful for basic and applied research for genome analysis and crop improvement in chickpea.
Collapse
Affiliation(s)
- Pavana J Hiremath
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT)Patancheru, India
- Osmania University (OU)Hyderabad, India
| | - Andrew Farmer
- National Centre for Genome Resources (NCGR)Santa Fe, NM, USA
| | - Steven B Cannon
- United States Department of Agriculture-Agricultural Research Service, Corn Insects and Crop Genetics Research Unit (USDA-ARS-CICGRU)Ames, IA, USA
- Department of Agronomy, Iowa State UniversityAmes, IA, USA
| | - Jimmy Woodward
- National Centre for Genome Resources (NCGR)Santa Fe, NM, USA
| | - Himabindu Kudapa
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT)Patancheru, India
| | - Reetu Tuteja
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT)Patancheru, India
| | - Ashish Kumar
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT)Patancheru, India
| | - Amindala BhanuPrakash
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT)Patancheru, India
| | | | - Neha Gujaria
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT)Patancheru, India
| | - Laxmanan Krishnamurthy
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT)Patancheru, India
| | - Pooran M Gaur
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT)Patancheru, India
| | | | - Trushar Shah
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT)Patancheru, India
| | - Ramamurthy Srinivasan
- National Research Centre on Plant Biotechnology (NRCPB), IARI CampusNew Delhi, India
| | - Marc Lohse
- Max Planck Institute for Molecular Plant Physiology (MPIMPP)Am Muehlenberg, Potsdam-Golm, Germany
| | - Yongli Xiao
- J. Craig Venter Institute (JCVI)Rockville, MD, USA
| | | | | | - Gregory D May
- National Centre for Genome Resources (NCGR)Santa Fe, NM, USA
| | - Rajeev K Varshney
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT)Patancheru, India
- Generation Challenge Program (GCP)c/o CIMMYT, Mexico DF, Mexico
- *Correspondence (Tel +91 40 30713305; fax +91 40 30713074/3075; email )
| |
Collapse
|
29
|
Han Y, Kang Y, Torres-Jerez I, Cheung F, Town CD, Zhao PX, Udvardi MK, Monteros MJ. Genome-wide SNP discovery in tetraploid alfalfa using 454 sequencing and high resolution melting analysis. BMC Genomics 2011; 12:1-11. [PMID: 21733171 PMCID: PMC3154875 DOI: 10.1186/1471-2164-12-350] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2011] [Accepted: 07/06/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Single nucleotide polymorphisms (SNPs) are the most common type of sequence variation among plants and are often functionally important. We describe the use of 454 technology and high resolution melting analysis (HRM) for high throughput SNP discovery in tetraploid alfalfa (Medicago sativa L.), a species with high economic value but limited genomic resources. RESULTS The alfalfa genotypes selected from M. sativa subsp. sativa var. 'Chilean' and M. sativa subsp. falcata var. 'Wisfal', which differ in water stress sensitivity, were used to prepare cDNA from tissue of clonally-propagated plants grown under either well-watered or water-stressed conditions, and then pooled for 454 sequencing. Based on 125.2 Mb of raw sequence, a total of 54,216 unique sequences were obtained including 24,144 tentative consensus (TCs) sequences and 30,072 singletons, ranging from 100 bp to 6,662 bp in length, with an average length of 541 bp. We identified 40,661 candidate SNPs distributed throughout the genome. A sample of candidate SNPs were evaluated and validated using high resolution melting (HRM) analysis. A total of 3,491 TCs harboring 20,270 candidate SNPs were located on the M. truncatula (MT 3.5.1) chromosomes. Gene Ontology assignments indicate that sequences obtained cover a broad range of GO categories. CONCLUSIONS We describe an efficient method to identify thousands of SNPs distributed throughout the alfalfa genome covering a broad range of GO categories. Validated SNPs represent valuable molecular marker resources that can be used to enhance marker density in linkage maps, identify potential factors involved in heterosis and genetic variation, and as tools for association mapping and genomic selection in alfalfa.
Collapse
Affiliation(s)
- Yuanhong Han
- Forage Improvement Division, The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK 73401, USA
| | | | | | | | | | | | | | | |
Collapse
|
30
|
Bohra A, Dubey A, Saxena RK, Penmetsa RV, Poornima KN, Kumar N, Farmer AD, Srivani G, Upadhyaya HD, Gothalwal R, Ramesh S, Singh D, Saxena K, Kishor PBK, Singh NK, Town CD, May GD, Cook DR, Varshney RK. Analysis of BAC-end sequences (BESs) and development of BES-SSR markers for genetic mapping and hybrid purity assessment in pigeonpea (Cajanus spp.). BMC Plant Biol 2011; 11:56. [PMID: 21447154 PMCID: PMC3079640 DOI: 10.1186/1471-2229-11-56] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/05/2010] [Accepted: 03/29/2011] [Indexed: 05/20/2023]
Abstract
BACKGROUND Pigeonpea [Cajanus cajan (L.) Millsp.] is an important legume crop of rainfed agriculture. Despite of concerted research efforts directed to pigeonpea improvement, stagnated productivity of pigeonpea during last several decades may be accounted to prevalence of various biotic and abiotic constraints and the situation is exacerbated by availability of inadequate genomic resources to undertake any molecular breeding programme for accelerated crop improvement. With the objective of enhancing genomic resources for pigeonpea, this study reports for the first time, large scale development of SSR markers from BAC-end sequences and their subsequent use for genetic mapping and hybridity testing in pigeonpea. RESULTS A set of 88,860 BAC (bacterial artificial chromosome)-end sequences (BESs) were generated after constructing two BAC libraries by using HindIII (34,560 clones) and BamHI (34,560 clones) restriction enzymes. Clustering based on sequence identity of BESs yielded a set of >52K non-redundant sequences, comprising 35 Mbp or >4% of the pigeonpea genome. These sequences were analyzed to develop annotation lists and subdivide the BESs into genome fractions (e.g., genes, retroelements, transpons and non-annotated sequences). Parallel analysis of BESs for microsatellites or simple sequence repeats (SSRs) identified 18,149 SSRs, from which a set of 6,212 SSRs were selected for further analysis. A total of 3,072 novel SSR primer pairs were synthesized and tested for length polymorphism on a set of 22 parental genotypes of 13 mapping populations segregating for traits of interest. In total, we identified 842 polymorphic SSR markers that will have utility in pigeonpea improvement. Based on these markers, the first SSR-based genetic map comprising of 239 loci was developed for this previously uncharacterized genome. Utility of developed SSR markers was also demonstrated by identifying a set of 42 markers each for two hybrids (ICPH 2671 and ICPH 2438) for genetic purity assessment in commercial hybrid breeding programme. CONCLUSION In summary, while BAC libraries and BESs should be useful for genomics studies, BES-SSR markers, and the genetic map should be very useful for linking the genetic map with a future physical map as well as for molecular breeding in pigeonpea.
Collapse
Affiliation(s)
- Abhishek Bohra
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Hyderabad, 502324, India
- Department of Genetics, Osmania University, Hyderabad 500007, India
| | - Anuja Dubey
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Hyderabad, 502324, India
- Department of Biotechnology and Bioinformatics Centre, Barkatullah University, Bhopal 462026, India
| | - Rachit K Saxena
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Hyderabad, 502324, India
- Department of Genetics, Osmania University, Hyderabad 500007, India
| | - R Varma Penmetsa
- Department of Plant Pathology, University of California, Davis, CA 95616, USA
| | - KN Poornima
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Hyderabad, 502324, India
- Department of Biotechnology, University of Agricultural Sciences (UAS), Bangalore 560065, India
| | - Naresh Kumar
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Hyderabad, 502324, India
- Department of Plant Breeding and Genetics, CCS Haryana Agricultural University (CCSHAU), Hisar 125004, India
| | - Andrew D Farmer
- National Center for Genome Resources (NCGR), Santa Fe, N M 87505, USA
| | - Gudipati Srivani
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Hyderabad, 502324, India
| | - Hari D Upadhyaya
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Hyderabad, 502324, India
| | - Ragini Gothalwal
- Department of Biotechnology and Bioinformatics Centre, Barkatullah University, Bhopal 462026, India
| | - S Ramesh
- Department of Biotechnology, University of Agricultural Sciences (UAS), Bangalore 560065, India
| | - Dhiraj Singh
- Department of Plant Breeding and Genetics, CCS Haryana Agricultural University (CCSHAU), Hisar 125004, India
| | - Kulbhushan Saxena
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Hyderabad, 502324, India
| | - PB Kavi Kishor
- Department of Genetics, Osmania University, Hyderabad 500007, India
| | - Nagendra K Singh
- National Research Center on Plant Biotechnology (NRCPB), New Delhi 110012, India
| | | | - Gregory D May
- National Center for Genome Resources (NCGR), Santa Fe, N M 87505, USA
| | - Douglas R Cook
- Department of Plant Pathology, University of California, Davis, CA 95616, USA
| | - Rajeev K Varshney
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Hyderabad, 502324, India
- Generation Challenge Programme (GCP), c/o CIMMYT, 06600 Mexico DF, Mexico
| |
Collapse
|
31
|
Xiao YL, Redman JC, Monaghan EL, Zhuang J, Underwood BA, Moskal WA, Wang W, Wu HC, Town CD. High throughput generation of promoter reporter (GFP) transgenic lines of low expressing genes in Arabidopsis and analysis of their expression patterns. Plant Methods 2010; 6:18. [PMID: 20687964 PMCID: PMC2927586 DOI: 10.1186/1746-4811-6-18] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2009] [Accepted: 08/06/2010] [Indexed: 05/20/2023]
Abstract
BACKGROUND Although the complete genome sequence and annotation of Arabidopsis were released at the end of year 2000, it is still a great challenge to understand the function of each gene in the Arabidopsis genome. One way to understand the function of genes on a genome-wide scale is expression profiling by microarrays. However, the expression level of many genes in Arabidopsis genome cannot be detected by microarray experiments. In addition, there are many more novel genes that have been discovered by experiments or predicted by new gene prediction programs. Another way to understand the function of individual genes is to investigate their in vivo expression patterns by reporter constructs in transgenic plants which can provide basic information on the patterns of gene expression. RESULTS A high throughput pipeline was developed to generate promoter-reporter (GFP) transgenic lines for Arabidopsis genes expressed at very low levels and to examine their expression patterns in vivo. The promoter region from a total of 627 non- or low-expressed genes in Arabidopsis based on Arabidopsis annotation release 5 were amplified and cloned into a Gateway vector. A total of 353 promoter-reporter (GFP) constructs were successfully transferred into Agrobacterium (GV3101) by triparental mating and subsequently used for Arabidopsis transformation. Kanamycin-resistant transgenic lines were obtained from 266 constructs and among them positive GFP expression was detected from 150 constructs. Of these 150 constructs, multiple transgenic lines exhibiting consistent expression patterns were obtained for 112 constructs. A total 81 different regions of expression were discovered during our screening of positive transgenic plants and assigned Plant Ontology (PO) codes. CONCLUSIONS Many of the genes tested for which expression data were lacking previously are indeed expressed in Arabidopsis during the developmental stages screened. More importantly, our study provides plant researchers with another resource of gene expression information in Arabidopsis. The results of this study are captured in a MySQL database and can be searched at http://www.jcvi.org/arabidopsis/qpcr/index.shtml. Transgenic seeds and constructs are also available for the research community.
Collapse
Affiliation(s)
- Yong-Li Xiao
- J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850, USA
| | - Julia C Redman
- J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850, USA
| | - Erin L Monaghan
- J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850, USA
| | - Jun Zhuang
- J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850, USA
| | - Beverly A Underwood
- J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850, USA
| | - William A Moskal
- J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850, USA
| | - Wei Wang
- J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850, USA
| | - Hank C Wu
- J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850, USA
| | - Christopher D Town
- J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850, USA
| |
Collapse
|
32
|
Fonseca JP, Menossi M, Thibaud-Nissen F, Town CD. Functional analysis of a TGA factor-binding site located in the promoter region controlling salicylic acid-induced NIMIN-1 expression in Arabidopsis. Genet Mol Res 2010; 9:167-75. [PMID: 20198573 DOI: 10.4238/vol9-1gmr704] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
TGA factors play a key role in plant defense by binding to the promoter region of defense genes, inducing expression. Salicylic acid (SA) induces the expression of the gene encoding NIMIN-1, which interacts with NPR1/NIM1, a key regulator of systemic acquired resistance. We investigated whether the TGA2-binding motif TGACG located upstream of the NIMIN-1 gene is necessary for SA induction of NIMIN-1 expression. A mutated version of the NIMIN-1 promoter was created by site-directed mutagenesis. We generated T-DNA constructs in which native NIMIN-1 and mutated promoters were fused to green fluorescent protein and beta-glucuronidase reporters. We produced transgenic Arabidopsis plants and observed NIMIN-1 promoter-driven green fluorescent protein expression in the roots, petiole and leaves. Constructs were agroinfiltrated into the leaves for transient quantitative assays of gene expression. Using quantitative real-time RT-PCR, we characterized the normal gene response to SA and compared it to the response of the mutant version of the NIMIN-1 promoter. Both the native NIMIN-1 construct and an endogenous copy of NIMIN-1 were induced by SA. However, the mutated promoter construct was much less sensitive to SA than the native NIMIN-1 promoter, indicating that this TGA2-binding motif is directly involved in the modulation of SA-induced NIMIN-1 expression in Arabidopsis.
Collapse
Affiliation(s)
- J P Fonseca
- Departamento de Genética, Evolução e Bioagentes, Universidade Estadual de Campinas, Campinas, SP, Brasil.
| | | | | | | |
Collapse
|
33
|
Trick M, Kwon SJ, Choi SR, Fraser F, Soumpourou E, Drou N, Wang Z, Lee SY, Yang TJ, Mun JH, Paterson AH, Town CD, Pires JC, Pyo Lim Y, Park BS, Bancroft I. Complexity of genome evolution by segmental rearrangement in Brassica rapa revealed by sequence-level analysis. BMC Genomics 2009; 10:539. [PMID: 19922648 PMCID: PMC2783169 DOI: 10.1186/1471-2164-10-539] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2009] [Accepted: 11/18/2009] [Indexed: 11/21/2022] Open
Abstract
Background The Brassica species, related to Arabidopsis thaliana, include an important group of crops and represent an excellent system for studying the evolutionary consequences of polyploidy. Previous studies have led to a proposed structure for an ancestral karyotype and models for the evolution of the B. rapa genome by triplication and segmental rearrangement, but these have not been validated at the sequence level. Results We developed computational tools to analyse the public collection of B. rapa BAC end sequence, in order to identify candidates for representing collinearity discontinuities between the genomes of B. rapa and A. thaliana. For each putative discontinuity, one of the BACs was sequenced and analysed for collinearity with the genome of A. thaliana. Additional BAC clones were identified and sequenced as part of ongoing efforts to sequence four chromosomes of B. rapa. Strikingly few of the 19 inter-chromosomal rearrangements corresponded to the set of collinearity discontinuities anticipated on the basis of previous studies. Our analyses revealed numerous instances of newly detected collinearity blocks. For B. rapa linkage group A8, we were able to develop a model for the derivation of the chromosome from the ancestral karyotype. We were also able to identify a rearrangement event in the ancestor of B. rapa that was not shared with the ancestor of A. thaliana, and is represented in triplicate in the B. rapa genome. In addition to inter-chromosomal rearrangements, we identified and analysed 32 BACs containing the end points of segmental inversion events. Conclusion Our results show that previous studies of segmental collinearity between the A. thaliana, Brassica and ancestral karyotype genomes, although very useful, represent over-simplifications of their true relationships. The presence of numerous cryptic collinear genome segments and the frequent occurrence of segmental inversions mean that inference of the positions of genes in B. rapa based on the locations of orthologues in A. thaliana can be misleading. Our results will be of relevance to a wide range of plants that have polyploid genomes, many of which are being considered according to a paradigm of comprising conserved synteny blocks with respect to sequenced, related genomes.
Collapse
Affiliation(s)
- Martin Trick
- John Innes Centre, Norwich Research Park, Colney, Norwich, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
34
|
Varshney RK, Hiremath PJ, Lekha P, Kashiwagi J, Balaji J, Deokar AA, Vadez V, Xiao Y, Srinivasan R, Gaur PM, Siddique KHM, Town CD, Hoisington DA. A comprehensive resource of drought- and salinity- responsive ESTs for gene discovery and marker development in chickpea (Cicer arietinum L.). BMC Genomics 2009; 10:523. [PMID: 19912666 PMCID: PMC2784481 DOI: 10.1186/1471-2164-10-523] [Citation(s) in RCA: 172] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2009] [Accepted: 11/15/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Chickpea (Cicer arietinum L.), an important grain legume crop of the world is seriously challenged by terminal drought and salinity stresses. However, very limited number of molecular markers and candidate genes are available for undertaking molecular breeding in chickpea to tackle these stresses. This study reports generation and analysis of comprehensive resource of drought- and salinity-responsive expressed sequence tags (ESTs) and gene-based markers. RESULTS A total of 20,162 (18,435 high quality) drought- and salinity- responsive ESTs were generated from ten different root tissue cDNA libraries of chickpea. Sequence editing, clustering and assembly analysis resulted in 6,404 unigenes (1,590 contigs and 4,814 singletons). Functional annotation of unigenes based on BLASTX analysis showed that 46.3% (2,965) had significant similarity (< or =1E-05) to sequences in the non-redundant UniProt database. BLASTN analysis of unique sequences with ESTs of four legume species (Medicago, Lotus, soybean and groundnut) and three model plant species (rice, Arabidopsis and poplar) provided insights on conserved genes across legumes as well as novel transcripts for chickpea. Of 2,965 (46.3%) significant unigenes, only 2,071 (32.3%) unigenes could be functionally categorised according to Gene Ontology (GO) descriptions. A total of 2,029 sequences containing 3,728 simple sequence repeats (SSRs) were identified and 177 new EST-SSR markers were developed. Experimental validation of a set of 77 SSR markers on 24 genotypes revealed 230 alleles with an average of 4.6 alleles per marker and average polymorphism information content (PIC) value of 0.43. Besides SSR markers, 21,405 high confidence single nucleotide polymorphisms (SNPs) in 742 contigs (with > or = 5 ESTs) were also identified. Recognition sites for restriction enzymes were identified for 7,884 SNPs in 240 contigs. Hierarchical clustering of 105 selected contigs provided clues about stress- responsive candidate genes and their expression profile showed predominance in specific stress-challenged libraries. CONCLUSION Generated set of chickpea ESTs serves as a resource of high quality transcripts for gene discovery and development of functional markers associated with abiotic stress tolerance that will be helpful to facilitate chickpea breeding. Mapping of gene-based markers in chickpea will also add more anchoring points to align genomes of chickpea and other legume species.
Collapse
Affiliation(s)
- Rajeev K Varshney
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Greater Hyderabad 502 324, AP, India
- Genomics Towards Gene Discovery Sub Programme, Generation Challenge Programme (GCP), c/o CIMMYT, Int. Apartado Postal 6-641, 06600, Mexico, D. F., Mexico
| | - Pavana J Hiremath
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Greater Hyderabad 502 324, AP, India
| | - Pazhamala Lekha
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Greater Hyderabad 502 324, AP, India
| | - Junichi Kashiwagi
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Greater Hyderabad 502 324, AP, India
- Graduate School of Agriculture, Hokkaido University, Kita 9 Nishi 9, Kita-ku, Sapporo, 060-8589, Japan
| | - Jayashree Balaji
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Greater Hyderabad 502 324, AP, India
| | - Amit A Deokar
- National Research Centre on Plant Biotechnology (NRCPB), IARI Campus, New Delhi-110012, India
| | - Vincent Vadez
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Greater Hyderabad 502 324, AP, India
| | - Yongli Xiao
- J. Craig Venter Institute (JCVI), 9704 Medical Center Drive, Rockville, MD 20850, USA
| | - Ramamurthy Srinivasan
- National Research Centre on Plant Biotechnology (NRCPB), IARI Campus, New Delhi-110012, India
| | - Pooran M Gaur
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Greater Hyderabad 502 324, AP, India
| | - Kadambot HM Siddique
- Institute of Agriculture, The University of Western Australia (UWA) (M082), 35 Stirling Highway, Crawley WA 6009, Australia
| | - Christopher D Town
- J. Craig Venter Institute (JCVI), 9704 Medical Center Drive, Rockville, MD 20850, USA
| | - David A Hoisington
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Greater Hyderabad 502 324, AP, India
| |
Collapse
|
35
|
Trick M, Cheung F, Drou N, Fraser F, Lobenhofer EK, Hurban P, Magusin A, Town CD, Bancroft I. A newly-developed community microarray resource for transcriptome profiling in Brassica species enables the confirmation of Brassica-specific expressed sequences. BMC Plant Biol 2009; 9:50. [PMID: 19426481 PMCID: PMC2685394 DOI: 10.1186/1471-2229-9-50] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/31/2008] [Accepted: 05/08/2009] [Indexed: 05/21/2023]
Abstract
BACKGROUND The Brassica species include an important group of crops and provide opportunities for studying the evolutionary consequences of polyploidy. They are related to Arabidopsis thaliana, for which the first complete plant genome sequence was obtained and their genomes show extensive, although imperfect, conserved synteny with that of A. thaliana. A large number of EST sequences, derived from a range of different Brassica species, are available in the public database, but no public microarray resource has so far been developed for these species. RESULTS We assembled unigenes using approximately 800,000 EST sequences, mainly from three species: B. napus, B. rapa and B. oleracea. The assembly was conducted with the aim of co-assembling ESTs of orthologous genes (including homoeologous pairs of genes in B. napus from each of the A and C genomes), but resolving assemblies of paralogous, or paleo-homoeologous, genes (i.e. the genes related by the ancestral genome triplication observed in diploid Brassica species). 90,864 unique sequence assemblies were developed. These were incorporated into the BAC sequence annotation for the Brassica rapa Genome Sequencing Project, enabling the identification of cognate genomic sequences for a proportion of them. A 60-mer oligo microarray comprising 94,558 probes was developed using the unigene sequences. Gene expression was analysed in reciprocal resynthesised B. napus lines and the B. oleracea and B. rapa lines used to produce them. The analysis showed that significant expression could consistently be detected in leaf tissue for 35,386 unigenes. Expression was detected across all four genotypes for 27,355 unigenes, genome-specific expression patterns were observed for 7,851 unigenes and 180 unigenes displayed other classes of expression pattern. Principal component analysis (PCA) clearly resolved the individual microarray datasets for B. rapa, B. oleracea and resynthesised B. napus. Quantitative differences in expression were observed between the resynthesised B. napus lines for 98 unigenes, most of which could be classified into non-additive expression patterns, including 17 that showed cytoplasm-specific patterns. We further characterized the unigenes for which A genome-specific expression was observed and cognate genomic sequences could be identified. Ten of these unigenes were found to be Brassica-specific sequences, including two that originate from complex loci comprising gene clusters. CONCLUSION We succeeded in developing a Brassica community microarray resource. Although expression can be measured for the majority of unigenes across species, there were numerous probes that reported in a genome-specific manner. We anticipate that some proportion of these will represent species-specific transcripts and the remainder will be the consequence of variation of sequences within the regions represented by the array probes. Our studies demonstrated that the datasets obtained from the arrays can be used for typical analyses, including PCA and the analysis of differential expression. We have also demonstrated that Brassica-specific transcripts identified in silico in the sequence assembly of public EST database accessions are indeed reported by the array. These would not be detectable using arrays designed using A. thaliana sequences.
Collapse
Affiliation(s)
- Martin Trick
- John Innes Centre, Norwich Research Park, Colney, Norwich, NR4 7UH, UK
| | - Foo Cheung
- The J Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850, USA
| | - Nizar Drou
- John Innes Centre, Norwich Research Park, Colney, Norwich, NR4 7UH, UK
| | - Fiona Fraser
- John Innes Centre, Norwich Research Park, Colney, Norwich, NR4 7UH, UK
| | - Edward K Lobenhofer
- Cogenics, A Division of Clinical Data, Inc, 100 Perimeter Park Drive, Suite C, Morrisville, NC 27560, USA
- Current address : Amgen Inc, 1 Amgen Center Drive, Thousand Oaks, CA 91320, USA
| | - Patrick Hurban
- Cogenics, A Division of Clinical Data, Inc, 100 Perimeter Park Drive, Suite C, Morrisville, NC 27560, USA
| | - Andreas Magusin
- John Innes Centre, Norwich Research Park, Colney, Norwich, NR4 7UH, UK
| | - Christopher D Town
- The J Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850, USA
| | - Ian Bancroft
- John Innes Centre, Norwich Research Park, Colney, Norwich, NR4 7UH, UK
| |
Collapse
|
36
|
Bartoš J, Paux E, Kofler R, Havránková M, Kopecký D, Suchánková P, Šafář J, Šimková H, Town CD, Lelley T, Feuillet C, Doležel J. A first survey of the rye (Secale cereale) genome composition through BAC end sequencing of the short arm of chromosome 1R. BMC Plant Biol 2008; 8:95. [PMID: 18803819 PMCID: PMC2565679 DOI: 10.1186/1471-2229-8-95] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/15/2008] [Accepted: 09/19/2008] [Indexed: 05/02/2023]
Abstract
BACKGROUND Rye (Secale cereale L.) belongs to tribe Triticeae and is an important temperate cereal. It is one of the parents of man-made species Triticale and has been used as a source of agronomically important genes for wheat improvement. The short arm of rye chromosome 1 (1RS), in particular is rich in useful genes, and as it may increase yield, protein content and resistance to biotic and abiotic stress, it has been introgressed into wheat as the 1BL.1RS translocation. A better knowledge of the rye genome could facilitate rye improvement and increase the efficiency of utilizing rye genes in wheat breeding. RESULTS Here, we report on BAC end sequencing of 1,536 clones from two 1RS-specific BAC libraries. We obtained 2,778 (90.4%) useful sequences with a cumulative length of 2,032,538 bp and an average read length of 732 bp. These sequences represent 0.5% of 1RS arm. The GC content of the sequenced fraction of 1RS is 45.9%, and at least 84% of the 1RS arm consists of repetitive DNA. We identified transposable element junctions in BESs and developed insertion site based polymorphism markers (ISBP). Out of the 64 primer pairs tested, 17 (26.6%) were specific for 1RS. We also identified BESs carrying microsatellites suitable for development of 1RS-specific SSR markers. CONCLUSION This work demonstrates the utility of chromosome arm-specific BAC libraries for targeted analysis of large Triticeae genomes and provides new sequence data from the rye genome and molecular markers for the short arm of rye chromosome 1.
Collapse
Affiliation(s)
- Jan Bartoš
- Laboratory of Molecular Cytogenetics and Cytometry, Institute of Experimental Botany, Sokolovská 6, CZ-77200 Olomouc, Czech Republic
| | - Etienne Paux
- INRA- Université Blaise Pascal, UMR GDEC 1095, 234 Avenue du Brezet, F-63100 Clermont-Ferrand, France
| | - Robert Kofler
- University of Natural Resources and Applied Life Sciences, Department for Agrobiotechnology, Institute for Plant Production Biotechnology, Konrad Lorenz Str. 20, A-3430 Tulln, Austria
| | - Miroslava Havránková
- Laboratory of Molecular Cytogenetics and Cytometry, Institute of Experimental Botany, Sokolovská 6, CZ-77200 Olomouc, Czech Republic
| | - David Kopecký
- Laboratory of Molecular Cytogenetics and Cytometry, Institute of Experimental Botany, Sokolovská 6, CZ-77200 Olomouc, Czech Republic
| | - Pavla Suchánková
- Laboratory of Molecular Cytogenetics and Cytometry, Institute of Experimental Botany, Sokolovská 6, CZ-77200 Olomouc, Czech Republic
| | - Jan Šafář
- Laboratory of Molecular Cytogenetics and Cytometry, Institute of Experimental Botany, Sokolovská 6, CZ-77200 Olomouc, Czech Republic
| | - Hana Šimková
- Laboratory of Molecular Cytogenetics and Cytometry, Institute of Experimental Botany, Sokolovská 6, CZ-77200 Olomouc, Czech Republic
| | - Christopher D Town
- The J. Craig Venter Institute, 9704 Medical Center Drive, Rockville MD 20850, USA
| | - Tamas Lelley
- University of Natural Resources and Applied Life Sciences, Department for Agrobiotechnology, Institute for Plant Production Biotechnology, Konrad Lorenz Str. 20, A-3430 Tulln, Austria
| | - Catherine Feuillet
- INRA- Université Blaise Pascal, UMR GDEC 1095, 234 Avenue du Brezet, F-63100 Clermont-Ferrand, France
| | - Jaroslav Doležel
- Laboratory of Molecular Cytogenetics and Cytometry, Institute of Experimental Botany, Sokolovská 6, CZ-77200 Olomouc, Czech Republic
- Department of Cell Biology and Genetics, Palacký University, Šlechtitelù 11, CZ-78371 Olomouc, Czech Republic
| |
Collapse
|
37
|
Verdier J, Kakar K, Gallardo K, Le Signor C, Aubert G, Schlereth A, Town CD, Udvardi MK, Thompson RD. Gene expression profiling of M. truncatula transcription factors identifies putative regulators of grain legume seed filling. Plant Mol Biol 2008; 67:567-80. [PMID: 18528765 DOI: 10.1007/s11103-008-9320-x] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/08/2007] [Accepted: 03/13/2008] [Indexed: 05/23/2023]
Abstract
Legume seeds represent a major source of proteins for human and livestock diets. The model legume Medicago truncatula is characterized by a process of seed development very similar to that of other legumes, involving the interplay of sets of transcription factors (TFs). Here, we report the first expression profiling of over 700 M. truncatula genes encoding putative TFs throughout seven stages of seed development, obtained using real-time quantitative RT-PCR. A total of 169 TFs were selected which were expressed at late embryogenesis, seed filling or desiccation. The site of expression within the seed was examined for 41 highly expressed transcription factors out of the 169. To identify possible target genes for these TFs, the data were combined with a microarray-derived transcriptome dataset. This study identified 17 TFs preferentially expressed in individual seed tissues and 135 corresponding co-expressed genes, including possible targets. Certain of the TFs co-expressed with storage protein mRNAs correspond to those already known to regulate seed storage protein synthesis in Arabidopsis, whereas the timing of expression of others may be more specifically related to the delayed expression of the legumin-class storage proteins observed in legumes.
Collapse
Affiliation(s)
- Jérôme Verdier
- Unité Mixte de Recherche en Génétique et Ecophysiologie des Légumineuses à Graines (UMR-LEG), Institut National de la Recherche Agronomique (INRA), Domaine d'Epoisses, 21110, Bretenieres, France
| | | | | | | | | | | | | | | | | |
Collapse
|
38
|
Jakše J, Meyer JDF, Suzuki G, McCallum J, Cheung F, Town CD, Havey MJ. Pilot sequencing of onion genomic DNA reveals fragments of transposable elements, low gene densities, and significant gene enrichment after methyl filtration. Mol Genet Genomics 2008; 280:287-92. [DOI: 10.1007/s00438-008-0364-z] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2008] [Accepted: 06/22/2008] [Indexed: 10/21/2022]
|
39
|
Kakar K, Wandrey M, Czechowski T, Gaertner T, Scheible WR, Stitt M, Torres-Jerez I, Xiao Y, Redman JC, Wu HC, Cheung F, Town CD, Udvardi MK. A community resource for high-throughput quantitative RT-PCR analysis of transcription factor gene expression in Medicago truncatula. Plant Methods 2008; 4:18. [PMID: 18611268 PMCID: PMC2490690 DOI: 10.1186/1746-4811-4-18] [Citation(s) in RCA: 90] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/15/2008] [Accepted: 07/08/2008] [Indexed: 05/18/2023]
Abstract
BACKGROUND Medicago truncatula is a model legume species that is currently the focus of an international genome sequencing effort. Although several different oligonucleotide and cDNA arrays have been produced for genome-wide transcript analysis of this species, intrinsic limitations in the sensitivity of hybridization-based technologies mean that transcripts of genes expressed at low-levels cannot be measured accurately with these tools. Amongst such genes are many encoding transcription factors (TFs), which are arguably the most important class of regulatory proteins. Quantitative reverse transcription-polymerase chain reaction (qRT-PCR) is the most sensitive method currently available for transcript quantification, and one that can be scaled up to analyze transcripts of thousands of genes in parallel. Thus, qRT-PCR is an ideal method to tackle the problem of TF transcript quantification in Medicago and other plants. RESULTS We established a bioinformatics pipeline to identify putative TF genes in Medicago truncatula and to design gene-specific oligonucleotide primers for qRT-PCR analysis of TF transcripts. We validated the efficacy and gene-specificity of over 1000 TF primer pairs and utilized these to identify sets of organ-enhanced TF genes that may play important roles in organ development or differentiation in this species. This community resource will be developed further as more genome sequence becomes available, with the ultimate goal of producing validated, gene-specific primers for all Medicago TF genes. CONCLUSION High-throughput qRT-PCR using a 384-well plate format enables rapid, flexible, and sensitive quantification of all predicted Medicago transcription factor mRNAs. This resource has been utilized recently by several groups in Europe, Australia, and the USA, and we expect that it will become the 'gold-standard' for TF transcript profiling in Medicago truncatula.
Collapse
Affiliation(s)
- Klementina Kakar
- Max-Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476 Potsdam-Golm, Germany
| | - Maren Wandrey
- Max-Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476 Potsdam-Golm, Germany
| | - Tomasz Czechowski
- Max-Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476 Potsdam-Golm, Germany
| | - Tanja Gaertner
- Max-Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476 Potsdam-Golm, Germany
| | - Wolf-Rüdiger Scheible
- Max-Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476 Potsdam-Golm, Germany
| | - Mark Stitt
- Max-Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476 Potsdam-Golm, Germany
| | - Ivone Torres-Jerez
- The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK, 73401, USA
| | - Yongli Xiao
- The J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD, 20850, USA
| | - Julia C Redman
- The J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD, 20850, USA
| | - Hank C Wu
- The J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD, 20850, USA
| | - Foo Cheung
- The J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD, 20850, USA
| | - Christopher D Town
- The J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD, 20850, USA
| | - Michael K Udvardi
- Max-Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476 Potsdam-Golm, Germany
- The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK, 73401, USA
| |
Collapse
|
40
|
Mian MAR, Zhang Y, Wang ZY, Zhang JY, Cheng X, Chen L, Chekhovskiy K, Dai X, Mao C, Cheung F, Zhao X, He J, Scott AD, Town CD, May GD. Analysis of tall fescue ESTs representing different abiotic stresses, tissue types and developmental stages. BMC Plant Biol 2008; 8:27. [PMID: 18318913 PMCID: PMC2323379 DOI: 10.1186/1471-2229-8-27] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2007] [Accepted: 03/04/2008] [Indexed: 05/02/2023]
Abstract
BACKGROUND Tall fescue (Festuca arundinacea Schreb) is a major cool season forage and turf grass species grown in the temperate regions of the world. In this paper we report the generation of a tall fescue expressed sequence tag (EST) database developed from nine cDNA libraries representing tissues from different plant organs, developmental stages, and abiotic stress factors. The results of inter-library and library-specific in silico expression analyses of these ESTs are also reported. RESULTS A total of 41,516 ESTs were generated from nine cDNA libraries of tall fescue representing tissues from different plant organs, developmental stages, and abiotic stress conditions. The Festuca Gene Index (FaGI) has been established. To date, this represents the first publicly available tall fescue EST database. In silico gene expression studies using these ESTs were performed to understand stress responses in tall fescue. A large number of ESTs of known stress response gene were identified from stressed tissue libraries. These ESTs represent gene homologues of heat-shock and oxidative stress proteins, and various transcription factor protein families. Highly expressed ESTs representing genes of unknown functions were also identified in the stressed tissue libraries. CONCLUSION FaGI provides a useful resource for genomics studies of tall fescue and other closely related forage and turf grass species. Comparative genomic analyses between tall fescue and other grass species, including ryegrasses (Lolium sp.), meadow fescue (F. pratensis) and tetraploid fescue (F. arundinacea var glaucescens) will benefit from this database. These ESTs are an excellent resource for the development of simple sequence repeat (SSR) and single nucleotide polymorphism (SNP) PCR-based molecular markers.
Collapse
Affiliation(s)
- MA Rouf Mian
- Forage Improvement Division, The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK 73402, USA
- USDA-ARS, The Ohio State University & OARDC, 1680 Madison Avenue, Wooster, OH 44691, USA
| | - Yan Zhang
- Forage Improvement Division, The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK 73402, USA
| | - Zeng-Yu Wang
- Forage Improvement Division, The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK 73402, USA
| | - Ji-Yi Zhang
- Forage Improvement Division, The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK 73402, USA
| | - Xiaofei Cheng
- Forage Improvement Division, The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK 73402, USA
| | - Lei Chen
- Forage Improvement Division, The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK 73402, USA
| | - Konstantin Chekhovskiy
- Forage Improvement Division, The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK 73402, USA
| | - Xinbin Dai
- Plant Biology Division, The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK 73402, USA
| | - Chunhong Mao
- Virginia Bioinformatics Institute, 1750 Kraft Drive Suite 1400, Virginia Tech, Blacksburg, VA 24061, USA
| | - Foo Cheung
- The J. Craig Venter Institute, 9712 Medical Center Drive, Rockville, MD 20850, USA
| | - Xuechun Zhao
- Plant Biology Division, The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK 73402, USA
| | - Ji He
- Plant Biology Division, The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK 73402, USA
| | - Angela D Scott
- Plant Biology Division, The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK 73402, USA
| | - Christopher D Town
- The J. Craig Venter Institute, 9712 Medical Center Drive, Rockville, MD 20850, USA
| | - Gregory D May
- Plant Biology Division, The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK 73402, USA
- National Center for Genome Resources, 2935 Rodeo Park Drive East, Santa Fe, NM 87505, USA
| |
Collapse
|
41
|
Timko MP, Rushton PJ, Laudeman TW, Bokowiec MT, Chipumuro E, Cheung F, Town CD, Chen X. Sequencing and analysis of the gene-rich space of cowpea. BMC Genomics 2008; 9:103. [PMID: 18304330 PMCID: PMC2279124 DOI: 10.1186/1471-2164-9-103] [Citation(s) in RCA: 81] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2007] [Accepted: 02/27/2008] [Indexed: 11/16/2022] Open
Abstract
Background Cowpea, Vigna unguiculata (L.) Walp., is one of the most important food and forage legumes in the semi-arid tropics because of its drought tolerance and ability to grow on poor quality soils. Approximately 80% of cowpea production takes place in the dry savannahs of tropical West and Central Africa, mostly by poor subsistence farmers. Despite its economic and social importance in the developing world, cowpea remains to a large extent an underexploited crop. Among the major goals of cowpea breeding and improvement programs is the stacking of desirable agronomic traits, such as disease and pest resistance and response to abiotic stresses. Implementation of marker-assisted selection and breeding programs is severely limited by a paucity of trait-linked markers and a general lack of information on gene structure and organization. With a nuclear genome size estimated at ~620 Mb, the cowpea genome is an ideal target for reduced representation sequencing. Results We report here the sequencing and analysis of the gene-rich, hypomethylated portion of the cowpea genome selectively cloned by methylation filtration (MF) technology. Over 250,000 gene-space sequence reads (GSRs) with an average length of 610 bp were generated, yielding ~160 Mb of sequence information. The GSRs were assembled, annotated by BLAST homology searches of four public protein annotation databases and four plant proteomes (A. thaliana, M. truncatula, O. sativa, and P. trichocarpa), and analyzed using various domain and gene modeling tools. A total of 41,260 GSR assemblies and singletons were annotated, of which 19,786 have unique GenBank accession numbers. Within the GSR dataset, 29% of the sequences were annotated using the Arabidopsis Gene Ontology (GO) with the largest categories of assigned function being catalytic activity and metabolic processes, groups that include the majority of cellular enzymes and components of amino acid, carbohydrate and lipid metabolism. A total of 5,888 GSRs had homology to genes encoding transcription factors (TFs) and transcription associated factors (TAFs) representing about 5% of the total annotated sequences in the dataset. Sixty-two (62) of the 64 well-characterized plant transcription factor (TF) gene families are represented in the cowpea GSRs, and these families are of similar size and phylogenetic organization to those characterized in other plants. The cowpea GSRs also provides a rich source of genes involved in photoperiodic control, symbiosis, and defense-related responses. Comparisons to available databases revealed that about 74% of cowpea ESTs and 70% of all legume ESTs were represented in the GSR dataset. As approximately 12% of all GSRs contain an identifiable simple-sequence repeat, the dataset is a powerful resource for the design of microsatellite markers. Conclusion The availability of extensive publicly available genomic data for cowpea, a non-model legume with significant importance in the developing world, represents a significant step forward in legume research. Not only does the gene space sequence enable the detailed analysis of gene structure, gene family organization and phylogenetic relationships within cowpea, but it also facilitates the characterization of syntenic relationships with other cultivated and model legumes, and will contribute to determining patterns of chromosomal evolution in the Leguminosae. The micro and macrosyntenic relationships detected between cowpea and other cultivated and model legumes should simplify the identification of informative markers for marker-assisted trait selection and map-based gene isolation necessary for cowpea improvement.
Collapse
Affiliation(s)
- Michael P Timko
- Department of Biology, University of Virginia, Charlottesville, Virginia 22903, USA.
| | | | | | | | | | | | | | | |
Collapse
|
42
|
Hribová E, Dolezelová M, Town CD, Macas J, Dolezel J. Isolation and characterization of the highly repeated fraction of the banana genome. Cytogenet Genome Res 2008; 119:268-74. [PMID: 18253041 DOI: 10.1159/000112073] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/14/2007] [Indexed: 01/04/2023] Open
Abstract
Although the nuclear genome of banana (Musa spp.) is relatively small (1C approximately 610 Mbp for M. acuminata), the results obtained from other sequenced genomes suggest that more than half of the banana genome may be composed of repetitive and non-coding DNA sequences. Knowledge of repetitive DNA can facilitate mapping of important traits, phylogenetic studies, BAC-based physical mapping, and genome sequencing/annotation. However, only a few repetitive DNA sequences have been characterized in banana. In this work, we used DNA reassociation kinetics to isolate the highly repeated fraction of the banana genome (M. acuminata 'Calcutta 4'). Two libraries, one prepared from Cot </=0.05 DNA (2,688 clones) and one from Cot </=0.1 sequences (4,608 clones), were constructed, and 614 DNA clones were chosen randomly for sequencing and further characterization. Dot-plot analysis revealed that 14% of the sequenced clones contained various semi-tandem and palindromic repeated sequences. 'BLAST' homology searches showed that, in addition to tandem repeats, the Cot libraries were composed mainly of different types of retrotransposons, the most frequent being the Ty3/gypsy type monkey retrotransposon. Selected sequences displaying tandem organization properties were mapped by PRimed IN Situ DNA labeling (PRINS) to the secondary constriction on metaphase chromosomes of M. acuminata 'Calcutta 4'. Southern hybridization with selected BAC clones carrying 45S rDNA confirmed the presence of the tandem repeats in the 45S rDNA unit. This work significantly expands the knowledge of the repetitive fraction of the Musa genome and organization of its chromosomes.
Collapse
Affiliation(s)
- E Hribová
- Laboratory of Molecular Cytogenetics and Cytometry, Institute of Experimental Botany, Olomouc, Czech Republic
| | | | | | | | | |
Collapse
|
43
|
Lescot M, Piffanelli P, Ciampi AY, Ruiz M, Blanc G, Leebens-Mack J, da Silva FR, Santos CMR, D'Hont A, Garsmeur O, Vilarinhos AD, Kanamori H, Matsumoto T, Ronning CM, Cheung F, Haas BJ, Althoff R, Arbogast T, Hine E, Pappas GJ, Sasaki T, Souza MT, Miller RNG, Glaszmann JC, Town CD. Insights into the Musa genome: syntenic relationships to rice and between Musa species. BMC Genomics 2008; 9:58. [PMID: 18234080 PMCID: PMC2270835 DOI: 10.1186/1471-2164-9-58] [Citation(s) in RCA: 70] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2007] [Accepted: 01/30/2008] [Indexed: 01/10/2023] Open
Abstract
Background Musa species (Zingiberaceae, Zingiberales) including bananas and plantains are collectively the fourth most important crop in developing countries. Knowledge concerning Musa genome structure and the origin of distinct cultivars has greatly increased over the last few years. Until now, however, no large-scale analyses of Musa genomic sequence have been conducted. This study compares genomic sequence in two Musa species with orthologous regions in the rice genome. Results We produced 1.4 Mb of Musa sequence from 13 BAC clones, annotated and analyzed them along with 4 previously sequenced BACs. The 443 predicted genes revealed that Zingiberales genes share GC content and distribution characteristics with eudicot and Poaceae genomes. Comparison with rice revealed microsynteny regions that have persisted since the divergence of the Commelinid orders Poales and Zingiberales at least 117 Mya. The previously hypothesized large-scale duplication event in the common ancestor of major cereal lineages within the Poaceae was verified. The divergence time distributions for Musa-Zingiber (Zingiberaceae, Zingiberales) orthologs and paralogs provide strong evidence for a large-scale duplication event in the Musa lineage after its divergence from the Zingiberaceae approximately 61 Mya. Comparisons of genomic regions from M. acuminata and M. balbisiana revealed highly conserved genome structure, and indicated that these genomes diverged circa 4.6 Mya. Conclusion These results point to the utility of comparative analyses between distantly-related monocot species such as rice and Musa for improving our understanding of monocot genome evolution. Sequencing the genome of M. acuminata would provide a strong foundation for comparative genomics in the monocots. In addition a genome sequence would aid genomic and genetic analyses of cultivated Musa polyploid genotypes in research aimed at localizing and cloning genes controlling important agronomic traits for breeding purposes.
Collapse
Affiliation(s)
- Magali Lescot
- French Agricultural Research Center for International Development, UMR 1096, Avenue Agropolis, TA40/03, FR-34398, Montpellier, Cedex 5, France.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
44
|
Chen ZJ, Scheffler BE, Dennis E, Triplett BA, Zhang T, Guo W, Chen X, Stelly DM, Rabinowicz PD, Town CD, Arioli T, Brubaker C, Cantrell RG, Lacape JM, Ulloa M, Chee P, Gingle AR, Haigler CH, Percy R, Saha S, Wilkins T, Wright RJ, Van Deynze A, Zhu Y, Yu S, Abdurakhmonov I, Katageri I, Kumar PA, Mehboob-Ur-Rahman, Zafar Y, Yu JZ, Kohel RJ, Wendel JF, Paterson AH. Toward sequencing cotton (Gossypium) genomes. Plant Physiol 2007; 145:1303-10. [PMID: 18056866 PMCID: PMC2151711 DOI: 10.1104/pp.107.107672] [Citation(s) in RCA: 177] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [What about the content of this article? (0)] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
|
45
|
Febrer M, Cheung F, Town CD, Cannon SB, Young ND, Abberton MT, Jenkins G, Milbourne D. Construction, characterization, and preliminary BAC-end sequencing analysis of a bacterial artificial chromosome library of white clover (Trifolium repens L.). Genome 2007; 50:412-21. [PMID: 17546099 DOI: 10.1139/g07-013] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
White clover (Trifolium repens L.) is a forage legume widely used in combination with grass in pastures because of its ability to fix nitrogen. We have constructed a bacterial artificial chromosome (BAC) library of an advanced breeding line of white clover. The library contains 37 248 clones with an average insert size of approximately 85 kb, representing an approximate 3-fold coverage of the white clover genome based on an estimated genome size of 960 Mb. The BAC library was pooled and screened by polymerase chain reaction (PCR) amplification using both white clover microsatellites and PCR-based markers derived from Medicago truncatula, resulting in an average of 6 hits per marker; this supports the estimated 3-fold genome coverage in this allotetraploid species. PCR-based screening of 766 clones with a multiplex set of chloroplast primers showed that only 0.5% of BAC clones contained chloroplast-derived inserts. The library was further evaluated by sequencing both ends of 724 of the clover BACs. These were analysed with respect to their sequence content and their homology to the contents of a range of plant gene, expressed sequence tag, and repeat element databases. Forty-three microsatellites were discovered in the BAC-end sequences (BESs) and investigated as potential genetic markers in white clover. The BESs were also compared with the partially sequenced genome of the model legume M. truncatula with the specific intention of identifying putative comparative-tile BACs, which represent potential regions of microsynteny between the 2 species; 14 such BACs were discovered. The results suggest that a large-scale BAC-end sequencing strategy has the potential to anchor a significant proportion of the genome of white clover onto the gene-space sequence of M. truncatula.
Collapse
Affiliation(s)
- Melanie Febrer
- Teagasc, Crops Research Centre, Oak Park, Carlow, Ireland
| | | | | | | | | | | | | | | |
Collapse
|
46
|
Silverstein KAT, Moskal WA, Wu HC, Underwood BA, Graham MA, Town CD, VandenBosch KA. Small cysteine-rich peptides resembling antimicrobial peptides have been under-predicted in plants. Plant J 2007; 51:262-80. [PMID: 17565583 DOI: 10.1111/j.1365-313x.2007.03136.x] [Citation(s) in RCA: 301] [Impact Index Per Article: 17.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Multicellular organisms produce small cysteine-rich antimicrobial peptides as an innate defense against pathogens. While defensins, a well-known class of such peptides, are common among eukaryotes, there are other classes restricted to the plant kingdom. These include thionins, lipid transfer proteins and snakins. In earlier work, we identified several divergent classes of small putatively secreted cysteine-rich peptides (CRPs) in legumes [Graham et al. (2004)Plant Physiol. 135, 1179-97]. Here, we built sequence motif models for each of these classes of peptides, and iteratively searched for related sequences within the comprehensive UniProt protein dataset, the Institute for Genomic Research's 33 plant gene indices, and the entire genomes of the model dicot, Arabidopsis thaliana, and the model monocot and crop species, Oryza sativa (rice). Using this search strategy, we identified approximately 13,000 plant genes encoding peptides with common features: (i) an N-terminal signal peptide, (ii) a small divergent charged or polar mature peptide with conserved cysteines, (iii) a similar intron/exon structure, (iv) spatial clustering in the genomes studied, and (v) overrepresentation in expressed sequences from reproductive structures of specific taxa. The identified genes include classes of defensins, thionins, lipid transfer proteins, and snakins, plus other protease inhibitors, pollen allergens, and uncharacterized gene families. We estimate that these classes of genes account for approximately 2-3% of the gene repertoire of each model species. Although 24% of the genes identified were not annotated in the latest Arabidopsis genome releases (TIGR5, TAIR6), we confirmed expression via RT-PCR for 59% of the sequences attempted. These findings highlight limitations in current annotation procedures for small divergent peptide classes.
Collapse
|
47
|
Abstract
BACKGROUND Musa species contain the fourth most important crop in developing countries. Here, we report the analysis of 6,252 BAC end-sequences, in order to view the sequence composition of the Musa acuminata genome in a cost effective and efficient manner. RESULTS BAC end sequencing generated 6,252 reads representing 4,420,944 bp, including 2,979 clone pairs with an average read length after cleaning and filtering of 707 bp. All sequences have been submitted to GenBank, with the accession numbers DX451975-DX458350. The BAC end-sequences, were searched against several databases and significant homology was found to mitochondria and chloroplast (2.6%), transposons and repetitive sequences (36%) and proteins (11%). Functional interpretation of the protein matches was carried out by Gene Ontology assignments from matches to Arabidopsis and was shown to cover a broad range of categories. From protein matching regions of Musa BAC end-sequences, it was determined that the GC content of coding regions was 47%. Where protein matches encompassed a start codon, GC content as a function of position (5' to 3') across 129 bp sliding windows generates a "rice-like" gradient. A total of 352 potential SSR markers were discovered. The most abundant simple sequence repeats in four size categories were AT-rich. After filtering mitochondria and chloroplast matches, thousands of BAC end-sequences had a significant BLASTN match to the Oryza sativa and Arabidopsis genome sequence. Of these, a small number of BAC end-sequence pairs were shown to map to neighboring regions of the Oryza sativa genome representing regions of potential microsynteny. CONCLUSION Database searches with the BAC end-sequences and ab initio analysis identified those reads likely to contain transposons, repeat sequences, proteins and simple sequence repeats. Approximately 600 BAC end-sequences contained protein sequences that were not found in the existing available Musa expressed sequence tags, repeat or transposon databases. In addition, gene statistics, GC content and profile could also be estimated based on the region matching the top protein hit. A small number of BAC end pair sequences can be mapped to neighboring regions of the Oryza sativa representing regions of potential microsynteny. These results suggest that a large-scale BAC end sequencing strategy has the potential to anchor a small proportion of the genome of Musa acuminata to the genomes of Oryza sativa and possibly Arabidopsis.
Collapse
Affiliation(s)
- Foo Cheung
- J Craig Venter Institute, 9712 Medical Center Drive, Rockville, MD 20850 USA
| | - Christopher D Town
- J Craig Venter Institute, 9712 Medical Center Drive, Rockville, MD 20850 USA
| |
Collapse
|
48
|
Udvardi MK, Kakar K, Wandrey M, Montanari O, Murray J, Andriankaja A, Zhang JY, Benedito V, Hofer JMI, Chueng F, Town CD. Legume transcription factors: global regulators of plant development and response to the environment. Plant Physiol 2007; 144:538-49. [PMID: 17556517 PMCID: PMC1914172 DOI: 10.1104/pp.107.098061] [Citation(s) in RCA: 161] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/14/2007] [Accepted: 03/24/2007] [Indexed: 05/15/2023]
|
49
|
Hernández G, Ramírez M, Valdés-López O, Tesfaye M, Graham MA, Czechowski T, Schlereth A, Wandrey M, Erban A, Cheung F, Wu HC, Lara M, Town CD, Kopka J, Udvardi MK, Vance CP. Phosphorus stress in common bean: root transcript and metabolic responses. Plant Physiol 2007; 144:752-67. [PMID: 17449651 PMCID: PMC1914166 DOI: 10.1104/pp.107.096958] [Citation(s) in RCA: 83] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/07/2007] [Accepted: 04/09/2007] [Indexed: 05/15/2023]
Abstract
Phosphorus (P) is an essential element for plant growth. Crop production of common bean (Phaseolus vulgaris), the most important legume for human consumption, is often limited by low P in the soil. Functional genomics were used to investigate global gene expression and metabolic responses of bean plants grown under P-deficient and P-sufficient conditions. P-deficient plants showed enhanced root to shoot ratio accompanied by reduced leaf area and net photosynthesis rates. Transcript profiling was performed through hybridization of nylon filter arrays spotted with cDNAs of 2,212 unigenes from a P deficiency root cDNA library. A total of 126 genes, representing different functional categories, showed significant differential expression in response to P: 62% of these were induced in P-deficient roots. A set of 372 bean transcription factor (TF) genes, coding for proteins with Inter-Pro domains characteristic or diagnostic for TF, were identified from The Institute of Genomic Research/Dana Farber Cancer Institute Common Bean Gene Index. Using real-time reverse transcription-polymerase chain reaction analysis, 17 TF genes were differentially expressed in P-deficient roots; four TF genes, including MYB TFs, were induced. Nonbiased metabolite profiling was used to assess the degree to which changes in gene expression in P-deficient roots affect overall metabolism. Stress-related metabolites such as polyols accumulated in P-deficient roots as well as sugars, which are known to be essential for P stress gene induction. Candidate genes have been identified that may contribute to root adaptation to P deficiency and be useful for improvement of common bean.
Collapse
Affiliation(s)
- Georgina Hernández
- Centro de Ciencias Genómicas-Universidad Nacional Autónoma de México, 66210 Cuernavaca, Mor., Mexico.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
50
|
Liu J, Maldonado-Mendoza I, Lopez-Meyer M, Cheung F, Town CD, Harrison MJ. Arbuscular mycorrhizal symbiosis is accompanied by local and systemic alterations in gene expression and an increase in disease resistance in the shoots. Plant J 2007; 50:529-44. [PMID: 17419842 DOI: 10.1111/j.1365-313x.2007.03069.x] [Citation(s) in RCA: 241] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Abstract
In natural ecosystems, the roots of many plants exist in association with arbuscular mycorrhizal (AM) fungi, and the resulting symbiosis has profound effects on the plant. The most frequently documented response is an increase in phosphorus nutrition; however, other effects have been noted, including increased resistance to abiotic and biotic stresses. Here we used a 16,000-feature oligonucleotide array and real-time quantitative RT-PCR to explore transcriptional changes triggered in Medicago truncatula roots and shoots as a result of AM symbiosis. By controlling the experimental conditions, phosphorus-related effects were minimized, and both local and systemic transcriptional responses to the AM fungus were revealed. The transcriptional response of the roots and shoots differed in both the magnitude of gene induction and the predicted functional categories of the mycorrhiza-regulated genes. In the roots, genes regulated in response to three different AM fungi were identified, and, through split-root experiments, an additional layer of regulation, in the colonized or non-colonized sections of the mycorrhizal root system, was uncovered. Transcript profiles of the shoots of mycorrhizal plants indicated the systemic induction of many genes predicted to be involved in stress or defense responses, and suggested that mycorrhizal plants might display enhanced disease resistance. Experimental evidence supports this prediction, and mycorrhizal M. truncatula plants showed increased resistance to a virulent bacterial pathogen, Xanthomonas campestris. Thus, the symbiosis is accompanied by a complex pattern of local and systemic changes in gene expression, including the induction of a functional defense response.
Collapse
Affiliation(s)
- Jinyuan Liu
- Boyce Thompson Institute for Plant Research, Cornell University, Tower Road, Ithaca, NY14853, USA
| | | | | | | | | | | |
Collapse
|