1
|
Prajapati MR, Singh J, Kumar P, Dixit R. De novo transcriptome analysis and identification of defensive genes in garlic (Allium sativum L.) using high-throughput sequencing. J Genet Eng Biotechnol 2023; 21:56. [PMID: 37162611 PMCID: PMC10172436 DOI: 10.1186/s43141-023-00499-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Accepted: 04/20/2023] [Indexed: 05/11/2023]
Abstract
BACKGROUND Garlic (Allium sativum L.) is the second most widely cultivated Allium which is mainly grown in temperate regions and used as a flavoring agent in a wide variety of foods. Garlic contains various bioactive compounds whose metabolic pathways, plant-pathogen interactions, defensive genes, identify interaction networks, and functional genomics were not previously predicted in the garlic at the genomic level. To address this issue, we constructed two garlic Illumina 2000 libraries from tissues of garlic clove and leaf. RESULTS Approximately 43 million 125 bp paired-end reads were obtained in the two libraries. A total of 239,973 contigs were generated by de novo assembly of both samples and were compared with the sequences in the NCBI non-redundant protein database (Nr). In total, 42% of contigs were matched to known proteins in public databases including Nr, Gene Ontology (GO), and Cluster Orthologous Gene Database (COG), and then, contigs were mapped to 138 via functional annotation against the Kyoto Encyclopedia of Genes and Genomes pathway database (KEGG). In addition, a number of regulatory genes including the CCHC (Zn) family, followed by WD40, bromodomain, bZIP, AP2-EREBP, BED-type (Zn) proteins, and defense response proteins related to different conserved domains, such as RGA3, NBS-LRR, TIR-NBS-LRR, LRR, NBS-ARC, and CC-NBS-LRR were discovered based on the transcriptome dataset. We compared the ortholog gene family of the A. sativum transcriptome to A. thaliana, O. sativa, and Z. mays and found that 12,077 orthologous gene families are specific to A. sativum L. Furthermore, we identified genes involved in plant defense mechanisms, their protein-protein interaction network, and plant-pathogen interaction pathways. CONCLUSIONS Our study contains an extensive sequencing and functional gene-annotation analysis of A. sativum L. The findings provide insights into the molecular basis of TFs, defensive genes, and a reference for future studies on the genetics and breeding of A. sativum L.
Collapse
Affiliation(s)
- Malyaj R Prajapati
- Division of Microbial and Environmental Biotechnology, College of Biotechnology, Sardar Vallabhbhai Patel University of Agriculture and Technology, Meerut, Uttar Pradesh, 250110, India
| | - Jitender Singh
- Department of Microbiology, Chaudhary Charan Singh University, Meerut, Uttar Pradesh, 250004, India.
| | - Pankaj Kumar
- Division of Microbial and Environmental Biotechnology, College of Biotechnology, Sardar Vallabhbhai Patel University of Agriculture and Technology, Meerut, Uttar Pradesh, 250110, India.
| | - Rekha Dixit
- Division of Microbial and Environmental Biotechnology, College of Biotechnology, Sardar Vallabhbhai Patel University of Agriculture and Technology, Meerut, Uttar Pradesh, 250110, India
| |
Collapse
|
2
|
Szukala A, Lovegrove‐Walsh J, Luqman H, Fior S, Wolfe TM, Frajman B, Schönswetter P, Paun O. Polygenic routes lead to parallel altitudinal adaptation in Heliosperma pusillum (Caryophyllaceae). Mol Ecol 2023; 32:1832-1847. [PMID: 35152499 PMCID: PMC10946620 DOI: 10.1111/mec.16393] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Revised: 12/29/2021] [Accepted: 02/02/2022] [Indexed: 11/28/2022]
Abstract
Understanding how organisms adapt to the environment is a major goal of modern biology. Parallel evolution-the independent evolution of similar phenotypes in different populations-provides a powerful framework to investigate the evolutionary potential of populations, the constraints of evolution, its repeatability and therefore its predictability. Here, we quantified the degree of gene expression and functional parallelism across replicated ecotype formation in Heliosperma pusillum (Caryophyllaceae), and gained insights into the architecture of adaptive traits. Population structure analyses and demographic modelling support a previously formulated hypothesis of parallel polytopic divergence of montane and alpine ecotypes. We detect a large proportion of differentially expressed genes (DEGs) underlying divergence within each replicate ecotype pair, with a strikingly low number of shared DEGs across pairs. Functional enrichment of DEGs reveals that the traits affected by significant expression divergence are largely consistent across ecotype pairs, in strong contrast to the nonshared genetic basis. The remarkable redundancy of differential gene expression indicates a polygenic architecture for the diverged adaptive traits. We conclude that polygenic traits appear key to opening multiple routes for adaptation, widening the adaptive potential of organisms.
Collapse
Affiliation(s)
- Aglaia Szukala
- Department of Botany and Biodiversity ResearchUniversity of ViennaViennaAustria
- Vienna Graduate School of Population GeneticsViennaAustria
| | | | - Hirzi Luqman
- Department of Environmental System ScienceETH ZürichZürichSwitzerland
| | - Simone Fior
- Department of Environmental System ScienceETH ZürichZürichSwitzerland
| | - Thomas M. Wolfe
- Institute for Forest EntomologyForest Pathology and Forest Protection, BOKUViennaAustria
| | - Božo Frajman
- Department of BotanyUniversity of InnsbruckInnsbruckAustria
| | | | - Ovidiu Paun
- Department of Botany and Biodiversity ResearchUniversity of ViennaViennaAustria
| |
Collapse
|
3
|
Williams AM, Itgen MW, Broz AK, Carter OG, Sloan DB. Long-read transcriptome and other genomic resources for the angiosperm Silene noctiflora. G3 (BETHESDA, MD.) 2021; 11:jkab189. [PMID: 34849814 PMCID: PMC8496259 DOI: 10.1093/g3journal/jkab189] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Accepted: 05/20/2021] [Indexed: 01/04/2023]
Abstract
The angiosperm genus Silene is a model system for several traits of ecological and evolutionary significance in plants, including breeding system and sex chromosome evolution, host-pathogen interactions, invasive species biology, heavy metal tolerance, and cytonuclear interactions. Despite its importance, genomic resources for this large genus of approximately 850 species are scarce, with only one published whole-genome sequence (from the dioecious species Silene latifolia). Here, we provide genomic and transcriptomic resources for a hermaphroditic representative of this genus (S. noctiflora), including a PacBio Iso-Seq transcriptome, which uses long-read, single-molecule sequencing technology to analyze full-length mRNA transcripts. Using these data, we have assembled and annotated high-quality full-length cDNA sequences for approximately 14,126 S. noctiflora genes and 25,317 isoforms. We demonstrated the utility of these data to distinguish between recent and highly similar gene duplicates by identifying novel paralogous genes in an essential protease complex. Furthermore, we provide a draft assembly for the approximately 2.7-Gb genome of this species, which is near the upper range of genome-size values reported for diploids in this genus and threefold larger than the 0.9-Gb genome of Silene conica, another species in the same subgenus. Karyotyping confirmed that S. noctiflora is a diploid, indicating that its large genome size is not due to polyploidization. These resources should facilitate further study and development of this genus as a model in plant ecology and evolution.
Collapse
Affiliation(s)
- Alissa M Williams
- Department of Biology, Colorado State University, Fort Collins, CO 80523, USA
- Cell and Molecular Biology Graduate Program, Colorado State University, Fort Collins, CO 80523, USA
| | - Michael W Itgen
- Department of Biology, Colorado State University, Fort Collins, CO 80523, USA
| | - Amanda K Broz
- Department of Biology, Colorado State University, Fort Collins, CO 80523, USA
| | - Olivia G Carter
- Department of Biology, Colorado State University, Fort Collins, CO 80523, USA
| | - Daniel B Sloan
- Department of Biology, Colorado State University, Fort Collins, CO 80523, USA
| |
Collapse
|
4
|
Krüger M, Abeyawardana OAJ, Krüger C, Juříček M, Štorchová H. Differentially Expressed Genes Shared by Two Distinct Cytoplasmic Male Sterility (CMS) Types of Silene vulgaris Suggest the Importance of Oxidative Stress in Pollen Abortion. Cells 2020; 9:cells9122700. [PMID: 33339225 PMCID: PMC7766179 DOI: 10.3390/cells9122700] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2020] [Accepted: 12/14/2020] [Indexed: 11/25/2022] Open
Abstract
Cytoplasmic male sterility (CMS), encoded by the interacting mitochondrial and nuclear genes, causes pollen abortion or non-viability. CMS is widely used in agriculture and extensively studied in crops. Much less is known about CMS in wild species. We performed a comparative transcriptomic analysis of male sterile and fertile individuals of Silene vulgaris, a model plant for the study of gynodioecy, to reveal the genes responsible for pollen abortion in this species. We used RNA-seq datasets previously employed for the analysis of mitochondrial and plastid transcriptomes of female and hermaphrodite flower buds, making it possible to compare the transcriptomes derived from three genomes in the same RNA specimen. We assembled de novo transcriptomes for two haplotypes of S. vulgaris and identified differentially expressed genes between the females and hermaphrodites, associated with stress response or pollen development. The gene for alternative oxidase was downregulated in females. The genetic pathways controlling CMS in S. vulgaris are similar to those in crops. The high number of the differentially expressed nuclear genes contrasts with the uniformity of organellar transcriptomes across genders, which suggests these pathways are evolutionarily conserved and that selective mechanisms may shield organellar transcription against changes in the cytoplasmic transcriptome.
Collapse
Affiliation(s)
- Manuela Krüger
- Institute of Experimental Botany, Czech Academy of Sciences, Rozvojová 263, 16502 Prague, Czech Republic; (M.K.); (O.A.J.A.); (C.K.); (M.J.)
| | - Oushadee A. J. Abeyawardana
- Institute of Experimental Botany, Czech Academy of Sciences, Rozvojová 263, 16502 Prague, Czech Republic; (M.K.); (O.A.J.A.); (C.K.); (M.J.)
- Department of Horticulture, Faculty of Agrobiology, Food and Natural Resources, Czech University of Life Sciences Prague, Kamýcká 129, 16500 Prague 6-Suchdol, Czech Republic
| | - Claudia Krüger
- Institute of Experimental Botany, Czech Academy of Sciences, Rozvojová 263, 16502 Prague, Czech Republic; (M.K.); (O.A.J.A.); (C.K.); (M.J.)
| | - Miloslav Juříček
- Institute of Experimental Botany, Czech Academy of Sciences, Rozvojová 263, 16502 Prague, Czech Republic; (M.K.); (O.A.J.A.); (C.K.); (M.J.)
| | - Helena Štorchová
- Institute of Experimental Botany, Czech Academy of Sciences, Rozvojová 263, 16502 Prague, Czech Republic; (M.K.); (O.A.J.A.); (C.K.); (M.J.)
- Correspondence: ; Tel.: +420-225-106-828
| |
Collapse
|
5
|
Park S, Son S, Shin M, Fujii N, Hoshino T, Park S. Transcriptome-wide mining, characterization, and development of microsatellite markers in Lychnis kiusiana (Caryophyllaceae). BMC PLANT BIOLOGY 2019; 19:14. [PMID: 30621589 PMCID: PMC6325733 DOI: 10.1186/s12870-018-1621-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/22/2017] [Accepted: 12/27/2018] [Indexed: 05/23/2023]
Abstract
BACKGROUND Lychnis kiusiana Makino is an endangered perennial herb native to wetland areas in Korea and Japan. Despite its conservational and evolutionary significance, population genetic resources are lacking for this species. Next-generation sequencing has been accepted as a rapid and cost-effective solution for the identification of microsatellite markers in nonmodel plants. RESULTS Using Illumina HiSeq 2000 sequencing technology, we assembled 67,498,600 reads into 91,900 contigs and identified 11,403 microsatellite repeat motifs in 9563 contigs. A total of 4510 microsatellite-containing transcripts had Gene Ontology (GO) annotations, and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis identified 124 pathways with significant scores. Many microsatellites in the L. kiusiana leaf transcriptome were linked to genes involved in the plant response to light intensity, salt stress, temperature stimulus, and nutrient and water deprivation. A total of 12,486 single-nucleotide polymorphisms (SNPs) were identified on transcripts harboring microsatellites. The analysis of nucleotide substitution rates for 2389 unigenes indicated that 39 genes were under strong positive selection. The primers of 6911 microsatellites were designed, and 40 of 50 selected primer pairs were consistently and successfully amplified from 51 individuals. Twenty-five of these were polymorphic, and the average number of alleles per SSR locus was 6.96, with a range from 2 to 15. The observed and expected heterozygosities ranged from 0.137 to 0.902 and 0.131 to 0.827, respectively, and locus-specific FIS estimates ranged from - 0.116 to 0.290. Eleven of the 25 primer pairs were successfully amplified in three additional species of Lychnis: 56% in L. wilfordii, 64% in L. cognata and 80% in L. fulgens. CONCLUSIONS The transcriptomic SSR markers of Lychnis kiusiana provide a valuable resource for understanding the population genetics, evolutionary history, and effective conservation management of this species. Furthermore, the identified microsatellite loci linked to the annotated genes should be useful for developing functional markers of L. kiusiana. The developed markers represent a potentially valuable source of transcriptomic SSR markers for population genetic analyses with moderate levels of cross-taxon portability.
Collapse
Affiliation(s)
- Seongjun Park
- Institute of Natural Science, Yeungnam University, Gyeongsan, Gyeongbuk 38541 South Korea
- Department of Life Sciences, Yeungnam University, Gyeongsan, Gyeongbuk 38541 South Korea
| | - Sungwon Son
- Plant Conservation Division, Korea National Arboretum, Pocheon, Gyeonggi 11186 South Korea
| | - Myungju Shin
- Department of Life Sciences, Yeungnam University, Gyeongsan, Gyeongbuk 38541 South Korea
| | - Noriyuki Fujii
- Department of Biological Science, Graduate School of Science and Technology, Kumamoto University, Chuo-ku, Kumamoto, 860-8555 Japan
| | - Takuji Hoshino
- Faculty of Biosphere-Geosphere Science, Okayama University of Science, Kita-ku, Okayama, 700-0005 Japan
| | - SeonJoo Park
- Department of Life Sciences, Yeungnam University, Gyeongsan, Gyeongbuk 38541 South Korea
| |
Collapse
|
6
|
Sun XY, Wang YH, Dong ZE, Wu HY, Chen PP, Xie Q. Identifying Differential Gene Expression in Wing Polymorphism of Adult Males of the Largest Water Strider: De novo Transcriptome Assembly for Gigantometra gigas (Hemiptera: Gerridae). JOURNAL OF INSECT SCIENCE (ONLINE) 2018; 18:5236978. [PMID: 30535417 PMCID: PMC6287054 DOI: 10.1093/jisesa/iey114] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/13/2018] [Indexed: 05/25/2023]
Abstract
Wing polymorphism is common in a wide variety of insect species. However, few studies have reported on adaptations in the wing polymorphism of insects at molecular level, in particular for males. Thus, the adaptive mechanisms need to be explored. The remarkable variability in wing morphs of insects is well represented in the water striders (Hemiptera: Gerridae). Within this family, Gigantometra gigas (China, 1925), the largest water strider known worldwide, displays macropterous and apterous males. In the present study, we used de novo transcriptome assembly to obtain gene expression information and compared body and leg-component lengths of adult males in different wing morphs. The analyses in both gene expression and phenotype levels were used for exploring the adaptive mechanism in wing polymorphism of G. gigas. After checking, a series of highly expressed structural genes were found in macropterous morphs, which were related to the maintenance of flight muscles and the enhancement of flight capacity, whereas in the apterous morphs, the imaginal morphogenesis protein-Late 2 (Imp-L2), which might inhibit wing development and increase the body size of insects, was still highly expressed in the adult stage. Moreover, body and leg-component lengths were significantly larger in apterous than in macropterous morphs. The larger size of the apterous morphs and the differences in highly expressed genes between the two wing morphs consistently demonstrate the adaptive significance of wing polymorphism in G. gigas. These results shed light on the future loss-of-function research of wing polymorphism in G. gigas.
Collapse
Affiliation(s)
- Xiao-ya Sun
- Institute of Entomology, College of Life Sciences, Nankai University, Tianjin, China
- State Key Laboratory of Biocontrol, Sun Yat-sen University, Guangzhou, Guangdong, China
- Department of Ecology and Evolution, College of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong, China
| | - Yan-hui Wang
- State Key Laboratory of Biocontrol, Sun Yat-sen University, Guangzhou, Guangdong, China
- Department of Ecology and Evolution, College of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong, China
| | - Zhuo-er Dong
- Institute of Entomology, College of Life Sciences, Nankai University, Tianjin, China
| | - Hao-yang Wu
- State Key Laboratory of Biocontrol, Sun Yat-sen University, Guangzhou, Guangdong, China
- Department of Ecology and Evolution, College of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong, China
| | - Ping-ping Chen
- National Reference Centre (NRC), Netherlands Plant Protection Organization (NPPO), Wageningen, the Netherlands
| | - Qiang Xie
- State Key Laboratory of Biocontrol, Sun Yat-sen University, Guangzhou, Guangdong, China
- Department of Ecology and Evolution, College of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong, China
| |
Collapse
|
7
|
Edera AA, Gandini CL, Sanchez-Puerta MV. Towards a comprehensive picture of C-to-U RNA editing sites in angiosperm mitochondria. PLANT MOLECULAR BIOLOGY 2018; 97:215-231. [PMID: 29761268 DOI: 10.1007/s11103-018-0734-9] [Citation(s) in RCA: 69] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/16/2017] [Accepted: 05/02/2018] [Indexed: 06/08/2023]
Abstract
Our understanding of the dynamic and evolution of RNA editing in angiosperms is in part limited by the few editing sites identified to date. This study identified 10,217 editing sites from 17 diverse angiosperms. Our analyses confirmed the universality of certain features of RNA editing, and offer new evidence behind the loss of editing sites in angiosperms. RNA editing is a post-transcriptional process that substitutes cytidines (C) for uridines (U) in organellar transcripts of angiosperms. These substitutions mostly take place in mitochondrial messenger RNAs at specific positions called editing sites. By means of publicly available RNA-seq data, this study identified 10,217 editing sites in mitochondrial protein-coding genes of 17 diverse angiosperms. Even though other types of mismatches were also identified, we did not find evidence of non-canonical editing processes. The results showed an uneven distribution of editing sites among species, genes, and codon positions. The analyses revealed that editing sites were conserved across angiosperms but there were some species-specific sites. Non-synonymous editing sites were particularly highly conserved (~ 80%) across the plant species and were efficiently edited (80% editing extent). In contrast, editing sites at third codon positions were poorly conserved (~ 30%) and only partially edited (~ 40% editing extent). We found that the loss of editing sites along angiosperm evolution is mainly occurring by replacing editing sites with thymidines, instead of a degradation of the editing recognition motif around editing sites. Consecutive and highly conserved editing sites had been replaced by thymidines as result of retroprocessing, by which edited transcripts are reverse transcribed to cDNA and then integrated into the genome by homologous recombination. This phenomenon was more pronounced in eudicots, and in the gene cox1. These results suggest that retroprocessing is a widespread driving force underlying the loss of editing sites in angiosperm mitochondria.
Collapse
Affiliation(s)
- Alejandro A Edera
- IBAM, Facultad de Ciencias Agrarias, CONICET, Universidad Nacional de Cuyo, M5528AHB, Chacras de Coria, Argentina.
| | - Carolina L Gandini
- IBAM, Facultad de Ciencias Agrarias, CONICET, Universidad Nacional de Cuyo, M5528AHB, Chacras de Coria, Argentina
| | - M Virginia Sanchez-Puerta
- IBAM, Facultad de Ciencias Agrarias, CONICET, Universidad Nacional de Cuyo, M5528AHB, Chacras de Coria, Argentina
- Facultad de Ciencias Exactas y Naturales, Universidad Nacional de Cuyo, 5500, Mendoza, Argentina
| |
Collapse
|
8
|
García-Gonzalo P, Del Real AEP, Lobo MC, Pérez-Sanz A. Different genotypes of Silene vulgaris (Moench) Garcke grown on chromium-contaminated soils influence root organic acid composition and rhizosphere bacterial communities. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2017; 24:25713-25724. [PMID: 27151239 DOI: 10.1007/s11356-016-6667-4] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/15/2015] [Accepted: 04/10/2016] [Indexed: 05/13/2023]
Abstract
Plant-microbe interactions are considered to be important processes determining the efficiency of phytoremediation of heavy metal-contaminated soils. However, relatively little is known about how these interactions are influenced by chromium (Cr) contamination. The effect of Cr stress on metal uptake, root organic acid composition, and rhizosphere bacterial communities was studied using two genotypes of the metallophyte Silene vulgaris, which have shown different tolerance to Cr(VI). The results indicated that root biomass and shoot biomass were not significantly influenced by Cr treatment, but metal uptake in shoots and roots was significantly impacted by the genotype. Principal component analyses (PCA) showed that variation in organic acids oxalic, citric, malic, formic, lactic, acetic, and succinic differed between genotypes. Changes in root organic acid contents in response to Cr revealed a significant increase of oxalic acid in genotype SV-21. The denaturing gradient gel electrophoresis (DGGE) cluster analysis showed that the community structure (determined by PCR-DGGE) was affected by plant genotype and, to a lesser extent, by Cr contamination, the first being the most influential factor shaping the rhizosphere microbiome. Under Cr pollution, a shift in the relative abundance of specific taxa was found and dominant phylotypes were identified as Variovorax in SV-21 and Chitinophaga niastensis, Pontibacter sp., and Ramlibacter sp. in SV-38. These results provided the basis for further studies aimed at the combined use of plants and soil microorganisms in the remediation of Cr-polluted soils.
Collapse
Affiliation(s)
- P García-Gonzalo
- Department of Agro-Environmental Research, IMIDRA, Alcalá de Henares, 28800, Madrid, Spain.
| | - A E Pradas Del Real
- Department of Agro-Environmental Research, IMIDRA, Alcalá de Henares, 28800, Madrid, Spain
- Geochimie 4D Group, ISTerre, Université Grenoble I, 38041, Grenoble Cedex 9, France
| | - M C Lobo
- Department of Agro-Environmental Research, IMIDRA, Alcalá de Henares, 28800, Madrid, Spain
| | - A Pérez-Sanz
- Department of Agro-Environmental Research, IMIDRA, Alcalá de Henares, 28800, Madrid, Spain
- Department of Earth Sciences, Natural History Museum, Cromwell Road, London, SW7 5BD, UK
| |
Collapse
|
9
|
Trucchi E, Frajman B, Haverkamp THA, Schönswetter P, Paun O. Genomic analyses suggest parallel ecological divergence in Heliosperma pusillum (Caryophyllaceae). THE NEW PHYTOLOGIST 2017; 216:267-278. [PMID: 28782803 PMCID: PMC5601199 DOI: 10.1111/nph.14722] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/19/2017] [Accepted: 06/20/2017] [Indexed: 05/24/2023]
Abstract
The mosaic distribution of interbreeding taxa with contrasting ecology and morphology offers an opportunity to study microevolutionary dynamics during ecological divergence. We investigate here the evolutionary history of an alpine and a montane ecotype of Heliosperma pusillum (Caryophyllaceae) in the south-eastern Alps. From six pairs of geographically close populations of the two ecotypes (120 individuals) we obtained a high-coverage restriction site associated DNA sequencing (RADseq) dataset that was used for demographic inference to test the hypothesis of parallel evolution of the two ecotypes. The data are consistent with repeated ecological divergence in H. pusillum, uncovering up to five polytopic origins of one ecotype from the other. A complex evolutionary history is evidenced, with local isolation-with-migration in two population pairs and intra-ecotype migration in two others. In all cases, the time of divergence or secondary contact was inferred as postglacial. A metagenomic analysis on exogenous contaminant RAD sequences suggests divergent microbial communities between the ecotypes. The lack of shared genomic regions of high divergence across population pairs illustrates the action of drift and/or local selection in shaping genetic divergence across repeated cases of ecological divergence.
Collapse
Affiliation(s)
- Emiliano Trucchi
- Department of Botany and Biodiversity ResearchUniversity of ViennaRennweg 14Vienna1030Austria
- Department of Life Sciences and BiotechnologyUniversity of FerraraVia L. Borsari 46Ferrara44121Italy
| | - Božo Frajman
- Institute of BotanyUniversity of InnsbruckSternwartestraße 15Innsbruck6020Austria
| | - Thomas H. A. Haverkamp
- Department of BiosciencesCentre for Ecological and Evolutionary SynthesisUniversity of OsloPO Box, 1066 BlindernOslo0316Norway
| | - Peter Schönswetter
- Institute of BotanyUniversity of InnsbruckSternwartestraße 15Innsbruck6020Austria
| | - Ovidiu Paun
- Department of Botany and Biodiversity ResearchUniversity of ViennaRennweg 14Vienna1030Austria
| |
Collapse
|
10
|
Evaluation of reference genes for reverse transcription quantitative real-time PCR (RT-qPCR) studies in Silene vulgaris considering the method of cDNA preparation. PLoS One 2017; 12:e0183470. [PMID: 28817728 PMCID: PMC5560574 DOI: 10.1371/journal.pone.0183470] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2017] [Accepted: 08/04/2017] [Indexed: 01/05/2023] Open
Abstract
Accurate gene expression measurements are essential in studies of both crop and wild plants. Reverse transcription quantitative real-time PCR (RT-qPCR) has become a preferred tool for gene expression estimation. A selection of suitable reference genes for the normalization of transcript levels is an essential prerequisite of accurate RT-qPCR results. We evaluated the expression stability of eight candidate reference genes across roots, leaves, flower buds and pollen of Silene vulgaris (bladder campion), a model plant for the study of gynodioecy. As random priming of cDNA is recommended for the study of organellar transcripts and poly(A) selection is indicated for nuclear transcripts, we estimated gene expression with both random-primed and oligo(dT)-primed cDNA. Accordingly, we determined reference genes that perform well with oligo(dT)- and random-primed cDNA, making it possible to estimate levels of nucleus-derived transcripts in the same cDNA samples as used for organellar transcripts, a key benefit in studies of cyto-nuclear interactions. Gene expression variance was estimated by RefFinder, which integrates four different analytical tools. The SvACT and SvGAPDH genes were the most stable candidates across various organs of S. vulgaris, regardless of whether pollen was included or not.
Collapse
|
11
|
Bazakos C, Hanemian M, Trontin C, Jiménez-Gómez JM, Loudet O. New Strategies and Tools in Quantitative Genetics: How to Go from the Phenotype to the Genotype. ANNUAL REVIEW OF PLANT BIOLOGY 2017; 68:435-455. [PMID: 28226236 DOI: 10.1146/annurev-arplant-042916-040820] [Citation(s) in RCA: 54] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
Quantitative genetics has a long history in plants: It has been used to study specific biological processes, identify the factors important for trait evolution, and breed new crop varieties. These classical approaches to quantitative trait locus mapping have naturally improved with technology. In this review, we show how quantitative genetics has evolved recently in plants and how new developments in phenotyping, population generation, sequencing, gene manipulation, and statistics are rejuvenating both the classical linkage mapping approaches (for example, through nested association mapping) as well as the more recently developed genome-wide association studies. These strategies are complementary in most instances, and indeed, one is often used to confirm the results of the other. Despite significant advances, an emerging trend is that the outcome and efficiency of the different approaches depend greatly on the genetic architecture of the trait in the genetic material under study.
Collapse
Affiliation(s)
- Christos Bazakos
- Institut Jean-Pierre Bourgin, INRA, AgroParisTech, CNRS, Université Paris-Saclay, 78026 Versailles Cedex, France;
| | - Mathieu Hanemian
- Institut Jean-Pierre Bourgin, INRA, AgroParisTech, CNRS, Université Paris-Saclay, 78026 Versailles Cedex, France;
| | - Charlotte Trontin
- Institut Jean-Pierre Bourgin, INRA, AgroParisTech, CNRS, Université Paris-Saclay, 78026 Versailles Cedex, France;
| | - José M Jiménez-Gómez
- Institut Jean-Pierre Bourgin, INRA, AgroParisTech, CNRS, Université Paris-Saclay, 78026 Versailles Cedex, France;
| | - Olivier Loudet
- Institut Jean-Pierre Bourgin, INRA, AgroParisTech, CNRS, Université Paris-Saclay, 78026 Versailles Cedex, France;
| |
Collapse
|
12
|
Slavokhotova AA, Shelenkov AA, Korostyleva TV, Rogozhin EA, Melnikova NV, Kudryavtseva AV, Odintsova TI. Defense peptide repertoire of Stellaria media predicted by high throughput next generation sequencing. Biochimie 2016; 135:15-27. [PMID: 28038935 DOI: 10.1016/j.biochi.2016.12.017] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2016] [Accepted: 12/26/2016] [Indexed: 12/16/2022]
Abstract
Being perfectly adapted to diverse environments, chickweed (Stellaria media (L.) Vill), a ubiquitous garden weed, grows widely in Europe and North America. As opposed to the model plants, many weeds, and S. media in particular, have been poorly studied, although they are likely to contain promising components of immunity and novel resistance genes. In this study, for the first time RNA-seq analysis of healthy and infected with Fusarium oxysporum chickweed seedlings, as well as de novo transcriptome assembly and annotation, are presented. Note, this research is focused on antimicrobial peptides (AMPs), the major components of plant immune system. Using custom software developed earlier, 145 unique putative AMPs (pAMPs) including defensins, thionins, hevein-like peptides, snakins, alpha-hairpinins, LTPs, and cysteine-rich peptides with novel cysteine motifs were predicted. Furthermore, changes in AMP expression profile in response to fungal infection were traced. In addition, the comparison of chickweed AMP repertoire with those of other Caryophyllaceae plants whose transcriptomes are presently available is made. As a result, alpha-hairpinins and hevein-like peptides which display characteristic modular structure appear to be specific AMPs distinguishing S. media from Dianthus caryophyllus, Silene vulgaris, and Silene latifolia. Finally, revealing several AMPs with proven antimicrobial activity gives opportunity to conclude that the presented method of AMP repertoire analysis reveals highly active AMPs playing vital role in plant immunity.
Collapse
Affiliation(s)
- Anna A Slavokhotova
- Vavilov Institute of General Genetics, Russian Academy of Sciences, 3 Gubkina Str., 119991 Moscow, Russian Federation.
| | - Andrey A Shelenkov
- Vavilov Institute of General Genetics, Russian Academy of Sciences, 3 Gubkina Str., 119991 Moscow, Russian Federation.
| | - Tatyana V Korostyleva
- Vavilov Institute of General Genetics, Russian Academy of Sciences, 3 Gubkina Str., 119991 Moscow, Russian Federation.
| | - Eugene A Rogozhin
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, 16/10 Miklukho-Maklaya Str., 117997 Moscow, Russian Federation.
| | - Nataliya V Melnikova
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 32 Vavilova Str., Moscow 119991, Russian Federation.
| | - Anna V Kudryavtseva
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 32 Vavilova Str., Moscow 119991, Russian Federation.
| | - Tatyana I Odintsova
- Vavilov Institute of General Genetics, Russian Academy of Sciences, 3 Gubkina Str., 119991 Moscow, Russian Federation.
| |
Collapse
|
13
|
Liu Q, Sun Y, Chen J, Li P, Li C, Niu G, Jiang L. Transcriptome analysis revealed the dynamic oil accumulation in Symplocos paniculata fruit. BMC Genomics 2016; 17:929. [PMID: 27852215 PMCID: PMC5112726 DOI: 10.1186/s12864-016-3275-0] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2016] [Accepted: 11/09/2016] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Symplocos paniculata, asiatic sweetleaf or sapphire berry, is a widespread shrub or small tree from Symplocaceae with high oil content and excellent fatty acid composition in fruit. It has been used as feedstocks for biodiesel and cooking oil production in China. Little transcriptome information is available on the regulatory molecular mechanism of oil accumulation at different fruit development stages. RESULTS The transcriptome at four different stages of fruit development (10, 80,140, and 170 days after flowering) of S. paniculata were analyzed. Approximately 28 million high quality clean reads were generated. These reads were trimmed and assembled into 182,904 non-redundant putative transcripts with a mean length of 592.91 bp and N50 length of 785 bp, respectively. Based on the functional annotation through Basic Local Alignment Search Tool (BLAST) with public protein database, the key enzymes involved in lipid metabolism were identified, and a schematic diagram of the pathway and temporal expression patterns of lipid metabolism was established. About 13,939 differentially expressed unigenes (DEGs) were screened out using differentially expressed sequencing (DESeq) method. The transcriptional regulatory patterns of the identified enzymes were highly related to the dynamic oil accumulation along with the fruit development of S. paniculata. In addition, quantitative real-time PCR (qRT-PCR) of six vital genes was significantly correlated with DESeq data. CONCLUSIONS The transcriptome sequences obtained and deposited in NCBI would enrich the public database and provide an unprecedented resource for the discovery of the genes associated with lipid metabolism pathway in S. paniculata. Results in this study will lay the foundation for exploring transcriptional regulatory profiles, elucidating molecular regulatory mechanisms, and accelerating genetic engineering process to improve the yield and quality of seed oil of S. paniculata.
Collapse
Affiliation(s)
- Qiang Liu
- Central South University of Forestry and Technology, 498 South Shaoshan Rd., Changsha, Hunan, 410004, China.,Texas A&M AgriLife Research Center at El Paso, 1380 A&M Circle, El Paso, TX, 79927, USA
| | - Youping Sun
- Texas A&M AgriLife Research Center at El Paso, 1380 A&M Circle, El Paso, TX, 79927, USA
| | - Jinzheng Chen
- Central South University of Forestry and Technology, 498 South Shaoshan Rd., Changsha, Hunan, 410004, China.,Hunan Academy of Forestry, 658 South Shaoshan Rd., Changsha, Hunan, 410004, China
| | - Peiwang Li
- Hunan Academy of Forestry, 658 South Shaoshan Rd., Changsha, Hunan, 410004, China
| | - Changzhu Li
- Hunan Academy of Forestry, 658 South Shaoshan Rd., Changsha, Hunan, 410004, China
| | - Genhua Niu
- Texas A&M AgriLife Research Center at El Paso, 1380 A&M Circle, El Paso, TX, 79927, USA
| | - Lijuan Jiang
- Central South University of Forestry and Technology, 498 South Shaoshan Rd., Changsha, Hunan, 410004, China.
| |
Collapse
|
14
|
Durai DA, Schulz MH. Informed kmer selection for de novo transcriptome assembly. Bioinformatics 2016; 32:1670-7. [PMID: 27153653 PMCID: PMC4892416 DOI: 10.1093/bioinformatics/btw217] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2015] [Accepted: 04/17/2016] [Indexed: 11/23/2022] Open
Abstract
Motivation:De novo transcriptome assembly is an integral part for many RNA-seq workflows. Common applications include sequencing of non-model organisms, cancer or meta transcriptomes. Most de novo transcriptome assemblers use the de Bruijn graph (DBG) as the underlying data structure. The quality of the assemblies produced by such assemblers is highly influenced by the exact word length k. As such no single kmer value leads to optimal results. Instead, DBGs over different kmer values are built and the assemblies are merged to improve sensitivity. However, no studies have investigated thoroughly the problem of automatically learning at which kmer value to stop the assembly. Instead a suboptimal selection of kmer values is often used in practice. Results: Here we investigate the contribution of a single kmer value in a multi-kmer based assembly approach. We find that a comparative clustering of related assemblies can be used to estimate the importance of an additional kmer assembly. Using a model fit based algorithm we predict the kmer value at which no further assemblies are necessary. Our approach is tested with different de novo assemblers for datasets with different coverage values and read lengths. Further, we suggest a simple post processing step that significantly improves the quality of multi-kmer assemblies. Conclusion: We provide an automatic method for limiting the number of kmer values without a significant loss in assembly quality but with savings in assembly time. This is a step forward to making multi-kmer methods more reliable and easier to use. Availability and Implementation:A general implementation of our approach can be found under: https://github.com/SchulzLab/KREATION. Supplementary information:Supplementary data are available at Bioinformatics online. Contact:mschulz@mmci.uni-saarland.de
Collapse
Affiliation(s)
- Dilip A Durai
- Cluster of Excellence on Multimodal Computing and Interaction, Saarland University, Saarbrücken, 66123, Germany Department for Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Saarbrücken, 66123, Germany
| | - Marcel H Schulz
- Cluster of Excellence on Multimodal Computing and Interaction, Saarland University, Saarbrücken, 66123, Germany Department for Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Saarbrücken, 66123, Germany
| |
Collapse
|
15
|
Trucchi E, Mazzarella AB, Gilfillan GD, Lorenzo MT, Schönswetter P, Paun O. BsRADseq: screening DNA methylation in natural populations of non-model species. Mol Ecol 2016; 25:1697-713. [PMID: 26818626 PMCID: PMC4949719 DOI: 10.1111/mec.13550] [Citation(s) in RCA: 64] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2015] [Revised: 12/01/2015] [Accepted: 01/15/2016] [Indexed: 12/24/2022]
Abstract
Epigenetic modifications are expected to occur at a much faster rate than genetic mutations, potentially causing isolated populations to stochastically drift apart, or if they are subjected to different selective regimes, to directionally diverge. A high level of genome‐wide epigenetic divergence between individuals occupying distinct habitats is therefore predicted. Here, we introduce bisulfite‐converted restriction site associated DNA sequencing (bsRADseq), an approach to quantify the level of DNA methylation differentiation across multiple individuals. This reduced representation method is flexible in the extent of DNA sequence interrogated. We showcase its applicability in three natural systems, each comprising individuals adapted to divergent environments: a diploid plant (Heliosperma, Caryophyllaceae), a tetraploid plant (Dactylorhiza, Orchidaceae) and an animal (Gasterosteusaculeatus, Gasterosteidae). We present a robust bioinformatic pipeline, combining tools for RAD locus assembly, SNP calling, bisulfite‐converted read mapping and DNA methylation calling to analyse bsRADseq data with or without a reference genome. Importantly, our approach accurately distinguishes between SNPs and methylation polymorphism (SMPs). Although DNA methylation frequency between different positions of a genome varies widely, we find a surprisingly high consistency in the methylation profile between individuals thriving in divergent ecological conditions, particularly in Heliosperma. This constitutive stability points to significant molecular or developmental constraints acting on DNA methylation variation. Altogether, by combining the flexibility of RADseq with the accuracy of bisulfite sequencing in quantifying DNA methylation, the bsRADseq methodology and our bioinformatic pipeline open up the opportunity for genome‐wide epigenetic investigations of evolutionary and ecological relevance in non‐model species, independent of their genomic features.
Collapse
Affiliation(s)
- Emiliano Trucchi
- Department of Botany and Biodiversity Research, University of Vienna, Rennweg 14, 1030, Vienna, Austria.,Department of Biosciences, Centre for Ecological and Evolutionary Synthesis, University of Oslo, PO Box 1066 Blindern, N-0316, Oslo, Norway
| | - Anna B Mazzarella
- Department of Biosciences, Centre for Ecological and Evolutionary Synthesis, University of Oslo, PO Box 1066 Blindern, N-0316, Oslo, Norway
| | - Gregor D Gilfillan
- Department of Medical Genetics, Oslo University Hospital, PO Box 4590 Nydalen, N-0420, Oslo, Norway
| | - Maria T Lorenzo
- Department of Botany and Biodiversity Research, University of Vienna, Rennweg 14, 1030, Vienna, Austria
| | - Peter Schönswetter
- Institute of Botany, University of Innsbruck, Sternwartestrasse 15, A-6020, Innsbruck, Austria
| | - Ovidiu Paun
- Department of Botany and Biodiversity Research, University of Vienna, Rennweg 14, 1030, Vienna, Austria
| |
Collapse
|
16
|
Casimiro-Soriguer I, Narbona E, Buide ML, del Valle JC, Whittall JB. Transcriptome and Biochemical Analysis of a Flower Color Polymorphism in Silene littorea (Caryophyllaceae). FRONTIERS IN PLANT SCIENCE 2016; 7:204. [PMID: 26973662 PMCID: PMC4770042 DOI: 10.3389/fpls.2016.00204] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/27/2015] [Accepted: 02/06/2016] [Indexed: 05/23/2023]
Abstract
Flower color polymorphisms are widely used as model traits from genetics to ecology, yet determining the biochemical and molecular basis can be challenging. Anthocyanin-based flower color variations can be caused by at least 12 structural and three regulatory genes in the anthocyanin biosynthetic pathway (ABP). We use mRNA-Seq to simultaneously sequence and estimate expression of these candidate genes in nine samples of Silene littorea representing three color morphs (dark pink, light pink and white) across three developmental stages in hopes of identifying the cause of flower color variation. We identified 29 putative paralogs for the 15 candidate genes in the ABP. We assembled complete coding sequences for 16 structural loci and nine of ten regulatory loci. Among these 29 putative paralogs, we identified 622 SNPs, yet only nine synonymous SNPs in Ans had allele frequencies that differentiated pigmented petals (dark pink and light pink) from white petals. These Ans allele frequency differences were further investigated with an expanded sequencing survey of 38 individuals, yet no SNPs consistently differentiated the color morphs. We also found one locus, F3h1, with strong differential expression between pigmented and white samples (>42x). This may be caused by decreased expression of Myb1a in white petal buds. Myb1a in S. littorea is a regulatory locus closely related to Subgroup 7 Mybs known to regulate F3h and other loci in the first half of the ABP in model species. We then compare the mRNA-Seq results with petal biochemistry which revealed cyanidin as the primary anthocyanin and five flavonoid intermediates. Concentrations of three of the flavonoid intermediates were significantly lower in white petals than in pigmented petals (rutin, quercetin and isovitexin). The biochemistry results for rutin, quercetin, luteolin and apigenin are consistent with the transcriptome results suggesting a blockage at F3h, possibly caused by downregulation of Myb1a.
Collapse
Affiliation(s)
- Inés Casimiro-Soriguer
- Department of Molecular Biology and Biochemical Engineering, Pablo de Olavide UniversitySeville, Spain
- Department of Plant Biology and Ecology, University of SevilleSeville, Spain
| | - Eduardo Narbona
- Department of Molecular Biology and Biochemical Engineering, Pablo de Olavide UniversitySeville, Spain
| | - M. L. Buide
- Department of Molecular Biology and Biochemical Engineering, Pablo de Olavide UniversitySeville, Spain
| | - José C. del Valle
- Department of Molecular Biology and Biochemical Engineering, Pablo de Olavide UniversitySeville, Spain
| | - Justen B. Whittall
- Department of Biology, Santa Clara University, College of Arts and SciencesSanta Clara, CA, USA
| |
Collapse
|
17
|
De novo assembly and characterization of the leaf, bud, and fruit transcriptome from the vulnerable tree Juglans mandshurica for the development of 20 new microsatellite markers using Illumina sequencing. Mol Genet Genomics 2015; 291:849-62. [DOI: 10.1007/s00438-015-1147-y] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2015] [Accepted: 11/13/2015] [Indexed: 01/08/2023]
|
18
|
Christmas MJ, Biffin E, Lowe AJ. Transcriptome sequencing, annotation and polymorphism detection in the hop bush, Dodonaea viscosa. BMC Genomics 2015; 16:803. [PMID: 26474753 PMCID: PMC4609105 DOI: 10.1186/s12864-015-1987-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2015] [Accepted: 10/06/2015] [Indexed: 01/09/2023] Open
Abstract
Background The hop bush, Dodonaea viscosa, is a trans-oceanic species distributed oversix continents. It evolved in Australia where it is found over a wide range of habitat types and is an ecologically important species. Limited genomic resources are currently available for this species, thus our understanding of its evolutionary history and ecological adaptation is restricted. Here, we present a comprehensive transcriptome dataset for future genomic studies into this species. Methods We performed Illumina sequencing of cDNA prepared from leaf tissue collected from seven populations of D. viscosa ssp. angustissima and spatulata distributed along an environmental gradient in South Australia. Sequenced reads were assembled to provide a transcriptome resource. Contiguous sequences (contigs) were annotated using BLAST searches against the NCBI non-redundant database and gene ontology definitions were assigned. Single nucleotide polymorphisms were detected for the establishment of a genetic marker set. A comparison between the two subspecies was also carried out. Results Illumina sequencing returned 268,672,818 sequence reads, which were de novoassembled into 105,125 contigs. Contigs with significant BLAST alignments (E value < 1e-5)numbered at 44,191, with 38,311 of these having their most significant hits to sequences from land plant species. Gene Ontology terms were assigned to 28,440 contigs and KEGG analysis identified 146 pathways that the gene products from 5,070 contigs are potentially involved in. The subspecies comparison identified 8,494 fixed SNP differences across 3,979 contiguous sequences, indicating a level of genetic differentiation between them. Across all samples, 248,235 SNPs were detected. Conclusions We have established a significant genomic data resource for D. viscosa,providing a comprehensive transcriptomic reference. Genetic differences among morphologically distinct subspecies were found. A wide range of putative gene regions were identified along with a large set of variable SNP markers, providing a basis for studies into the evolution and ecological adaptation of D. viscosa. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1987-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Matthew J Christmas
- Environment Institute and School of Biological Sciences, The University of Adelaide, North Terrace, Adelaide, 5005, SA, Australia.
| | - Ed Biffin
- Environment Institute and School of Biological Sciences, The University of Adelaide, North Terrace, Adelaide, 5005, SA, Australia.
| | - Andrew J Lowe
- Environment Institute and School of Biological Sciences, The University of Adelaide, North Terrace, Adelaide, 5005, SA, Australia.
| |
Collapse
|
19
|
Ma Z, Tan Y, Cui G, Feng Y, Cui Q, Song X. Transcriptome and gene expression analysis of DHA producer Aurantiochytrium under low temperature conditions. Sci Rep 2015; 5:14446. [PMID: 26403200 PMCID: PMC4585886 DOI: 10.1038/srep14446] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2015] [Accepted: 08/28/2015] [Indexed: 01/05/2023] Open
Abstract
Aurantiochytrium is a promising docosahexaenoic acid (DHA) production candidate due to its fast growth rate and high proportions of lipid and DHA content. In this study, high-throughput RNA sequencing technology was employed to explore the acclimatization of this DHA producer under cold stress at the transcriptional level. The overall de novo assembly of the cDNA sequence data generated 29,783 unigenes, with an average length of 1,200 bp. In total, 13,245 unigenes were annotated in at least one database. A comparative genomic analysis between normal conditions and cold stress revealed that 2,013 genes were differentially expressed during the growth stage, while 2,071 genes were differentially expressed during the lipid accumulation stage. Further functional categorization and analyses showed some differentially expressed genes were involved in processes crucial to cold acclimation, such as signal transduction, cellular component biogenesis, and carbohydrate and lipid metabolism. A brief survey of the transcripts obtained in response to cold stress underlines the survival strategy of Aurantiochytrium; of these transcripts, many directly or indirectly influence the lipid composition. This is the first study to perform a transcriptomic analysis of the Aurantiochytrium under low temperature conditions. Our results will help to enhance DHA production by Aurantiochytrium in the future.
Collapse
Affiliation(s)
- Zengxin Ma
- Shandong Provincial Key Laboratory of Energy Genetics, Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences, Qingdao, Shandong, China
- Qingdao Engineering Laboratory of Single Cell Oil, Qingdao, Shandong, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Yanzhen Tan
- Shandong Provincial Key Laboratory of Energy Genetics, Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences, Qingdao, Shandong, China
- Qingdao Engineering Laboratory of Single Cell Oil, Qingdao, Shandong, China
| | - Guzhen Cui
- Shandong Provincial Key Laboratory of Energy Genetics, Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences, Qingdao, Shandong, China
- Qingdao Engineering Laboratory of Single Cell Oil, Qingdao, Shandong, China
| | - Yingang Feng
- Shandong Provincial Key Laboratory of Energy Genetics, Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences, Qingdao, Shandong, China
- Qingdao Engineering Laboratory of Single Cell Oil, Qingdao, Shandong, China
| | - Qiu Cui
- Shandong Provincial Key Laboratory of Energy Genetics, Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences, Qingdao, Shandong, China
- Key Laboratory of Biofuels, Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences, Qingdao, Shandong, China
- Qingdao Engineering Laboratory of Single Cell Oil, Qingdao, Shandong, China
| | - Xiaojin Song
- Shandong Provincial Key Laboratory of Energy Genetics, Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences, Qingdao, Shandong, China
- Qingdao Engineering Laboratory of Single Cell Oil, Qingdao, Shandong, China
| |
Collapse
|
20
|
Qiu S, Bergero R, Guirao-Rico S, Campos JL, Cezard T, Gharbi K, Charlesworth D. RAD mapping reveals an evolving, polymorphic and fuzzy boundary of a plant pseudoautosomal region. Mol Ecol 2015; 25:414-30. [PMID: 26139514 DOI: 10.1111/mec.13297] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2015] [Revised: 06/23/2015] [Accepted: 06/25/2015] [Indexed: 01/10/2023]
Abstract
How loss of genetic exchanges (recombination) evolves between sex chromosomes is a long-standing question. Suppressed recombination may evolve when a sexually antagonistic (SA) polymorphism occurs in a partially sex-linked 'pseudoautosomal' region (or 'PAR'), maintaining allele frequency differences between the two sexes, and creating selection for closer linkage with the fully sex-linked region of the Y chromosome in XY systems, or the W in ZW sex chromosome systems. Most evidence consistent with the SA polymorphism hypothesis is currently indirect, and more studies of the genetics and population genetics of PAR genes are clearly needed. The sex chromosomes of the plant Silene latifolia are suitable for such studies, as they evolved recently and the loss of recombination could still be ongoing. Here, we used RAD sequencing to genetically map sequences in this plant, which has a large genome (c. 3 gigabases) and no available whole-genome sequence. We mapped 83 genes on the sex chromosomes, and comparative mapping in the related species S. vulgaris supports previous evidence for additions to an ancestral PAR and identified at least 12 PAR genes. We describe evidence that recombination rates have been reduced in meiosis of both sexes, and differences in recombination between S. latifolia families suggest ongoing recombination suppression. Large allele frequency differences between the sexes were found at several loci closely linked to the PAR boundary, and genes in different regions of the PAR showed striking sequence diversity patterns that help illuminate the evolution of the PAR.
Collapse
Affiliation(s)
- S Qiu
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, UK
| | - R Bergero
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, UK
| | - S Guirao-Rico
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, UK
| | - J L Campos
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, UK
| | - T Cezard
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, UK
| | - K Gharbi
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, UK
| | - D Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, UK
| |
Collapse
|
21
|
Abstract
Background Silene latifolia represents one of the best-studied plant sex chromosome systems. A new approach using RNA-seq data has recently identified hundreds of new sex-linked genes in this species. However, this approach is expected to miss genes that are either not expressed or are expressed at low levels in the tissue(s) used for RNA-seq. Therefore other independent approaches are needed to discover such sex-linked genes. Results Here we used 10 well-characterized S. latifolia sex-linked genes and their homologs in Silene vulgaris, a species without sex chromosomes, to screen BAC libraries of both species. We isolated and sequenced 4 Mb of BAC clones of S. latifolia X and Y and S. vulgaris genomic regions, which yielded 59 new sex-linked genes (with S. vulgaris homologs for some of them). We assembled sequences that we believe represent the tip of the Xq arm. These sequences are clearly not pseudoautosomal, so we infer that the S. latifolia X has a single pseudoautosomal region (PAR) on the Xp arm. The estimated mean gene density in X BACs is 2.2 times lower than that in S. vulgaris BACs, agreeing with the genome size difference between these species. Gene density was estimated to be extremely low in the Y BAC clones. We compared our BAC-located genes with the sex-linked genes identified in previous RNA-seq studies, and found that about half of them (those with low expression in flower buds) were not identified as sex-linked in previous RNA-seq studies. We compiled a set of ~70 validated X/Y genes and X-hemizygous genes (without Y copies) from the literature, and used these genes to show that X-hemizygous genes have a higher probability of being undetected by the RNA-seq approach, compared with X/Y genes; we used this to estimate that about 30 % of our BAC-located genes must be X-hemizygous. The estimate is similar when we use BAC-located genes that have S. vulgaris homologs, which excludes genes that were gained by the X chromosome. Conclusions Our BAC sequencing identified 59 new sex-linked genes, and our analysis of these BAC-located genes, in combination with RNA-seq data suggests that gene losses from the S. latifolia Y chromosome could be as high as 30 %, higher than previous estimates of 10-20 %. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1698-7) contains supplementary material, which is available to authorized users.
Collapse
|
22
|
Chung HH, Schwinn KE, Ngo HM, Lewis DH, Massey B, Calcott KE, Crowhurst R, Joyce DC, Gould KS, Davies KM, Harrison DK. Characterisation of betalain biosynthesis in Parakeelya flowers identifies the key biosynthetic gene DOD as belonging to an expanded LigB gene family that is conserved in betalain-producing species. FRONTIERS IN PLANT SCIENCE 2015; 6:499. [PMID: 26217353 PMCID: PMC4493658 DOI: 10.3389/fpls.2015.00499] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2015] [Accepted: 06/22/2015] [Indexed: 05/18/2023]
Abstract
Plant betalain pigments are intriguing because they are restricted to the Caryophyllales and are mutually exclusive with the more common anthocyanins. However, betalain biosynthesis is poorly understood compared to that of anthocyanins. In this study, betalain production and betalain-related genes were characterized in Parakeelya mirabilis (Montiaceae). RT-PCR and transcriptomics identified three sequences related to the key biosynthetic enzyme Dopa 4,5-dioxgenase (DOD). In addition to a LigB gene similar to that of non-Caryophyllales species (Class I genes), two other P. mirabilis LigB genes were found (DOD and DOD-like, termed Class II). PmDOD and PmDOD-like had 70% amino acid identity. Only PmDOD was implicated in betalain synthesis based on transient assays of enzyme activity and correlation of transcript abundance to spatio-temporal betalain accumulation. The role of PmDOD-like remains unknown. The striking pigment patterning of the flowers was due to distinct zones of red betacyanin and yellow betaxanthin production. The major betacyanin was the unglycosylated betanidin rather than the commonly found glycosides, an occurrence for which there are a few previous reports. The white petal zones lacked pigment but had DOD activity suggesting alternate regulation of the pathway in this tissue. DOD and DOD-like sequences were also identified in other betalain-producing species but not in examples of anthocyanin-producing Caryophyllales or non-Caryophyllales species. A Class I LigB sequence from the anthocyanin-producing Caryophyllaceae species Dianthus superbus and two DOD-like sequences from the Amaranthaceae species Beta vulgaris and Ptilotus spp. did not show DOD activity in the transient assay. The additional sequences suggests that DOD is part of a larger LigB gene family in betalain-producing Caryophyllales taxa, and the tandem genomic arrangement of two of the three B. vulgaris LigB genes suggests the involvement of duplication in the gene family evolution.
Collapse
Affiliation(s)
- Hsiao-Hang Chung
- Centre for Native Floriculture, School of Agriculture and Food Sciences, The University of Queensland, GattonQLD, Australia
| | - Kathy E. Schwinn
- New Zealand Institute for Plant & Food Research LimitedPalmerston North, New Zealand
| | - Hanh M. Ngo
- New Zealand Institute for Plant & Food Research LimitedPalmerston North, New Zealand
| | - David H. Lewis
- New Zealand Institute for Plant & Food Research LimitedPalmerston North, New Zealand
| | - Baxter Massey
- Centre for Native Floriculture, School of Agriculture and Food Sciences, The University of Queensland, GattonQLD, Australia
| | - Kate E. Calcott
- New Zealand Institute for Plant & Food Research LimitedPalmerston North, New Zealand
- Victoria University of WellingtonWellington, New Zealand
| | - Ross Crowhurst
- New Zealand Institute for Plant & Food Research LimitedAuckland, New Zealand
| | - Daryl C. Joyce
- Centre for Native Floriculture, School of Agriculture and Food Sciences, The University of Queensland, GattonQLD, Australia
| | - Kevin S. Gould
- Victoria University of WellingtonWellington, New Zealand
| | - Kevin M. Davies
- New Zealand Institute for Plant & Food Research LimitedPalmerston North, New Zealand
| | - Dion K. Harrison
- Centre for Native Floriculture, School of Agriculture and Food Sciences, The University of Queensland, GattonQLD, Australia
| |
Collapse
|
23
|
Wei C, Tao X, Li M, He B, Yan L, Tan X, Zhang Y. De novo transcriptome assembly of Ipomoea nil using Illumina sequencing for gene discovery and SSR marker identification. Mol Genet Genomics 2015; 290:1873-84. [PMID: 25877516 DOI: 10.1007/s00438-015-1034-6] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2014] [Accepted: 03/23/2015] [Indexed: 01/14/2023]
Abstract
Ipomoea nil is widely used as an ornamental plant due to its abundance of flower color, but the limited transcriptome and genomic data hinder research on it. Using illumina platform, transcriptome profiling of I. nil was performed through high-throughput sequencing, which was proven to be a rapid and cost-effective means to characterize gene content. Our goal is to use the resulting information to facilitate the relevant research on flowering and flower color formation in I. nil. In total, 268 million unique illumina RNA-Seq reads were produced and used in the transcriptome assembly. These reads were assembled into 220,117 contigs, of which 137,307 contigs were annotated using the GO and KEGG database. Based on the result of functional annotations, a total of 89,781 contigs were assigned 455,335 GO term annotations. Meanwhile, 17,418 contigs were identified with pathway annotation and they were functionally assigned to 144 KEGG pathways. Our transcriptome revealed at least 55 contigs as probably flowering-related genes in I. nil, and we also identified 25 contigs that encode key enzymes in the phenylpropanoid biosynthesis pathway. Based on the analysis relating to gene expression profiles, in the phenylpropanoid biosynthesis pathway of I. nil, the repression of lignin biosynthesis might lead to the redirection of the metabolic flux into anthocyanin biosynthesis. This may be the most likely reason that I. nil has high anthocyanins content, especially in its flowers. Additionally, 15,537 simple sequence repeats (SSRs) were detected using the MISA software, and these SSRs will undoubtedly benefit future breeding work. Moreover, the information uncovered in this study will also serve as a valuable resource for understanding the flowering and flower color formation mechanisms in I. nil.
Collapse
Affiliation(s)
- Changhe Wei
- Key Laboratory of Bio-resources and Eco-environment, Ministry of Education, Sichuan Key Laboratory of Molecular Biology and Biotechnology, College of Life Sciences, Sichuan University, Chengdu, 610064, China
| | - Xiang Tao
- Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, 610041, China
| | - Ming Li
- Institute of Biotechnology and Nuclear Technology, Sichuan Academy of Agricultural Sciences, Chengdu, 610061, China
| | - Bin He
- Key Laboratory of Bio-resources and Eco-environment, Ministry of Education, Sichuan Key Laboratory of Molecular Biology and Biotechnology, College of Life Sciences, Sichuan University, Chengdu, 610064, China
| | - Lang Yan
- Key Laboratory of Bio-resources and Eco-environment, Ministry of Education, Sichuan Key Laboratory of Molecular Biology and Biotechnology, College of Life Sciences, Sichuan University, Chengdu, 610064, China
| | - Xuemei Tan
- Key Laboratory of Bio-resources and Eco-environment, Ministry of Education, Sichuan Key Laboratory of Molecular Biology and Biotechnology, College of Life Sciences, Sichuan University, Chengdu, 610064, China
| | - Yizheng Zhang
- Key Laboratory of Bio-resources and Eco-environment, Ministry of Education, Sichuan Key Laboratory of Molecular Biology and Biotechnology, College of Life Sciences, Sichuan University, Chengdu, 610064, China.
| |
Collapse
|
24
|
Guzman F, Kulcheski FR, Turchetto-Zolet AC, Margis R. De novo assembly of Eugenia uniflora L. transcriptome and identification of genes from the terpenoid biosynthesis pathway. PLANT SCIENCE : AN INTERNATIONAL JOURNAL OF EXPERIMENTAL PLANT BIOLOGY 2014; 229:238-246. [PMID: 25443850 DOI: 10.1016/j.plantsci.2014.10.003] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/14/2014] [Revised: 10/07/2014] [Accepted: 10/10/2014] [Indexed: 05/06/2023]
Abstract
Pitanga (Eugenia uniflora L.) is a member of the Myrtaceae family and is of particular interest due to its medicinal properties that are attributed to specialized metabolites with known biological activities. Among these molecules, terpenoids are the most abundant in essential oils that are found in the leaves and represent compounds with potential pharmacological benefits. The terpene diversity observed in Myrtaceae is determined by the activity of different members of the terpene synthase and oxidosqualene cyclase families. Therefore, the aim of this study was to perform a de novo assembly of transcripts from E. uniflora leaves and to annotation to identify the genes potentially involved in the terpenoid biosynthesis pathway and terpene diversity. In total, 72,742 unigenes with a mean length of 1048bp were identified. Of these, 43,631 and 36,289 were annotated with the NCBI non-redundant protein and Swiss-Prot databases, respectively. The gene ontology categorized the sequences into 53 functional groups. A metabolic pathway analysis with KEGG revealed 8,625 unigenes assigned to 141 metabolic pathways and 40 unigenes predicted to be associated with the biosynthesis of terpenoids. Furthermore, we identified four putative full-length terpene synthase genes involved in sesquiterpenes and monoterpenes biosynthesis, and three putative full-length oxidosqualene cyclase genes involved in the triterpenes biosynthesis. The expression of these genes was validated in different E. uniflora tissues.
Collapse
Affiliation(s)
- Frank Guzman
- PPGGBM, Departamento de Genética, Universidade Federal do Rio Grande do Sul - UFRGS, Porto Alegre, RS, Brazil; PPGBCM, Centro de Biotecnologia, Universidade Federal do Rio Grande do Sul - UFRGS, Porto Alegre, RS, Brazil
| | - Franceli Rodrigues Kulcheski
- PPGGBM, Departamento de Genética, Universidade Federal do Rio Grande do Sul - UFRGS, Porto Alegre, RS, Brazil; PPGBCM, Centro de Biotecnologia, Universidade Federal do Rio Grande do Sul - UFRGS, Porto Alegre, RS, Brazil
| | | | - Rogerio Margis
- PPGGBM, Departamento de Genética, Universidade Federal do Rio Grande do Sul - UFRGS, Porto Alegre, RS, Brazil; PPGBCM, Centro de Biotecnologia, Universidade Federal do Rio Grande do Sul - UFRGS, Porto Alegre, RS, Brazil; Departamento de Biofisica, Universidade Federal do Rio Grande do Sul - UFRGS, Porto Alegre, RS, Brazil.
| |
Collapse
|
25
|
Sequencing pools of individuals — mining genome-wide polymorphism data without big funding. Nat Rev Genet 2014; 15:749-63. [DOI: 10.1038/nrg3803] [Citation(s) in RCA: 512] [Impact Index Per Article: 51.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
|
26
|
Baloun J, Nevrtalova E, Kovacova V, Hudzieczek V, Cegan R, Vyskot B, Hobza R. Characterization of the HMA7 gene and transcriptomic analysis of candidate genes for copper tolerance in two Silene vulgaris ecotypes. JOURNAL OF PLANT PHYSIOLOGY 2014; 171:1188-96. [PMID: 24973591 DOI: 10.1016/j.jplph.2014.04.014] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2014] [Revised: 04/07/2014] [Accepted: 04/07/2014] [Indexed: 05/25/2023]
Abstract
Silene vulgaris possesses ecotype-specific tolerance to high levels of copper in the soil. Although this was reported a few decades ago, little is known about this trait on a molecular level. The aim of this study was to analyze the transcription response to elevated copper concentrations in two S. vulgaris ecotypes originating from copper-contrasting soil types - copper-tolerant Lubietova and copper-sensitive Stranska skala. To reveal if plants are transcriptionally affected, we first analyzed the HMA7 gene, a known key player in copper metabolism. Based on BAC library screening, we identified a BAC clone containing a SvHMA7 sequence with all the structural properties specific for plant copper-transporting ATPases. The functionality of the gene was tested using heterologous complementation in yeast mutants. Analyses of SvHMA7 transcription patterns showed that both ecotypes studied up-regulated SvHMA7 transcription after the copper treatment. Our data are supported by analysis of appropriate reference genes based on RNA-Seq databases. To identify genes specifically involved in copper response in the studied ecotypes, we analyzed transcription profiles of genes coding Cu-transporting proteins and genes involved in the prevention of copper-induced oxidative stress in both ecotypes. Our data show that three genes (APx, POD and COPT5) differ in their transcription pattern between the ecotypes with constitutively increased transcription in Lubietova. Taken together, we have identified transcription differences between metallifferous and non-metalliferous ecotypes of S. vulgaris, and we have suggested candidate genes participating in metal tolerance in this species.
Collapse
Affiliation(s)
- Jiri Baloun
- Department of Plant Developmental Genetics, Institute of Biophysics, Academy of Sciences of the Czech Republic, v.v.i., Kralovopolska 135, CZ-612 00 Brno, Czech Republic.
| | - Eva Nevrtalova
- Department of Plant Developmental Genetics, Institute of Biophysics, Academy of Sciences of the Czech Republic, v.v.i., Kralovopolska 135, CZ-612 00 Brno, Czech Republic; Department of Plant Biology, Faculty of Agronomy, Mendel University in Brno, Zemedelska 1, CZ-613 00 Brno, Czech Republic
| | - Viera Kovacova
- Department of Plant Developmental Genetics, Institute of Biophysics, Academy of Sciences of the Czech Republic, v.v.i., Kralovopolska 135, CZ-612 00 Brno, Czech Republic
| | - Vojtech Hudzieczek
- Department of Plant Developmental Genetics, Institute of Biophysics, Academy of Sciences of the Czech Republic, v.v.i., Kralovopolska 135, CZ-612 00 Brno, Czech Republic
| | - Radim Cegan
- Department of Plant Developmental Genetics, Institute of Biophysics, Academy of Sciences of the Czech Republic, v.v.i., Kralovopolska 135, CZ-612 00 Brno, Czech Republic
| | - Boris Vyskot
- Department of Plant Developmental Genetics, Institute of Biophysics, Academy of Sciences of the Czech Republic, v.v.i., Kralovopolska 135, CZ-612 00 Brno, Czech Republic
| | - Roman Hobza
- Department of Plant Developmental Genetics, Institute of Biophysics, Academy of Sciences of the Czech Republic, v.v.i., Kralovopolska 135, CZ-612 00 Brno, Czech Republic; Institute of Experimental Botany, Centre of the Region Haná for Biotechnological and Agricultural Research, Slechtitelu 31, CZ-78371 Olomouc, Czech Republic
| |
Collapse
|
27
|
Dong S, Liu Y, Niu J, Ning Y, Lin S, Zhang Z. De novo transcriptome analysis of the Siberian apricot (Prunus sibirica L.) and search for potential SSR markers by 454 pyrosequencing. Gene 2014; 544:220-7. [PMID: 24746601 DOI: 10.1016/j.gene.2014.04.031] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2014] [Revised: 04/10/2014] [Accepted: 04/17/2014] [Indexed: 11/29/2022]
Abstract
The Siberian apricot, an economically and ecologically important plant in China, contains seeds high in oil and can grow on marginal land. Although this species has multiple purposes and may be a feedstock of biofuel in China, transcriptome information and molecular research on this species remain limited. RNA-Seq technology has been widely applied to transcriptomics, genomics and the development of molecular markers, and functional gene studies. In this study, we obtained 1,243,067 high-quality reads with a mean size of 425 bp in a single run, totaling 528.4 Mb of sequence data using 454 GS FLX Titanium sequencing. All reads were assembled de novo into 46,940 unigenes with a mean size of 651 bp (range: 45-5566 bp). Assembled unigenes were annotated in multiple public databases based on similarity alignments to genes and proteins. 191 unigenes involving in lipid biosynthesis and metabolism were found, among them, expression patterns of two desaturase enzymes were analyzed by quantitative real-time polymerase chain reaction (qRT-PCR), based on six tissues from Siberian apricot, the seeds had the highest expression. 7304 simple sequence repeats (SSR) were identified from 6509 unigenes, a total of 9930 primer pairs were designed, 50 primer pairs were randomly selected to validate of the usefulness, and 24 (48%) primer pairs produced bands of the expected size. These data provide a base of sequence information to improve agronomic characters and molecular marker-assisted breeding to alter the composition of fatty acids in seeds from this plant, and hence, facilitate its utilization as a future biodiesel feedstock.
Collapse
Affiliation(s)
- Shubin Dong
- Lab of Systematic Evolution and Biogeography of Woody Plants, College of Nature Conservation, Beijing Forestry University, Beijing 100083, China
| | - Yulin Liu
- Key Laboratory for Genetics and Breeding of Forest Trees and Ornamental Plants of Ministry of Education, College of Biological Sciences and Biotechnology, Beijing Forestry University, Beijing 100083, China
| | - Jun Niu
- Lab of Systematic Evolution and Biogeography of Woody Plants, College of Nature Conservation, Beijing Forestry University, Beijing 100083, China
| | - Yu Ning
- Lab of Systematic Evolution and Biogeography of Woody Plants, College of Nature Conservation, Beijing Forestry University, Beijing 100083, China
| | - Shanzhi Lin
- Key Laboratory for Genetics and Breeding of Forest Trees and Ornamental Plants of Ministry of Education, College of Biological Sciences and Biotechnology, Beijing Forestry University, Beijing 100083, China
| | - Zhixiang Zhang
- Lab of Systematic Evolution and Biogeography of Woody Plants, College of Nature Conservation, Beijing Forestry University, Beijing 100083, China.
| |
Collapse
|
28
|
Brousseau L, Tinaut A, Duret C, Lang T, Garnier-Gere P, Scotti I. High-throughput transcriptome sequencing and preliminary functional analysis in four Neotropical tree species. BMC Genomics 2014; 15:238. [PMID: 24673733 PMCID: PMC3986928 DOI: 10.1186/1471-2164-15-238] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2014] [Accepted: 03/13/2014] [Indexed: 12/30/2022] Open
Abstract
Background The Amazonian rainforest is predicted to suffer from ongoing environmental changes. Despite the need to evaluate the impact of such changes on tree genetic diversity, we almost entirely lack genomic resources. Results In this study, we analysed the transcriptome of four tropical tree species (Carapa guianensis, Eperua falcata, Symphonia globulifera and Virola michelii) with contrasting ecological features, belonging to four widespread botanical families (respectively Meliaceae, Fabaceae, Clusiaceae and Myristicaceae). We sequenced cDNA libraries from three organs (leaves, stems, and roots) using 454 pyrosequencing. We have developed an R and bioperl-based bioinformatic procedure for de novo assembly, gene functional annotation and marker discovery. Mismatch identification takes into account single-base quality values as well as the likelihood of false variants as a function of contig depth and number of sequenced chromosomes. Between 17103 (for Symphonia globulifera) and 23390 (for Eperua falcata) contigs were assembled. Organs varied in the numbers of unigenes they apparently express, with higher number in roots. Patterns of gene expression were similar across species, with metabolism of aromatic compounds standing out as an overrepresented gene function. Transcripts corresponding to several gene functions were found to be over- or underrepresented in each organ. We identified between 4434 (for Symphonia globulifera) and 9076 (for Virola surinamensis) well-supported mismatches. The resulting overall mismatch density was comprised between 0.89 (S. globulifera) and 1.05 (V. surinamensis) mismatches/100 bp in variation-containing contigs. Conclusion The relative representation of gene functions in the four transcriptomes suggests that secondary metabolism may be particularly important in tropical trees. The differential representation of transcripts among tissues suggests differential gene expression, which opens the way to functional studies in these non-model, ecologically important species. We found substantial amounts of mismatches in the four species. These newly identified putative variants are a first step towards acquiring much needed genomic resources for tropical tree species. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-238) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | | | | | - Ivan Scotti
- INRA, UMR 0745 EcoFoG, Campus agronomique BP 709, F-97387 Cedex, France.
| |
Collapse
|
29
|
Melicher D, Torson AS, Dworkin I, Bowsher JH. A pipeline for the de novo assembly of the Themira biloba (Sepsidae: Diptera) transcriptome using a multiple k-mer length approach. BMC Genomics 2014; 15:188. [PMID: 24621177 PMCID: PMC4008362 DOI: 10.1186/1471-2164-15-188] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2013] [Accepted: 03/03/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The Sepsidae family of flies is a model for investigating how sexual selection shapes courtship and sexual dimorphism in a comparative framework. However, like many non-model systems, there are few molecular resources available. Large-scale sequencing and assembly have not been performed in any sepsid, and the lack of a closely related genome makes investigation of gene expression challenging. Our goal was to develop an automated pipeline for de novo transcriptome assembly, and to use that pipeline to assemble and analyze the transcriptome of the sepsid Themira biloba. RESULTS Our bioinformatics pipeline uses cloud computing services to assemble and analyze the transcriptome with off-site data management, processing, and backup. It uses a multiple k-mer length approach combined with a second meta-assembly to extend transcripts and recover more bases of transcript sequences than standard single k-mer assembly. We used 454 sequencing to generate 1.48 million reads from cDNA generated from embryo, larva, and pupae of T. biloba and assembled a transcriptome consisting of 24,495 contigs. Annotation identified 16,705 transcripts, including those involved in embryogenesis and limb patterning. We assembled transcriptomes from an additional three non-model organisms to demonstrate that our pipeline assembled a higher-quality transcriptome than single k-mer approaches across multiple species. CONCLUSIONS The pipeline we have developed for assembly and analysis increases contig length, recovers unique transcripts, and assembles more base pairs than other methods through the use of a meta-assembly. The T. biloba transcriptome is a critical resource for performing large-scale RNA-Seq investigations of gene expression patterns, and is the first transcriptome sequenced in this Dipteran family.
Collapse
Affiliation(s)
- Dacotah Melicher
- />Department of Biological Sciences, North Dakota State University, 1340 Bolley Drive, 218 Stevens Hall, Fargo, ND 58102 USA
| | - Alex S Torson
- />Department of Biological Sciences, North Dakota State University, 1340 Bolley Drive, 218 Stevens Hall, Fargo, ND 58102 USA
| | - Ian Dworkin
- />Department of Zoology, Michigan State University, 328 Giltner Hall, East Lansing, MI 48823 USA
| | - Julia H Bowsher
- />Department of Biological Sciences, North Dakota State University, 1340 Bolley Drive, 218 Stevens Hall, Fargo, ND 58102 USA
| |
Collapse
|
30
|
Kawakami T, Darby BJ, Ungerer MC. Transcriptome resources for the perennial sunflowerHelianthus maximilianiobtained from ecologically divergent populations. Mol Ecol Resour 2014; 14:812-9. [DOI: 10.1111/1755-0998.12227] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2013] [Revised: 01/07/2014] [Accepted: 01/08/2014] [Indexed: 11/30/2022]
Affiliation(s)
- Takeshi Kawakami
- Division of Biology; Kansas State University; Manhattan KS 66506 USA
- Department of Evolutionary Biology; Evolutionary Biology Centre; Uppsala University; Uppsala Sweden
| | - Brian J. Darby
- Department of Biology; University of North Dakota; Grand Forks ND 58202 USA
| | - Mark C. Ungerer
- Division of Biology; Kansas State University; Manhattan KS 66506 USA
| |
Collapse
|
31
|
Chow KS, Ghazali AK, Hoh CC, Mohd-Zainuddin Z. RNA sequencing read depth requirement for optimal transcriptome coverage in Hevea brasiliensis. BMC Res Notes 2014; 7:69. [PMID: 24484543 PMCID: PMC3926681 DOI: 10.1186/1756-0500-7-69] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2013] [Accepted: 01/17/2014] [Indexed: 11/29/2022] Open
Abstract
BACKGROUND One of the concerns of assembling de novo transcriptomes is determining the amount of read sequences required to ensure a comprehensive coverage of genes expressed in a particular sample. In this report, we describe the use of Illumina paired-end RNA-Seq (PE RNA-Seq) reads from Hevea brasiliensis (rubber tree) bark to devise a transcript mapping approach for the estimation of the read amount needed for deep transcriptome coverage. FINDINGS We optimized the assembly of a Hevea bark transcriptome based on 16 Gb Illumina PE RNA-Seq reads using the Oases assembler across a range of k-mer sizes. We then assessed assembly quality based on transcript N50 length and transcript mapping statistics in relation to (a) known Hevea cDNAs with complete open reading frames, (b) a set of core eukaryotic genes and (c) Hevea genome scaffolds. This was followed by a systematic transcript mapping process where sub-assemblies from a series of incremental amounts of bark transcripts were aligned to transcripts from the entire bark transcriptome assembly. The exercise served to relate read amounts to the degree of transcript mapping level, the latter being an indicator of the coverage of gene transcripts expressed in the sample. As read amounts or datasize increased toward 16 Gb, the number of transcripts mapped to the entire bark assembly approached saturation. A colour matrix was subsequently generated to illustrate sequencing depth requirement in relation to the degree of coverage of total sample transcripts. CONCLUSIONS We devised a procedure, the "transcript mapping saturation test", to estimate the amount of RNA-Seq reads needed for deep coverage of transcriptomes. For Hevea de novo assembly, we propose generating between 5-8 Gb reads, whereby around 90% transcript coverage could be achieved with optimized k-mers and transcript N50 length. The principle behind this methodology may also be applied to other non-model plants, or with reads from other second generation sequencing platforms.
Collapse
MESH Headings
- Databases, Genetic
- Gene Expression Profiling/methods
- Gene Expression Regulation, Plant
- Gene Library
- Genes, Plant
- Hevea/chemistry
- Hevea/genetics
- High-Throughput Nucleotide Sequencing
- Open Reading Frames
- Plant Bark/metabolism
- Plant Leaves/metabolism
- Plant Proteins/genetics
- RNA, Messenger/biosynthesis
- RNA, Messenger/chemistry
- RNA, Messenger/genetics
- RNA, Messenger/isolation & purification
- RNA, Plant/biosynthesis
- RNA, Plant/chemistry
- RNA, Plant/genetics
- RNA, Plant/isolation & purification
- Reproducibility of Results
- Transcriptome
Collapse
Affiliation(s)
- Keng-See Chow
- Biotechnology Unit, Malaysian Rubber Board, Rubber Research Institute of Malaysia, Experiment Station, Kuala Lumpur 47000, Sungai Buloh, Selangor, Malaysia
| | - Ahmad-Kamal Ghazali
- Codon Genomics SB, No. 26, Jalan Dutamas 7, Taman Dutamas, Balakong 43200, Seri Kembangan Balakong, Selangor, Malaysia
| | - Chee-Choong Hoh
- Codon Genomics SB, No. 26, Jalan Dutamas 7, Taman Dutamas, Balakong 43200, Seri Kembangan Balakong, Selangor, Malaysia
| | - Zainorlina Mohd-Zainuddin
- Biotechnology Unit, Malaysian Rubber Board, Rubber Research Institute of Malaysia, Experiment Station, Kuala Lumpur 47000, Sungai Buloh, Selangor, Malaysia
| |
Collapse
|
32
|
Pannell JR, Fields PD. Evolution in subdivided plant populations: concepts, recent advances and future directions. THE NEW PHYTOLOGIST 2014; 201:417-432. [PMID: 24111698 DOI: 10.1111/nph.12495] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/18/2013] [Accepted: 08/07/2013] [Indexed: 05/14/2023]
Abstract
Research into the evolution of subdivided plant populations has long involved the study of phenotypic variation across plant geographic ranges and the genetic details underlying that variation. Genetic polymorphism at different marker loci has also allowed us to infer the long- and short-term histories of gene flow within and among populations, including range expansions and colonization-extinction dynamics. However, the advent of affordable genome-wide sequences for large numbers of individuals is opening up new possibilities for the study of subdivided populations. In this review, we consider what the new tools and technologies may allow us to do. In particular, we encourage researchers to look beyond the description of variation and to use genomic tools to address new hypotheses, or old ones afresh. Because subdivided plant populations are complex structures, we caution researchers away from adopting simplistic interpretations of their data, and to consider the patterns they observe in terms of the population genetic processes that have given rise to them; here, the genealogical framework of the coalescent will continue to be conceptually and analytically useful.
Collapse
Affiliation(s)
- John R Pannell
- Department of Ecology and Evolution, University of Lausanne, 1015, Lausanne, Switzerland
| | - Peter D Fields
- Department of Biology, University of Virginia, PO Box 400328, Charlottesville, VA, 22904-4328, USA
| |
Collapse
|
33
|
Ahn YK, Tripathi S, Cho YI, Kim JH, Lee HE, Kim DS, Woo JG, Cho MC. De novo transcriptome assembly and novel microsatellite marker information in Capsicum annuum varieties Saengryeg 211 and Saengryeg 213. BOTANICAL STUDIES 2013; 54:58. [PMID: 28510893 PMCID: PMC5430321 DOI: 10.1186/1999-3110-54-58] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/06/2013] [Accepted: 11/02/2013] [Indexed: 05/31/2023]
Abstract
BACKGROUND Pepper, Capsicum annuum L., Solanaceae, is a major staple economically important vegetable crop worldwide. Limited functional genomics resources and whole genome association studies could be substantially improved through the application of molecular approach for the characterization of gene content and identification of molecular markers. The massive parallel pyrosequencing of two pepper varieties, the highly pungent, Saengryeg 211, and the non-pungent, Saengryeg 213, including de novo transcriptome assembly, functional annotation, and in silico discovery of potential molecular markers is described. We performed 454 GS-FLX Titanium sequencing of polyA-selected and normalized cDNA libraries generated from a single pool of transcripts obtained from mature fruits of two pepper varieties. RESULTS A single 454 pyrosequencing run generated 361,671 and 274,269 reads totaling 164.49 and 124.60 Mb of sequence data (average read length of 454 nucleotides), which assembled into 23,821 and 17,813 isotigs and 18,147 and 15,129 singletons for both varieties, respectively. These reads were organized into 20,352 and 15,781 'isogroups' for both varieties. Assembled sequences were functionally annotated based on homology to genes in multiple public databases and assigned with Gene Ontology (GO) terms. Sequence variants analyses identified a total of 3,766 and 2,431 potential (Simple Sequence Repeat) SSR motifs for microsatellite analysis for both varieties, where trinucleotide was the most common repeat unit (84%), followed by di (9.9%), hexa (4.1%) and pentanucleotide repeats (2.1%). GAA repeat (8.6%) was the most frequent repeat motif, followed by TGG (7.2%), TTC (6.5%), and CAG (6.2%). CONCLUSIONS High-throughput transcriptome assembly, annotation and large scale of SSR marker discovery has been achieved using next generation sequencing (NGS) of two pepper varieties. These valuable informations for functional genomics resource shall help to further improve the pepper breeding efforts with respect to genetic linkage maps, QTL mapping and marker-assisted trait selection.
Collapse
Affiliation(s)
- Yul-Kyun Ahn
- Vegetable Research Division, National Institute of Horticultural & Herbal Science, Rural Development Administration, Suwon, 440-706 Republic of Korea
| | - Swati Tripathi
- Vegetable Research Division, National Institute of Horticultural & Herbal Science, Rural Development Administration, Suwon, 440-706 Republic of Korea
| | - Young-Il Cho
- Vegetable Research Division, National Institute of Horticultural & Herbal Science, Rural Development Administration, Suwon, 440-706 Republic of Korea
| | - Jeong-Ho Kim
- Vegetable Research Division, National Institute of Horticultural & Herbal Science, Rural Development Administration, Suwon, 440-706 Republic of Korea
| | - Hye-Eun Lee
- Vegetable Research Division, National Institute of Horticultural & Herbal Science, Rural Development Administration, Suwon, 440-706 Republic of Korea
| | - Do-Sun Kim
- Vegetable Research Division, National Institute of Horticultural & Herbal Science, Rural Development Administration, Suwon, 440-706 Republic of Korea
| | - Jong-Gyu Woo
- Vegetable Research Division, National Institute of Horticultural & Herbal Science, Rural Development Administration, Suwon, 440-706 Republic of Korea
| | - Myeong-Cheoul Cho
- Vegetable Research Division, National Institute of Horticultural & Herbal Science, Rural Development Administration, Suwon, 440-706 Republic of Korea
| |
Collapse
|
34
|
Agunbiade TA, Sun W, Coates BS, Djouaka R, Tamò M, Ba MN, Binso-Dabire C, Baoua I, Olds BP, Pittendrigh BR. Development of reference transcriptomes for the major field insect pests of cowpea: a toolbox for insect pest management approaches in west Africa. PLoS One 2013; 8:e79929. [PMID: 24278221 PMCID: PMC3838393 DOI: 10.1371/journal.pone.0079929] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2013] [Accepted: 09/27/2013] [Indexed: 11/19/2022] Open
Abstract
Cowpea is a widely cultivated and major nutritional source of protein for many people that live in West Africa. Annual yields and longevity of grain storage is greatly reduced by feeding damage caused by a complex of insect pests that include the pod sucking bugs, Anoplocnemis curvipes Fabricius (Hemiptera: Coreidae) and Clavigralla tomentosicollis Stål (Hemiptera: Coreidae); as well as phloem-feeding cowpea aphids, Aphis craccivora Koch (Hemiptera: Aphididae) and flower thrips, Megalurothrips sjostedti Trybom (Thysanoptera: Thripidae). Efforts to control these pests remain a challenge and there is a need to understand the structure and movement of these pest populations in order to facilitate the development of integrated pest management strategies (IPM). Molecular tools have the potential to help facilitate a better understanding of pest populations. Towards this goal, we used 454 pyrosequencing technology to generate 319,126, 176,262, 320,722 and 227,882 raw reads from A. curvipes, A. craccivora, C. tomentosicollis and M. sjostedti, respectively. The reads were de novo assembled into 11,687, 7,647, 10,652 and 7,348 transcripts for A. curvipes, A. craccivora, C. tomentosicollis and M. sjostedti, respectively. Functional annotation of the resulting transcripts identified genes putatively involved in insecticide resistance, pathogen defense and immunity. Additionally, sequences that matched the primary aphid endosymbiont, Buchnera aphidicola, were identified among A. craccivora transcripts. Furthermore, 742, 97, 607 and 180 single nucleotide polymorphisms (SNPs) were respectively predicted among A. curvipes, A. craccivora, C. tomentosicollis and M. sjostedti transcripts, and will likely be valuable tools for future molecular genetic marker development. These results demonstrate that Roche 454-based transcriptome sequencing could be useful for the development of genomic resources for cowpea pest insects in West Africa.
Collapse
Affiliation(s)
- Tolulope A. Agunbiade
- Department of Entomology, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
- * E-mail:
| | - Weilin Sun
- Department of Entomology, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Brad S. Coates
- Corn Insects and Crop Genetics Research Unit, United States Department of Agriculture, Agricultural Research Service, Ames, Iowa, United States of America
| | | | - Manuele Tamò
- International Institute of Tropical Agriculture, Cotonou, Benin
| | - Malick N. Ba
- Institut de l’Environnement et de Recherches Agricoles, Ouagadougou, Burkina Faso
| | | | - Ibrahim Baoua
- Institut National de la Recherche Agronomique du Niger, Maradi, Niger
| | - Brett P. Olds
- Department of Animal Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Barry R. Pittendrigh
- Department of Entomology, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| |
Collapse
|
35
|
Transcriptome analysis of Capsicum annuum varieties Mandarin and Blackcluster: assembly, annotation and molecular marker discovery. Gene 2013; 533:494-9. [PMID: 24125952 DOI: 10.1016/j.gene.2013.09.095] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2013] [Accepted: 09/26/2013] [Indexed: 11/22/2022]
Abstract
Next generation sequencing technologies have proven to be a rapid and cost-effective means to assemble and characterize gene content and identify molecular markers in various organisms. Pepper (Capsicum annuum L., Solanaceae) is a major staple vegetable crop, which is economically important and has worldwide distribution. High-throughput transcriptome profiling of two pepper cultivars, Mandarin and Blackcluster, using 454 GS-FLX pyrosequencing yielded 279,221 and 316,357 sequenced reads with a total 120.44 and 142.54Mb of sequence data (average read length of 431 and 450 nucleotides). These reads resulted from 17,525 and 16,341 'isogroups' and were assembled into 19,388 and 18,057 isotigs, and 22,217 and 13,153 singletons for both the cultivars, respectively. Assembled sequences were annotated functionally based on homology to genes in multiple public databases. Detailed sequence variant analysis identified a total of 9701 and 12,741 potential SNPs which eventually resulted in 1025 and 1059 genotype specific SNPs, for both the varieties, respectively, after examining SNP frequency distribution for each mapped unigenes. These markers for pepper will be highly valuable for marker-assisted breeding and other genetic studies.
Collapse
|
36
|
Liu Y, Huang Z, Ao Y, Li W, Zhang Z. Transcriptome analysis of yellow horn (Xanthoceras sorbifolia Bunge): a potential oil-rich seed tree for biodiesel in China. PLoS One 2013; 8:e74441. [PMID: 24040247 PMCID: PMC3770547 DOI: 10.1371/journal.pone.0074441] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2013] [Accepted: 08/02/2013] [Indexed: 11/27/2022] Open
Abstract
Background Yellow horn (Xanthoceras sorbifolia Bunge) is an oil-rich seed shrub that grows well in cold, barren environments and has great potential for biodiesel production in China. However, the limited genetic data means that little information about the key genes involved in oil biosynthesis is available, which limits further improvement of this species. In this study, we describe sequencing and de novo transcriptome assembly to produce the first comprehensive and integrated genomic resource for yellow horn and identify the pathways and key genes related to oil accumulation. In addition, potential molecular markers were identified and compiled. Methodology/Principal Findings Total RNA was isolated from 30 plants from two regions, including buds, leaves, flowers and seeds. Equal quantities of RNA from these tissues were pooled to construct a cDNA library for 454 pyrosequencing. A total of 1,147,624 high-quality reads with total and average lengths of 530.6 Mb and 462 bp, respectively, were generated. These reads were assembled into 51,867 unigenes, corresponding to a total of 36.1 Mb with a mean length, N50 and median of 696, 928 and 570 bp, respectively. Of the unigenes, 17,541 (33.82%) were unmatched in any public protein databases. We identified 281 unigenes that may be involved in de novo fatty acid (FA) and triacylglycerol (TAG) biosynthesis and metabolism. Furthermore, 6,707 SSRs, 16,925 SNPs and 6,201 InDels with high-confidence were also identified in this study. Conclusions This transcriptome represents a new functional genomics resource and a foundation for further studies on the metabolic engineering of yellow horn to increase oil content and modify oil composition. The potential molecular markers identified in this study provide a basis for polymorphism analysis of Xanthoceras, and even Sapindaceae; they will also accelerate the process of breeding new varieties with better agronomic characteristics.
Collapse
Affiliation(s)
- Yulin Liu
- College of Biological Science and Biotechnology, Beijing Forest University, Beijing, China
| | - Zhedong Huang
- Laboratory of Systematic Evolution and Biogeography of Woody Plants, College of Nature Conservation, Beijing Forest University, Beijing, China
| | - Yan Ao
- Academy of Forest, Beijing Forest University, Beijing, China
| | - Wei Li
- College of Biological Science and Biotechnology, Beijing Forest University, Beijing, China
- * E-mail: (WL); (ZXZ)
| | - Zhixiang Zhang
- Laboratory of Systematic Evolution and Biogeography of Woody Plants, College of Nature Conservation, Beijing Forest University, Beijing, China
- * E-mail: (WL); (ZXZ)
| |
Collapse
|
37
|
Harris SE, Munshi-South J, Obergfell C, O’Neill R. Signatures of rapid evolution in urban and rural transcriptomes of white-footed mice (Peromyscus leucopus) in the New York metropolitan area. PLoS One 2013; 8:e74938. [PMID: 24015321 PMCID: PMC3756007 DOI: 10.1371/journal.pone.0074938] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2013] [Accepted: 08/06/2013] [Indexed: 12/16/2022] Open
Abstract
Urbanization is a major cause of ecological degradation around the world, and human settlement in large cities is accelerating. New York City (NYC) is one of the oldest and most urbanized cities in North America, but still maintains 20% vegetation cover and substantial populations of some native wildlife. The white-footed mouse, Peromyscusleucopus, is a common resident of NYC's forest fragments and an emerging model system for examining the evolutionary consequences of urbanization. In this study, we developed transcriptomic resources for urban P. leucopus to examine evolutionary changes in protein-coding regions for an exemplar "urban adapter." We used Roche 454 GS FLX+ high throughput sequencing to derive transcriptomes from multiple tissues from individuals across both urban and rural populations. From these data, we identified 31,015 SNPs and several candidate genes potentially experiencing positive selection in urban populations of P. leucopus. These candidate genes are involved in xenobiotic metabolism, innate immune response, demethylation activity, and other important biological phenomena in novel urban environments. This study is one of the first to report candidate genes exhibiting signatures of directional selection in divergent urban ecosystems.
Collapse
Affiliation(s)
- Stephen E. Harris
- Program in Ecology, Evolutionary Biology, & Behavior, The Graduate Center, City University of New York (CUNY), New York, New York, United States of America
| | - Jason Munshi-South
- Louis Calder Center, Fordham University, Armonk, New York, United States of America
| | - Craig Obergfell
- Molecular & Cell Biology, University of Connecticut, Storrs, Connecticut, United States of America
| | - Rachel O’Neill
- Molecular & Cell Biology, University of Connecticut, Storrs, Connecticut, United States of America
| |
Collapse
|
38
|
Zavodna M, Grueber CE, Gemmell NJ. Parallel tagged next-generation sequencing on pooled samples - a new approach for population genetics in ecology and conservation. PLoS One 2013; 8:e61471. [PMID: 23637841 PMCID: PMC3630221 DOI: 10.1371/journal.pone.0061471] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2012] [Accepted: 03/08/2013] [Indexed: 12/02/2022] Open
Abstract
Next-generation sequencing (NGS) on pooled samples has already been broadly applied in human medical diagnostics and plant and animal breeding. However, thus far it has been only sparingly employed in ecology and conservation, where it may serve as a useful diagnostic tool for rapid assessment of species genetic diversity and structure at the population level. Here we undertake a comprehensive evaluation of the accuracy, practicality and limitations of parallel tagged amplicon NGS on pooled population samples for estimating species population diversity and structure. We obtained 16S and Cyt b data from 20 populations of Leiopelma hochstetteri, a frog species of conservation concern in New Zealand, using two approaches - parallel tagged NGS on pooled population samples and individual Sanger sequenced samples. Data from each approach were then used to estimate two standard population genetic parameters, nucleotide diversity (π) and population differentiation (FST), that enable population genetic inference in a species conservation context. We found a positive correlation between our two approaches for population genetic estimates, showing that the pooled population NGS approach is a reliable, rapid and appropriate method for population genetic inference in an ecological and conservation context. Our experimental design also allowed us to identify both the strengths and weaknesses of the pooled population NGS approach and outline some guidelines and suggestions that might be considered when planning future projects.
Collapse
Affiliation(s)
- Monika Zavodna
- Centre for Reproduction and Genomics, Department of Anatomy, University of Otago, Dunedin, New Zealand
| | - Catherine E. Grueber
- Centre for Reproduction and Genomics, Department of Anatomy, University of Otago, Dunedin, New Zealand
- Department of Zoology, University of Otago, Dunedin, New Zealand
- Allan Wilson Centre for Molecular Ecology and Evolution, University of Otago, Dunedin, New Zealand
| | - Neil J. Gemmell
- Centre for Reproduction and Genomics, Department of Anatomy, University of Otago, Dunedin, New Zealand
- Allan Wilson Centre for Molecular Ecology and Evolution, University of Otago, Dunedin, New Zealand
| |
Collapse
|
39
|
Müller K, Storchova H. Transcription of atp1 is influenced by both genomic configuration and nuclear background in the highly rearranged mitochondrial genomes of Silene vulgaris. PLANT MOLECULAR BIOLOGY 2013; 81:495-505. [PMID: 23361622 DOI: 10.1007/s11103-013-0018-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/12/2012] [Accepted: 01/19/2013] [Indexed: 06/01/2023]
Abstract
An extraordinary variation in mitochondrial DNA sequence exists in angiosperm Silene vulgaris. The atp1 gene is flanked by very variable regions, as deduced from four completely sequenced mitochondrial genomes of this species. This diversity contributed to a highly variable transcript profile of this gene observed across S. vulgaris populations. We examined the atp1 transcript in the KOV mitochondrial genome and found three 5' ends, created most likely by the combination of transcription initiation and RNA processing. Most atp1 transcripts terminated about 70 bp upstream of the translation stop codon, which was present in only 10 % of them. Controlled crosses between a KOV mother and a geographically distant pollen donor (Krasnoyarsk, Russia) showed that nuclear background also affected atp1 transcription. The distant pollen donor introduced the factor(s) preventing the formation of a long 2,100 nt-transcript, because this long atp1 transcript reappeared in the progeny from self-crosses. The highly rearranged mitochondrial genomes with a variation in gene flanking regions make S. vulgaris an excellent model for the study of mitochondrial gene expression in plants.
Collapse
Affiliation(s)
- Karel Müller
- Institute of Experimental Botany, Academy of Sciences of the Czech Republic, Prague, Lysolaje, 16502, Czech Republic
| | | |
Collapse
|
40
|
Käfer J, Talianová M, Bigot T, Michu E, Guéguen L, Widmer A, Žlůvová J, Glémin S, Marais GAB. Patterns of molecular evolution in dioecious and non-dioecious Silene. J Evol Biol 2012. [PMID: 23206219 DOI: 10.1111/jeb.12052] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Dioecy (i.e. having separate sexes) is a rather rare breeding system in flowering plants. Such rareness may result from a high probability of extinction in dioecious species because of less efficient dispersal and the costs of sexual selection, which are expected to harm dioecious species' survival on the long term. These handicaps should decrease the effective population size (Ne) of dioecious species, which in turn should reduce the efficacy of selection. Moreover, sexual selection in dioecious species is expected to specifically affect some genes, which will evolve under positive selection. The relative contribution of these effects is currently unknown and we tried to disentangle them by comparing sequence evolution between dioecious and non-dioecious species in the Silene genus (Caryophyllaceae), where dioecy has evolved at least twice. For the dioecious species in the section Melandrium, where dioecy is the oldest, we found a global reduction of purifying selection, while on some, male-biased genes, positive selection was found. For section Otites, where dioecy evolved more recently, we found no significant differences between dioecious and non-dioecious species. Our results are consistent with the view that dioecy is an evolutionary dead end in flowering plants, although other scenarios for explaining reduced Ne cannot be ruled out. Our results also show that contrasting forces act on the genomes of dioecious plants, and suggest that some time is required before the genome of such plants bears the footprints of dioecy.
Collapse
Affiliation(s)
- J Käfer
- Univ Lyon 1, Centre National de la Recherche Scientifique, UMR5558, Laboratoire de Biométrie et Biologie Évolutive, Villeurbanne, France.
| | | | | | | | | | | | | | | | | |
Collapse
|
41
|
Sloan DB, Müller K, McCauley DE, Taylor DR, Štorchová H. Intraspecific variation in mitochondrial genome sequence, structure, and gene content in Silene vulgaris, an angiosperm with pervasive cytoplasmic male sterility. THE NEW PHYTOLOGIST 2012; 196:1228-1239. [PMID: 23009072 DOI: 10.1111/j.1469-8137.2012.04340.x] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/02/2012] [Accepted: 08/17/2012] [Indexed: 05/04/2023]
Abstract
In angiosperms, mitochondrial-encoded genes can cause cytoplasmic male sterility (CMS), resulting in the coexistence of female and hermaphroditic individuals (gynodioecy). We compared four complete mitochondrial genomes from the gynodioecious species Silene vulgaris and found unprecedented amounts of intraspecific diversity for plant mitochondrial DNA (mtDNA). Remarkably, only about half of overall sequence content is shared between any pair of genomes. The four mtDNAs range in size from 361 to 429 kb and differ in gene complement, with rpl5 and rps13 being intact in some genomes but absent or pseudogenized in others. The genomes exhibit essentially no conservation of synteny and are highly repetitive, with evidence of reciprocal recombination occurring even across short repeats (< 250 bp). Some mitochondrial genes exhibit atypically high degrees of nucleotide polymorphism, while others are invariant. The genomes also contain a variable number of small autonomously mapping chromosomes, which have only recently been identified in angiosperm mtDNA. Southern blot analysis of one of these chromosomes indicated a complex in vivo structure consisting of both monomeric circles and multimeric forms. We conclude that S. vulgaris harbors an unusually large degree of variation in mtDNA sequence and structure and discuss the extent to which this variation might be related to CMS.
Collapse
Affiliation(s)
- Daniel B Sloan
- Department of Biology, University of Virginia, Charlottesville, VA, 22904, USA
| | - Karel Müller
- Institute of Experimental Botany, Academy of Sciences of the Czech Republic, Prague, Lysolaje, 16502, Czech Republic
| | - David E McCauley
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, 37235, USA
| | - Douglas R Taylor
- Department of Biology, University of Virginia, Charlottesville, VA, 22904, USA
| | - Helena Štorchová
- Institute of Experimental Botany, Academy of Sciences of the Czech Republic, Prague, Lysolaje, 16502, Czech Republic
| |
Collapse
|
42
|
Zhang Y, Zhang S, Han S, Li X, Qi L. Transcriptome profiling and in silico analysis of somatic embryos in Japanese larch (Larix leptolepis). PLANT CELL REPORTS 2012; 31:1637-57. [PMID: 22622308 DOI: 10.1007/s00299-012-1277-1] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/03/2012] [Revised: 04/19/2012] [Accepted: 04/20/2012] [Indexed: 05/13/2023]
Abstract
UNLABELLED Japanese larch (Larix leptolepis) is an ecologically and economically important species mainly grown in northeastern China, Japan and Europe. However, erratic flowering and poor germplasm resources caused by high embryo abortion rates have hampered breeding of Larix species. Somatic embryogenesis (SE) is an effective tool for the production of L. leptolepis with desirable characteristics, such as expression of totipotency, preparation of synthetic seeds, and genetic transformation. However, public genomic resources for this species are limited. We sequenced 591,759 raw expressed sequence tags (ESTs) from a 454 sequencing cDNA library of L. leptolepis somatic embryos, resulting in 572,403 high-quality reads. These reads were assembled into 70,927 unique sequences (UniGenes), including 32,321 contigs and 38,606 singletons. After removal of low-quality sequences, 65,115 UniGenes were annotated using the UniProtKB program. Based on their sequence similarity with known proteins, the matched 30,372 sequences from 664 species were estimated to represent approximately 19,000 unique genes. Gene ontology analysis revealed 21,324 UniGenes assigned to 51 categories. By Kyoto Encyclopedia of Genes and Genomes mapping, 25,773 transcripts were associated with 160 biochemical pathways. Further analysis screened four signal transduction pathways represented by 337 enzymes and 17 secondary metabolites. In silico analysis reveals that 207 UniESTs in Larix are homologous to MAPKs genes identified from other model plants, which may be involved in regulating SE development. This study provides an initial insight into the Larix transcriptomes of the pro-embryogenic mass and is a sound basis for future studies. KEY MESSAGE We constructed a large, full-length 454 sequencing cDNA library of Larix leptolepis during somatic embryogenesis. More than 590,000 sequences were obtained and a deep-coverage EST database was constructed.
Collapse
Affiliation(s)
- Yuan Zhang
- Laboratory of Cell Biology, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, 100091, China
| | | | | | | | | |
Collapse
|
43
|
De novo transcriptome assembly and SNP discovery in the wing polymorphic salt marsh beetle Pogonus chalceus (Coleoptera, Carabidae). PLoS One 2012; 7:e42605. [PMID: 22870338 PMCID: PMC3411629 DOI: 10.1371/journal.pone.0042605] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2012] [Accepted: 07/09/2012] [Indexed: 11/29/2022] Open
Abstract
Background The salt marsh beetle Pogonus chalceus represents a unique opportunity to understand and study the origin and evolution of dispersal polymorphisms as remarkable inter-population divergence in dispersal related traits (e.g. wing development, body size and metabolism) has been shown to persist in face of strong homogenizing gene flow. Sequencing and assembling the transcriptome of P. chalceus is a first step in developing large scale genetic information that will allow us to further study the recurrent phenotypic evolution in dispersal traits in these natural populations. Methodology/Results We used the Illumina HiSeq2000 to sequence 37 Gbases of the transcriptome and performed de novo transcriptome assembly with the Trinity short read assembler. This resulted in 65,766 contigs, clustering into 39,393 unique transcripts (unigenes). A subset of 12,987 show similarity (BLAST) to known proteins in the NCBI database and 7,589 are assigned Gene Ontology (GO). Using homology searches we identified all reported genes involved in wing development, juvenile- and ecdysteroid hormone pathways in Tribolium castaneum. About half (56.7%) of the unique assembled genes are shared among three life stages (third-instar larva, pupa, and imago). We identified 38,141 single nucleotide polymorphisms (SNPs) in these unigenes. Of these SNPs, 26,823 (70.3%) were found in a predicted open reading frame (ORF) and 6,998 (18.3%) were nonsynonymous. Conclusions The assembled transcriptome and SNP data are essential genomic resources for further study of the developmental pathways, genetic mechanisms and metabolic consequences of adaptive divergence in dispersal power in natural populations.
Collapse
|
44
|
Sloan DB, Alverson AJ, Chuckalovcak JP, Wu M, McCauley DE, Palmer JD, Taylor DR. Rapid evolution of enormous, multichromosomal genomes in flowering plant mitochondria with exceptionally high mutation rates. PLoS Biol 2012; 10:e1001241. [PMID: 22272183 PMCID: PMC3260318 DOI: 10.1371/journal.pbio.1001241] [Citation(s) in RCA: 373] [Impact Index Per Article: 31.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2011] [Accepted: 11/30/2011] [Indexed: 11/28/2022] Open
Abstract
A pair of species within the genus Silene have evolved the largest known mitochondrial genomes, coinciding with extreme changes in mutation rate, recombination activity, and genome structure. Genome size and complexity vary tremendously among eukaryotic species and their organelles. Comparisons across deeply divergent eukaryotic lineages have suggested that variation in mutation rates may explain this diversity, with increased mutational burdens favoring reduced genome size and complexity. The discovery that mitochondrial mutation rates can differ by orders of magnitude among closely related angiosperm species presents a unique opportunity to test this hypothesis. We sequenced the mitochondrial genomes from two species in the angiosperm genus Silene with recent and dramatic accelerations in their mitochondrial mutation rates. Contrary to theoretical predictions, these genomes have experienced a massive proliferation of noncoding content. At 6.7 and 11.3 Mb, they are by far the largest known mitochondrial genomes, larger than most bacterial genomes and even some nuclear genomes. In contrast, two slowly evolving Silene mitochondrial genomes are smaller than average for angiosperms. Consequently, this genus captures approximately 98% of known variation in organelle genome size. The expanded genomes reveal several architectural changes, including the evolution of complex multichromosomal structures (with 59 and 128 circular-mapping chromosomes, ranging in size from 44 to 192 kb). They also exhibit a substantial reduction in recombination and gene conversion activity as measured by the relative frequency of alternative genome conformations and the level of sequence divergence between repeat copies. The evolution of mutation rate, genome size, and chromosome structure can therefore be extremely rapid and interrelated in ways not predicted by current evolutionary theories. Our results raise the hypothesis that changes in recombinational processes, including gene conversion, may be a central force driving the evolution of both mutation rate and genome structure. A fundamental challenge in evolutionary biology is to explain why organisms exhibit dramatic variation in genome size and complexity. One hypothesis predicts that high rates of mutation in DNA sequence create selection against large and complex genomes, which are more susceptible to mutational disruption. Species of flowering plants in the genus Silene vary by approximately 100-fold in the rates of mutation in their mitochondrial DNA, providing an excellent opportunity to test the predicted effects of high mutation rates on genome evolution. Contrary to expectation, Silene species with elevated mutation rates have experienced dramatic expansions in mitochondrial genome size compared to their slowly evolving relatives, resulting in the largest known mitochondrial genomes. In addition to the increases in size and mutation rate, these genomes also reveal a history of rapid change in genome structure. They have been fragmented into dozens of chromosomes and appear to have experienced major reductions in recombination activity. All of these changes have occurred in just the past few million years. This mitochondrial genome diversity within the genus Silene provides a striking example of rapid genomic change and raises new hypotheses regarding the relationship between mutation rate and genome evolution.
Collapse
Affiliation(s)
- Daniel B Sloan
- Department of Biology, University of Virginia, Charlottesville, Virginia, United States of America.
| | | | | | | | | | | | | |
Collapse
|