1
|
Yadav R, Lone SA, Gaikwad K, Singh NK, Padaria JC. Transcriptome sequence analysis and mining of SSRs in Jhar Ber (Ziziphus nummularia (Burm.f.) Wight & Arn) under drought stress. Sci Rep 2018; 8:2406. [PMID: 29402924 PMCID: PMC5799245 DOI: 10.1038/s41598-018-20548-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2017] [Accepted: 01/02/2018] [Indexed: 11/09/2022] Open
Abstract
Ziziphus nummularia (Burm.f.) Wight & Arn., a perennial shrub that thrives in the arid regions, is naturally tolerant to drought. However, there are limited studies on the genomics of drought tolerance in Ziziphus sp. In this study, RNA-sequencing of one month old seedlings treated with PEG 6000 was performed using Roche GS-FLX454 Titanium pyrosequencing. A total of 367,176 raw sequence reads were generated, and upon adapter trimming and quality filtration 351,872 reads were assembled de novo into 32,739 unigenes. Further characterization of the unigenes indicated that 73.25% had significant hits in the protein database. Kyoto encyclopedia of genes and genomes database (KEGG) identified 113 metabolic pathways from the obtained unigenes. A large number of drought-responsive genes were obtained and among them differential gene expression of 16 highly induced genes was validated by qRT-PCR analysis. To develop genic-markers, 3,425 simple sequence repeats (SSRs) were identified in 2,813 unigene sequences. The data generated shall serve as an important reservoir for the identification and characterization of drought stress responsive genes for development of drought tolerant crops.
Collapse
Affiliation(s)
- Radha Yadav
- Biotechnology and Climate Change Group, National Research Centre on Plant Biotechnology (ICAR-NRCPB), New Delhi, 110012, India
| | - Showkat Ahmad Lone
- Biotechnology and Climate Change Group, National Research Centre on Plant Biotechnology (ICAR-NRCPB), New Delhi, 110012, India
- Department of Agricultural Microbiology, Faculty of Agricultural Sciences, Aligarh Muslim University, Aligarh, 202002, India
| | - Kishor Gaikwad
- Biotechnology and Climate Change Group, National Research Centre on Plant Biotechnology (ICAR-NRCPB), New Delhi, 110012, India
| | - Nagendra Kumar Singh
- Biotechnology and Climate Change Group, National Research Centre on Plant Biotechnology (ICAR-NRCPB), New Delhi, 110012, India
| | - Jasdeep Chatrath Padaria
- Biotechnology and Climate Change Group, National Research Centre on Plant Biotechnology (ICAR-NRCPB), New Delhi, 110012, India.
| |
Collapse
|
2
|
Huang J, Zhang C, Zhao X, Fei Z, Wan K, Zhang Z, Pang X, Yin X, Bai Y, Sun X, Gao L, Li R, Zhang J, Li X. The Jujube Genome Provides Insights into Genome Evolution and the Domestication of Sweetness/Acidity Taste in Fruit Trees. PLoS Genet 2016; 12:e1006433. [PMID: 28005948 PMCID: PMC5179053 DOI: 10.1371/journal.pgen.1006433] [Citation(s) in RCA: 89] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2016] [Accepted: 10/20/2016] [Indexed: 12/26/2022] Open
Abstract
Jujube (Ziziphus jujuba Mill.) belongs to the Rhamnaceae family and is a popular fruit tree species with immense economic and nutritional value. Here, we report a draft genome of the dry jujube cultivar 'Junzao' and the genome resequencing of 31 geographically diverse accessions of cultivated and wild jujubes (Ziziphus jujuba var. spinosa). Comparative analysis revealed that the genome of 'Dongzao', a fresh jujube, was ~86.5 Mb larger than that of the 'Junzao', partially due to the recent insertions of transposable elements in the 'Dongzao' genome. We constructed eight proto-chromosomes of the common ancestor of Rhamnaceae and Rosaceae, two sister families in the order Rosales, and elucidated the evolutionary processes that have shaped the genome structures of modern jujubes. Population structure analysis revealed the complex genetic background of jujubes resulting from extensive hybridizations between jujube and its wild relatives. Notably, several key genes that control fruit organic acid metabolism and sugar content were identified in the selective sweep regions. We also identified S-locus genes controlling gametophytic self-incompatibility and investigated haplotype patterns of the S locus in the jujube genomes, which would provide a guideline for parent selection for jujube crossbreeding. This study provides valuable genomic resources for jujube improvement, and offers insights into jujube genome evolution and its population structure and domestication.
Collapse
Affiliation(s)
- Jian Huang
- College of Forestry, Northwest A&F University, Yangling, China
- Center for Jujube Engineering and Technology of State Forestry Administration, Northwest A&F University, Yangling, China
| | - Chunmei Zhang
- College of Forestry, Northwest A&F University, Yangling, China
- Center for Jujube Engineering and Technology of State Forestry Administration, Northwest A&F University, Yangling, China
| | - Xing Zhao
- Novogene Bioinformatics Institute, Beijing, China
| | - Zhangjun Fei
- Boyce Thompson Institute, Cornell University, Ithaca, New York, United States of America
| | - KangKang Wan
- Novogene Bioinformatics Institute, Beijing, China
| | - Zhong Zhang
- College of Forestry, Northwest A&F University, Yangling, China
- Center for Jujube Engineering and Technology of State Forestry Administration, Northwest A&F University, Yangling, China
| | - Xiaoming Pang
- College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
| | - Xiao Yin
- College of Forestry, Northwest A&F University, Yangling, China
| | - Yang Bai
- Boyce Thompson Institute, Cornell University, Ithaca, New York, United States of America
| | - Xiaoqing Sun
- Novogene Bioinformatics Institute, Beijing, China
| | - Lizhi Gao
- Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species in Southwest China, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
| | - Ruiqiang Li
- Novogene Bioinformatics Institute, Beijing, China
| | - Jinbo Zhang
- Novogene Bioinformatics Institute, Beijing, China
| | - Xingang Li
- College of Forestry, Northwest A&F University, Yangling, China
- Center for Jujube Engineering and Technology of State Forestry Administration, Northwest A&F University, Yangling, China
| |
Collapse
|
3
|
Zeng T, Zhang L, Li J, Wang D, Tian Y, Lu L. De novo assembly and characterization of Muscovy duck liver transcriptome and analysis of differentially regulated genes in response to heat stress. Cell Stress Chaperones 2015; 20:483-93. [PMID: 25663538 PMCID: PMC4406934 DOI: 10.1007/s12192-015-0573-4] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2014] [Revised: 12/28/2014] [Accepted: 01/14/2015] [Indexed: 01/20/2023] Open
Abstract
High temperature is a major abiotic stress limiting animal growth and productivity worldwide. The Muscovy duck (Cairina moschata), sometimes called the Barbary drake, is a type of duck with a fairly unusual domestication history. In Southeast Asia, duck meat is one of the top meats consumed, and as such, the production of the meat is an important topic of research. The transcriptomic and genomic data presently available are insufficient to understanding the molecular mechanism underlying the heat tolerance of Muscovy ducks. Thus, transcriptome and expression profiling data for this species are required as important resource for identifying genes and developing molecular marker. In this study, de novo transcriptome assembly and gene expression analysis using Illumina sequencing technology were performed. More than 225 million clean reads were generated and assembled into 36,903 unique transcripts with an average length of 1,135 bp. A total of 21,221 (57.50 %) unigenes were annotated. Gene Ontology (GO) analysis of the annotated unigenes revealed that the majority of sequenced genes were associated with transcription, signal transduction, and apoptosis. We also performed gene expression profiling analysis upon heat treatment in Muscovy ducks and identified 470 heat-response unique transcripts. GO term enrichment showed that protein folding and chaperone binding were significant enrichment, whereas KEGG pathway analyses showed that Ras and MAPKs were activated after heat stress in Muscovy ducks. Our research enriched sequences information of Muscovy duck, provided novel insights into responses to heat stress in these ducks, and serve as candidate genes or markers that can be used to guide future efforts to breed heat-tolerant duck strains.
Collapse
Affiliation(s)
- Tao Zeng
- />Institute of Animal Husbandry and Veterinary Medicine, Zhejiang Academy of Agricultural Sciences, Hangzhou, 310021 China
| | - Liping Zhang
- />Institute of Animal Husbandry and Veterinary Medicine, Zhejiang Academy of Agricultural Sciences, Hangzhou, 310021 China
- />College of Chemistry and Life Sciences, Zhejiang Normal University, Jinhua, 321004 China
| | - Jinjun Li
- />Institute of Animal Husbandry and Veterinary Medicine, Zhejiang Academy of Agricultural Sciences, Hangzhou, 310021 China
| | - Deqian Wang
- />Institute of Animal Husbandry and Veterinary Medicine, Zhejiang Academy of Agricultural Sciences, Hangzhou, 310021 China
| | - Yong Tian
- />Institute of Animal Husbandry and Veterinary Medicine, Zhejiang Academy of Agricultural Sciences, Hangzhou, 310021 China
| | - Lizhi Lu
- />Institute of Animal Husbandry and Veterinary Medicine, Zhejiang Academy of Agricultural Sciences, Hangzhou, 310021 China
| |
Collapse
|
4
|
Addisalem AB, Esselink GD, Bongers F, Smulders MJM. Genomic sequencing and microsatellite marker development for Boswellia papyrifera, an economically important but threatened tree native to dry tropical forests. AOB PLANTS 2015; 7:plu086. [PMID: 25573702 PMCID: PMC4433549 DOI: 10.1093/aobpla/plu086] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/11/2014] [Accepted: 12/08/2014] [Indexed: 06/01/2023]
Abstract
Microsatellite (or simple sequence repeat, SSR) markers are highly informative DNA markers often used in conservation genetic research. Next-generation sequencing enables efficient development of large numbers of SSR markers at lower costs. Boswellia papyrifera is an economically important tree species used for frankincense production, an aromatic resinous gum exudate from bark. It grows in dry tropical forests in Africa and is threatened by a lack of rejuvenation. To help guide conservation efforts for this endangered species, we conducted an analysis of its genomic DNA sequences using Illumina paired-end sequencing. The genome size was estimated at 705 Mb per haploid genome. The reads contained one microsatellite repeat per 5.7 kb. Based on a subset of these repeats, we developed 46 polymorphic SSR markers that amplified 2-12 alleles in 10 genotypes. This set included 30 trinucleotide repeat markers, four tetranucleotide repeat markers, six pentanucleotide markers and six hexanucleotide repeat markers. Several markers were cross-transferable to Boswellia pirrotae and B. popoviana. In addition, retrotransposons were identified, the reads were assembled and several contigs were identified with similarity to genes of the terpene and terpenoid backbone synthesis pathways, which form the major constituents of the bark resin.
Collapse
Affiliation(s)
- A B Addisalem
- Wageningen UR Plant Breeding, Wageningen University and Research Center, PO Box 386, NL-6700 AJ Wageningen, The Netherlands Center for Ecosystem Studies, Forest Ecology and Forest Management Group, Wageningen University and Research Center, PO Box 47, NL-6700 AA Wageningen, The Netherlands Wondo Genet College of Forestry and Natural Resources, PO Box 128, Shashemene, Ethiopia
| | - G Danny Esselink
- Wageningen UR Plant Breeding, Wageningen University and Research Center, PO Box 386, NL-6700 AJ Wageningen, The Netherlands
| | - F Bongers
- Center for Ecosystem Studies, Forest Ecology and Forest Management Group, Wageningen University and Research Center, PO Box 47, NL-6700 AA Wageningen, The Netherlands
| | - M J M Smulders
- Wageningen UR Plant Breeding, Wageningen University and Research Center, PO Box 386, NL-6700 AJ Wageningen, The Netherlands
| |
Collapse
|
5
|
Pan BZ, Chen MS, Ni J, Xu ZF. Transcriptome of the inflorescence meristems of the biofuel plant Jatropha curcas treated with cytokinin. BMC Genomics 2014; 15:974. [PMID: 25400171 PMCID: PMC4246439 DOI: 10.1186/1471-2164-15-974] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2014] [Accepted: 10/29/2014] [Indexed: 12/21/2022] Open
Abstract
Background Jatropha curcas, whose seed content is approximately 30–40% oil, is an ideal feedstock for producing biodiesel and bio-jet fuels. However, Jatropha plants have a low number of female flowers, which results in low seed yield that cannot meet the needs of the biofuel industry. Thus, increasing the number of female flowers is critical for the improvement of Jatropha seed yield. Our previous findings showed that cytokinin treatment can increase the flower number and female to male ratio and also induce bisexual flowers in Jatropha. The mechanisms underlying the influence of cytokinin on Jatropha flower development and sex determination, however, have not been clarified. Results This study examined the transcriptional levels of genes involved in the response to cytokinin in Jatropha inflorescence meristems at different time points after cytokinin treatment by 454 sequencing, which gave rise to a total of 294.6 Mb of transcript sequences. Up-regulated and down-regulated annotated and novel genes were identified, and the expression levels of the genes of interest were confirmed by qRT-PCR. The identified transcripts include those encoding genes involved in the biosynthesis, metabolism, and signaling of cytokinin and other plant hormones, flower development and cell division, which may be related to phenotypic changes of Jatropha in response to cytokinin treatment. Our analysis indicated that Jatropha orthologs of the floral organ identity genes known as ABCE model genes, JcAP1,2, JcPI, JcAG, and JcSEP1,2,3, were all significantly repressed, with an exception of one B-function gene JcAP3 that was shown to be up-regulated by BA treatment, indicating different mechanisms to be involved in the floral organ development of unisexual flowers of Jatropha and bisexual flowers of Arabidopsis. Several cell division-related genes, including JcCycA3;2, JcCycD3;1, JcCycD3;2 and JcTSO1, were up-regulated, which may contribute to the increased flower number after cytokinin treatment. Conclusions This study presents the first report of global expression patterns of cytokinin-regulated transcripts in Jatropha inflorescence meristems. This report laid the foundation for further mechanistic studies on Jatropha and other non-model plants responding to cytokinin. Moreover, the identification of functional candidate genes will be useful for generating superior varieties of high-yielding transgenic Jatropha. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-974) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | - Zeng-Fu Xu
- Key Laboratory of Tropical Plant Resources and Sustainable Use, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Sciences, Yunnan, Menglun 666303, People's Republic of China.
| |
Collapse
|
6
|
De novo assembly and characterization of the fruit transcriptome of Chinese jujube (Ziziphus jujuba Mill.) Using 454 pyrosequencing and the development of novel tri-nucleotide SSR markers. PLoS One 2014; 9:e106438. [PMID: 25184704 PMCID: PMC4153635 DOI: 10.1371/journal.pone.0106438] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2014] [Accepted: 07/29/2014] [Indexed: 12/14/2022] Open
Abstract
Chinese jujube (Ziziphus jujuba Mill.) is an economically important deciduous tree that has high therapeutic value and health benefits. However, a lack of sequence data and molecular markers have constrained genetic and breeding studies for better fruit quality and other traits in Chinese jujube. In this study, two combined cDNA libraries of ‘Dongzao’ fruit representing the early and late stages of fruit development were constructed and sequenced on the 454 GS FLX Titanium platform. In total, 1,124,197 reads were generated and then de novo assembled into 97,479 unigenes. A total of 52,938 unigenes were homologous to genes in the NCBI non-redundant sequence database. A total of 33,123 unigenes were assigned to one or more Gene Ontology terms, and 16,693 unigenes were classified into 319 Kyoto Encyclopedia of Genes and Genomes pathways. The results showed that the Smirnoff-Wheeler pathway was the main pathway for the biosynthesis of ascorbic acid in Chinese jujube. The number of differentially expressed genes between the two stages of fruit development was 1,764, among which 974 and 790 genes were up-regulated and down-regulated, respectively. Furthermore, 9,893 sequences were identified containing SSRs. 93 primer pairs designed from the sequences with a tri-nucleotide repeat showed successful PCR amplification and could be validated in Chinese jujube accessions and Z. mauritiana Lam and Z. acidojujuba as well, of which 71 primer pairs were polymorphic. The obtained transcriptome provides a most comprehensive resource currently available for gene discovery and the development of functional markers in Z. jujuba. The newly developed microsatellite markers could be used in applications such as genetic linkage analysis and association studies, diversity analysis, and marker-assisted selection in Chinese jujube and related species.
Collapse
|
7
|
Qualitative de novo analysis of full length cDNA and quantitative analysis of gene expression for common marmoset (Callithrix jacchus) transcriptomes using parallel long-read technology and short-read sequencing. PLoS One 2014; 9:e100936. [PMID: 24977701 PMCID: PMC4076266 DOI: 10.1371/journal.pone.0100936] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2014] [Accepted: 05/28/2014] [Indexed: 12/24/2022] Open
Abstract
The common marmoset (Callithrix jacchus) is a non-human primate that could prove useful as human pharmacokinetic and biomedical research models. The cytochromes P450 (P450s) are a superfamily of enzymes that have critical roles in drug metabolism and disposition via monooxygenation of a broad range of xenobiotics; however, information on some marmoset P450s is currently limited. Therefore, identification and quantitative analysis of tissue-specific mRNA transcripts, including those of P450s and flavin-containing monooxygenases (FMO, another monooxygenase family), need to be carried out in detail before the marmoset can be used as an animal model in drug development. De novo assembly and expression analysis of marmoset transcripts were conducted with pooled liver, intestine, kidney, and brain samples from three male and three female marmosets. After unique sequences were automatically aligned by assembling software, the mean contig length was 718 bp (with a standard deviation of 457 bp) among a total of 47,883 transcripts. Approximately 30% of the total transcripts were matched to known marmoset sequences. Gene expression in 18 marmoset P450- and 4 FMO-like genes displayed some tissue-specific patterns. Of these, the three most highly expressed in marmoset liver were P450 2D-, 2E-, and 3A-like genes. In extrahepatic tissues, including brain, gene expressions of these monooxygenases were lower than those in liver, although P450 3A4 (previously P450 3A21) in intestine and P450 4A11- and FMO1-like genes in kidney were relatively highly expressed. By means of massive parallel long-read sequencing and short-read technology applied to marmoset liver, intestine, kidney, and brain, the combined next-generation sequencing analyses reported here were able to identify novel marmoset drug-metabolizing P450 transcripts that have until now been little reported. These results provide a foundation for mechanistic studies and pave the way for the use of marmosets as model animals for drug development in the future.
Collapse
|
8
|
Dong S, Liu Y, Niu J, Ning Y, Lin S, Zhang Z. De novo transcriptome analysis of the Siberian apricot (Prunus sibirica L.) and search for potential SSR markers by 454 pyrosequencing. Gene 2014; 544:220-7. [PMID: 24746601 DOI: 10.1016/j.gene.2014.04.031] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2014] [Revised: 04/10/2014] [Accepted: 04/17/2014] [Indexed: 11/29/2022]
Abstract
The Siberian apricot, an economically and ecologically important plant in China, contains seeds high in oil and can grow on marginal land. Although this species has multiple purposes and may be a feedstock of biofuel in China, transcriptome information and molecular research on this species remain limited. RNA-Seq technology has been widely applied to transcriptomics, genomics and the development of molecular markers, and functional gene studies. In this study, we obtained 1,243,067 high-quality reads with a mean size of 425 bp in a single run, totaling 528.4 Mb of sequence data using 454 GS FLX Titanium sequencing. All reads were assembled de novo into 46,940 unigenes with a mean size of 651 bp (range: 45-5566 bp). Assembled unigenes were annotated in multiple public databases based on similarity alignments to genes and proteins. 191 unigenes involving in lipid biosynthesis and metabolism were found, among them, expression patterns of two desaturase enzymes were analyzed by quantitative real-time polymerase chain reaction (qRT-PCR), based on six tissues from Siberian apricot, the seeds had the highest expression. 7304 simple sequence repeats (SSR) were identified from 6509 unigenes, a total of 9930 primer pairs were designed, 50 primer pairs were randomly selected to validate of the usefulness, and 24 (48%) primer pairs produced bands of the expected size. These data provide a base of sequence information to improve agronomic characters and molecular marker-assisted breeding to alter the composition of fatty acids in seeds from this plant, and hence, facilitate its utilization as a future biodiesel feedstock.
Collapse
Affiliation(s)
- Shubin Dong
- Lab of Systematic Evolution and Biogeography of Woody Plants, College of Nature Conservation, Beijing Forestry University, Beijing 100083, China
| | - Yulin Liu
- Key Laboratory for Genetics and Breeding of Forest Trees and Ornamental Plants of Ministry of Education, College of Biological Sciences and Biotechnology, Beijing Forestry University, Beijing 100083, China
| | - Jun Niu
- Lab of Systematic Evolution and Biogeography of Woody Plants, College of Nature Conservation, Beijing Forestry University, Beijing 100083, China
| | - Yu Ning
- Lab of Systematic Evolution and Biogeography of Woody Plants, College of Nature Conservation, Beijing Forestry University, Beijing 100083, China
| | - Shanzhi Lin
- Key Laboratory for Genetics and Breeding of Forest Trees and Ornamental Plants of Ministry of Education, College of Biological Sciences and Biotechnology, Beijing Forestry University, Beijing 100083, China
| | - Zhixiang Zhang
- Lab of Systematic Evolution and Biogeography of Woody Plants, College of Nature Conservation, Beijing Forestry University, Beijing 100083, China.
| |
Collapse
|
9
|
Nagpure NS, Rashid I, Pathak AK, Singh M, Singh SP, Sarkar UK. In silico analysis of SSRs in mitochondrial genomes of fishes. ACTA ACUST UNITED AC 2014; 26:195-201. [PMID: 24660911 DOI: 10.3109/19401736.2014.892075] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
The availability of fish mitochondrial (mt) genomes provides an opportunity to explore the simple sequence repeats. In the present study, mt genomes of 85 fish species reported from Indian subcontinent were downloaded from NCBI and computationally analysed for finding SSRs types, frequency of occurrence, mutation and evolutionary adaptation across species. A total of 92 microsatellites in different nucleotide combinations were detected in 59 species. 26 interspersed SSRs, mostly poly (AT)n were found in the D-loop regions in the species of Cyprinidae. Fifty-six SSRs of 12 bp fixed length were observed in eight genes only. Further, identical repeat motifs were found on the same location in ATP6 and ND4 genes, which were biased towards particular habitat. The comparison of ATP6 and ND4 gene sets to other homologous sequences showed point mutations. This study explores the SSRs discovery and their utility as marker for species and population identification.
Collapse
|
10
|
Izzah NK, Lee J, Jayakodi M, Perumal S, Jin M, Park BS, Ahn K, Yang TJ. Transcriptome sequencing of two parental lines of cabbage (Brassica oleracea L. var. capitata L.) and construction of an EST-based genetic map. BMC Genomics 2014; 15:149. [PMID: 24559437 PMCID: PMC3936860 DOI: 10.1186/1471-2164-15-149] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2013] [Accepted: 02/17/2014] [Indexed: 12/27/2022] Open
Abstract
Background Expressed sequence tag (EST)-based markers are preferred because they reflect transcribed portions of the genome. We report the development of simple sequence repeat (SSR) and single nucleotide polymorphism (SNP) markers derived from transcriptome sequences in cabbage, and their utility for map construction. Results Transcriptome sequences were obtained from two cabbage parental lines, C1184 and C1234, which are susceptible and resistant to black rot disease, respectively, using the 454 platform. A total of 92,255 and 127,522 reads were generated and clustered into 34,688 and 40,947 unigenes, respectively. We identified 2,405 SSR motifs from the unigenes of the black rot-resistant parent C1234. Trinucleotide motifs were the most abundant (66.15%) among the repeat motifs. In addition, 1,167 SNPs were detected between the two parental lines. A total of 937 EST-based SSR and 97 SNP-based dCAPS markers were designed and used for detection of polymorphism between parents. Using an F2 population, we built a genetic map comprising 265 loci, and consisting of 98 EST-based SSRs, 21 SNP-based dCAPS, 55 IBP markers derived from B. rapa genome sequence and 91 public SSRs, distributed on nine linkage groups spanning a total of 1,331.88 cM with an average distance of 5.03 cM between adjacent loci. The parental lines used in this study are elite breeding lines with little genetic diversity; therefore, the markers that mapped in our genetic map will have broad spectrum utility. Conclusions This genetic map provides additional genetic information to the existing B. oleracea map. Moreover, the new set of EST-based SSR and dCAPS markers developed herein is a valuable resource for genetic studies and will facilitate cabbage breeding. Additionally, this study demonstrates the usefulness of NGS transcriptomes for the development of genetic maps even with little genetic diversity in the mapping population.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Tae-Jin Yang
- Department of Plant Science, Plant Genomics and Breeding Institute, and Research Institute for Agriculture and Life Sciences, College of Agriculture and Life Sciences, Seoul National University, Seoul 151-921, Republic of Korea.
| |
Collapse
|
11
|
Luo C, Zhang Q, Luo Z. Genome-wide transcriptome analysis of Chinese pollination-constant nonastringent persimmon fruit treated with ethanol. BMC Genomics 2014; 15:112. [PMID: 24507483 PMCID: PMC3922992 DOI: 10.1186/1471-2164-15-112] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2013] [Accepted: 02/04/2014] [Indexed: 11/10/2022] Open
Abstract
Background The persimmon Diospyros kaki Thunb. is an important commercial and deciduous fruit tree. The fruits have proanthocyanidin (PA) content of >25% of the dry weight and are astringent. PAs cause astringency that is often undesirable for human consumption; thus, the removal of astringency is an important practice in the persimmon industry. Soluble PAs can be converted to insoluble PAs by enclosing the fruit in a polyethylene bag containing diluted ethanol. The genomic resource development of the persimmon is delayed because of its large and complex genome. Second-generation sequencing is an efficient technique for generating huge sequences that can represent a large number of genes and their expression levels. Results We used 454 sequencing for the de novo transcriptome assembly of persimmon fruit treated with 5% ethanol (Tr library) and without treatment as the control (Co library) to investigate the genes and pathways that control PA biosynthesis and other secondary metabolites. We obtained 374.6 Mb in clean nucleotides comprising 624,690 and 626,203 clean sequencing reads from the Tr and Co libraries, respectively. We also identified 83,898 unigenes; 54,719 (~65.2%) unigenes were annotated based on similarity searches with known proteins. Up to 14,954 of the unigenes were assigned to the protein database Clusters of Orthologous Groups (COG), 24,337 were assigned to the term annotation database of Gene Ontology (GO), and 45,506 were assigned to 200 pathways in the database of Kyoto Encyclopedia of Genes and Genomes (KEGG). The two libraries were compared to identify the differentially expressed unigenes. The expression levels of genes involved in PA biosynthesis and tannin coagulation were analysed, and some of them were verified using quantitative real time PCR (qRT-PCR). Conclusions This study provides abundant genomic data for persimmon and offers comprehensive sequence resources for persimmon research. The transcriptome dataset will improve our understanding of the molecular mechanisms of tannin coagulation and other biochemical processes in persimmons.
Collapse
Affiliation(s)
| | | | - Zhengrong Luo
- Key Laboratory of Horticultural Plant Biology (MOE), Huazhong Agricultural University, 430070 Wuhan, China.
| |
Collapse
|
12
|
Ma Q, Feng K, Yang W, Chen Y, Yu F, Yin T. Identification and characterization of nucleotide variations in the genome of Ziziphus jujuba (Rhamnaceae) by next generation sequencing. Mol Biol Rep 2014; 41:3219-23. [PMID: 24477589 DOI: 10.1007/s11033-014-3184-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2013] [Accepted: 01/18/2014] [Indexed: 10/25/2022]
Abstract
In this study, single-nucleotide polymorphisms (SNPs) and insertions/deletions (InDels) in the genome of Ziziphus jujuba were identified using sequences generated by the Roche 454 GS-FLX sequencer. A total of, 573,141 reads were produced with an average read length of 360 bp. After quality control, 258,754 of the filtered reads were assembled into 23,864 contigs, and 293,458 remained as singletons. Using the contig assemblies as a reference, 17,160 SNPs and 478 InDels were identified. Among the SNPs, transitions occurred three times more frequently than transversions. In transitions, the number of C/T and G/A transitions was similar. Among the transversions, A/T was the most abundant, and C/G was much rarer than any of the other types of transversions, accounting for only about half the numbers of A/C, A/T and G/T transversions. For the InDels, mononucleotide changes amounted to 64.4% of the total number of InDels. In general, the frequency of detected InDels decreased as the length of the InDels increased. This study provides valuable marker resources for future genetic studies of Ziziphus spp.
Collapse
Affiliation(s)
- Qiuyue Ma
- The Southern Modern Forestry Collaborative Innovation Center, Nanjing Forestry University, 159 Longpan Road, Nanjing, 210037, China
| | | | | | | | | | | |
Collapse
|
13
|
Hackenberg M, Shi BJ, Gustafson P, Langridge P. Characterization of phosphorus-regulated miR399 and miR827 and their isomirs in barley under phosphorus-sufficient and phosphorus-deficient conditions. BMC PLANT BIOLOGY 2013; 13:214. [PMID: 24330740 PMCID: PMC3878733 DOI: 10.1186/1471-2229-13-214] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/23/2013] [Accepted: 11/16/2013] [Indexed: 05/20/2023]
Abstract
BACKGROUND miR399 and miR827 are both involved in conserved phosphorus (P) deficiency signalling pathways. miR399 targets the PHO2 gene encoding E2 enzyme that negatively regulates phosphate uptake and root-to-shoot allocation, while miR827 targets SPX-domain-containing genes that negatively regulate other P-responsive genes. However, the response of miR399 and miR827 to P conditions in barley has not been investigated. RESULTS In this study, we investigated the expression profiles of miR399 and miR827 in barley (Hordeum vulagre L.) under P-deficient and P-sufficient conditions. We identified 10 members of the miR399 family and one miR827 gene in barley, all of which were significantly up-regulated under deficient P. In addition, we found many isomirs of the miR399 family and miR827, most of which were also significantly up-regulated under deficient P. Several isomirs of miR399 members were found to be able to cleave their predicted targets in vivo. Surprisingly, a few small RNAs (sRNAs) derived from the single-stranded loops of the hairpin structures of MIR399b and MIR399e-1 were also found to be able to cleave their predicted targets in vivo. Many antisense sRNAs of miR399 and a few for miR827 were also detected, but they did not seem to be regulated by P. Intriguingly, the lowest expressed member, hvu-miR399k, had four-fold more antisense sRNAs than sense sRNAs, and furthermore under P sufficiency, the antisense sRNAs are more frequent than the sense sRNAs. We identified a potential regulatory network among miR399, its target HvPHO2 and target mimics HvIPS1 and HvIPS2 in barley under P-deficient and P-sufficient conditions. CONCLUSIONS Our data provide an important insight into the mechanistic regulation and function of miR399, miR827 and their isomirs in barley under different P conditions.
Collapse
Affiliation(s)
- Michael Hackenberg
- Computational Genomics and Bioinformatics Group, Genetics Department, University of Granada, 18071 Granada, Spain
| | - Bu-Jun Shi
- Australian Centre for Plant Functional Genomics, The University of Adelaide, Urrbrae, South Australia 5064, Australia
| | - Perry Gustafson
- USDA-ARS, 206 Curtis Hall, University of Missouri, Columbia, MO 65211-7020, USA
| | - Peter Langridge
- Australian Centre for Plant Functional Genomics, The University of Adelaide, Urrbrae, South Australia 5064, Australia
| |
Collapse
|
14
|
Huang D, Zhang Y, Jin M, Li H, Song Z, Wang Y, Chen J. Characterization and high cross-species transferability of microsatellite markers from the floral transcriptome of Aspidistra saxicola (Asparagaceae). Mol Ecol Resour 2013; 14:569-77. [PMID: 24286608 DOI: 10.1111/1755-0998.12197] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2012] [Revised: 10/28/2013] [Accepted: 10/28/2013] [Indexed: 11/29/2022]
Abstract
Recent studies utilizing transcriptome sequences generated by next-generation sequencing (NGS) technologies have demonstrated the ability to rapidly detect and characterize thousands of gene-based microsatellites from different plants. However, these simple sequence repeats (SSRs) were seldom used directly to test interspecific transferability in the populations of closely related species. Aspidistra Ker-Gawl. is a monocot genus with high species richness and diversity in flower structure, but its fresh floral materials are not easy to obtain. Until now, little is known about genetic background in the species of Aspidistra, quite apart from the fearful reduction of their natural habitats. In this study, the floral transcriptome of Aspidistra saxicola was obtained using NGS. Based on these data, a total of 5527 SSRs were identified in the unigenes. Among these SSRs, the proportions of di- and tri-nucleotide repeats were quite close (49.6% verse 46.8%), and the most tri-nucleotide repeats were AGG/CCT followed by AAG/CTT and AGC/GCT in A. saxicola, showing distinct differences with other angiosperm species. To assess genetic diversity in the species of Aspidistra, 48 SSR loci were tested in four available populations of A. elatior. The results revealed that more than a third of the loci were polymorphic. The majority of these primers could be amplified in 24 species representing the main clades of Aspidistra. The primer subsets from transcriptome data proved highly useful for detecting polymorphisms in the related species, supporting the finding that NGS is an efficient approach to molecular marker development at both intra- and interspecies levels, especially in endangered nonmodel species.
Collapse
Affiliation(s)
- Danni Huang
- Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, Institute of Biodiversity Science, School of Life Sciences, Fudan University, Shanghai 200433, China
| | | | | | | | | | | | | |
Collapse
|
15
|
Xia JH, Liu P, Liu F, Lin G, Sun F, Tu R, Yue GH. Analysis of stress-responsive transcriptome in the intestine of Asian seabass (Lates calcarifer) using RNA-seq. DNA Res 2013; 20:449-60. [PMID: 23761194 PMCID: PMC3789556 DOI: 10.1093/dnares/dst022] [Citation(s) in RCA: 77] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2012] [Accepted: 05/08/2013] [Indexed: 12/20/2022] Open
Abstract
Identification of differentially expressed genes (DEGs) and regulated pathways in response to stressors using a whole-genome approach is critical to understanding the mechanisms underlying stress responses. We challenged Asian seabass with lipopolysaccharide (LPS), Vibrio harveyi, high salinity and fasting, and sequenced six cDNA libraries of intestine samples using Roche 454 RNA-seq. Over 1 million reads (average size: 516 bp) were obtained. The de novo assembly obtained 83 911 unisequences with an average length of 747 bp. In total, 62.3% of the unisequences were annotated. We observed overall similar expression profiles among different challenges, while a number of DEGs and regulated pathways were identified under specific challenges. More than 1000 DEGs and over 200 regulated pathways for each stressor were identified. Thirty-seven genes were differentially expressed in response to all challenges. Our data suggest that there is a global coordination and fine-tuning of gene regulation during different challenges. In addition, we detected dramatic immune responses in intestines under different stressors. This study is the first step towards the comprehensive understanding of the mechanisms underlying stress responses and supplies significant transcriptome resources for studying biological questions in non-model fish species.
Collapse
Affiliation(s)
- Jun Hong Xia
- Molecular Population Genetics Group, Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, Singapore117604, Singapore
| | - Peng Liu
- Molecular Population Genetics Group, Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, Singapore117604, Singapore
| | - Feng Liu
- Molecular Population Genetics Group, Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, Singapore117604, Singapore
| | - Grace Lin
- Molecular Population Genetics Group, Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, Singapore117604, Singapore
| | - Fei Sun
- Molecular Population Genetics Group, Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, Singapore117604, Singapore
| | - Rongjian Tu
- Molecular Population Genetics Group, Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, Singapore117604, Singapore
| | - Gen Hua Yue
- Molecular Population Genetics Group, Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, Singapore117604, Singapore
- Department of Biological Sciences, National University of Singapore, 14 Science Drive 4, Singapore 117543, Singapore
| |
Collapse
|
16
|
Bhardwaj J, Chauhan R, Swarnkar MK, Chahota RK, Singh AK, Shankar R, Yadav SK. Comprehensive transcriptomic study on horse gram (Macrotyloma uniflorum): De novo assembly, functional characterization and comparative analysis in relation to drought stress. BMC Genomics 2013; 14:647. [PMID: 24059455 PMCID: PMC3853109 DOI: 10.1186/1471-2164-14-647] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2013] [Accepted: 09/13/2013] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Drought tolerance is an attribute maintained in plants by cross-talk between multiple and cascading metabolic pathways. Without a sequenced genome available for horse gram, it is difficult to comprehend such complex networks and intercalated genes associated with drought tolerance of horse gram (Macrotyloma uniflorum). Therefore, de novo transcriptome discovery and associated analyses was done for this highly drought tolerant yet under exploited legume to decipher its genetic makeup. RESULTS Eight samples comprising of shoot and root tissues of two horse gram genotypes (drought-sensitive; M-191 and drought-tolerant; M-249) were used for comparison under control and polyethylene glycol-induced drought stress conditions. Using Illumina sequencing technology, a total of 229,297,896 paired end read pairs were generated and utilized for de novo assembly of horse gram. Significant BLAST hits were obtained for 26,045 transcripts while, 3,558 transcripts had no hits but contained important conserved domains. A total of 21,887 unigenes were identified. SSRs containing sequences covered 16.25% of the transcriptome with predominant tri- and mono-nucleotides (43%). The total GC content of the transcriptome was found to be 43.44%. Under Gene Ontology response to stimulus, DNA binding and catalytic activity was highly expressed during drought stress conditions. Serine/threonine protein kinase was found to dominate in Enzyme Classification while pathways belonging to ribosome metabolism followed by plant pathogen interaction and plant hormone signal transduction were predominant in Kyoto Encyclopedia of Genes and Genomes analysis. Independent search on plant metabolic network pathways suggested valine degradation, gluconeogenesis and purine nucleotide degradation to be highly influenced under drought stress in horse gram. Transcription factors belonging to NAC, MYB-related, and WRKY families were found highly represented under drought stress. qRT-PCR validated the expression profile for 9 out of 10 genes analyzed in response to drought stress. CONCLUSIONS De novo transcriptome discovery and analysis has generated enormous information over horse gram genomics. The genes and pathways identified suggest efficient regulation leading to active adaptation as a basal defense response against drought stress by horse gram. The knowledge generated can be further utilized for exploring other underexploited plants for stress responsive genes and improving plant tolerance.
Collapse
Affiliation(s)
- Jyoti Bhardwaj
- Plant Metabolic Engineering Laboratory, Council of Scientific and Industrial Research-Institute of Himalayan Bioresource Technology, Palampur 176061, HP, India.
| | | | | | | | | | | | | |
Collapse
|
17
|
Liu Y, Huang Z, Ao Y, Li W, Zhang Z. Transcriptome analysis of yellow horn (Xanthoceras sorbifolia Bunge): a potential oil-rich seed tree for biodiesel in China. PLoS One 2013; 8:e74441. [PMID: 24040247 PMCID: PMC3770547 DOI: 10.1371/journal.pone.0074441] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2013] [Accepted: 08/02/2013] [Indexed: 11/27/2022] Open
Abstract
Background Yellow horn (Xanthoceras sorbifolia Bunge) is an oil-rich seed shrub that grows well in cold, barren environments and has great potential for biodiesel production in China. However, the limited genetic data means that little information about the key genes involved in oil biosynthesis is available, which limits further improvement of this species. In this study, we describe sequencing and de novo transcriptome assembly to produce the first comprehensive and integrated genomic resource for yellow horn and identify the pathways and key genes related to oil accumulation. In addition, potential molecular markers were identified and compiled. Methodology/Principal Findings Total RNA was isolated from 30 plants from two regions, including buds, leaves, flowers and seeds. Equal quantities of RNA from these tissues were pooled to construct a cDNA library for 454 pyrosequencing. A total of 1,147,624 high-quality reads with total and average lengths of 530.6 Mb and 462 bp, respectively, were generated. These reads were assembled into 51,867 unigenes, corresponding to a total of 36.1 Mb with a mean length, N50 and median of 696, 928 and 570 bp, respectively. Of the unigenes, 17,541 (33.82%) were unmatched in any public protein databases. We identified 281 unigenes that may be involved in de novo fatty acid (FA) and triacylglycerol (TAG) biosynthesis and metabolism. Furthermore, 6,707 SSRs, 16,925 SNPs and 6,201 InDels with high-confidence were also identified in this study. Conclusions This transcriptome represents a new functional genomics resource and a foundation for further studies on the metabolic engineering of yellow horn to increase oil content and modify oil composition. The potential molecular markers identified in this study provide a basis for polymorphism analysis of Xanthoceras, and even Sapindaceae; they will also accelerate the process of breeding new varieties with better agronomic characteristics.
Collapse
Affiliation(s)
- Yulin Liu
- College of Biological Science and Biotechnology, Beijing Forest University, Beijing, China
| | - Zhedong Huang
- Laboratory of Systematic Evolution and Biogeography of Woody Plants, College of Nature Conservation, Beijing Forest University, Beijing, China
| | - Yan Ao
- Academy of Forest, Beijing Forest University, Beijing, China
| | - Wei Li
- College of Biological Science and Biotechnology, Beijing Forest University, Beijing, China
- * E-mail: (WL); (ZXZ)
| | - Zhixiang Zhang
- Laboratory of Systematic Evolution and Biogeography of Woody Plants, College of Nature Conservation, Beijing Forest University, Beijing, China
- * E-mail: (WL); (ZXZ)
| |
Collapse
|
18
|
Qi J, Zheng N, Zhang B, Sun P, Hu S, Xu W, Ma Q, Zhao T, Zhou L, Qin M, Li X. Mining genes involved in the stratification of Paris polyphylla seeds using high-throughput embryo transcriptome sequencing. BMC Genomics 2013; 14:358. [PMID: 23718911 PMCID: PMC3679829 DOI: 10.1186/1471-2164-14-358] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2012] [Accepted: 05/27/2013] [Indexed: 12/31/2022] Open
Abstract
Background Paris polyphylla var. yunnanensis is an important medicinal plant. Seed dormancy is one of the main factors restricting artificial cultivation. The molecular mechanisms of seed dormancy remain unclear, and little genomic or transcriptome data are available for this plant. Results In this study, massive parallel pyrosequencing on the Roche 454-GS FLX Titanium platform was used to generate a substantial sequence dataset for the P. polyphylla embryo. 369,496 high quality reads were obtained, ranging from 50 to 1146 bp, with a mean of 219 bp. These reads were assembled into 47,768 unigenes, which included 16,069 contigs and 31,699 singletons. Using BLASTX searches of public databases, 15,757 (32.3%) unique transcripts were identified. Gene Ontology and Cluster of Orthologous Groups of proteins annotations revealed that these transcripts were broadly representative of the P. polyphylla embryo transcriptome. The Kyoto Encyclopedia of Genes and Genomes assigned 5961 of the unique sequences to specific metabolic pathways. Relative expression levels analysis showed that eleven phytohormone-related genes and five other genes have different expression patterns in the embryo and endosperm in the seed stratification process. Conclusions Gene annotation and quantitative RT-PCR expression analysis identified 464 transcripts that may be involved in phytohormone catabolism and biosynthesis, hormone signal, seed dormancy, seed maturation, cell wall growth and circadian rhythms. In particular, the relative expression analysis of sixteen genes (CYP707A, NCED, GA20ox2, GA20ox3, ABI2, PP2C, ARP3, ARP7, IAAH, IAAS, BRRK, DRM, ELF1, ELF2, SFR6, and SUS) in embryo and endosperm and at two temperatures indicated that these related genes may be candidates for clarifying the molecular basis of seed dormancy in P. polyphlla var. yunnanensis.
Collapse
|
19
|
Niu SH, Li ZX, Yuan HW, Chen XY, Li Y, Li W. Transcriptome characterisation of Pinus tabuliformis and evolution of genes in the Pinus phylogeny. BMC Genomics 2013; 14:263. [PMID: 23597112 PMCID: PMC3640921 DOI: 10.1186/1471-2164-14-263] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2012] [Accepted: 04/15/2013] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The Chinese pine (Pinus tabuliformis) is an indigenous conifer species in northern China but is relatively underdeveloped as a genomic resource; thus, limiting gene discovery and breeding. Large-scale transcriptome data were obtained using a next-generation sequencing platform to compensate for the lack of P. tabuliformis genomic information. RESULTS The increasing amount of transcriptome data on Pinus provides an excellent resource for multi-gene phylogenetic analysis and studies on how conserved genes and functions are maintained in the face of species divergence. The first P. tabuliformis transcriptome from a normalised cDNA library of multiple tissues and individuals was sequenced in a full 454 GS-FLX run, producing 911,302 sequencing reads. The high quality overlapping expressed sequence tags (ESTs) were assembled into 46,584 putative transcripts, and more than 700 SSRs and 92,000 SNPs/InDels were characterised. Comparative analysis of the transcriptome of six conifer species yielded 191 orthologues, from which we inferred a phylogenetic tree, evolutionary patterns and calculated rates of gene diversion. We also identified 938 fast evolving sequences that may be useful for identifying genes that perhaps evolved in response to positive selection and might be responsible for speciation in the Pinus lineage. CONCLUSIONS A large collection of high-quality ESTs was obtained, de novo assembled and characterised, which represents a dramatic expansion of the current transcript catalogues of P. tabuliformis and which will gradually be applied in breeding programs of P. tabuliformis. Furthermore, these data will facilitate future studies of the comparative genomics of P. tabuliformis and other related species.
Collapse
Affiliation(s)
- Shi-Hui Niu
- National Engineering Laboratory for Forest Tree Breeding, College of Biological Science and Technology, Beijing Forestry University, Beijing 100083, People's Republic of China
| | | | | | | | | | | |
Collapse
|
20
|
Martin LBB, Fei Z, Giovannoni JJ, Rose JKC. Catalyzing plant science research with RNA-seq. FRONTIERS IN PLANT SCIENCE 2013; 4:66. [PMID: 23554602 PMCID: PMC3612697 DOI: 10.3389/fpls.2013.00066] [Citation(s) in RCA: 97] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/05/2013] [Accepted: 03/10/2013] [Indexed: 05/18/2023]
Abstract
Next generation DNA sequencing technologies are driving increasingly rapid, affordable and high resolution analyses of plant transcriptomes through sequencing of their associated cDNA (complementary DNA) populations; an analytical platform commonly referred to as RNA-sequencing (RNA-seq). Since entering the arena of whole genome profiling technologies only a few years ago, RNA-seq has proven itself to be a powerful tool with a remarkably diverse range of applications, from detailed studies of biological processes at the cell type-specific level, to providing insights into fundamental questions in plant biology on an evolutionary time scale. Applications include generating genomic data for heretofore unsequenced species, thus expanding the boundaries of what had been considered "model organisms," elucidating structural and regulatory gene networks, revealing how plants respond to developmental cues and their environment, allowing a better understanding of the relationships between genes and their products, and uniting the "omics" fields of transcriptomics, proteomics, and metabolomics into a now common systems biology paradigm. We provide an overview of the breadth of such studies and summarize the range of RNA-seq protocols that have been developed to address questions spanning cell type-specific-based transcriptomics, transcript secondary structure and gene mapping.
Collapse
Affiliation(s)
| | - Zhangjun Fei
- Boyce Thompson Institute for Plant ResearchIthaca, NY, USA
- Robert W. Holly Center for Agriculture and Health, United States Department of Agriculture-Agricultural Research ServiceIthaca, NY, USA
| | - James J. Giovannoni
- Boyce Thompson Institute for Plant ResearchIthaca, NY, USA
- Robert W. Holly Center for Agriculture and Health, United States Department of Agriculture-Agricultural Research ServiceIthaca, NY, USA
| | | |
Collapse
|
21
|
Fu N, Wang Q, Shen HL. De novo assembly, gene annotation and marker development using Illumina paired-end transcriptome sequences in celery (Apium graveolens L.). PLoS One 2013; 8:e57686. [PMID: 23469050 PMCID: PMC3585167 DOI: 10.1371/journal.pone.0057686] [Citation(s) in RCA: 60] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2012] [Accepted: 01/23/2013] [Indexed: 12/11/2022] Open
Abstract
Background Celery is an increasing popular vegetable species, but limited transcriptome and genomic data hinder the research to it. In addition, a lack of celery molecular markers limits the process of molecular genetic breeding. High-throughput transcriptome sequencing is an efficient method to generate a large transcriptome sequence dataset for gene discovery, molecular marker development and marker-assisted selection breeding. Principal Findings Celery transcriptomes from four tissues were sequenced using Illumina paired-end sequencing technology. De novo assembling was performed to generate a collection of 42,280 unigenes (average length of 502.6 bp) that represent the first transcriptome of the species. 78.43% and 48.93% of the unigenes had significant similarity with proteins in the National Center for Biotechnology Information (NCBI) non-redundant protein database (Nr) and Swiss-Prot database respectively, and 10,473 (24.77%) unigenes were assigned to Clusters of Orthologous Groups (COG). 21,126 (49.97%) unigenes harboring Interpro domains were annotated, in which 15,409 (36.45%) were assigned to Gene Ontology(GO) categories. Additionally, 7,478 unigenes were mapped onto 228 pathways using the Kyoto Encyclopedia of Genes and Genomes Pathway database (KEGG). Large numbers of simple sequence repeats (SSRs) were indentified, and then the rate of successful amplication and polymorphism were investigated among 31 celery accessions. Conclusions This study demonstrates the feasibility of generating a large scale of sequence information by Illumina paired-end sequencing and efficient assembling. Our results provide a valuable resource for celery research. The developed molecular markers are the foundation of further genetic linkage analysis and gene localization, and they will be essential to accelerate the process of breeding.
Collapse
Affiliation(s)
- Nan Fu
- College of Agriculture and Biotechnology, China Agricultural University, Beijing, China
| | - Qian Wang
- College of Agriculture and Biotechnology, China Agricultural University, Beijing, China
| | - Huo-Lin Shen
- College of Agriculture and Biotechnology, China Agricultural University, Beijing, China
- * E-mail:
| |
Collapse
|
22
|
Loss-Morais G, Waterhouse PM, Margis R. Description of plant tRNA-derived RNA fragments (tRFs) associated with argonaute and identification of their putative targets. Biol Direct 2013; 8:6. [PMID: 23402430 PMCID: PMC3574835 DOI: 10.1186/1745-6150-8-6] [Citation(s) in RCA: 107] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2012] [Accepted: 02/07/2013] [Indexed: 01/25/2023] Open
Abstract
tRNA-derived RNA fragments (tRFs) are 19mer small RNAs that associate with Argonaute (AGO) proteins in humans. However, in plants, it is unknown if tRFs bind with AGO proteins. Here, using public deep sequencing libraries of immunoprecipitated Argonaute proteins (AGO-IP) and bioinformatics approaches, we identified the Arabidopsis thaliana AGO-IP tRFs. Moreover, using three degradome deep sequencing libraries, we identified four putative tRF targets. The expression pattern of tRFs, based on deep sequencing data, was also analyzed under abiotic and biotic stresses. The results obtained here represent a useful starting point for future studies on tRFs in plants.
Collapse
Affiliation(s)
- Guilherme Loss-Morais
- Universidade Federal do Rio Grande do Sul, Centro de Biotecnologia, Predio 43431, Sala 213, POBox 15005, Porto Alegre, RS, Brazil
| | | | | |
Collapse
|
23
|
Zhou Y, Gao F, Liu R, Feng J, Li H. De novo sequencing and analysis of root transcriptome using 454 pyrosequencing to discover putative genes associated with drought tolerance in Ammopiptanthus mongolicus. BMC Genomics 2012; 13:266. [PMID: 22721448 PMCID: PMC3407029 DOI: 10.1186/1471-2164-13-266] [Citation(s) in RCA: 65] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2012] [Accepted: 06/21/2012] [Indexed: 11/10/2022] Open
Abstract
Background De novo assembly of transcript sequences produced by next-generation sequencing technologies offers a rapid approach to obtain expressed gene sequences for non-model organisms. Ammopiptanthus mongolicus, a super-xerophytic broadleaf evergreen wood, is an ecologically important foundation species in desert ecosystems and exhibits substantial drought tolerance in Mid-Asia desert. Root plays an important role in water absorption of plant. There are insufficient transcriptomic and genomic data in public databases for understanding of the molecular mechanism underlying the drought tolerance of A. mongolicus. Thus, high throughput transcriptome sequencing from A. mongolicus root is helpful to generate a large amount of transcript sequences for gene discovery and molecular marker development. Results A total of 672,002 sequencing reads were obtained from a 454 GS XLR70 Titanium pyrosequencer with a mean length of 279 bp. These reads were assembled into 29,056 unique sequences including 15,173 contigs and 13,883 singlets. In our assembled sequences, 1,827 potential simple sequence repeats (SSR) molecular markers were discovered. Based on sequence similarity with known plant proteins, the assembled sequences represent approximately 9,771 proteins in PlantGDB. Based on the Gene ontology (GO) analysis, hundreds of drought stress-related genes were found. We further analyzed the gene expression profiles of 27 putative genes involved in drought tolerance using quantitative real-time PCR (qRT-PCR) assay. Conclusions Our sequence collection represents a major transcriptomic resource for A. mongolicus, and the large number of genetic markers predicted should contribute to future research in Ammopiptanthus genus. The potential drought stress related transcripts identified in this study provide a good start for further investigation into the drought adaptation in Ammopiptanthus.
Collapse
Affiliation(s)
- Yijun Zhou
- College of Life and Environmental Sciences, Minzu University of China, Beijing, 100081, China
| | | | | | | | | |
Collapse
|
24
|
Huang J, Lu X, Yan H, Chen S, Zhang W, Huang R, Zheng Y. Transcriptome characterization and sequencing-based identification of salt-responsive genes in Millettia pinnata, a semi-mangrove plant. DNA Res 2012; 19:195-207. [PMID: 22351699 PMCID: PMC3325082 DOI: 10.1093/dnares/dss004] [Citation(s) in RCA: 65] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Semi-mangroves form a group of transitional species between glycophytes and halophytes, and hold unique potential for learning molecular mechanisms underlying plant salt tolerance. Millettia pinnata is a semi-mangrove plant that can survive a wide range of saline conditions in the absence of specialized morphological and physiological traits. By employing the Illumina sequencing platform, we generated ~192 million short reads from four cDNA libraries of M. pinnata and processed them into 108,598 unisequences with a high depth of coverage. The mean length and total length of these unisequences were 606 bp and 65.8 Mb, respectively. A total of 54,596 (50.3%) unisequences were assigned Nr annotations. Functional classification revealed the involvement of unisequences in various biological processes related to metabolism and environmental adaptation. We identified 23,815 candidate salt-responsive genes with significantly differential expression under seawater and freshwater treatments. Based on the reverse transcription-polymerase chain reaction (RT-PCR) and real-time PCR analyses, we verified the changes in expression levels for a number of candidate genes. The functional enrichment analyses for the candidate genes showed tissue-specific patterns of transcriptome remodelling upon salt stress in the roots and the leaves. The transcriptome of M. pinnata will provide valuable gene resources for future application in crop improvement. In addition, this study sets a good example for large-scale identification of salt-responsive genes in non-model organisms using the sequencing-based approach.
Collapse
Affiliation(s)
- Jianzi Huang
- College of Life Science, Shenzhen University, Shenzhen, China
| | | | | | | | | | | | | |
Collapse
|
25
|
Schliesky S, Gowik U, Weber APM, Bräutigam A. RNA-Seq Assembly - Are We There Yet? FRONTIERS IN PLANT SCIENCE 2012; 3:220. [PMID: 23056003 PMCID: PMC3457010 DOI: 10.3389/fpls.2012.00220] [Citation(s) in RCA: 72] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/06/2012] [Accepted: 09/05/2012] [Indexed: 05/20/2023]
Abstract
Transcriptomic sequence resources represent invaluable assets for research, in particular for non-model species without a sequenced genome. To date, the Next Generation Sequencing technologies 454/Roche and Illumina have been used to generate transcriptome sequence databases by mRNA-Seq for more than fifty different plant species. While some of the databases were successfully used for downstream applications, such as proteomics, the assembly parameters indicate that the assemblies do not yet accurately reflect the actual plant transcriptomes. Two different assembly strategies have been used, overlap consensus based assemblers for long reads and Eulerian path/de Bruijn graph assembler for short reads. In this review, we discuss the challenges and solutions to the transcriptome assembly problem. A list of quality control parameters and the necessary scripts to produce them are provided.
Collapse
Affiliation(s)
- Simon Schliesky
- Center of Excellence on Plant Sciences (CEPLAS), Institute for Plant Biochemistry, Heinrich Heine UniversityDüsseldorf, Germany
| | - Udo Gowik
- Center of Excellence on Plant Sciences (CEPLAS), Institute for Plant Developmental and Molecular Biology, Heinrich Heine UniversityDüsseldorf, Germany
| | - Andreas P. M. Weber
- Center of Excellence on Plant Sciences (CEPLAS), Institute for Plant Biochemistry, Heinrich Heine UniversityDüsseldorf, Germany
| | - Andrea Bräutigam
- Center of Excellence on Plant Sciences (CEPLAS), Institute for Plant Biochemistry, Heinrich Heine UniversityDüsseldorf, Germany
- *Correspondence: Andrea Bräutigam, Institute for Plant Biochemistry, 26.03.01.Room 32, Heinrich Heine University Düsseldorf, 40225 Düsseldorf, Germany. e-mail:
| |
Collapse
|