1
|
Development of Genomic Resources in Mexican Bursera (Section: Bullockia: Burseraceae): Genome Assembly, Annotation, and Marker Discovery for Three Copal Species. Genes (Basel) 2022; 13:genes13101741. [PMID: 36292626 PMCID: PMC9601875 DOI: 10.3390/genes13101741] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Revised: 09/21/2022] [Accepted: 09/23/2022] [Indexed: 11/17/2022] Open
Abstract
Bursera comprises ~100 tropical shrub and tree species, with the center of the species diversification in Mexico. The genomic resources developed for the genus are scarce, and this has limited the study of the gene flow, local adaptation, and hybridization dynamics. In this study, based on ~155 million Illumina paired-end reads per species, we performed a de novo genome assembly and annotation of three Bursera species of the Bullockia section: Bursera bipinnata, Bursera cuneata, and Bursera palmeri. The total lengths of the genome assemblies were 253, 237, and 229 Mb for B. cuneata, B. palmeri, and B. bipinnata, respectively. The assembly of B. palmeri retrieved the most complete and single-copy BUSCOs (87.3%) relative to B. cuneata (86.5%) and B. bipinnata (76.6%). The ab initio gene prediction recognized between 21,000 and 32,000 protein-coding genes. Other genomic features, such as simple sequence repeats (SSRs), were also detected. Using the de novo genome assemblies as a reference, we identified single-nucleotide polymorphisms (SNPs) for a set of 43 Bursera individuals. Moreover, we mapped the filtered reads of each Bursera species against the chloroplast genomes of five Burseraceae species, obtaining consensus sequences ranging from 156 to 160 kb in length. Our work contributes to the generation of genomic resources for an important but understudied genus of tropical-dry-forest species.
Collapse
|
2
|
Tyrka M, Bakera B, Szeliga M, Święcicka M, Krajewski P, Mokrzycka M, Rakoczy-Trojanowska M. Identification of Rf Genes in Hexaploid Wheat ( Triticumaestivum L.) by RNA-Seq and Paralog Analyses. Int J Mol Sci 2021; 22:ijms22179146. [PMID: 34502055 PMCID: PMC8431562 DOI: 10.3390/ijms22179146] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2021] [Revised: 08/19/2021] [Accepted: 08/20/2021] [Indexed: 11/16/2022] Open
Abstract
Among the natural mechanisms used for wheat hybrid breeding, the most desirable is the system combining the cytoplasmic male sterility (cms) of the female parent with the fertility-restoring genes (Rf) of the male parent. The objective of this study was to identify Rf candidate genes in the wheat genome on the basis of transcriptome sequencing (RNA-seq) and paralog analysis data. Total RNA was isolated from the anthers of two fertility-restorer (Primépi and Patras) and two non-restorer (Astoria and Grana) varieties at the tetrad and late uninucleate microspore stages. Of 36,912 differentially expressed genes (DEGs), 21 encoding domains in known fertility-restoring proteins were selected. To enrich the pool of Rf candidates, 52 paralogs (PAGs) of the 21 selected DEGs were included in the analyses. The expression profiles of most of the DEGs and PAGs determined bioinformatically were as expected (i.e., they were overexpressed in at least one fertility-restorer variety). However, these results were only partially consistent with the quantitative real-time PCR data. The DEG and PAG promoters included cis-regulatory elements common among PPR-encoding genes. On the basis of the obtained results, we designated seven genes as Rf candidate genes, six of which were identified for the first time in this study.
Collapse
Affiliation(s)
- Mirosław Tyrka
- Department of Biotechnology and Bioinformatics, Rzeszów University of Technology, Powstańców Warszawy 12, 35-959 Rzeszów, Poland; (M.T.); (M.S.)
| | - Beata Bakera
- Department of Plant Genetics, Breeding and Biotechnology, Warsaw University of Life Sciences, Nowoursynowska 166, 02-787 Warszawa, Poland; (B.B.); (M.Ś.)
| | - Magdalena Szeliga
- Department of Biotechnology and Bioinformatics, Rzeszów University of Technology, Powstańców Warszawy 12, 35-959 Rzeszów, Poland; (M.T.); (M.S.)
| | - Magdalena Święcicka
- Department of Plant Genetics, Breeding and Biotechnology, Warsaw University of Life Sciences, Nowoursynowska 166, 02-787 Warszawa, Poland; (B.B.); (M.Ś.)
| | - Paweł Krajewski
- Institute of Plant Genetics, Polish Academy of Sciences, Strzeszyńska 34, 60-479 Poznań, Poland; (P.K.); (M.M.)
| | - Monika Mokrzycka
- Institute of Plant Genetics, Polish Academy of Sciences, Strzeszyńska 34, 60-479 Poznań, Poland; (P.K.); (M.M.)
| | - Monika Rakoczy-Trojanowska
- Department of Plant Genetics, Breeding and Biotechnology, Warsaw University of Life Sciences, Nowoursynowska 166, 02-787 Warszawa, Poland; (B.B.); (M.Ś.)
- Correspondence: ; Tel./Fax: +48-22-59-32152
| |
Collapse
|
3
|
Su H, Fan J, Ma D, Zhu H. Identification and Characterization of Osmoregulation Related MicroRNAs in Gills of Hybrid Tilapia Under Three Types of Osmotic Stress. Front Genet 2021; 12:526277. [PMID: 33889171 PMCID: PMC8056028 DOI: 10.3389/fgene.2021.526277] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2020] [Accepted: 02/24/2021] [Indexed: 11/13/2022] Open
Abstract
Researchers have increasingly suggested that microRNAs (miRNAs) are small non-coding RNAs that post-transcriptionally regulate gene expression and protein translation in organs and respond to abiotic and biotic stressors. To understand the function of miRNAs in osmotic stress regulation of the gills of hybrid tilapia (Oreochromis mossambicus ♀ × Oreochromis urolepis hornorum ♂), high-throughput Illumina deep sequencing technology was used to investigate the expression profiles of miRNAs under salinity stress (S, 25‰), alkalinity stress (A, 4‰) and salinity-alkalinity stress (SA, S: 15‰, A: 4‰) challenges. The results showed that 31, 41, and 27 upregulated and 33, 42, and 40 downregulated miRNAs (P < 0.05) were identified in the salt stress, alkali stress, and saline-alkali stress group, respectively, which were compared with those in the control group (C). Fourteen significantly differently expressed miRNAs were selected randomly and then validated by a quantitative polymerase chain reaction. On the basis of Gene Ontology and Kyoto Encyclopedia of Genes and Genomes pathway analysis, genes related to osmoregulation and biosynthesis were enriched in the three types of osmotic stress. In addition, three miRNAs and three predicted target genes were chosen to conduct a quantitative polymerase chain reaction in the hybrid tilapia and its parents during 96-h osmotic stress. Differential expression patterns of miRNAs provided the basis for research data to further investigate the miRNA-modulating networks in osmoregulation of teleost.
Collapse
Affiliation(s)
- Huanhuan Su
- Key Laboratory of Tropical & Subtropical Fishery Resource Application & Cultivation, Ministry of Agriculture and Rural Affairs, Pearl River Fisheries Research Institute, Chinese Academy of Fishery Sciences, Guangzhou, China
- College of Fisheries and Life Science, Shanghai Ocean University, Shanghai, China
| | - Jiajia Fan
- Key Laboratory of Tropical & Subtropical Fishery Resource Application & Cultivation, Ministry of Agriculture and Rural Affairs, Pearl River Fisheries Research Institute, Chinese Academy of Fishery Sciences, Guangzhou, China
| | - Dongmei Ma
- Key Laboratory of Tropical & Subtropical Fishery Resource Application & Cultivation, Ministry of Agriculture and Rural Affairs, Pearl River Fisheries Research Institute, Chinese Academy of Fishery Sciences, Guangzhou, China
| | - Huaping Zhu
- Key Laboratory of Tropical & Subtropical Fishery Resource Application & Cultivation, Ministry of Agriculture and Rural Affairs, Pearl River Fisheries Research Institute, Chinese Academy of Fishery Sciences, Guangzhou, China
| |
Collapse
|
4
|
Barbanti A, Torrado H, Macpherson E, Bargelloni L, Franch R, Carreras C, Pascual M. Helping decision making for reliable and cost-effective 2b-RAD sequencing and genotyping analyses in non-model species. Mol Ecol Resour 2020; 20. [PMID: 32061018 DOI: 10.1111/1755-0998.13144] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2019] [Revised: 02/04/2020] [Accepted: 02/10/2020] [Indexed: 12/18/2022]
Abstract
High-throughput sequencing has revolutionized population and conservation genetics. RAD sequencing methods, such as 2b-RAD, can be used on species lacking a reference genome. However, transferring protocols across taxa can potentially lead to poor results. We tested two different IIB enzymes (AlfI and CspCI) on two species with different genome sizes (the loggerhead turtle Caretta caretta and the sharpsnout seabream Diplodus puntazzo) to build a set of guidelines to improve 2b-RAD protocols on non-model organisms while optimising costs. Good results were obtained even with degraded samples, showing the value of 2b-RAD in studies with poor DNA quality. However, library quality was found to be a critical parameter on the number of reads and loci obtained for genotyping. Resampling analyses with different number of reads per individual showed a trade-off between number of loci and number of reads per sample. The resulting accumulation curves can be used as a tool to calculate the number of sequences per individual needed to reach a mean depth ≥20 reads to acquire good genotyping results. Finally, we demonstrated that selective-base ligation does not affect genomic differentiation between individuals, indicating that this technique can be used in species with large genome sizes to adjust the number of loci to the study scope, to reduce sequencing costs and to maintain suitable sequencing depth for a reliable genotyping without compromising the results. Here, we provide a set of guidelines to improve 2b-RAD protocols on non-model organisms with different genome sizes, helping decision-making for a reliable and cost-effective genotyping.
Collapse
Affiliation(s)
- Anna Barbanti
- Department of Genetics, Microbiology and Statistics and IRBio, University of Barcelona, Barcelona, Spain
| | - Hector Torrado
- Department of Genetics, Microbiology and Statistics and IRBio, University of Barcelona, Barcelona, Spain.,Center for Advanced Studies of Blanes (CEAB-CSIC), Blanes, Girona, Spain
| | - Enrique Macpherson
- Center for Advanced Studies of Blanes (CEAB-CSIC), Blanes, Girona, Spain
| | - Luca Bargelloni
- Department of Comparative Biomedicine and Food Science, University of Padova, Legnaro, Italy
| | - Rafaella Franch
- Department of Comparative Biomedicine and Food Science, University of Padova, Legnaro, Italy
| | - Carlos Carreras
- Department of Genetics, Microbiology and Statistics and IRBio, University of Barcelona, Barcelona, Spain
| | - Marta Pascual
- Department of Genetics, Microbiology and Statistics and IRBio, University of Barcelona, Barcelona, Spain
| |
Collapse
|
5
|
Wang S, Li X, Zhang W, Gao Y, Zhang K, Hao Q, Li W, Wang Z, Li M, Zhang W, Zhang Y, Zhang C. Genome-Wide Investigation of Genes Regulated by ERα in Breast Cancer Cells. Molecules 2018; 23:molecules23102543. [PMID: 30301189 PMCID: PMC6222792 DOI: 10.3390/molecules23102543] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2018] [Revised: 09/26/2018] [Accepted: 10/03/2018] [Indexed: 01/13/2023] Open
Abstract
Estrogen receptor alpha (ERα), which has been detected in over 70% of breast cancer cases, is a driving factor for breast cancer growth. For investigating the underlying genes and networks regulated by ERα in breast cancer, RNA-seq was performed between ERα transgenic MDA-MB-231 cells and wild type MDA-MB-231 cells. A total of 267 differentially expressed genes (DEGs) were identified. Then bioinformatics analyses were performed to illustrate the mechanism of ERα. Besides, by comparison of RNA-seq data obtained from MDA-MB-231 cells and microarray dataset obtained from estrogen (E2) stimulated MCF-7 cells, an overlap of 126 DEGs was screened. The expression level of ERα was negatively associated with metastasis and EMT in breast cancer. We further verified that ERα might inhibit metastasis by regulating of VCL and TNFRSF12A, and suppress EMT by the regulating of JUNB and ID3. And the relationship between ERα and these genes were validated by RT-PCR and correlation analysis based on TCGA database. By PPI network analysis, we identified TOP5 hub genes, FOS, SP1, CDKN1A, CALCR and JUNB, which were involved in cell proliferation and invasion. Taken together, the whole-genome insights carried in this work can help fully understanding biological roles of ERα in breast cancer.
Collapse
Affiliation(s)
- Shuning Wang
- State Key Laboratory of Cancer Biology, Biotechnology Center, School of Pharmacy, The Fourth Military Medical University, Xi'an 710032, China.
| | - Xiaoju Li
- State Key Laboratory of Cancer Biology, Biotechnology Center, School of Pharmacy, The Fourth Military Medical University, Xi'an 710032, China.
| | - Wangqian Zhang
- State Key Laboratory of Cancer Biology, Biotechnology Center, School of Pharmacy, The Fourth Military Medical University, Xi'an 710032, China.
| | - Yuan Gao
- State Key Laboratory of Cancer Biology, Biotechnology Center, School of Pharmacy, The Fourth Military Medical University, Xi'an 710032, China.
| | - Kuo Zhang
- State Key Laboratory of Cancer Biology, Biotechnology Center, School of Pharmacy, The Fourth Military Medical University, Xi'an 710032, China.
| | - Qiang Hao
- State Key Laboratory of Cancer Biology, Biotechnology Center, School of Pharmacy, The Fourth Military Medical University, Xi'an 710032, China.
| | - Weina Li
- State Key Laboratory of Cancer Biology, Biotechnology Center, School of Pharmacy, The Fourth Military Medical University, Xi'an 710032, China.
| | - Zhaowei Wang
- State Key Laboratory of Cancer Biology, Biotechnology Center, School of Pharmacy, The Fourth Military Medical University, Xi'an 710032, China.
| | - Meng Li
- State Key Laboratory of Cancer Biology, Biotechnology Center, School of Pharmacy, The Fourth Military Medical University, Xi'an 710032, China.
| | - Wei Zhang
- State Key Laboratory of Cancer Biology, Biotechnology Center, School of Pharmacy, The Fourth Military Medical University, Xi'an 710032, China.
| | - Yingqi Zhang
- State Key Laboratory of Cancer Biology, Biotechnology Center, School of Pharmacy, The Fourth Military Medical University, Xi'an 710032, China.
| | - Cun Zhang
- State Key Laboratory of Cancer Biology, Biotechnology Center, School of Pharmacy, The Fourth Military Medical University, Xi'an 710032, China.
| |
Collapse
|
6
|
Zhang B, Chen X, Lu X, Shu N, Wang X, Yang X, Wang S, Wang J, Guo L, Wang D, Ye W. Transcriptome Analysis of Gossypium hirsutum L. Reveals Different Mechanisms among NaCl, NaOH and Na 2CO 3 Stress Tolerance. Sci Rep 2018; 8:13527. [PMID: 30202076 PMCID: PMC6131252 DOI: 10.1038/s41598-018-31668-z] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2017] [Accepted: 08/23/2018] [Indexed: 11/29/2022] Open
Abstract
As an important source of fiber and edible oil, cotton has great economic value. In comparison to their individual studies, association and differentiation between salt and alkaline tolerance has not been focused yet by scientists. We have used next-generation RNA-Seq technique to analyze transcriptional changes under salt and alkaline stresses in cotton. Overall, 25,929 and 6,564 differentially expressed genes (DEGs) were identified in roots and leaves, respectively. Gene functional annotation showed that genes involving ionic homeostasis were significantly up-regulated under NaCl stress and Na2CO3 stress, and genes enriched in starch and sucrose metabolism were up-regulated under NaOH stress and Na2CO3 stress. Furthermore, a synergistic enhancing effect between NaCl and NaOH stress was also observed in this study. Likewise, our studies indicate further that genes related with starch and sucrose metabolism were regulated to respond to the high pH under Na2CO3 stress, inducing plant hormone signal transduction and key enzyme reactive oxygen species (ROS) activity to respond to ionic toxicity and intracellular ionic homeostasis. By analyzing the expression profiles of diverse tissues under different salt and alkaline stresses, this study provides valuable ideas for genetic improvements of cotton tolerance to salt-alkaline stress.
Collapse
Affiliation(s)
- Binglei Zhang
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Key Laboratory for Cotton Genetic Improvement, Anyang, 455000, Henan, China
| | - Xiugui Chen
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Key Laboratory for Cotton Genetic Improvement, Anyang, 455000, Henan, China
| | - Xuke Lu
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Key Laboratory for Cotton Genetic Improvement, Anyang, 455000, Henan, China
| | - Na Shu
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Key Laboratory for Cotton Genetic Improvement, Anyang, 455000, Henan, China
| | - Xiaoge Wang
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Key Laboratory for Cotton Genetic Improvement, Anyang, 455000, Henan, China
| | - Xiaomin Yang
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Key Laboratory for Cotton Genetic Improvement, Anyang, 455000, Henan, China
| | - Shuai Wang
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Key Laboratory for Cotton Genetic Improvement, Anyang, 455000, Henan, China
| | - Junjuan Wang
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Key Laboratory for Cotton Genetic Improvement, Anyang, 455000, Henan, China
| | - Lixue Guo
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Key Laboratory for Cotton Genetic Improvement, Anyang, 455000, Henan, China
| | - Delong Wang
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Key Laboratory for Cotton Genetic Improvement, Anyang, 455000, Henan, China
| | - Wuwei Ye
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Key Laboratory for Cotton Genetic Improvement, Anyang, 455000, Henan, China.
| |
Collapse
|
7
|
Yang L, Xu Y, Zhang R, Wang X, Yang C. Comprehensive transcriptome profiling of soybean leaves in response to simulated acid rain. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2018; 158:18-27. [PMID: 29656160 DOI: 10.1016/j.ecoenv.2018.04.015] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2018] [Revised: 04/04/2018] [Accepted: 04/06/2018] [Indexed: 06/08/2023]
Abstract
As a source of edible oil and protein, soybean is a major globally important economic crop; Improving its production has been an important objective of soybean breeding. Acid rain has been shown to influence soybean growth and productivity, with consequent adverse impacts on its production for use by human populations. In this study, RNA sequencing technology was utilized to examine changes in gene expression when soybean was exposed to simulated acid rain (SAR). We sampled soybean leaves at five time intervals (0, 6, 30, 54, 78, and 102 h), and built the cDNA library. In total, 54,175 expression genes were found, including 2016 genes with differential expression. A total of 416 genes were considered, as they were closely related to the response to SAR. Genes related to the regulation of sulfur and nitrogen metabolism, carbohydrate metabolism, photosynthesis, and reactive oxygen species were among those differentially expressed in response to SAR. In this study, we examined the response mechanisms of soybean under SAR exposure. Our findings will improve our understanding of the molecular mechanisms employed by soybean in responding to abiotic stress, and therefore provides important information in developing soybean breeding to improve tolerance to these stresses.
Collapse
Affiliation(s)
- Long Yang
- State Key Laboratory of Crop Biology, Shandong Agricultural University, Tai'an, China
| | - Yongchao Xu
- State Key Laboratory of Crop Biology, Shandong Agricultural University, Tai'an, China
| | - Rui Zhang
- State Key Laboratory of Crop Biology, Shandong Agricultural University, Tai'an, China
| | - Xiaotong Wang
- State Key Laboratory of Crop Biology, Shandong Agricultural University, Tai'an, China
| | - Chao Yang
- State Key Laboratory of Crop Biology, Shandong Agricultural University, Tai'an, China.
| |
Collapse
|
8
|
Pan J, Chen H, Guo B, Liu C. Understanding the molecular mechanisms underlying the effects of light intensity on flavonoid production by RNA-seq analysis in Epimedium pseudowushanense B.L.Guo. PLoS One 2017; 12:e0182348. [PMID: 28786984 PMCID: PMC5546586 DOI: 10.1371/journal.pone.0182348] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2017] [Accepted: 07/17/2017] [Indexed: 02/02/2023] Open
Abstract
Epimedium pseudowushanense B.L.Guo, a light-demanding shade herb, is used in traditional medicine to increase libido and strengthen muscles and bones. The recognition of the health benefits of Epimedium has increased its market demand. However, its resource recycling rate is low and environmentally dependent. Furthermore, its natural sources are endangered, further increasing prices. Commercial culture can address resource constraints of it.Understanding the effects of environmental factors on the production of its active components would improve the technology for cultivation and germplasm conservation. Here, we studied the effects of light intensities on the flavonoid production and revealed the molecular mechanism using RNA-seq analysis. Plants were exposed to five levels of light intensity through the periods of germination to flowering, the flavonoid contents were measured using HPLC. Quantification of epimedin A, epimedin B, epimedin C, and icariin showed that the flavonoid contents varied with different light intensity levels. And the largest amount of epimedin C was produced at light intensity level 4 (I4). Next, the leaves under the treatment of three light intensity levels ("L", "M" and "H") with the largest differences in the flavonoid content, were subjected to RNA-seq analysis. Transcriptome reconstruction identified 43,657 unigenes. All unigene sequences were annotated by searching against the Nr, Gene Ontology, and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. In total, 4008, 5260, and 3591 significant differentially expressed genes (DEGs) were identified between the groups L vs. M, M vs. H and L vs. H. Particularly, twenty-one full-length genes involved in flavonoid biosynthesis were identified. The expression levels of the flavonol synthase, chalcone synthase genes were strongly associated with light-induced flavonoid abundance with the highest expression levels found in the H group. Furthermore, 65 transcription factors, including 31 FAR1, 17 MYB-related, 12 bHLH, and 5 WRKY, were differentially expressed after light induction. Finally, a model was proposed to explain the light-induced flavonoid production. This study provided valuable information to improve cultivation practices and produced the first comprehensive resource for E. pseudowushanense transcriptomes.
Collapse
Affiliation(s)
- Junqian Pan
- Key Laboratory of Bioactive Substances and Resources Utilization of Chinese Herbal Medicine from Ministry of Education, Beijing, P.R. China
- Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences, Peking Union Medical College, Beijing, P.R. China
| | - Haimei Chen
- Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences, Peking Union Medical College, Beijing, P.R. China
| | - Baolin Guo
- Key Laboratory of Bioactive Substances and Resources Utilization of Chinese Herbal Medicine from Ministry of Education, Beijing, P.R. China
- Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences, Peking Union Medical College, Beijing, P.R. China
| | - Chang Liu
- Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences, Peking Union Medical College, Beijing, P.R. China
| |
Collapse
|
9
|
Shaina H, UlAbdin Z, Webb BA, Arif MJ, Jamil A. De novo sequencing and transcriptome analysis of venom glands of endoparasitoid Aenasius arizonensis (Girault) (=Aenasius bambawalei Hayat) (Hymenoptera, Encyrtidae). Toxicon 2016; 121:134-144. [PMID: 27594666 DOI: 10.1016/j.toxicon.2016.08.022] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2016] [Revised: 08/11/2016] [Accepted: 08/31/2016] [Indexed: 12/25/2022]
Abstract
Aenasius bambawalei Hayat (Encyrtidae: Hymenoptera) has been synonymized with Aenasius arizonensis (Girault) is a small, newly discovered endoparasitoid of the cotton mealybug Phenacoccuss solenopsis Tinsley (Pseudococcidae: Hemiptera), which completes its life cycle inside the body of its host and it is a potential insect control tool. Despite the acquired knowledge regarding host-parasitoid interaction, little information is available on the factors of parasitoid origin able to modulate mealybug physiology. The components of A. arizonensis venom have not been well studied but venom from other parasitoids and wasps contain biologically active proteins that have potential applications in pest management or may be of medicinal importance. To provide an insight into the transcripts expressed in the venom gland of A. arizonensis, a transcriptomic database was developed utilizing high throughput RNA sequencing approaches to analyze the genes expressed in venom glands of this endoparasitic wasp. The resulting A. arizonensis RNA sequences were assembled de-novo with contigs then blasted against the NCBI non-redundant sequence database. Contigs which matched database sequences were mostly homologous to genes from hymenopteran parasitoids such as Nasonia vitripennis, Copidosoma floridanum, Fopius arsenus and Pteromalas puparium. Further analysis of the A. arizonensis database was then performed which focused on selected genes encoding proteins potentially involved in host developmental arrest, disrupting the host immune system, host paralysis, and transcripts that support these functions. Sequenced mRNAS predicted to encode full length ORFs of Calreticulin, Serine Protease Precursor and Arginine kinase proteins were identified and the tissue specific expression of these putative venom genes was analyzed by RT-PCR. In addition, results also demonstrate that de novo transcriptome assembly allows useful venom gene expression analysis in a species lacking a genome sequence database and may provide useful information for devising control tools for insect pests and other applications.
Collapse
Affiliation(s)
- Hoor Shaina
- Department of Entomology, University of Agriculture Faisalabad, Pakistan
| | - Zain UlAbdin
- Department of Entomology, University of Agriculture Faisalabad, Pakistan.
| | - Bruce A Webb
- Department of Entomology, University of Kentucky, Lexington, USA.
| | | | - Amer Jamil
- Department of Biochemistry, University of Agriculture Faisalabad, Pakistan
| |
Collapse
|
10
|
High throughput sequencing of small RNAs transcriptomes in two Crassostrea oysters identifies microRNAs involved in osmotic stress response. Sci Rep 2016; 6:22687. [PMID: 26940974 PMCID: PMC4778033 DOI: 10.1038/srep22687] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2015] [Accepted: 02/17/2016] [Indexed: 11/21/2022] Open
Abstract
Increasing evidence suggests that microRNAs post-transcriptionally regulate gene expression and are involved in responses to biotic and abiotic stress. However, the role of miRNAs involved in osmotic plasticity remains largely unknown in marine bivalves. In the present study, we performed low salinity challenge with two Crassostrea species (C. gigas and C. hongkongensis), and conducted high-throughput sequencing of four small RNA libraries constructed from the gill tissues. A total of 202 and 87 miRNAs were identified from C. gigas and C. hongkongensis, respectively. Six miRNAs in C. gigas and two in C. hongkongensis were differentially expressed in response to osmotic stress. The expression profiles of these eight miRNAs were validated by qRT-PCR. Based on GO enrichment and KEGG pathway analysis, genes associated with microtubule-based process and cellular component movement were enriched in both species. In addition, five miRNA-mRNA interaction pairs that showed opposite expression patterns were identified in the C. hongkongensis, Differential expression analysis identified the miRNAs that play important regulatory roles in response to low salinity stress, providing insights into molecular mechanisms that are essential for salinity tolerance in marine bivalves.
Collapse
|
11
|
De Novo Sequencing and Analysis of the Safflower Transcriptome to Discover Putative Genes Associated with Safflor Yellow in Carthamus tinctorius L. Int J Mol Sci 2015; 16:25657-77. [PMID: 26516840 PMCID: PMC4632820 DOI: 10.3390/ijms161025657] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2015] [Revised: 10/13/2015] [Accepted: 10/20/2015] [Indexed: 02/06/2023] Open
Abstract
Safflower (Carthamus tinctorius L.), an important traditional Chinese medicine, is cultured widely for its pharmacological effects, but little is known regarding the genes related to the metabolic regulation of the safflower’s yellow pigment. To investigate genes related to safflor yellow biosynthesis, 454 pyrosequencing of flower RNA at different developmental stages was performed, generating large databases.In this study, we analyzed 454 sequencing data from different flowering stages in safflower. In total, 1,151,324 raw reads and 1,140,594 clean reads were produced, which were assembled into 51,591 unigenes with an average length of 679 bp and a maximum length of 5109 bp. Among the unigenes, 40,139 were in the early group, 39,768 were obtained from the full group and 28,316 were detected in both samples. With the threshold of “log2 ratio ≥ 1”, there were 34,464 differentially expressed genes, of which 18,043 were up-regulated and 16,421 were down-regulated in the early flower library. Based on the annotations of the unigenes, 281 pathways were predicted. We selected 12 putative genes and analyzed their expression levels using quantitative real time-PCR. The results were consistent with the 454 sequencing results. In addition, the expression of chalcone synthase, chalcone isomerase and anthocyanidin synthase, which are involved in safflor yellow biosynthesis and safflower yellow pigment (SYP) content, were analyzed in different flowering periods, indicating that their expression levels were related to SYP synthesis. Moreover, to further confirm the results of the 454 pyrosequencing, full-length cDNA of chalcone isomerase (CHI) and anthocyanidin synthase (ANS) were cloned from safflower petal by RACE (Rapid-amplification of cDNA ends) method according to fragment of the transcriptome.
Collapse
|
12
|
Callol A, Reyes-López FE, Roig FJ, Goetz G, Goetz FW, Amaro C, MacKenzie SA. An Enriched European Eel Transcriptome Sheds Light upon Host-Pathogen Interactions with Vibrio vulnificus. PLoS One 2015. [PMID: 26207370 PMCID: PMC4514713 DOI: 10.1371/journal.pone.0133328] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Infectious diseases are one of the principal bottlenecks for the European eel recovery. The aim of this study was to develop a new molecular tool to be used in host-pathogen interaction experiments in the eel. To this end, we first stimulated adult eels with different pathogen-associated molecular patterns (PAMPs), extracted RNA from the immune-related tissues and sequenced the transcriptome. We obtained more than 2x106 reads that were assembled and annotated into 45,067 new descriptions with a notable representation of novel transcripts related with pathogen recognition, signal transduction and the immune response. Then, we designed a DNA-microarray that was used to analyze the early immune response against Vibrio vulnificus, a septicemic pathogen that uses the gills as the portal of entry into the blood, as well as the role of the main toxin of this species (RtxA13) on this early interaction. The gill transcriptomic profiles obtained after bath infecting eels with the wild type strain or with a mutant deficient in rtxA13 were analyzed and compared. Results demonstrate that eels react rapidly and locally against the pathogen and that this immune-response is rtxA13-dependent as transcripts related with cell destruction were highly up-regulated only in the gills from eels infected with the wild-type strain. Furthermore, significant differences in the immune response against the wild type and the mutant strain also suggest that host survival after V. vulnificus infection could depend on an efficient local phagocytic activity. Finally, we also found evidence of the presence of an interbranchial lymphoid tissue in European eel gills although further experiments will be necessary to identify such tissue.
Collapse
Affiliation(s)
- Agnès Callol
- Departament de Microbiologia i Ecologia, Universitat de Valencia, Burjassot, Spain
- Institut de Biotecnologia i Biomedicina, Universitat Autònoma de Barcelona, Bellaterra, Spain
| | - Felipe E. Reyes-López
- Institut de Biotecnologia i Biomedicina, Universitat Autònoma de Barcelona, Bellaterra, Spain
- Department de Biologia cel·lular, Fisiologia Animal i Immunologia, Universitat Autònoma de Barcelona, Bellaterra, Spain
| | - Francisco J. Roig
- Departament de Microbiologia i Ecologia, Universitat de Valencia, Burjassot, Spain
| | - Giles Goetz
- Northwest Fisheries Science Center, Seattle, United States of America
| | | | - Carmen Amaro
- Departament de Microbiologia i Ecologia, Universitat de Valencia, Burjassot, Spain
| | - Simon A. MacKenzie
- Institut de Biotecnologia i Biomedicina, Universitat Autònoma de Barcelona, Bellaterra, Spain
- Institute of Aquaculture, University of Stirling, Stirling, United Kingdom
- * E-mail:
| |
Collapse
|
13
|
Abstract
The searching of human housekeeping (HK) genes has been a long quest since the emergence of transcriptomics, and is instrumental for us to understand the structure of genome and the fundamentals of biological processes. The resolved genes are frequently used in evolution studies and as normalization standards in quantitative gene-expression analysis. Within the past 20 years, more than a dozen HK-gene studies have been conducted, yet none of them sampled human tissues completely. We believe an integration of these results will help remove false positive genes owing to the inadequate sampling. Surprisingly, we only find one common gene across 15 examined HK-gene datasets comprising 187 different tissue and cell types. Our subsequent analyses suggest that it might not be appropriate to rigidly define HK genes as expressed in all tissue types that have diverse developmental, physiological, and pathological states. It might be beneficial to use more robustly identified HK functions for filtering criteria, in which the representing genes can be a subset of genome. These genes are not necessarily the same, and perhaps need not to be the same, everywhere in our body.
Collapse
Affiliation(s)
- Yijuan Zhang
- Department of Chemistry and Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, British Columbia, Canada
| | - Ding Li
- Department of Chemistry and Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, British Columbia, Canada
| | - Bingyun Sun
- Department of Chemistry and Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, British Columbia, Canada
| |
Collapse
|
14
|
Siozios S, Tosi L, Ferrarini A, Ferrari A, Tononi P, Bellin D, Maurhofer M, Gessler C, Delledonne M, Pertot I. Transcriptional Reprogramming of the Mycoparasitic Fungus Ampelomyces quisqualis During the Powdery Mildew Host-Induced Germination. PHYTOPATHOLOGY 2015; 105:199-209. [PMID: 25185010 DOI: 10.1094/phyto-01-14-0013-r] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
Ampelomyces quisqualis is a mycoparasite of a diverse range of phytopathogenic fungi associated with the powdery mildew disease. Among them are several Erysiphaceae species with great economic impact on high-value crops such as grape. Due to its ability to parasitize and prevent the spread of powdery mildews, A. quisqualis has received considerable attention for its biocontrol potential. However, and in sharp contrast to the extensively studied biocontrol species belonging to the genus Trichoderma, little is known about the biology of A. quisqualis at the molecular and genetic levels. We present the first genome-wide transcription profiling in A. quisqualis during host-induced germination. A total of 1,536 putative genes showed significant changes in transcription during the germination of A. quisqualis. This finding denotes an extensive transcriptional reprogramming of A. quisqualis induced by the presence of the host. Several upregulated genes were predicted to encode for putative mycoparasitism-related proteins such as secreted proteases, virulence factors, and proteins related to toxin biosynthesis. Our data provide the most comprehensive sequence resource currently available for A. quisqualis in addition to offering valuable insights into the biology of A. quisqualis and its mycoparasitic lifestyle. Eventually, this may improve the biocontrol capacity of this mycoparasite.
Collapse
|
15
|
Sundaresan S, Philosoph-Hadas S, Riov J, Mugasimangalam R, Kuravadi NA, Kochanek B, Salim S, Tucker ML, Meir S. De novo Transcriptome Sequencing and Development of Abscission Zone-Specific Microarray as a New Molecular Tool for Analysis of Tomato Organ Abscission. FRONTIERS IN PLANT SCIENCE 2015; 6:1258. [PMID: 26834766 PMCID: PMC4712312 DOI: 10.3389/fpls.2015.01258] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2015] [Accepted: 12/24/2015] [Indexed: 05/19/2023]
Abstract
Abscission of flower pedicels and leaf petioles of tomato (Solanum lycopersicum) can be induced by flower removal or leaf deblading, respectively, which leads to auxin depletion, resulting in increased sensitivity of the abscission zone (AZ) to ethylene. However, the molecular mechanisms that drive the acquisition of abscission competence and its modulation by auxin gradients are not yet known. We used RNA-Sequencing (RNA-Seq) to obtain a comprehensive transcriptome of tomato flower AZ (FAZ) and leaf AZ (LAZ) during abscission. RNA-Seq was performed on a pool of total RNA extracted from tomato FAZ and LAZ, at different abscission stages, followed by de novo assembly. The assembled clusters contained transcripts that are already known in the Solanaceae (SOL) genomics and NCBI databases, and over 8823 identified novel tomato transcripts of varying sizes. An AZ-specific microarray, encompassing the novel transcripts identified in this study and all known transcripts from the SOL genomics and NCBI databases, was constructed to study the abscission process. Multiple probes for longer genes and key AZ-specific genes, including antisense probes for all transcripts, make this array a unique tool for studying abscission with a comprehensive set of transcripts, and for mining for naturally occurring antisense transcripts. We focused on comparing the global transcriptomes generated from the FAZ and the LAZ to establish the divergences and similarities in their transcriptional networks, and particularly to characterize the processes and transcriptional regulators enriched in gene clusters that are differentially regulated in these two AZs. This study is the first attempt to analyze the global gene expression in different AZs in tomato by combining the RNA-Seq technique with oligonucleotide microarrays. Our AZ-specific microarray chip provides a cost-effective approach for expression profiling and robust analysis of multiple samples in a rapid succession.
Collapse
Affiliation(s)
- Srivignesh Sundaresan
- Department of Postharvest Science of Fresh Produce, Agricultural Research Organization, The Volcani CenterBet-Dagan, Israel
- The Robert H. Smith Faculty of Agriculture, Food and Environment, The Robert H. Smith Institute of Plant Sciences and Genetics in Agriculture, The Hebrew University of JerusalemRehovot, Israel
| | - Sonia Philosoph-Hadas
- Department of Postharvest Science of Fresh Produce, Agricultural Research Organization, The Volcani CenterBet-Dagan, Israel
| | - Joseph Riov
- The Robert H. Smith Faculty of Agriculture, Food and Environment, The Robert H. Smith Institute of Plant Sciences and Genetics in Agriculture, The Hebrew University of JerusalemRehovot, Israel
| | - Raja Mugasimangalam
- Department of Bioinformatics, QTLomics Technologies Pvt. LtdBangalore, India
| | - Nagesh A. Kuravadi
- Department of Bioinformatics, QTLomics Technologies Pvt. LtdBangalore, India
| | - Bettina Kochanek
- Department of Postharvest Science of Fresh Produce, Agricultural Research Organization, The Volcani CenterBet-Dagan, Israel
| | - Shoshana Salim
- Department of Postharvest Science of Fresh Produce, Agricultural Research Organization, The Volcani CenterBet-Dagan, Israel
| | - Mark L. Tucker
- Soybean Genomics and Improvement Laboratory, United States Department of Agriculture, Agricultural Research ServiceBeltsville, MD, USA
| | - Shimon Meir
- Department of Postharvest Science of Fresh Produce, Agricultural Research Organization, The Volcani CenterBet-Dagan, Israel
- *Correspondence: Shimon Meir
| |
Collapse
|
16
|
Luo H, Zhu Y, Song J, Xu L, Sun C, Zhang X, Xu Y, He L, Sun W, Xu H, Wang B, Li X, Li C, Liu J, Chen S. Transcriptional data mining of Salvia miltiorrhiza in response to methyl jasmonate to examine the mechanism of bioactive compound biosynthesis and regulation. PHYSIOLOGIA PLANTARUM 2014; 152:241-55. [PMID: 24660670 DOI: 10.1111/ppl.12193] [Citation(s) in RCA: 64] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/18/2013] [Accepted: 02/13/2014] [Indexed: 05/25/2023]
Abstract
Salvia miltiorrhiza is a Chinese herb with significant pharmacologic effects because of the bioactive compounds of tanshinones and phenolic acids. Methyl jasmonate (MeJA) has been used as an effective elicitor to enhance the production of these compounds. However, the molecular mechanism of MeJA-mediated tanshinone and salvianolic acid biosynthesis remains unclear. The transcriptional profiles of S. miltiorrhiza leaves at 12 h (T12) after MeJA elicitation and mock-treated leaves (T0) were generated using the Illumina deep RNA sequencing (RNA-seq) strategy to detect the changes in gene expression in response to MeJA. In total, 37 647 unique sequences were obtained from about 21 million reads, and 25 641 (71.53%) of these sequences were annotated based on the blast searches against the public databases. A total of 5287 unique sequences were expressed differentially between the samples of T0 and T12, which covered almost all the known genes involved in tanshinone and phenolic acid biosynthesis in S. miltiorrhiza. Many of the transcription factors (e.g. MYB, bHLH and WRKY) and genes involved in plant hormone biosynthesis and signal transduction were expressed differentially in response to the MeJA induction. Importantly, three and four candidate cytochrome P450s (P450s) that could be involved in the tanshinone and phenolic acid biosynthesis, respectively, were selected from the RNA-seq data based on co-expressed pattern analysis with SmCPS1/SmKSL1 and SmRAS, which are the key genes responsible for biosynthesis. This comprehensive investigation of MeJA-induced gene expression profiles can shed light on the molecular mechanisms of the MeJA-mediated bioactive compound biosynthesis and regulation in S. miltiorrhiza.
Collapse
Affiliation(s)
- Hongmei Luo
- The National Engineering Laboratory for Breeding of Endangered Medicinal Materials, Institute of Medicinal Plant Development (IMPLAD), Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Xia EH, Jiang JJ, Huang H, Zhang LP, Zhang HB, Gao LZ. Transcriptome analysis of the oil-rich tea plant, Camellia oleifera, reveals candidate genes related to lipid metabolism. PLoS One 2014; 9:e104150. [PMID: 25136805 PMCID: PMC4138098 DOI: 10.1371/journal.pone.0104150] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2014] [Accepted: 07/09/2014] [Indexed: 11/18/2022] Open
Abstract
Background Rapidly driven by the need for developing sustainable sources of nutritionally important fatty acids and the rising concerns about environmental impacts after using fossil oil, oil-plants have received increasing awareness nowadays. As an important oil-rich plant in China, Camellia oleifera has played a vital role in providing nutritional applications, biofuel productions and chemical feedstocks. However, the lack of C. oleifera genome sequences and little genetic information have largely hampered the urgent needs for efficient utilization of the abundant germplasms towards modern breeding efforts of this woody oil-plant. Results Here, using the 454 GS-FLX sequencing platform, we generated approximately 600,000 RNA-Seq reads from four tissues of C. oleifera. These reads were trimmed and assembled into 104,842 non-redundant putative transcripts with a total length of ∼38.9 Mb, representing more than 218-fold of all the C. oleifera sequences currently deposited in the GenBank (as of March 2014). Based on the BLAST similarity searches, nearly 42.6% transcripts could be annotated with known genes, conserved domains, or Gene Ontology (GO) terms. Comparisons with the cultivated tea tree, C. sinensis, identified 3,022 pairs of orthologs, of which 211 exhibited the evidence under positive selection. Pathway analysis detected the majority of genes potentially related to lipid metabolism. Evolutionary analysis of omega-6 fatty acid desaturase (FAD2) genes among 20 oil-plants unexpectedly suggests that a parallel evolution may occur between C. oleifera and Olea oleifera. Additionally, more than 2,300 simple sequence repeats (SSRs) and 20,200 single-nucleotide polymorphisms (SNPs) were detected in the C. oleifera transcriptome. Conclusions The generated transcriptome represents a considerable increase in the number of sequences deposited in the public databases, providing an unprecedented opportunity to discover all related-genes associated with lipid metabolic pathway in C. oleifera. It will greatly enhance the generation of new varieties of C. oleifera with increased yields and high quality.
Collapse
Affiliation(s)
- En-Hua Xia
- Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, China
- University of the Chinese Academy of Sciences, Beijing, China
| | - Jian-Jun Jiang
- Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, China
| | - Hui Huang
- Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, China
| | - Li-Ping Zhang
- Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, China
| | - Hai-Bin Zhang
- Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, China
- University of the Chinese Academy of Sciences, Beijing, China
| | - Li-Zhi Gao
- Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, China
- * E-mail:
| |
Collapse
|
18
|
Hook SE, Twine NA, Simpson SL, Spadaro DA, Moncuquet P, Wilkins MR. 454 pyrosequencing-based analysis of gene expression profiles in the amphipod Melita plumulosa: transcriptome assembly and toxicant induced changes. AQUATIC TOXICOLOGY (AMSTERDAM, NETHERLANDS) 2014; 153:73-88. [PMID: 24434169 DOI: 10.1016/j.aquatox.2013.11.022] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/23/2013] [Revised: 11/26/2013] [Accepted: 11/28/2013] [Indexed: 05/20/2023]
Abstract
Next generation sequencing using Roche's 454 pyrosequencing platform can be used to generate genomic information for non-model organisms, although there are bioinformatic challenges associated with these studies. These challenges are compounded by a lack of a standardized protocol to either assemble data or to evaluate the quality of a de novo transcriptome. This study presents an assembly of the control and toxicant responsive transcriptome of Melita plumulosa, an Australian amphipod commonly used in ecotoxicological studies. RNA was harvested from control amphipods, juvenile amphipods, and from amphipods exposed to either metal or diesel contaminated sediments. This RNA was used as the basis for a 454 based transcriptome sequencing effort. Sequencing generated 1.3 million reads from control, juvenile, metal-exposed and diesel-exposed amphipods. Different read filtering and assembly protocols were evaluated to generate an assembly that (i) had an optimal number of contigs; (ii) had long contigs; (iii) contained a suitable representation of conserved genes; and (iv) had long ortholog alignment lengths relative to the length of each contig. A final assembly, generated using fixed-length trimming based on the sequence quality scores, followed by assembly using the MIRA algorithm, produced the best results. The 26,625 contigs generated via this approach were annotated using Blast2GO, and the differential expression between treatments and control was determined by mapping with BWA followed by DESeq. Although the mapping generated low coverage, many differentially expressed contigs, including some with known developmental or toxicological function, were identified. This study demonstrated that 454 pyrosequencing is an effective means of generating reference transcriptome information for organisms, such as the amphipod M. plumulosa, that have no genomic information available in databases or in closely related sequenced species. It also demonstrated how optimization of read filtering protocols and assembly approaches changes the utility of results obtained from next generation sequencing studies, and establishes criteria to determine the quality of a de novo assembly in species lacking a reference genome. This new transcriptomic knowledge provides the genomic foundation for the creation of microarray and qPCR assays, serving as a reference transcriptome in future RNAseq studies, and allowing both the biology and ecotoxicology of this organism to be better understood. This approach will allow genomics-based methodology to be applied to a wider range of environmentally relevant species.
Collapse
Affiliation(s)
- Sharon E Hook
- CSIRO Land and Water, Locked Bag 2007, Kirrawee, NSW 2232, Australia.
| | - Natalie A Twine
- NSW Systems Biology Initiative, School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW 2052, Australia
| | - Stuart L Simpson
- CSIRO Land and Water, Locked Bag 2007, Kirrawee, NSW 2232, Australia
| | - David A Spadaro
- CSIRO Land and Water, Locked Bag 2007, Kirrawee, NSW 2232, Australia
| | - Philippe Moncuquet
- CSIRO Mathematics, Informatics, and Statistics, Acton, ACT, 2601, Australia
| | - Marc R Wilkins
- NSW Systems Biology Initiative, School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW 2052, Australia
| |
Collapse
|
19
|
Hook SE, Osborn HL, Gissi F, Moncuquet P, Twine NA, Wilkins MR, Adams MS. RNA-Seq analysis of the toxicant-induced transcriptome of the marine diatom, Ceratoneis closterium. Mar Genomics 2014; 16:45-53. [DOI: 10.1016/j.margen.2013.12.004] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2013] [Revised: 11/29/2013] [Accepted: 12/18/2013] [Indexed: 01/17/2023]
|
20
|
Gallardo-Escárate C, Valenzuela-Muñoz V, Nuñez-Acuña G. RNA-Seq analysis using de novo transcriptome assembly as a reference for the salmon louse Caligus rogercresseyi. PLoS One 2014; 9:e92239. [PMID: 24691066 PMCID: PMC3972170 DOI: 10.1371/journal.pone.0092239] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2013] [Accepted: 02/19/2014] [Indexed: 11/22/2022] Open
Abstract
Despite the economic and environmental impacts that sea lice infestations have on salmon farming worldwide, genomic data generated by high-throughput transcriptome sequencing for different developmental stages, sexes, and strains of sea lice is still limited or unknown. In this study, RNA-seq analysis was performed using de novo transcriptome assembly as a reference for evidenced transcriptional changes from six developmental stages of the salmon louse Caligus rogercresseyi. EST-datasets were generated from the nauplius I, nauplius II, copepodid and chalimus stages and from female and male adults using MiSeq Illumina sequencing. A total of 151,788,682 transcripts were yielded, which were assembled into 83,444 high quality contigs and subsequently annotated into roughly 24,000 genes based on known proteins. To identify differential transcription patterns among salmon louse stages, cluster analyses were performed using normalized gene expression values. Herein, four clusters were differentially expressed between nauplius I–II and copepodid stages (604 transcripts), five clusters between copepodid and chalimus stages (2,426 transcripts), and six clusters between female and male adults (2,478 transcripts). Gene ontology analysis revealed that the nauplius I–II, copepodid and chalimus stages are mainly annotated to aminoacid transfer/repair/breakdown, metabolism, molting cycle, and nervous system development. Additionally, genes showing differential transcription in female and male adults were highly related to cytoskeletal and contractile elements, reproduction, cell development, morphogenesis, and transcription-translation processes. The data presented in this study provides the most comprehensive transcriptome resource available for C. rogercresseyi, which should be used for future genomic studies linked to host-parasite interactions.
Collapse
Affiliation(s)
- Cristian Gallardo-Escárate
- Laboratory of Biotechnology and Aquatic Genomics, Interdisciplinary Center for Aquaculture Research (INCAR), University of Concepción, Concepción, Chile
| | - Valentina Valenzuela-Muñoz
- Laboratory of Biotechnology and Aquatic Genomics, Interdisciplinary Center for Aquaculture Research (INCAR), University of Concepción, Concepción, Chile
| | - Gustavo Nuñez-Acuña
- Laboratory of Biotechnology and Aquatic Genomics, Interdisciplinary Center for Aquaculture Research (INCAR), University of Concepción, Concepción, Chile
| |
Collapse
|
21
|
Mishima K, Fujiwara T, Iki T, Kuroda K, Yamashita K, Tamura M, Fujisawa Y, Watanabe A. Transcriptome sequencing and profiling of expressed genes in cambial zone and differentiating xylem of Japanese cedar (Cryptomeria japonica). BMC Genomics 2014; 15:219. [PMID: 24649833 PMCID: PMC3999911 DOI: 10.1186/1471-2164-15-219] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2013] [Accepted: 03/07/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Forest trees have ecological and economic importance, and Japanese cedar has highly valued wood attributes. Thus, studies of molecular aspects of wood formation offer practical information that may be used for screening and forward genetics approaches to improving wood quality. RESULTS After identifying expressed sequence tags in Japanese cedar tissue undergoing xylogenesis, we designed a custom cDNA microarray to compare expression of highly regulated genes throughout a growing season. This led to identification of candidate genes involved both in wood formation and later cessation of growth and dormancy. Based on homology to orthologous protein groups, the genes were assigned to functional classes. A high proportion of sequences fell into functional classes related to posttranscriptional modification and signal transduction, while transcription factors and genes involved in the metabolism of sugars, cell-wall synthesis and lignification, and cold hardiness were among other classes of genes identified as having a potential role in xylem formation and seasonal wood formation. CONCLUSIONS We obtained 55,051 unique sequences by next-generation sequencing of a cDNA library prepared from cambial meristem and derivative cells. Previous studies on conifers have identified unique sequences expressed in developing xylem, but this is the first comprehensive study utilizing a collection of expressed sequence tags for expression studies related to xylem formation in Japanese cedar, which belongs to a different lineage than the Pinaceae. Our characterization of these sequences should allow comparative studies of genome evolution and functional genetics of wood species.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Atsushi Watanabe
- Department of Forest Environmental Sciences, Faculty of Agriculture, Kyushu University, 6-10-1 Hakozaki, Higashi-ku, Fukuoka 812-8581, Japan.
| |
Collapse
|
22
|
Huang J, Liu X, Wang J, Lü Y. Transcriptomic analysis of Asiatic lily in the process of vernalization via RNA-seq. Mol Biol Rep 2014; 41:3839-52. [PMID: 24557890 DOI: 10.1007/s11033-014-3250-2] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2013] [Accepted: 02/07/2014] [Indexed: 01/05/2023]
Abstract
The new transcriptomes provided comprehensive sequence profiling data of transcriptomic variation during vernalization in Lily Asiatic Hybrids 'Tiny ghost'. A number of 52,277,184 sequencing raw reads totaling 5.11 Gbp of the chilling treatment (4 °C) sample and 39,466,176 sequencing raw reads totaling 3.85 Gbp of room temperature control (25 °C) sample were assembled de novo into 68,718 unigenes with a mean length of 984 bp, and a total of 33,208 (45.6%) unigenes were annotated by using public protein databases with a cut-off E value about 10(-5). There are 6,153 unigenes of which were assigned to specific metabolic pathways by the Kyoto encyclopedia of genes and genomes. Gene Ontology analysis of the annotated unigenes revealed that the majority of sequenced genes were associated with signal transduction mechanisms, posttranslational modification, protein turnover and chaperones. In addition, the genes expression levels were compared just after vernalization completion between the cold treatment and room temperature control. There are 68,116 unigenes were differentially expressed, and hierarchical clustering analysis arranged 7,301 significantly differentially-expressed unigenes into 56 groups. Six genes related to the vernalization were selected to confirm their expression levels by using quantitative real-time polymerase chain reaction. Furthermore, typical vernalization unigenes VRN1 and VRN2 were identified, and also some vernalization-associated unigenes, such as CBF, SOC, TaAGL, AP2, LEA, LIM et al. were also annotated in the present study. As for VRN1 and VRN2, their expressions were consistent with some previous related studies. Also, this was the first time the vernalization genes VRN1 and VRN2 were founded in lily. According to the results of the present studies, we predicted that they would play an important role during vernalization in Lily Asiatic Hybrids; these data provided the foundation for the future studies of metabolism during vernalization of Asiatic lily.
Collapse
Affiliation(s)
- Jie Huang
- College of Landscape Architecture, Beijing Forestry University, No. 35 Qinghua East Road, Beijing, 100083, People's Republic of China
| | | | | | | |
Collapse
|
23
|
Cavallini E, Zenoni S, Finezzo L, Guzzo F, Zamboni A, Avesani L, Tornielli GB. Functional Diversification of Grapevine MYB5a and MYB5b in the Control of Flavonoid Biosynthesis in a Petunia Anthocyanin Regulatory Mutant. ACTA ACUST UNITED AC 2014; 55:517-34. [DOI: 10.1093/pcp/pct190] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
|
24
|
Wang L, Zhang XW, Pan LL, Liu WF, Wang DP, Zhang GY, Yin YX, Yin A, Jia SG, Yu XG, Sun GY, Hu SN, Al-Mssallem IS, Yu J. A large-scale gene discovery for the red palm weevil Rhynchophorus ferrugineus (Coleoptera: Curculionidae). INSECT SCIENCE 2013; 20:689-702. [PMID: 23955844 DOI: 10.1111/j.1744-7917.2012.01561.x] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 06/05/2012] [Indexed: 06/02/2023]
Abstract
The red palm weevil (RPW; Rhynchophorus ferrugineus) is a devastating pest of palms, prevalent in the Middle East as well as many other regions of the world. Here, we report a large-scale de novo complementary DNA (cDNA) sequencing effort that acquired ∼5 million reads and assembled them into 26 765 contigs from 12 libraries made from samples of different RPW developmental stages based on the Roche/454 GS FLX platform. We annotated these contigs based on the publically available known insect genes and the Tribolium castaneum genome assembly. We find that over 80% of coding sequences (CDS) from the RPW contigs have high-identity homologs to known proteins with complete CDS. Gene expression analysis shows that the pupa and larval stages have the highest and lowest expression levels, respectively. In addition, we also identified more than 60 000 single nucleotide polymorphisms and 1 200 simple sequence repeat markers. This study provides the first large-scale cDNA dataset for RPW, a much-needed resource for future molecular studies.
Collapse
Affiliation(s)
- Lei Wang
- Joint Center for Genomics Research (JCGR), King Abdulaziz City for Science and Technology (KACST) and Chinese Academy of Sciences (CAS), Riyadh, Kingdom of Saudi Arabia; CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing; Graduate University of Chinese Academy of Sciences, Beijing, China
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
25
|
Bagnaresi P, Sala T, Irdani T, Scotto C, Lamontanara A, Beretta M, Rotino GL, Sestili S, Cattivelli L, Sabatini E. Solanum torvum responses to the root-knot nematode Meloidogyne incognita. BMC Genomics 2013; 14:540. [PMID: 23937585 PMCID: PMC3750854 DOI: 10.1186/1471-2164-14-540] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2013] [Accepted: 08/05/2013] [Indexed: 11/24/2022] Open
Abstract
BACKGROUND Solanum torvum Sw is worldwide employed as rootstock for eggplant cultivation because of its vigour and resistance/tolerance to the most serious soil-borne diseases as bacterial, fungal wilts and root-knot nematodes. The little information on Solanum torvum (hereafter Torvum) resistance mechanisms, is mostly attributable to the lack of genomic tools (e.g. dedicated microarray) as well as to the paucity of database information limiting high-throughput expression studies in Torvum. RESULTS As a first step towards transcriptome profiling of Torvum inoculated with the nematode M. incognita, we built a Torvum 3' transcript catalogue. One-quarter of a 454 full run resulted in 205,591 quality-filtered reads. De novo assembly yielded 24,922 contigs and 11,875 singletons. Similarity searches of the S. torvum transcript tags catalogue produced 12,344 annotations. A 30,0000 features custom combimatrix chip was then designed and microarray hybridizations were conducted for both control and 14 dpi (day post inoculation) with Meloidogyne incognita-infected roots samples resulting in 390 differentially expressed genes (DEG). We also tested the chip with samples from the phylogenetically-related nematode-susceptible eggplant species Solanum melongena. An in-silico validation strategy was developed based on assessment of sequence similarity among Torvum probes and eggplant expressed sequences available in public repositories. GO term enrichment analyses with the 390 Torvum DEG revealed enhancement of several processes as chitin catabolism and sesquiterpenoids biosynthesis, while no GO term enrichment was found with eggplant DEG.The genes identified from S. torvum catalogue, bearing high similarity to known nematode resistance genes, were further investigated in view of their potential role in the nematode resistance mechanism. CONCLUSIONS By combining 454 pyrosequencing and microarray technology we were able to conduct a cost-effective global transcriptome profiling in a non-model species. In addition, the development of an in silico validation strategy allowed to further extend the use of the custom chip to a related species and to assess by comparison the expression of selected genes without major concerns of artifacts. The expression profiling of S. torvum responses to nematode infection points to sesquiterpenoids and chitinases as major effectors of nematode resistance. The availability of the long sequence tags in S. torvum catalogue will allow precise identification of active nematocide/nematostatic compounds and associated enzymes posing the basis for exploitation of these resistance mechanisms in other species.
Collapse
Affiliation(s)
- Paolo Bagnaresi
- Consiglio per la Ricerca e la Sperimentazione in Agricoltura, Genomics Research Centre, via S Protaso 302, I-29107, Fiorenzuola d’Arda (PC), Italy
| | - Tea Sala
- Consiglio per la Ricerca e la Sperimentazione in Agricoltura, Unità di Ricerca per l’Orticoltura, Montanaso Lombardo (LO), Italy
| | - Tiziana Irdani
- Consiglio per la Ricerca e la Sperimentazione in Agricoltura, Centro di ricerca per l’agrobiologia e la pedologia, Cascine del Riccio, 50125, Firenze, Italy
| | - Cristina Scotto
- Consiglio per la Ricerca e la Sperimentazione in Agricoltura, Centro di ricerca per l’agrobiologia e la pedologia, Cascine del Riccio, 50125, Firenze, Italy
| | - Antonella Lamontanara
- Consiglio per la Ricerca e la Sperimentazione in Agricoltura, Genomics Research Centre, via S Protaso 302, I-29107, Fiorenzuola d’Arda (PC), Italy
| | - Massimiliano Beretta
- UNIMORE, Scienze Agrarie e degli Alimenti, Università degli Studi di Modena e Reggio Emilia, via Giovanni Amendola 2, Padiglione Besta, Reggio Emilia 42122, Italy
| | - Giuseppe Leonardo Rotino
- Consiglio per la Ricerca e la Sperimentazione in Agricoltura, Unità di Ricerca per l’Orticoltura, Montanaso Lombardo (LO), Italy
| | - Sara Sestili
- Consiglio per la Ricerca e la Sperimentazione in Agricoltura, Unità di Ricerca per l’Orticoltura, Monsampolo del Tronto, AP, Italy
| | - Luigi Cattivelli
- Consiglio per la Ricerca e la Sperimentazione in Agricoltura, Genomics Research Centre, via S Protaso 302, I-29107, Fiorenzuola d’Arda (PC), Italy
| | - Emidio Sabatini
- Consiglio per la Ricerca e la Sperimentazione in Agricoltura, Unità di Ricerca per l’Orticoltura, Monsampolo del Tronto, AP, Italy
| |
Collapse
|
26
|
Functional Annotation and Comparative Analysis of a Zygopteran Transcriptome. G3-GENES GENOMES GENETICS 2013; 3:763-770. [PMID: 23550132 PMCID: PMC3618363 DOI: 10.1534/g3.113.005637] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
In this paper we present a de novo assembly of the transcriptome of the damselfly (Enallagma hageni) through the use of 454 pyrosequencing. E. hageni is a member of the suborder Zygoptera, in the order Odonata, and Odonata organisms form the basal lineage of the winged insects (Pterygota). To date, sequence data used in phylogenetic analysis of Enallagma species have been derived from either mitochondrial DNA or ribosomal nuclear DNA. This Enallagma transcriptome contained 31,661 contigs that were assembled and translated into 14,813 individual open reading frames. Using these data, we constructed an extensive dataset of 634 orthologous nuclear protein-encoding genes across 11 species of Arthropoda and used Bayesian techniques to elucidate the position of Enallagma in the arthropod phylogenetic tree. Additionally, we demonstrated that the Enallagma transcriptome contains 169 genes that are evolving at rates that differ relative to those of the rest of the transcriptome (29 accelerated and 140 decreased), and, through multiple Gene Ontology searches and clustering methods, we present the first functional annotation of any palaeopteran’s transcriptome in the literature.
Collapse
|
27
|
Malenke JR, Milash B, Miller AW, Dearing MD. Transcriptome sequencing and microarray development for the woodrat (Neotoma spp.): custom genetic tools for exploring herbivore ecology. Mol Ecol Resour 2013; 13:674-87. [PMID: 23496907 DOI: 10.1111/1755-0998.12092] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2012] [Revised: 01/31/2013] [Accepted: 02/05/2013] [Indexed: 01/08/2023]
Abstract
Massively parallel sequencing has enabled the creation of novel, in-depth genetic tools for nonmodel, ecologically important organisms. We present the de novo transcriptome sequencing, analysis and microarray development for a vertebrate herbivore, the woodrat (Neotoma spp.). This genus is of ecological and evolutionary interest, especially with respect to ingestion and hepatic metabolism of potentially toxic plant secondary compounds. We generated a liver transcriptome of the desert woodrat (Neotoma lepida) using the Roche 454 platform. The assembled contigs were well annotated using rodent references (99.7% annotation), and biotransformation function was reflected in the gene ontology. The transcriptome was used to develop a custom microarray (eArray, Agilent). We tested the microarray with three experiments: one across species with similar habitat (thus, dietary) niches, one across species with different habitat niches and one across populations within a species. The resulting one-colour arrays had high technical and biological quality. Probes designed from the woodrat transcriptome performed significantly better than functionally similar probes from the Norway rat (Rattus norvegicus). There were a multitude of expression differences across the woodrat treatments, many of which related to biotransformation processes and activities. The pattern and function of the differences indicate shared ecological pressures, and not merely phylogenetic distance, play an important role in shaping gene expression profiles of woodrat species and populations. The quality and functionality of the woodrat transcriptome and custom microarray suggest these tools will be valuable for expanding the scope of herbivore biology, as well as the exploration of conceptual topics in ecology.
Collapse
Affiliation(s)
- J R Malenke
- Department of Biology, University of Utah, Salt Lake City, UT 84112, USA.
| | | | | | | |
Collapse
|
28
|
Gao Y, Xu H, Shen Y, Wang J. Transcriptomic analysis of rice (Oryza sativa) endosperm using the RNA-Seq technique. PLANT MOLECULAR BIOLOGY 2013; 81:363-78. [PMID: 23322175 DOI: 10.1007/s11103-013-0009-4] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/21/2012] [Accepted: 12/31/2012] [Indexed: 05/11/2023]
Abstract
The endosperm plays an important role in seed formation and germination, especially in rice (Oryza sativa). We used a high-throughput sequencing technique (RNA-Seq) to reveal the molecular mechanisms involved in rice endosperm development. Three cDNA libraries were taken from rice endosperm at 3, 6 and 10 days after pollination (DAP), which resulted in the detection of 21,596, 20,910 and 19,459 expressed gens, respectively. By ERANGE, we identified 10,371 differentially expressed genes (log(2)Ratio ≥1, FDR ≤0.001). The results were compared against three public databases (Gene Ontology, Kyoto Encyclopedia of Genes and Genomes and MapMan) in order to annotate the gene descriptions, associate them with gene ontology terms and to assign each to pathways. A large number of genes related to ribosomes, the spliceosome and oxidative phosphorylation were found to be expressed in the early and middle stages. Plant hormone, galactose metabolism and carbon fixation related genes showed a significant increase in expression at the middle stage, whereas genes for defense against disease or response to stress as well as genes for starch/sucrose metabolism were strongly expressed during the later stages of endosperm development. Interestingly, most metabolic pathways were down-regulated between 3 and 10 DAP except for those involved in the accumulation of material, such as starch/sucrose and protein metabolism. We also identified the expression of 1,118 putative transcription factor genes in endosperm development. The RNA-Seq results provide further systematic understanding of rice endosperm development at a fine scale and a foundation for future studies.
Collapse
Affiliation(s)
- Yi Gao
- State Key Laboratory of Hybrid Rice, College of Life Sciences, Wuhan University, Wuhan, 430072, China
| | | | | | | |
Collapse
|
29
|
Venturini L, Ferrarini A, Zenoni S, Tornielli GB, Fasoli M, Santo SD, Minio A, Buson G, Tononi P, Zago ED, Zamperin G, Bellin D, Pezzotti M, Delledonne M. De novo transcriptome characterization of Vitis vinifera cv. Corvina unveils varietal diversity. BMC Genomics 2013; 14:41. [PMID: 23331995 PMCID: PMC3556335 DOI: 10.1186/1471-2164-14-41] [Citation(s) in RCA: 84] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2012] [Accepted: 01/11/2013] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Plants such as grapevine (Vitis spp.) display significant inter-cultivar genetic and phenotypic variation. The genetic components underlying phenotypic diversity in grapevine must be understood in order to disentangle genetic and environmental factors. RESULTS We have shown that cDNA sequencing by RNA-seq is a robust approach for the characterization of varietal diversity between a local grapevine cultivar (Corvina) and the PN40024 reference genome. We detected 15,161 known genes including 9463 with novel splice isoforms, and identified 2321 potentially novel protein-coding genes in non-annotated or unassembled regions of the reference genome. We also discovered 180 apparent private genes in the Corvina genome which were missing from the reference genome. CONCLUSIONS The de novo assembly approach allowed a substantial amount of the Corvina transcriptome to be reconstructed, improving known gene annotations by robustly defining gene structures, annotating splice isoforms and detecting genes without annotations. The private genes we discovered are likely to be nonessential but could influence certain cultivar-specific characteristics. Therefore, the application of de novo transcriptome assembly should not be restricted to species lacking a reference genome because it can also improve existing reference genome annotations and identify novel, cultivar-specific genes.
Collapse
Affiliation(s)
- Luca Venturini
- Biotechnology Department, University of Verona, Strada Le Grazie 15, I-37134, Verona, Italy
| | - Alberto Ferrarini
- Biotechnology Department, University of Verona, Strada Le Grazie 15, I-37134, Verona, Italy
| | - Sara Zenoni
- Biotechnology Department, University of Verona, Strada Le Grazie 15, I-37134, Verona, Italy
| | | | - Marianna Fasoli
- Biotechnology Department, University of Verona, Strada Le Grazie 15, I-37134, Verona, Italy
| | - Silvia Dal Santo
- Biotechnology Department, University of Verona, Strada Le Grazie 15, I-37134, Verona, Italy
| | - Andrea Minio
- Biotechnology Department, University of Verona, Strada Le Grazie 15, I-37134, Verona, Italy
| | - Genny Buson
- Biotechnology Department, University of Verona, Strada Le Grazie 15, I-37134, Verona, Italy
| | - Paola Tononi
- Biotechnology Department, University of Verona, Strada Le Grazie 15, I-37134, Verona, Italy
| | - Elisa Debora Zago
- Biotechnology Department, University of Verona, Strada Le Grazie 15, I-37134, Verona, Italy
| | - Gianpiero Zamperin
- Biotechnology Department, University of Verona, Strada Le Grazie 15, I-37134, Verona, Italy
| | - Diana Bellin
- Biotechnology Department, University of Verona, Strada Le Grazie 15, I-37134, Verona, Italy
| | - Mario Pezzotti
- Biotechnology Department, University of Verona, Strada Le Grazie 15, I-37134, Verona, Italy
| | - Massimo Delledonne
- Biotechnology Department, University of Verona, Strada Le Grazie 15, I-37134, Verona, Italy
| |
Collapse
|
30
|
Bougas B, Normandeau E, Audet C, Bernatchez L. Linking transcriptomic and genomic variation to growth in brook charr hybrids (Salvelinus fontinalis, Mitchill). Heredity (Edinb) 2013; 110:492-500. [PMID: 23321707 DOI: 10.1038/hdy.2012.117] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Hybridization can lead to phenotypic differences arising from changes in gene expression patterns or new allele combinations. Variation in gene expression is thought to be controlled by differences in transcription regulation of parental alleles, either through cis- or trans-regulatory elements. A previous study among brook charr hybrids from different populations (Rupert, Laval, and domestic) showing distinct length at age during early life stages also revealed different patterns in transcription regulation inheritance of transcript abundance. In the present study, transcript abundance using RNA-sequencing and quantitative real-time PCR, single-nucleotide polymorphism (SNP) genotypes and allelic imbalance were assessed in order to understand the molecular mechanisms underlying the observed transcriptomic and differences in length at age among domestic × Rupert hybrids and Laval × domestic hybrids. We found 198 differentially expressed genes between the two hybrid crosses, and allelic imbalance could be analyzed for 69 of them. Among these 69 genes, 36 genes exhibited cis-acting regulatory effects in both of the two crosses, thus confirming the prevalent role of cis-acting regulatory elements in the regulation of differentially expressed genes among intraspecific hybrids. In addition, we detected a significant association between SNP genotypes of three genes and length at age. Our study is thus one of the few that have highlighted some of the molecular mechanisms potentially involved in the differential phenotypic expression in intraspecific hybrids for nonmodel species.
Collapse
Affiliation(s)
- B Bougas
- Département de biologie, Institut de Biologie Intégrative et des Systèmes IBIS, Université Laval, Québec, Canada.
| | | | | | | |
Collapse
|
31
|
Sweetman C, Wong DC, Ford CM, Drew DP. Transcriptome analysis at four developmental stages of grape berry (Vitis vinifera cv. Shiraz) provides insights into regulated and coordinated gene expression. BMC Genomics 2012; 13:691. [PMID: 23227855 PMCID: PMC3545830 DOI: 10.1186/1471-2164-13-691] [Citation(s) in RCA: 100] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2012] [Accepted: 12/04/2012] [Indexed: 11/21/2022] Open
Abstract
Background Vitis vinifera berry development is characterised by an initial phase where the fruit is small, hard and acidic, followed by a lag phase known as veraison. In the final phase, berries become larger, softer and sweeter and accumulate an array of organoleptic compounds. Since the physiological and biochemical makeup of grape berries at harvest has a profound impact on the characteristics of wine, there is great interest in characterising the molecular and biophysical changes that occur from flowering through veraison and ripening, including the coordination and temporal regulation of metabolic gene pathways. Advances in deep-sequencing technologies, combined with the availability of increasingly accurate V. vinifera genomic and transcriptomic data, have enabled us to carry out RNA-transcript expression analysis on a global scale at key points during berry development. Results A total of 162 million 100-base pair reads were generated from pooled Vitis vinifera (cv. Shiraz) berries sampled at 3-weeks post-anthesis, 10- and 11-weeks post-anthesis (corresponding to early and late veraison) and at 17-weeks post-anthesis (harvest). Mapping reads from each developmental stage (36-45 million) onto the NCBI RefSeq transcriptome of 23,720 V. vinifera mRNAs revealed that at least 75% of these transcripts were detected in each sample. RNA-Seq analysis uncovered 4,185 transcripts that were significantly upregulated at a single developmental stage, including 161 transcription factors. Clustering transcripts according to distinct patterns of transcription revealed coordination in metabolic pathways such as organic acid, stilbene and terpenoid metabolism. From the phenylpropanoid/stilbene biosynthetic pathway at least 46 transcripts were upregulated in ripe berries when compared to veraison and immature berries, and 12 terpene synthases were predominantly detected only in a single sample. Quantitative real-time PCR was used to validate the expression pattern of 12 differentially expressed genes from primary and secondary metabolic pathways. Conclusions In this study we report the global transcriptional profile of Shiraz grapes at key stages of development. We have undertaken a comprehensive analysis of gene families contributing to commercially important berry characteristics and present examples of co-regulation and differential gene expression. The data reported here will provide an invaluable resource for the on-going molecular investigation of wine grapes.
Collapse
Affiliation(s)
- Crystal Sweetman
- Wine Science and Business, School of Agriculture Food and Wine, University of Adelaide, Waite Campus, Urrbrae, South Australia, 5064, Australia
| | | | | | | |
Collapse
|
32
|
Zeng V, Extavour CG. ASGARD: an open-access database of annotated transcriptomes for emerging model arthropod species. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2012. [PMID: 23180770 PMCID: PMC3504982 DOI: 10.1093/database/bas048] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
The increased throughput and decreased cost of next-generation sequencing (NGS) have shifted the bottleneck genomic research from sequencing to annotation, analysis and accessibility. This is particularly challenging for research communities working on organisms that lack the basic infrastructure of a sequenced genome, or an efficient way to utilize whatever sequence data may be available. Here we present a new database, the Assembled Searchable Giant Arthropod Read Database (ASGARD). This database is a repository and search engine for transcriptomic data from arthropods that are of high interest to multiple research communities but currently lack sequenced genomes. We demonstrate the functionality and utility of ASGARD using de novo assembled transcriptomes from the milkweed bug Oncopeltus fasciatus, the cricket Gryllus bimaculatus and the amphipod crustacean Parhyale hawaiensis. We have annotated these transcriptomes to assign putative orthology, coding region determination, protein domain identification and Gene Ontology (GO) term annotation to all possible assembly products. ASGARD allows users to search all assemblies by orthology annotation, GO term annotation or Basic Local Alignment Search Tool. User-friendly features of ASGARD include search term auto-completion suggestions based on database content, the ability to download assembly product sequences in FASTA format, direct links to NCBI data for predicted orthologs and graphical representation of the location of protein domains and matches to similar sequences from the NCBI non-redundant database. ASGARD will be a useful repository for transcriptome data from future NGS studies on these and other emerging model arthropods, regardless of sequencing platform, assembly or annotation status. This database thus provides easy, one-stop access to multi-species annotated transcriptome information. We anticipate that this database will be useful for members of multiple research communities, including developmental biology, physiology, evolutionary biology, ecology, comparative genomics and phylogenomics. Database URL:asgard.rc.fas.harvard.edu
Collapse
Affiliation(s)
- Victor Zeng
- Department of Organismic and Evolutionary Biology, Harvard University, 16 Divinity Avenue, Cambridge, MA 02138, USA
| | | |
Collapse
|
33
|
Transcriptome de novo assembly from next-generation sequencing and comparative analyses in the hexaploid salt marsh species Spartina maritima and Spartina alterniflora (Poaceae). Heredity (Edinb) 2012; 110:181-93. [PMID: 23149455 DOI: 10.1038/hdy.2012.76] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
Spartina species have a critical ecological role in salt marshes and represent an excellent system to investigate recurrent polyploid speciation. Using the 454 GS-FLX pyrosequencer, we assembled and annotated the first reference transcriptome (from roots and leaves) for two related hexaploid Spartina species that hybridize in Western Europe, the East American invasive Spartina alterniflora and the Euro-African S. maritima. The de novo read assembly generated 38 478 consensus sequences and 99% found an annotation using Poaceae databases, representing a total of 16 753 non-redundant genes. Spartina expressed sequence tags were mapped onto the Sorghum bicolor genome, where they were distributed among the subtelomeric arms of the 10 S. bicolor chromosomes, with high gene density correlation. Normalization of the complementary DNA library improved the number of annotated genes. Ecologically relevant genes were identified among GO biological function categories in salt and heavy metal stress response, C4 photosynthesis and in lignin and cellulose metabolism. Expression of some of these genes had been found to be altered by hybridization and genome duplication in a previous microarray-based study in Spartina. As these species are hexaploid, up to three duplicated homoeologs may be expected per locus. When analyzing sequence polymorphism at four different loci in S. maritima and S. alterniflora, we found up to four haplotypes per locus, suggesting the presence of two expressed homoeologous sequences with one or two allelic variants each. This reference transcriptome will allow analysis of specific Spartina genes of ecological or evolutionary interest, estimation of homoeologous gene expression variation using RNA-seq and further gene expression evolution analyses in natural populations.
Collapse
|
34
|
Marinković M, de Leeuw WC, de Jong M, Kraak MHS, Admiraal W, Breit TM, Jonker MJ. Combining next-generation sequencing and microarray technology into a transcriptomics approach for the non-model organism Chironomus riparius. PLoS One 2012; 7:e48096. [PMID: 23133553 PMCID: PMC3485019 DOI: 10.1371/journal.pone.0048096] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2012] [Accepted: 09/19/2012] [Indexed: 12/22/2022] Open
Abstract
Whole-transcriptome gene-expression analyses are commonly performed in species that have a sequenced genome and for which microarrays are commercially available. To do such analyses in species with no or limited genome data, i.e. non-model organisms, necessary transcriptomics resources, i.e. an annotated transcriptome and a validated gene-expression microarray, must first be developed. The aim of the present study was to establish an advanced approach for developing transcriptomics resources for non-model organisms by combining next-generation sequencing (NGS) and microarray technology. We applied our approach to the non-biting midge Chironomus riparius, an ecologically relevant species that is widely used in sediment ecotoxicity testing. We sampled extensively covering all C. riparius developmental stages as well as toxicant exposed larvae and obtained from a normalized cDNA library 1.5 M NGS reads totalling 501 Mbp. Using the NGS data we developed transcriptomics resources in several steps. First, we designed 844 k probes directly on the NGS reads, as well as 76 k probes targeting expressed sequence tags of related species. These probes were tested for their affinity to C. riparius DNA and mRNA, by performing two biological experiments with a 1 M probe-selection microarray that contained the entire probe-library. Subsequently, the 1.5 M NGS reads were assembled into 23,709 isotigs and 135,082 singletons, which were associated to ∼55 k, respectively, ∼61 k gene ontology terms and which corresponded together to 22,593 unique protein accessions. An algorithm was developed that took the assembly and the probe affinities to DNA and mRNA into account, what resulted in 59 k highly-reliable probes that targeted uniquely 95% of the isotigs and 18% of the singletons. Concluding, our approach allowed the development of high-quality transcriptomics resources for C. riparius, and is applicable to any non-model organism. It is expected, that these resources will advance ecotoxicity testing with C. riparius as whole-transcriptome gene-expression analysis are now possible with this species.
Collapse
Affiliation(s)
- Marino Marinković
- Microarray Department and Integrative Bioinformatics Unit, Swammerdam Institute for Life Sciences (SILS), University of Amsterdam, Amsterdam, The Netherlands
- Department of Aquatic Ecology and Ecotoxicology, Institute for Biodiversity and Ecosystem Dynamics (IBED), University of Amsterdam, Amsterdam, The Netherlands
| | - Wim C. de Leeuw
- Microarray Department and Integrative Bioinformatics Unit, Swammerdam Institute for Life Sciences (SILS), University of Amsterdam, Amsterdam, The Netherlands
- Netherlands Bioinformatics Centre (NBIC), Nijmegen, The Netherlands
| | - Mark de Jong
- Microarray Department and Integrative Bioinformatics Unit, Swammerdam Institute for Life Sciences (SILS), University of Amsterdam, Amsterdam, The Netherlands
- Netherlands Bioinformatics Centre (NBIC), Nijmegen, The Netherlands
| | - Michiel H. S. Kraak
- Department of Aquatic Ecology and Ecotoxicology, Institute for Biodiversity and Ecosystem Dynamics (IBED), University of Amsterdam, Amsterdam, The Netherlands
| | - Wim Admiraal
- Department of Aquatic Ecology and Ecotoxicology, Institute for Biodiversity and Ecosystem Dynamics (IBED), University of Amsterdam, Amsterdam, The Netherlands
| | - Timo M. Breit
- Microarray Department and Integrative Bioinformatics Unit, Swammerdam Institute for Life Sciences (SILS), University of Amsterdam, Amsterdam, The Netherlands
- Netherlands Bioinformatics Centre (NBIC), Nijmegen, The Netherlands
| | - Martijs J. Jonker
- Microarray Department and Integrative Bioinformatics Unit, Swammerdam Institute for Life Sciences (SILS), University of Amsterdam, Amsterdam, The Netherlands
- Netherlands Bioinformatics Centre (NBIC), Nijmegen, The Netherlands
- * E-mail:
| |
Collapse
|
35
|
Zhang Y, Pei X, Zhang C, Lu Z, Wang Z, Jia S, Li W. De novo foliar transcriptome of Chenopodium amaranticolor and analysis of its gene expression during virus-induced hypersensitive response. PLoS One 2012; 7:e45953. [PMID: 23029338 PMCID: PMC3461033 DOI: 10.1371/journal.pone.0045953] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2012] [Accepted: 08/23/2012] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND The hypersensitive response (HR) system of Chenopodium spp. confers broad-spectrum virus resistance. However, little knowledge exists at the genomic level for Chenopodium, thus impeding the advanced molecular research of this attractive feature. Hence, we took advantage of RNA-seq to survey the foliar transcriptome of C. amaranticolor, a Chenopodium species widely used as laboratory indicator for pathogenic viruses, in order to facilitate the characterization of the HR-type of virus resistance. METHODOLOGY AND PRINCIPAL FINDINGS Using Illumina HiSeq™ 2000 platform, we obtained 39,868,984 reads with 3,588,208,560 bp, which were assembled into 112,452 unigenes (3,847 clusters and 108,605 singletons). BlastX search against the NCBI NR database identified 61,698 sequences with a cut-off E-value above 10(-5). Assembled sequences were annotated with gene descriptions, GO, COG and KEGG terms, respectively. A total number of 738 resistance gene analogs (RGAs) and homology sequences of 6 key signaling proteins within the R proteins-directed signaling pathway were identified. Based on this transcriptome data, we investigated the gene expression profiles over the stage of HR induced by Tobacco mosaic virus and Cucumber mosaic virus by using digital gene expression analysis. Numerous candidate genes specifically or commonly regulated by these two distinct viruses at early and late stages of the HR were identified, and the dynamic changes of the differently expressed genes enriched in the pathway of plant-pathogen interaction were particularly emphasized. CONCLUSIONS To our knowledge, this study is the first description of the genetic makeup of C. amaranticolor, providing deep insight into the comprehensive gene expression information at transcriptional level in this species. The 738 RGAs as well as the differentially regulated genes, particularly the common genes regulated by both TMV and CMV, are suitable candidates which merit further functional characterization to dissect the molecular mechanisms and regulatory pathways of the HR-type of virus resistance in Chenopodium.
Collapse
Affiliation(s)
- Yongqiang Zhang
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing, People's Republic of China
| | - Xinwu Pei
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing, People's Republic of China
| | - Chao Zhang
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing, People's Republic of China
| | - Zifeng Lu
- State Key Laboratory of Grassland Agro-ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, People's Republic of China
| | - Zhixing Wang
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing, People's Republic of China
| | - Shirong Jia
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing, People's Republic of China
| | - Weimin Li
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing, People's Republic of China
- * E-mail:
| |
Collapse
|
36
|
Park W, Scheffler BE, Bauer PJ, Campbell BT. Genome-wide identification of differentially expressed genes under water deficit stress in upland cotton (Gossypium hirsutum L.). BMC PLANT BIOLOGY 2012; 12:90. [PMID: 22703539 PMCID: PMC3438127 DOI: 10.1186/1471-2229-12-90] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/23/2011] [Accepted: 05/10/2012] [Indexed: 05/17/2023]
Abstract
BACKGROUND Cotton is the world's primary fiber crop and is a major agricultural commodity in over 30 countries. Like many other global commodities, sustainable cotton production is challenged by restricted natural resources. In response to the anticipated increase of agricultural water demand, a major research direction involves developing crops that use less water or that use water more efficiently. In this study, our objective was to identify differentially expressed genes in response to water deficit stress in cotton. A global expression analysis using cDNA-Amplified Fragment Length Polymorphism was conducted to compare root and leaf gene expression profiles from a putative drought resistant cotton cultivar grown under water deficit stressed and well watered field conditions. RESULTS We identified a total of 519 differentially expressed transcript derived fragments. Of these, 147 transcript derived fragment sequences were functionally annotated according to their gene ontology. Nearly 70 percent of transcript derived fragments belonged to four major categories: 1) unclassified, 2) stress/defense, 3) metabolism, and 4) gene regulation. We found heat shock protein-related and reactive oxygen species-related transcript derived fragments to be among the major parts of functional pathways induced by water deficit stress. Also, twelve novel transcripts were identified as both water deficit responsive and cotton specific. A subset of differentially expressed transcript derived fragments was verified using reverse transcription-polymerase chain reaction. Differential expression analysis also identified five pairs of duplicated transcript derived fragments in which four pairs responded differentially between each of their two homologues under water deficit stress. CONCLUSIONS In this study, we detected differentially expressed transcript derived fragments from water deficit stressed root and leaf tissues in tetraploid cotton and provided their gene ontology, functional/biological distribution, and possible roles of gene duplication. This discovery demonstrates complex mechanisms involved with polyploid cotton's transcriptome response to naturally occurring field water deficit stress. The genes identified in this study will provide candidate targets to manipulate the water use characteristics of cotton at the molecular level.
Collapse
Affiliation(s)
- Wonkeun Park
- USDA-ARS, Coastal Plains Soil, Water and Plant Research Center, Florence, SC, USA
| | | | - Philip J Bauer
- USDA-ARS, Coastal Plains Soil, Water and Plant Research Center, Florence, SC, USA
| | - B Todd Campbell
- USDA-ARS, Coastal Plains Soil, Water and Plant Research Center, Florence, SC, USA
| |
Collapse
|
37
|
|
38
|
Strickler SR, Bombarely A, Mueller LA. Designing a transcriptome next-generation sequencing project for a nonmodel plant species. AMERICAN JOURNAL OF BOTANY 2012; 99:257-66. [PMID: 22268224 DOI: 10.3732/ajb.1100292] [Citation(s) in RCA: 133] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]
Abstract
The application of next-generation sequencing (NGS) to transcriptomics, commonly called RNA-seq, allows the nearly complete characterization of transcriptomic events occurring in a specific tissue. It has proven particularly useful in nonmodel species, which often lack the resources available for sequenced organisms. Mainly, RNA-seq does not require a reference genome to gain useful transcriptomic information. In this review, the application of RNA-seq to nonmodel plant species will be addressed. Important experimental considerations from presequencing issues to postsequencing analysis, including sample and platform selection, and useful bioinformatics tools for assembly and data analysis, are covered. Methods of assembling RNA-seq data and analyses commonly performed with RNA-seq data, including single nucleotide polymorphism detection and analysis of differential expression, are explored. In addition, studies that have used RNA-seq to elucidate nonmodel plant transcriptomics are highlighted.
Collapse
Affiliation(s)
- Susan R Strickler
- Boyce Thompson Institute for Plant Research, Tower Road, Ithaca, New York 14853, USA
| | | | | |
Collapse
|
39
|
Habu T, Yamane H, Igarashi K, Hamada K, Yano K, Tao R. 454-Pyrosequencing of the Transcriptome in Leaf and Flower Buds of Japanese Apricot (Prunus mume Sieb. et Zucc.) at Different Dormant Stages. ACTA ACUST UNITED AC 2012. [DOI: 10.2503/jjshs1.81.239] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
40
|
iAssembler: a package for de novo assembly of Roche-454/Sanger transcriptome sequences. BMC Bioinformatics 2011; 12:453. [PMID: 22111509 PMCID: PMC3233632 DOI: 10.1186/1471-2105-12-453] [Citation(s) in RCA: 114] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2011] [Accepted: 11/23/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Expressed Sequence Tags (ESTs) have played significant roles in gene discovery and gene functional analysis, especially for non-model organisms. For organisms with no full genome sequences available, ESTs are normally assembled into longer consensus sequences for further downstream analysis. However current de novo EST assembly programs often generate large number of assembly errors that will negatively affect the downstream analysis. In order to generate more accurate consensus sequences from ESTs, tools are needed to reduce or eliminate errors from de novo assemblies. RESULTS We present iAssembler, a pipeline that can assemble large-scale ESTs into consensus sequences with significantly higher accuracy than current existing assemblers. iAssembler employs MIRA and CAP3 assemblers to generate initial assemblies, followed by identifying and correcting two common types of transcriptome assembly errors: 1) ESTs from different transcripts (mainly alternatively spliced transcripts or paralogs) are incorrectly assembled into same contigs; and 2) ESTs from same transcripts fail to be assembled together. iAssembler can be used to assemble ESTs generated using the traditional Sanger method and/or the Roche-454 massive parallel pyrosequencing technology. CONCLUSION We compared performances of iAssembler and several other de novo EST assembly programs using both Roche-454 and Sanger EST datasets. It demonstrated that iAssembler generated significantly more accurate consensus sequences than other assembly programs.
Collapse
|
41
|
Russell JR, Bayer M, Booth C, Cardle L, Hackett CA, Hedley PE, Jorgensen L, Morris JA, Brennan RM. Identification, utilisation and mapping of novel transcriptome-based markers from blackcurrant (Ribes nigrum). BMC PLANT BIOLOGY 2011; 11:147. [PMID: 22035129 PMCID: PMC3217869 DOI: 10.1186/1471-2229-11-147] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/01/2011] [Accepted: 10/28/2011] [Indexed: 05/27/2023]
Abstract
BACKGROUND Deep-level second generation sequencing (2GS) technologies are now being applied to non-model species as a viable and favourable alternative to Sanger sequencing. Large-scale SNP discovery was undertaken in blackcurrant (Ribes nigrum L.) using transcriptome-based 2GS 454 sequencing on the parental genotypes of a reference mapping population, to generate large numbers of novel markers for the construction of a high-density linkage map. RESULTS Over 700,000 reads were produced, from which a total of 7,000 SNPs were found. A subset of polymorphic SNPs was selected to develop a 384-SNP OPA assay using the Illumina BeadXpress platform. Additionally, the data enabled identification of 3,000 novel EST-SSRs. The selected SNPs and SSRs were validated across diverse Ribes germplasm, including mapping populations and other selected Ribes species.SNP-based maps were developed from two blackcurrant mapping populations, incorporating 48% and 27% of assayed SNPs respectively. A relatively high proportion of visually monomorphic SNPs were investigated further by quantitative trait mapping of theta score outputs from BeadStudio analysis, and this enabled additional SNPs to be placed on the two maps. CONCLUSIONS The use of 2GS technology for the development of markers is superior to previously described methods, in both numbers of markers and biological informativeness of those markers. Whilst the numbers of reads and assembled contigs were comparable to similar sized studies of other non-model species, here a high proportion of novel genes were discovered across a wide range of putative function and localisation. The potential utility of markers developed using the 2GS approach in downstream breeding applications is discussed.
Collapse
Affiliation(s)
- Joanne R Russell
- Cell & Molecular Sciences, James Hutton Institute, Invergowrie, Dundee DD2 5DA, UK
| | - Micha Bayer
- Cell & Molecular Sciences, James Hutton Institute, Invergowrie, Dundee DD2 5DA, UK
| | - Clare Booth
- Cell & Molecular Sciences, James Hutton Institute, Invergowrie, Dundee DD2 5DA, UK
| | - Linda Cardle
- Cell & Molecular Sciences, James Hutton Institute, Invergowrie, Dundee DD2 5DA, UK
| | - Christine A Hackett
- Biomathematics and Statistics Scotland, James Hutton Institute, Invergowrie, Dundee DD2 5DA, UK
| | - Pete E Hedley
- Cell & Molecular Sciences, James Hutton Institute, Invergowrie, Dundee DD2 5DA, UK
| | - Linzi Jorgensen
- Cell & Molecular Sciences, James Hutton Institute, Invergowrie, Dundee DD2 5DA, UK
| | - Jenny A Morris
- Cell & Molecular Sciences, James Hutton Institute, Invergowrie, Dundee DD2 5DA, UK
| | - Rex M Brennan
- Cell & Molecular Sciences, James Hutton Institute, Invergowrie, Dundee DD2 5DA, UK
| |
Collapse
|
42
|
Sloan DB, Keller SR, Berardi AE, Sanderson BJ, Karpovich JF, Taylor DR. De novo transcriptome assembly and polymorphism detection in the flowering plant Silene vulgaris (Caryophyllaceae). Mol Ecol Resour 2011; 12:333-43. [PMID: 21999839 DOI: 10.1111/j.1755-0998.2011.03079.x] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Members of the angiosperm genus Silene are widely used in studies of ecology and evolution, but available genomic and population genetic resources within Silene remain limited. Deep transcriptome (i.e. expressed sequence tag or EST) sequencing has proven to be a rapid and cost-effective means to characterize gene content and identify polymorphic markers in non-model organisms. In this study, we report the results of 454 GS-FLX Titanium sequencing of a polyA-selected and normalized cDNA library from Silene vulgaris. The library was generated from a single pool of transcripts, combining RNA from leaf, root and floral tissue from three genetically divergent European subpopulations of S. vulgaris. A single full-plate 454 run produced 959,520 reads totalling 363.6 Mb of sequence data with an average read length of 379.0 bp after quality trimming and removal of custom library adaptors. We assembled 832,251 (86.7%) of these reads into 40,964 contigs, which have a total length of 25.4 Mb and can be organized into 18,178 graph-based clusters or 'isogroups'. Assembled sequences were annotated based on homology to genes in multiple public databases. Analysis of sequence variants identified 13,432 putative single-nucleotide polymorphisms (SNPs) and 1320 simple sequence repeats (SSRs) that are candidates for microsatellite analysis. Estimates of nucleotide diversity from 1577 contigs were used to generate genome-wide distributions that revealed several outliers with high diversity. All of these resources are publicly available through NCBI and/or our website (http://silenegenomics.biology.virginia.edu) and should provide valuable genomic and population genetic tools for the Silene research community.
Collapse
Affiliation(s)
- Daniel B Sloan
- Department of Biology, University of Virginia, Charlottesville, VA 22903, USA.
| | | | | | | | | | | |
Collapse
|
43
|
Zenoni S, D'Agostino N, Tornielli GB, Quattrocchio F, Chiusano ML, Koes R, Zethof J, Guzzo F, Delledonne M, Frusciante L, Gerats T, Pezzotti M. Revealing impaired pathways in the an11 mutant by high-throughput characterization of Petunia axillaris and Petunia inflata transcriptomes. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2011; 68:11-27. [PMID: 21623977 DOI: 10.1111/j.1365-313x.2011.04661.x] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
Petunia is an excellent model system, especially for genetic, physiological and molecular studies. Thus far, however, genome-wide expression analysis has been applied rarely because of the lack of sequence information. We applied next-generation sequencing to generate, through de novo read assembly, a large catalogue of transcripts for Petunia axillaris and Petunia inflata. On the basis of both transcriptomes, comprehensive microarray chips for gene expression analysis were established and used for the analysis of global- and organ-specific gene expression in Petunia axillaris and Petunia inflata and to explore the molecular basis of the seed coat defects in a Petunia hybrida mutant, anthocyanin 11 (an11), lacking a WD40-repeat (WDR) transcription regulator. Among the transcripts differentially expressed in an11 seeds compared with wild type, many expected targets of AN11 were found but also several interesting new candidates that might play a role in morphogenesis of the seed coat. Our results validate the combination of next-generation sequencing with microarray analyses strategies to identify the transcriptome of two petunia species without previous knowledge of their genome, and to develop comprehensive chips as useful tools for the analysis of gene expression in P. axillaris, P. inflata and P. hybrida.
Collapse
Affiliation(s)
- Sara Zenoni
- Department of Biotechnology, University of Verona, Strada Le Grazie 15, 37134 Verona, Italy
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
44
|
Shimizu KK, Kudoh H, Kobayashi MJ. Plant sexual reproduction during climate change: gene function in natura studied by ecological and evolutionary systems biology. ANNALS OF BOTANY 2011; 108:777-87. [PMID: 21852275 PMCID: PMC3170158 DOI: 10.1093/aob/mcr180] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2010] [Accepted: 05/18/2011] [Indexed: 05/19/2023]
Abstract
BACKGROUND It is essential to understand and predict the effects of changing environments on plants. This review focuses on the sexual reproduction of plants, as previous studies have suggested that this trait is particularly vulnerable to climate change, and because a number of ecologically and evolutionarily relevant genes have been identified. SCOPE It is proposed that studying gene functions in naturally fluctuating conditions, or gene functions in natura, is important to predict responses to changing environments. First, we discuss flowering time, an extensively studied example of phenotypic plasticity. The quantitative approaches of ecological and evolutionary systems biology have been used to analyse the expression of a key flowering gene, FLC, of Arabidopsis halleri in naturally fluctuating environments. Modelling showed that FLC acts as a quantitative tracer of the temperature over the preceding 6 weeks. The predictions of this model were verified experimentally, confirming its applicability to future climate changes. Second, the evolution of self-compatibility as exemplifying an evolutionary response is discussed. Evolutionary genomic and functional analyses have indicated that A. thaliana became self-compatible via a loss-of-function mutation in the male specificity gene, SCR/SP11. Self-compatibility evolved during glacial-interglacial cycles, suggesting its association with mate limitation during migration. Although the evolution of self-compatibility may confer short-term advantages, it is predicted to increase the risk of extinction in the long term because loss-of-function mutations are virtually irreversible. CONCLUSIONS Recent studies of FLC and SCR have identified gene functions in natura that are unlikely to be found in laboratory experiments. The significance of epigenetic changes and the study of non-model species with next-generation DNA sequencers is also discussed.
Collapse
Affiliation(s)
- Kentaro K Shimizu
- Institute of Plant Biology, University Research Priority Program in Systems Biology/Functional Genomics & Zurich-Basel Plant Science Center, University of Zurich, Zollikerstrasse 107, CH-8008 Zurich, Switzerland.
| | | | | |
Collapse
|
45
|
Siebert S, Robinson MD, Tintori SC, Goetz F, Helm RR, Smith SA, Shaner N, Haddock SHD, Dunn CW. Differential gene expression in the siphonophore Nanomia bijuga (Cnidaria) assessed with multiple next-generation sequencing workflows. PLoS One 2011; 6:e22953. [PMID: 21829563 PMCID: PMC3146525 DOI: 10.1371/journal.pone.0022953] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2011] [Accepted: 07/01/2011] [Indexed: 02/02/2023] Open
Abstract
We investigated differential gene expression between functionally specialized feeding polyps and swimming medusae in the siphonophore Nanomia bijuga (Cnidaria) with a hybrid long-read/short-read sequencing strategy. We assembled a set of partial gene reference sequences from long-read data (Roche 454), and generated short-read sequences from replicated tissue samples that were mapped to the references to quantify expression. We collected and compared expression data with three short-read expression workflows that differ in sample preparation, sequencing technology, and mapping tools. These workflows were Illumina mRNA-Seq, which generates sequence reads from random locations along each transcript, and two tag-based approaches, SOLiD SAGE and Helicos DGE, which generate reads from particular tag sites. Differences in expression results across workflows were mostly due to the differential impact of missing data in the partial reference sequences. When all 454-derived gene reference sequences were considered, Illumina mRNA-Seq detected more than twice as many differentially expressed (DE) reference sequences as the tag-based workflows. This discrepancy was largely due to missing tag sites in the partial reference that led to false negatives in the tag-based workflows. When only the subset of reference sequences that unambiguously have tag sites was considered, we found broad congruence across workflows, and they all identified a similar set of DE sequences. Our results are promising in several regards for gene expression studies in non-model organisms. First, we demonstrate that a hybrid long-read/short-read sequencing strategy is an effective way to collect gene expression data when an annotated genome sequence is not available. Second, our replicated sampling indicates that expression profiles are highly consistent across field-collected animals in this case. Third, the impacts of partial reference sequences on the ability to detect DE can be mitigated through workflow choice and deeper reference sequencing.
Collapse
Affiliation(s)
- Stefan Siebert
- Department of Ecology and Evolutionary Biology, Brown University, Providence, Rhode Island, United States of America
- * E-mail: (SS); (CWD)
| | - Mark D. Robinson
- Epigenetics Laboratory, Cancer Research Program, Garvan Institute of Medical Research, Sydney, New South Wales, Australia
- Bioinformatics Division, Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia
| | - Sophia C. Tintori
- Department of Ecology and Evolutionary Biology, Brown University, Providence, Rhode Island, United States of America
| | - Freya Goetz
- Department of Ecology and Evolutionary Biology, Brown University, Providence, Rhode Island, United States of America
| | - Rebecca R. Helm
- Department of Ecology and Evolutionary Biology, Brown University, Providence, Rhode Island, United States of America
| | - Stephen A. Smith
- Department of Ecology and Evolutionary Biology, Brown University, Providence, Rhode Island, United States of America
- Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
| | - Nathan Shaner
- Monterey Bay Aquarium Research Institute, Moss Landing, California, United States of America
| | - Steven H. D. Haddock
- Monterey Bay Aquarium Research Institute, Moss Landing, California, United States of America
| | - Casey W. Dunn
- Department of Ecology and Evolutionary Biology, Brown University, Providence, Rhode Island, United States of America
- * E-mail: (SS); (CWD)
| |
Collapse
|
46
|
Hou R, Bao Z, Wang S, Su H, Li Y, Du H, Hu J, Wang S, Hu X. Transcriptome sequencing and de novo analysis for Yesso scallop (Patinopecten yessoensis) using 454 GS FLX. PLoS One 2011; 6:e21560. [PMID: 21720557 PMCID: PMC3123371 DOI: 10.1371/journal.pone.0021560] [Citation(s) in RCA: 192] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2010] [Accepted: 06/03/2011] [Indexed: 01/05/2023] Open
Abstract
BACKGROUND Bivalves comprise 30,000 extant species, constituting the second largest group of mollusks. However, limited genetic research has focused on this group of animals so far, which is, in part, due to the lack of genomic resources. The advent of high-throughput sequencing technologies enables generation of genomic resources in a short time and at a minimal cost, and therefore provides a turning point for bivalve research. In the present study, we performed de novo transcriptome sequencing to first produce a comprehensive expressed sequence tag (EST) dataset for the Yesso scallop (Patinopecten yessoensis). RESULTS In a single 454 sequencing run, 805,330 reads were produced and then assembled into 32,590 contigs, with about six-fold sequencing coverage. A total of 25,237 unique protein-coding genes were identified from a variety of developmental stages and adult tissues based on sequence similarities with known proteins. As determined by GO annotation and KEGG pathway mapping, functional annotation of the unigenes recovered diverse biological functions and processes. Transcripts putatively involved in growth, reproduction and stress/immune-response were identified. More than 49,000 single nucleotide polymorphisms (SNPs) and 2,700 simple sequence repeats (SSRs) were also detected. CONCLUSION Our data provide the most comprehensive transcriptomic resource currently available for P. yessoensis. Candidate genes potentially involved in growth, reproduction, and stress/immunity-response were identified, and are worthy of further investigation. A large number of SNPs and SSRs were also identified and ready for marker development. This resource should lay an important foundation for future genetic or genomic studies on this species.
Collapse
Affiliation(s)
- Rui Hou
- Key Laboratory of Marine Genetics and Breeding (MGB), Ministry of Education, College of Marine Life Sciences. Ocean University of China, Qingdao, China
| | - Zhenmin Bao
- Key Laboratory of Marine Genetics and Breeding (MGB), Ministry of Education, College of Marine Life Sciences. Ocean University of China, Qingdao, China
| | - Shan Wang
- Key Laboratory of Marine Genetics and Breeding (MGB), Ministry of Education, College of Marine Life Sciences. Ocean University of China, Qingdao, China
| | - Hailin Su
- Key Laboratory of Marine Genetics and Breeding (MGB), Ministry of Education, College of Marine Life Sciences. Ocean University of China, Qingdao, China
| | - Yan Li
- Key Laboratory of Marine Genetics and Breeding (MGB), Ministry of Education, College of Marine Life Sciences. Ocean University of China, Qingdao, China
| | - Huixia Du
- Key Laboratory of Marine Genetics and Breeding (MGB), Ministry of Education, College of Marine Life Sciences. Ocean University of China, Qingdao, China
| | - Jingjie Hu
- Key Laboratory of Marine Genetics and Breeding (MGB), Ministry of Education, College of Marine Life Sciences. Ocean University of China, Qingdao, China
| | - Shi Wang
- Key Laboratory of Marine Genetics and Breeding (MGB), Ministry of Education, College of Marine Life Sciences. Ocean University of China, Qingdao, China
- * E-mail: (SW); (XH)
| | - Xiaoli Hu
- Key Laboratory of Marine Genetics and Breeding (MGB), Ministry of Education, College of Marine Life Sciences. Ocean University of China, Qingdao, China
- * E-mail: (SW); (XH)
| |
Collapse
|
47
|
New approaches to Prunus transcriptome analysis. Genetica 2011; 139:755-69. [PMID: 21584650 DOI: 10.1007/s10709-011-9580-2] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2010] [Accepted: 04/26/2011] [Indexed: 12/11/2022]
Abstract
The recent sequencing of the complete genome of the peach offers new opportunities for further transcriptomic studies in Prunus species in the called post-genomics era. First works on transcriptome analysis in Prunus species started in the early 2000s with the development of ESTs (expressed sequence tags) and the analysis of several candidate genes. Later, new strategies of massive analysis (high throughput) of transcriptomes have been applied, producing larger amounts of data in terms of expression of a large number of genes in a single experiment. One of these systems is massive transcriptome analysis using cDNA biochips (microarrays) to analyze thousands of genes by hybridization of mRNA labelled with fluorescence. However, the recent emergence of a massive sequencing methodology ("deep-sequencing") of the transcriptome (RNA-Seq), based on lowering the costs of DNA (in this cases complementary, cDNA) sequencing, could be more suitable than the application of microarrays. Recent papers have described the tremendous power of this technology, both in terms of profiling coverage and quantitative accuracy in transcriptomic studies. Now this technology is being applied to plant species, including Prunus. In this work, we analyze the potential in using this RNA-Seq technology in the study of Prunus transcriptomes and the development of genomic tools. In addition, the strengths and limitations of RNA-Seq relative to microarray profiling have been discussed.
Collapse
|
48
|
Challenges in studies on flowering time: interfaces between phenological research and the molecular network of flowering genes. Ecol Res 2011. [DOI: 10.1007/s11284-011-0835-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
49
|
Yang SS, Tu ZJ, Cheung F, Xu WW, Lamb JFS, Jung HJG, Vance CP, Gronwald JW. Using RNA-Seq for gene identification, polymorphism detection and transcript profiling in two alfalfa genotypes with divergent cell wall composition in stems. BMC Genomics 2011; 12:199. [PMID: 21504589 DOI: 10.1186/1471-2164-12] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2010] [Accepted: 04/19/2011] [Indexed: 05/23/2023] Open
Abstract
BACKGROUND Alfalfa, [Medicago sativa (L.) sativa], a widely-grown perennial forage has potential for development as a cellulosic ethanol feedstock. However, the genomics of alfalfa, a non-model species, is still in its infancy. The recent advent of RNA-Seq, a massively parallel sequencing method for transcriptome analysis, provides an opportunity to expand the identification of alfalfa genes and polymorphisms, and conduct in-depth transcript profiling. RESULTS Cell walls in stems of alfalfa genotype 708 have higher cellulose and lower lignin concentrations compared to cell walls in stems of genotype 773. Using the Illumina GA-II platform, a total of 198,861,304 expression sequence tags (ESTs, 76 bp in length) were generated from cDNA libraries derived from elongating stem (ES) and post-elongation stem (PES) internodes of 708 and 773. In addition, 341,984 ESTs were generated from ES and PES internodes of genotype 773 using the GS FLX Titanium platform. The first alfalfa (Medicago sativa) gene index (MSGI 1.0) was assembled using the Sanger ESTs available from GenBank, the GS FLX Titanium EST sequences, and the de novo assembled Illumina sequences. MSGI 1.0 contains 124,025 unique sequences including 22,729 tentative consensus sequences (TCs), 22,315 singletons and 78,981 pseudo-singletons. We identified a total of 1,294 simple sequence repeats (SSR) among the sequences in MSGI 1.0. In addition, a total of 10,826 single nucleotide polymorphisms (SNPs) were predicted between the two genotypes. Out of 55 SNPs randomly selected for experimental validation, 47 (85%) were polymorphic between the two genotypes. We also identified numerous allelic variations within each genotype. Digital gene expression analysis identified numerous candidate genes that may play a role in stem development as well as candidate genes that may contribute to the differences in cell wall composition in stems of the two genotypes. CONCLUSIONS Our results demonstrate that RNA-Seq can be successfully used for gene identification, polymorphism detection and transcript profiling in alfalfa, a non-model, allogamous, autotetraploid species. The alfalfa gene index assembled in this study, and the SNPs, SSRs and candidate genes identified can be used to improve alfalfa as a forage crop and cellulosic feedstock.
Collapse
Affiliation(s)
- S Samuel Yang
- USDA-Agricultural Research Service, Plant Science Research Unit, St, Paul, MN 55108, USA.
| | | | | | | | | | | | | | | |
Collapse
|
50
|
Yang SS, Tu ZJ, Cheung F, Xu WW, Lamb JFS, Jung HJG, Vance CP, Gronwald JW. Using RNA-Seq for gene identification, polymorphism detection and transcript profiling in two alfalfa genotypes with divergent cell wall composition in stems. BMC Genomics 2011; 12:199. [PMID: 21504589 PMCID: PMC3112146 DOI: 10.1186/1471-2164-12-199] [Citation(s) in RCA: 105] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2010] [Accepted: 04/19/2011] [Indexed: 02/08/2023] Open
Abstract
Background Alfalfa, [Medicago sativa (L.) sativa], a widely-grown perennial forage has potential for development as a cellulosic ethanol feedstock. However, the genomics of alfalfa, a non-model species, is still in its infancy. The recent advent of RNA-Seq, a massively parallel sequencing method for transcriptome analysis, provides an opportunity to expand the identification of alfalfa genes and polymorphisms, and conduct in-depth transcript profiling. Results Cell walls in stems of alfalfa genotype 708 have higher cellulose and lower lignin concentrations compared to cell walls in stems of genotype 773. Using the Illumina GA-II platform, a total of 198,861,304 expression sequence tags (ESTs, 76 bp in length) were generated from cDNA libraries derived from elongating stem (ES) and post-elongation stem (PES) internodes of 708 and 773. In addition, 341,984 ESTs were generated from ES and PES internodes of genotype 773 using the GS FLX Titanium platform. The first alfalfa (Medicago sativa) gene index (MSGI 1.0) was assembled using the Sanger ESTs available from GenBank, the GS FLX Titanium EST sequences, and the de novo assembled Illumina sequences. MSGI 1.0 contains 124,025 unique sequences including 22,729 tentative consensus sequences (TCs), 22,315 singletons and 78,981 pseudo-singletons. We identified a total of 1,294 simple sequence repeats (SSR) among the sequences in MSGI 1.0. In addition, a total of 10,826 single nucleotide polymorphisms (SNPs) were predicted between the two genotypes. Out of 55 SNPs randomly selected for experimental validation, 47 (85%) were polymorphic between the two genotypes. We also identified numerous allelic variations within each genotype. Digital gene expression analysis identified numerous candidate genes that may play a role in stem development as well as candidate genes that may contribute to the differences in cell wall composition in stems of the two genotypes. Conclusions Our results demonstrate that RNA-Seq can be successfully used for gene identification, polymorphism detection and transcript profiling in alfalfa, a non-model, allogamous, autotetraploid species. The alfalfa gene index assembled in this study, and the SNPs, SSRs and candidate genes identified can be used to improve alfalfa as a forage crop and cellulosic feedstock.
Collapse
Affiliation(s)
- S Samuel Yang
- USDA-Agricultural Research Service, Plant Science Research Unit, St, Paul, MN 55108, USA.
| | | | | | | | | | | | | | | |
Collapse
|