1
|
Khelghatibana F, Javan-Nikkhah M, Safaie N, Sobhani A, Shams S, Sari E. A reference transcriptome for walnut anthracnose pathogen, Ophiognomonia leptostyla, guides the discovery of candidate virulence genes. Fungal Genet Biol 2023; 169:103828. [PMID: 37657751 DOI: 10.1016/j.fgb.2023.103828] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Revised: 08/13/2023] [Accepted: 08/28/2023] [Indexed: 09/03/2023]
Abstract
Despite the economic losses due to the walnut anthracnose, Ophiognomonia leptostyla is an orphan fungus with respect to genomic resources. In the present study, the transcriptome of O. leptostyla was assembled for the first time. RNA sequencing was conducted for the fungal mycelia grown in a liquid media, and the inoculated leaf samples of walnut with the fungal conidia sampled at 48, 96 and 144 h post inoculation (hpi). The completeness, correctness, and contiguity of the de novo transcriptome assemblies generated with Trinity, Oases, SOAPdenovo-Trans and Bridger were compared to identify a single superior reference assembly. In most of the assessment criteria including N50, Transrate score, number of ORFs with known description in gene bank, the percentage of reads mapped back to the transcript (RMBT), BUSCO score, Swiss-Prot coverage bin and RESM-EVAL score, the Bridger assembly was the superior and thus used as a reference for profiling the O. leptostyla transcriptome in liquid media vs. during walnut infection. The k-means clustering of transcripts resulted in four distinct transcription patterns across the three sampling time points. Most of the detected CAZy transcripts had elevated transcription at 96 hpi that is hypothetically concurrent with the start of intracellular growth. The in-silico analysis revealed 103 candidate effectors of which six were members of Necrosis and Ethylene Inducing Like Protein (NLP) gene family belonging to three distinct k-means clusters. This study provided a complex and temporal pattern of the CAZys and candidate effectors transcription during six days post O. leptostyla inoculation on walnut leaves, introducing a list of candidate virulence genes for validation in future studies.
Collapse
Affiliation(s)
- Fatemeh Khelghatibana
- Department of Plant Pathology, Iranian Research Institute of Plant Protection, Agricultural Research, Education and Extension Organization (AREEO), Tehran, Iran.
| | - Mohammad Javan-Nikkhah
- Department of Plant Protection, College of Agriculture and Natural Resources, University of Tehran, Karaj, Iran
| | - Naser Safaie
- Department of Plant Pathology, Faculty of Agriculture, Tarbiat Modares University, Tehran, Iran
| | - Ahmad Sobhani
- Agricultural Biotechnology Research Institute of Iran - Isfahan Branch, Agricultural Research, Education and Extension Organization (AREEO), Isfahan, Iran
| | - Somayeh Shams
- Department of Plant Production and Genetic Engineering, Faculty of Agriculture, University of Lorestan, Khorramabad, Iran
| | - Ehsan Sari
- Department of Microbiology and Plant Pathology, University of California, Riverside, CA, USA.
| |
Collapse
|
2
|
Abstract
Polyploidizations, or whole-genome duplications (WGDs), in plants have increased biological complexity, facilitated evolutionary innovation, and likely enabled adaptation under harsh conditions. Besides genomic data, transcriptome data have been widely employed to detect WGDs, due to their efficient accessibility to the gene space of a species. Age distributions based on synonymous substitutions (so-called KS age distributions) for paralogs assembled from transcriptome data have identified numerous WGDs in plants, paving the way for further studies on the importance of WGDs for the evolution of seed and flowering plants. However, it is still unclear how transcriptome-based age distributions compare to those based on genomic data. In this chapter, we implemented three different de novo transcriptome assembly pipelines with two popular assemblers, i.e., Trinity and SOAPdenovo-Trans. We selected six plant species with published genomes and transcriptomes to evaluate how assembled transcripts from different pipelines perform when using KS distributions to detect previously documented WGDs in the six species. Further, using genes predicted in each genome as references, we evaluated the effects of missing genes, gene family clustering, and de novo assembled transcripts on the transcriptome-based KS distributions. Our results show that, although the transcriptome-based KS distributions differ from the genome-based ones with respect to their shapes and scales, they are still reasonably reliable for unveiling WGDs, except in species where most duplicates originated from a recent WGD. We also discuss how to overcome some possible pitfalls when using transcriptome data to identify WGDs.
Collapse
Affiliation(s)
- Jia Li
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium.,VIB Center for Plant Systems Biology, VIB, Ghent, Belgium
| | - Yves Van de Peer
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium.
| | - Zhen Li
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium.
| |
Collapse
|
3
|
Coate JE. Beyond Transcript Concentrations: Quantifying Polyploid Expression Responses per Biomass, per Genome, and per Cell with RNA-Seq. Methods Mol Biol 2023; 2545:227-250. [PMID: 36720816 DOI: 10.1007/978-1-0716-2561-3_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
RNA-seq has been used extensively to study expression responses to polyploidy. Most current methods for normalizing RNA-seq data yield estimates of transcript concentrations (transcripts per transcriptome). The implicit assumption of these normalization methods is that transcriptome size is equivalent between the samples being compared such that transcript concentrations are equivalent to transcripts per cell. In recent years, however, evidence has mounted that transcriptome size can vary dramatically in response to a range of factors including polyploidy and that such variation is ubiquitous. Where such variation exists, transcript concentration is often a poor or even misleading proxy for expression responses at other biologically relevant scales (e.g., expression per cell). Thus, it is important that transcriptomic studies of polyploids move beyond simply comparing transcript concentrations if we are to gain a complete understanding of how genome multiplication affects gene expression. I discuss this issue in more detail and summarize a suite of approaches that can leverage RNA-seq to quantify expression responses per genome, per cell, and per unit of biomass.
Collapse
|
4
|
Lei Y, Harris AJ, Wang A, Zhao L, Luo M, Li J, Chen H. Comparative transcriptomic analysis of genes in the triterpene saponin biosynthesis pathway in leaves and roots of
Ardisia kteniophylla
A. DC., a plant used in traditional Chinese medicine. Ecol Evol 2022; 12:e8920. [PMID: 35600685 PMCID: PMC9120893 DOI: 10.1002/ece3.8920] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Revised: 04/08/2022] [Accepted: 04/25/2022] [Indexed: 11/28/2022] Open
Abstract
Ardisia kteniophylla (Primulaceae) is highly valued in traditional medicine due to its production of the pharmacologically active secondary metabolites, especially triterpenoid saponins in its roots. Although A. kteniophylla is very important in traditional medicine, the genetic basis for its production of triterpenoid saponins remains largely unknown. Therefore, we sequenced transcriptomes of A. kteniophylla to identify putative genes involved in production of triterpenoid saponins in both leaves and roots, and we used the transcriptomes to compare expression levels of these genes between the two organ systems. The production of triterpenoid saponins in plants is usually induced through hormonal signaling on account of the presence of pests. Thus, we treated plants with the hormones salicylic acid (SA) and methyl jasmonate (MeJA) and used quantitative real‐time PCR (qRT‐PCR) to investigate expression levels of genes involved in triterpenoid saponin biosynthesis. In total, we obtained transcriptomes for leaf and root tissues representing 52,454 unigenes. Compared with the leaf transcriptome, we found that 6092 unigenes were upregulated in the root, especially enzymes involved in the direct synthesis of triterpenoid saponins, while 6001 genes appeared downregulated, including those involved in precursory steps in the triterpenoid saponin biosynthesis pathway. Our results from qRT‐PCR indicate that genes within the upstream parts of the triterpenoid saponin biosynthesis pathway may be upregulated under exposure to the applied hormones, but downstream genes are downregulated. This suggests possible conflicting effects of SA and MeJA in promoting the production of secondary metabolites on the one hand, and, on the other, limiting plant growth processes to devote energy to combating pests. We also performed an analysis of transcription factors (TFs) and found 997 unique transcripts belonging to 16 TF families. Our data may help to facilitate future work on triterpene saponins biosynthesis in A. kteniophylla with potential pharmacological and molecular breeding applications.
Collapse
Affiliation(s)
- Yuyang Lei
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization South China Botanical Garden Chinese Academy of Sciences Guangzhou China
- Wuhan Guishan Mountain Scenic Management Office Wuhan China
| | - AJ Harris
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization South China Botanical Garden Chinese Academy of Sciences Guangzhou China
| | - Aihua Wang
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization South China Botanical Garden Chinese Academy of Sciences Guangzhou China
- Key Laboratory of Environment Change and Resources Use in Beibu Gulf (Nanning Normal University) Ministry of Education Nanning China
| | - Liyun Zhao
- CAS Engineering Laboratory for Vegetation Ecosystem Restoration on Islands and Coastal Zones South China Botanical Garden Chinese Academy of Sciences Guangzhou China
| | - Ming Luo
- Key Laboratory of South China Agricultural Plant Molecular Analysis and Genetic Improvement South China Botanical Garden Chinese Academy of Sciences Guangzhou China
- Guangdong Provincial Key Laboratory of Applied Botany South China Botanical Garden Chinese Academy of Sciences Guangzhou China
| | - Ji Li
- Guangdong Provincial Key Laboratory of Applied Botany South China Botanical Garden Chinese Academy of Sciences Guangzhou China
| | - Hongfeng Chen
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization South China Botanical Garden Chinese Academy of Sciences Guangzhou China
| |
Collapse
|
5
|
RNA-seq for revealing the function of the transcriptome. Bioinformatics 2022. [DOI: 10.1016/b978-0-323-89775-4.00002-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
|
6
|
Taheri-Dehkordi A, Naderi R, Martinelli F, Salami SA. Computational screening of miRNAs and their targets in saffron (Crocus sativus L.) by transcriptome mining. PLANTA 2021; 254:117. [PMID: 34751821 DOI: 10.1007/s00425-021-03761-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/06/2021] [Accepted: 10/18/2021] [Indexed: 06/13/2023]
Abstract
A robust workflow for the identification of miRNAs and their targets in saffron was developed. MicroRNA-mediated gene regulation in saffron is potentially involved in several biological processes, including the biosynthesis of highly valuable apocarotenoids. Saffron (Crocus sativus L.) is the most expensive spice in the world and a major source of apocarotenoids. Even though miRNAs (20-24 nt non-coding small RNAs) are important regulators of gene expression at transcriptional and post-transcriptional levels, their role in saffron has not been thoroughly investigated. As a result, a workflow for computational identification of miRNAs and their targets can be useful to uncover the regulatory networks underlying biological processes in this valuable plant. The efficiency of several assembly tools such as Trans-ABySS, Trinity, Bridger, rnaSPAdes, and EvidentialGene was evaluated based on both reference-based and reference-free metrics using transcriptome data. A reliable workflow for computational identification of miRNAs and their targets in saffron was described. The EvidentialGene was found to be the most efficient de novo transcriptome assembler for saffron as a complex triploid model, followed by the Trinity. In total, 66 miRNAs from 19 different families that target 2880 genes, including several transcription factors involved in the flowering transition, were identified. Three of the identified targets were involved in the terpenoids backbone biosynthesis. CsCCD and CsUGT genes involved in the apocarotenoids biosynthetic pathway were targeted by csa-miR156g and csa-miR156b-3p, revealing a unique post-transcriptional regulation dynamic in saffron. The identified miRNAs and their targets add to our understanding of the many biological roles of miRNAs in saffron and shed new light on the control of the apocarotenoid biosynthetic pathway in this valuable plant.
Collapse
Affiliation(s)
- Ayat Taheri-Dehkordi
- Department of Horticultural Science, Faculty of Agricultural Science and Engineering, University of Tehran, Karaj, Iran
| | - Roohangiz Naderi
- Department of Horticultural Science, Faculty of Agricultural Science and Engineering, University of Tehran, Karaj, Iran
| | | | - Seyed Alireza Salami
- Department of Horticultural Science, Faculty of Agricultural Science and Engineering, University of Tehran, Karaj, Iran.
| |
Collapse
|
7
|
Baranwal VK, Negi N, Khurana P. Comparative transcriptomics of leaves of five mulberry accessions and cataloguing structural and expression variants for future prospects. PLoS One 2021; 16:e0252246. [PMID: 34260613 PMCID: PMC8279327 DOI: 10.1371/journal.pone.0252246] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2021] [Accepted: 05/12/2021] [Indexed: 12/13/2022] Open
Abstract
Bombyx mori, a monophagous insect, prefers leaves of the certain species of Morus more than others. The preference has been attributed to morphological and anatomical features and biochemical compounds. In the present manuscript a comparison has been made among the transcriptome of leaves of the two preferred cultivated varieties and three wild types species. While assembling, high quality transcriptomes of five genotypes were constructed with a total of 100930, 151245, 89724, 181761 and 102908 transcripts from ML, MN, MS, K2 and V1 samples respectively. Further, to compare them, orthologs were identified from these assembled transcriptome. A total of 22462, 23413, 23685, 24371, 18362, 22326, 20058, 18049, 17567 and 20518 clusters of orthologs were found in one to one comparison in KvsN, KvsL, KvsS, KvsV, LvsN, LvsS, LvsV, NvsS, NvsV, and SvsV respectively. 4236 orthologs with algebraic connectivity of 1.0 were then used to compare and to find out differentially expressed transcripts from all the genotypes. A total of 1037 transcripts expressed that include some of the important morphology, anatomy and biochemical pathways regulating transcription factors (AP2/ERFs and C2H2 Zinc fingers) and signalling components were identified to express differentially. Further, these transcriptomes were used find out markers (SSR) and variants and a total of 1101013, 537245, 970877, 310437, 675772, 338400, 581189, 751477, 514999 and 257107 variants including SNP, MNP, Insertions and deletions were found in one to one comparisons. Taken together, our data could be highly useful for mulberry community worldwide as it could be utilized in mulberry breeding programs.
Collapse
Affiliation(s)
- Vinay Kumar Baranwal
- Department of Plant Molecular Biology, University of Delhi South Campus, New Delhi, India
- Department of Botany, Swami Devanand Post Graduate College, Math-Lar, Lar, Deoria, Uttar Pradesh, India
| | - Nisha Negi
- Department of Plant Molecular Biology, University of Delhi South Campus, New Delhi, India
| | - Paramjit Khurana
- Department of Plant Molecular Biology, University of Delhi South Campus, New Delhi, India
| |
Collapse
|
8
|
Chen X, Lai H, Li R, Yao Y, Liu J, Yuan S, Fu S, Hu X, Guo J. Character changes and Transcriptomic analysis of a cassava sexual Tetraploid. BMC PLANT BIOLOGY 2021; 21:188. [PMID: 33874893 PMCID: PMC8056498 DOI: 10.1186/s12870-021-02963-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/21/2020] [Accepted: 04/07/2021] [Indexed: 06/12/2023]
Abstract
BACKGROUND Cassava (Manihot esculenta Crantz) is an important food crop known for its high starch content. Polyploid breeding is effective in its genetic improvement, and use of 2n gametes in sexual polyploid breeding is one of the potential methods for cassava breeding and improvement. In our study, the cassava sexual tetraploid (ST), which carries numerous valuable traits, was successfully generated by hybridizing 2n female gametes SC5 (♀) and 2n male gametes SC10 (♂). However, the molecular mechanisms remain unclear. To understand these underlying molecular mechanisms behind the phenotypic alterations and heterosis in ST plants, we investigated the differences in gene expression between polyploids and diploids by determining the transcriptomes of the ST plant and its parents during the tuber root enlargement period. We also compared the characters and transcriptomes of the ST plant with its parents. RESULTS The ST plant was superior in plant height, stem diameter, leaf area, petiole length, plant weight, and root weight than the parent plants, except the leaf number, which was lower. The number of starch granules was higher in the roots of ST plants than those in the parent plants after five months (tuber root enlargement period), which could be due to a higher leaf net photosynthetic rate leading to early filling of starch granules. Based on transcriptome analysis, we identified 2934 and 3171 differentially expressed genes (DEGs) in the ST plant as compared to its female and male parents, respectively. Pathway enrichment analyses revealed that flavonoid biosynthesis and glycolysis/gluconeogenesis were significantly enriched in the ST plants, which might contribute to the colors of petiole (purple-red), root epidermis (dark brown), and tuber starch accumulation, respectively. CONCLUSIONS After sexual polyploidization, the phenotype of ST has changed significantly in comparison to their diploid parents, mainly manifest as enlarged biomass, yield, early starch filling, deep colored petiole and root epidermis. The tetraploid plants were also mature early due to early starch grain filling. Owing to enriched flavonoid biosynthesis and glycolysis/gluconeogenesis, they are possibly resistant to adversity stresses and provide better yield, respectively.
Collapse
Affiliation(s)
- Xia Chen
- Agricultural College of Hainan University, Haikou, 571104 China
| | - Hanggui Lai
- Agricultural College of Hainan University, Haikou, 571104 China
| | - Ruimei Li
- Key Laboratory of Biology and Genetic Resources of Tropical Crops, Ministry of Agriculture, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, 571101 China
| | - Yuan Yao
- Key Laboratory of Biology and Genetic Resources of Tropical Crops, Ministry of Agriculture, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, 571101 China
| | - Jiao Liu
- Key Laboratory of Biology and Genetic Resources of Tropical Crops, Ministry of Agriculture, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, 571101 China
| | - Shuai Yuan
- Agricultural College of Hainan University, Haikou, 571104 China
| | - Shaoping Fu
- Key Laboratory of Biology and Genetic Resources of Tropical Crops, Ministry of Agriculture, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, 571101 China
| | - Xinwen Hu
- Agricultural College of Hainan University, Haikou, 571104 China
| | - Jianchun Guo
- Key Laboratory of Biology and Genetic Resources of Tropical Crops, Ministry of Agriculture, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, 571101 China
| |
Collapse
|
9
|
Madritsch S, Burg A, Sehr EM. Comparing de novo transcriptome assembly tools in di- and autotetraploid non-model plant species. BMC Bioinformatics 2021; 22:146. [PMID: 33752598 PMCID: PMC7986043 DOI: 10.1186/s12859-021-04078-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Accepted: 03/15/2021] [Indexed: 01/15/2023] Open
Abstract
Background Polyploidy is very common in plants and can be seen as one of the key drivers in the domestication of crops and the establishment of important agronomic traits. It can be the main source of genomic repatterning and introduces gene duplications, affecting gene expression and alternative splicing. Since fully sequenced genomes are not yet available for many plant species including crops, de novo transcriptome assembly is the basis to understand molecular and functional mechanisms. However, in complex polyploid plants, de novo transcriptome assembly is challenging, leading to increased rates of fused or redundant transcripts. Since assemblers were developed mainly for diploid organisms, they may not well suited for polyploids. Also, comparative evaluations of these tools on higher polyploid plants are extremely rare. Thus, our aim was to fill this gap and to provide a basic guideline for choosing the optimal de novo assembly strategy focusing on autotetraploids, as the scientific interest in this type of polyploidy is steadily increasing. Results We present a comparison of two common (SOAPdenovo-Trans, Trinity) and one recently published transcriptome assembler (TransLiG) on diploid and autotetraploid species of the genera Acer and Vaccinium using Arabidopsis thaliana as a reference. The number of assembled transcripts was up to 11 and 14 times higher with an increased number of short transcripts for Acer and Vaccinium, respectively, compared to A. thaliana. In diploid samples, Trinity and TransLiG performed similarly good while in autotetraploids, TransLiG assembled most complete transcriptomes with an average of 1916 assembled BUSCOs vs. 1705 BUSCOs for Trinity. Of all three assemblers, SOAPdenovo-Trans performed worst (1133 complete BUSCOs). Conclusion All three assembly tools produced complete assemblies when dealing with the model organism A. thaliana, independently of its ploidy level, but their performances differed extremely when it comes to non-model autotetraploids, where specifically TransLiG and Trinity produced a high number of redundant transcripts. The recently published assembler TransLiG has not been tested yet on any plant organism but showed highest completeness and full-length transcriptomes, especially in autotetraploids. Including such species during the development and testing of new assembly tools is highly appreciated and recommended as many important crops are polyploid. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-021-04078-8.
Collapse
Affiliation(s)
- Silvia Madritsch
- AIT Austrian Institute of Technology, Center for Health and Bioresources, Tulln, Austria.,Center for Integrative Bioinformatics Vienna, Max Perutz Labs, University of Vienna, Medical University of Vienna, Vienna, Austria
| | - Agnes Burg
- AIT Austrian Institute of Technology, Center for Health and Bioresources, Tulln, Austria
| | - Eva M Sehr
- AIT Austrian Institute of Technology, Center for Health and Bioresources, Tulln, Austria.
| |
Collapse
|
10
|
Yang S, Zhang J, Geng Y, Tang Z, Wang J, Guo F, Meng J, Wang Q, Wan S, Li X. Transcriptome analysis reveals the mechanism of improving erect-plant-type peanut yield by single-seeding precision sowing. PeerJ 2021; 9:e10616. [PMID: 33614263 PMCID: PMC7879956 DOI: 10.7717/peerj.10616] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2020] [Accepted: 11/30/2020] [Indexed: 01/11/2023] Open
Abstract
Background In China, double-seed (DS) sowing (i.e., sowing two seeds per hole) has been conventionally performed towards the erect-plant-type peanuts to increase the low germination rate due to poor seed preservation conditions. However, the corresponding within-hole plant competition usually limits the subsequent plant growth and the final yield. We developed a high-yield cultivation system of single-seed (SS) precision sowing to solve this paradox, saving 20% of seeds and increasing yields by more than 10% relative to the conventional DS sowing. Methods To explore the mechanisms of these two different cropping patterns in peanut yields, we conducted transcriptomic and physiological comparisons in the seeding plant leaf and root tissues between SS precision sowing and standard DS sowing treatments. Results After assembly, each library contained an average of 43 million reads and generated a total of 523,800, 338 clean reads. After GO and Kyoto Encyclopedia of Genes and Genomes pathway analysis, we found the key genes for biotic and abiotic stress showed higher expression in roots of plants grown under the SS precision sowing treatment, including genes encoding disease resistance, oxidation-reduction, hormone related, and stress response transcription factors and signaling regulation proteins. In particular, the resveratrol synthesis genes related to stress and disease resistance appeared induced in roots under the SS sowing treatment. Conclusion These data indicated that Aspergillus flavus resistance and stress tolerance in roots under SS precision sowing were enhanced compared with roots under the DS sowing treatment. This work benefits the development of underground pods and thus increasing peanut yields.
Collapse
Affiliation(s)
- Sha Yang
- Biotechnology Research Center, Shandong Academy of Agricultural Sciences, Ji'nan, China.,Scientific observation and experiment station of crop cultivation in east China, Ministry of Agriculture and Rural Affairs, Dongying, China
| | - Jialei Zhang
- Biotechnology Research Center, Shandong Academy of Agricultural Sciences, Ji'nan, China.,Scientific observation and experiment station of crop cultivation in east China, Ministry of Agriculture and Rural Affairs, Dongying, China
| | - Yun Geng
- Biotechnology Research Center, Shandong Academy of Agricultural Sciences, Ji'nan, China.,Scientific observation and experiment station of crop cultivation in east China, Ministry of Agriculture and Rural Affairs, Dongying, China
| | - Zhaohui Tang
- Biotechnology Research Center, Shandong Academy of Agricultural Sciences, Ji'nan, China.,Scientific observation and experiment station of crop cultivation in east China, Ministry of Agriculture and Rural Affairs, Dongying, China
| | - Jianguo Wang
- Biotechnology Research Center, Shandong Academy of Agricultural Sciences, Ji'nan, China.,Scientific observation and experiment station of crop cultivation in east China, Ministry of Agriculture and Rural Affairs, Dongying, China
| | - Feng Guo
- Biotechnology Research Center, Shandong Academy of Agricultural Sciences, Ji'nan, China.,Scientific observation and experiment station of crop cultivation in east China, Ministry of Agriculture and Rural Affairs, Dongying, China
| | - Jingjing Meng
- Biotechnology Research Center, Shandong Academy of Agricultural Sciences, Ji'nan, China.,Scientific observation and experiment station of crop cultivation in east China, Ministry of Agriculture and Rural Affairs, Dongying, China
| | - Quan Wang
- Biotechnology Research Center, Shandong Academy of Agricultural Sciences, Ji'nan, China.,College of Life Sciences, Shandong Normal University, Ji'nan, China
| | - Shubo Wan
- Shandong Academy of Agricultural Sciences/Shandong Provincial Key Laboratory of Crop Genetic Improvement, Ecology and Physiology, Ji'nan, China
| | - Xinguo Li
- Biotechnology Research Center, Shandong Academy of Agricultural Sciences, Ji'nan, China.,Scientific observation and experiment station of crop cultivation in east China, Ministry of Agriculture and Rural Affairs, Dongying, China
| |
Collapse
|
11
|
Modern Approaches for Transcriptome Analyses in Plants. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2021; 1346:11-50. [DOI: 10.1007/978-3-030-80352-0_2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
12
|
Vasav AP, Pable AA, Barvkar VT. Differential transcriptome and metabolome analysis of Plumbago zeylanica L. reveal putative genes involved in plumbagin biosynthesis. Fitoterapia 2020; 147:104761. [PMID: 33069837 DOI: 10.1016/j.fitote.2020.104761] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2020] [Revised: 09/24/2020] [Accepted: 10/05/2020] [Indexed: 11/20/2022]
Abstract
Plumbagin is a pharmacologically active naphthoquinone present in the Plumbago zeylanica L. having important medicinal properties. The root of P. zeylanica is rich and primary tissue of the plumbagin biosynthesis and accumulation. The complete biosynthetic pathway of plumbagin in plant is still obscure. The present study attempts to understand the plumbagin biosynthetic pathway with the help of differential transcriptome and metabolome analysis of P. zeylanica leaf and root. The transcriptome data showed co-expression of Aldo-keto reductase (PzAKR), Polyketide cyclase (Pzcyclase) and Cytochrome P450 (PzCYPs) transcripts along with the Polyketide synthase (PzPKS) transcripts. Their higher expression in root as compared to leaf supports their possible involvement in plumbagin biosynthesis. The metabolome data of leaf and root revealed naphthalene derivative isoshinanolone that could be potential precursor of plumbagin. Pathway elucidation and transcriptome data of P. zeylanica, will enable and accelerate research on naphthoquinone biosynthesis in plants.
Collapse
Affiliation(s)
- Arati P Vasav
- Department of Botany, Savitribai Phule Pune University, Pune 411007, India
| | - Anupama A Pable
- Department of Microbiology, Savitribai Phule Pune University, Pune 411007, India
| | - Vitthal T Barvkar
- Department of Botany, Savitribai Phule Pune University, Pune 411007, India.
| |
Collapse
|
13
|
Use of Targeted Amplicon Sequencing in Peanut to Generate Allele Information on Allotetraploid Sub-Genomes. Genes (Basel) 2020; 11:genes11101220. [PMID: 33080972 PMCID: PMC7650781 DOI: 10.3390/genes11101220] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Revised: 10/09/2020] [Accepted: 10/14/2020] [Indexed: 11/18/2022] Open
Abstract
The use of molecular markers in plant breeding has become a routine practice, but the cost per accession can be a hindrance to the routine use of Quantitative Trait Loci (QTL) identification in breeding programs. In this study, we demonstrate the use of targeted re-sequencing as a proof of concept of a cost-effective approach to retrieve highly informative allele information, as well as develop a bioinformatics strategy to capture the genome-specific information of a polyploid species. SNPs were identified from alignment of raw transcriptome reads (2 × 50 bp) to a synthetic tetraploid genome using BWA followed by a GATK pipeline. Regions containing high polymorphic SNPs in both A genome and B genomes were selected as targets for the resequencing study. Targets were amplified using multiplex PCR followed by sequencing on an Illumina HiSeq. Eighty-one percent of the SNP calls in diploids and 68% of the SNP calls in tetraploids were confirmed. These results were also confirmed by KASP validation. Based on this study, we find that targeted resequencing technologies have potential for obtaining maximum allele information in allopolyploids at reduced cost.
Collapse
|
14
|
Maldonado JA, Firneno TJ, Roelke CE, Rains ND, Mwgiri J, Fujita MK. Transcriptome sequencing reveals signatures of positive selection in the Spot-Tailed Earless Lizard. PLoS One 2020; 15:e0234504. [PMID: 32542006 PMCID: PMC7295237 DOI: 10.1371/journal.pone.0234504] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2020] [Accepted: 05/26/2020] [Indexed: 11/21/2022] Open
Abstract
The continual loss of threatened biodiversity is occurring at an accelerated pace. High-throughput sequencing technologies are now providing opportunities to address this issue by aiding in the generation of molecular data for many understudied species of high conservation interest. Our overall goal of this study was to begin building the genomic resources to continue investigations and conservation of the Spot-Tailed Earless lizard. Here we leverage the power of high-throughput sequencing to generate the liver transcriptome for the Northern Spot-Tailed Earless Lizard (Holbrookia lacerata) and Southern Spot-Tailed Earless Lizard (Holbrookia subcaudalis), which have declined in abundance in the past decades, and their sister species, the Common Lesser Earless Lizard (Holbrookia maculata). Our efforts produced high quality and robust transcriptome assemblies validated by 1) quantifying the number of processed reads represented in the transcriptome assembly and 2) quantifying the number of highly conserved single-copy orthologs that are present in our transcript set using the BUSCO pipeline. We found 1,361 1-to-1 orthologs among the three Holbrookia species, Anolis carolinensis, and Sceloporus undulatus. We carried out dN/dS selection tests using a branch-sites model and identified a dozen genes that experienced positive selection in the Holbrookia lineage with functions in development, immunity, and metabolism. Our single-copy orthologous sequences additionally revealed significant pairwise sequence divergence (~.73%) between the Northern H. lacerata and Southern H. subcaudalis that further supports the recent elevation of the Southern Spot-Tailed Earless Lizard to full species.
Collapse
Affiliation(s)
- Jose A. Maldonado
- Department of Biology, Amphibian and Reptile Diversity Research Center, The University of Texas at Arlington, Arlington, TX, United States of America
| | - Thomas J. Firneno
- Department of Biology, Amphibian and Reptile Diversity Research Center, The University of Texas at Arlington, Arlington, TX, United States of America
| | - Corey E. Roelke
- Department of Biology, Amphibian and Reptile Diversity Research Center, The University of Texas at Arlington, Arlington, TX, United States of America
| | - Nathan D. Rains
- Texas Parks and Wildlife Department, Austin, Texas, TX, United States of America
| | - Juliet Mwgiri
- Department of Biology, Amphibian and Reptile Diversity Research Center, The University of Texas at Arlington, Arlington, TX, United States of America
| | - Matthew K. Fujita
- Department of Biology, Amphibian and Reptile Diversity Research Center, The University of Texas at Arlington, Arlington, TX, United States of America
- * E-mail:
| |
Collapse
|
15
|
Li H, Zhou R, Xu S, Chen X, Hong Y, Lu Q, Liu H, Zhou B, Liang X. Improving Gene Annotation of the Peanut Genome by Integrated Proteogenomics Workflow. J Proteome Res 2020; 19:2226-2235. [DOI: 10.1021/acs.jproteome.9b00723] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Affiliation(s)
- Haifen Li
- Crops Research Institute, Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory for Crops Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement, Guangzhou 510640, China
| | - Ruo Zhou
- Deepxomics Co., Ltd., Shenzhen 518000, China
| | - Shaohang Xu
- Deepxomics Co., Ltd., Shenzhen 518000, China
| | - Xiaoping Chen
- Crops Research Institute, Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory for Crops Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement, Guangzhou 510640, China
| | - Yanbin Hong
- Crops Research Institute, Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory for Crops Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement, Guangzhou 510640, China
| | - Qing Lu
- Crops Research Institute, Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory for Crops Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement, Guangzhou 510640, China
| | - Hao Liu
- Crops Research Institute, Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory for Crops Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement, Guangzhou 510640, China
| | - Baojin Zhou
- Deepxomics Co., Ltd., Shenzhen 518000, China
| | - Xuanqiang Liang
- Crops Research Institute, Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory for Crops Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement, Guangzhou 510640, China
| |
Collapse
|
16
|
Li FD, Tong W, Xia EH, Wei CL. Optimized sequencing depth and de novo assembler for deeply reconstructing the transcriptome of the tea plant, an economically important plant species. BMC Bioinformatics 2019; 20:553. [PMID: 31694521 PMCID: PMC6836513 DOI: 10.1186/s12859-019-3166-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2019] [Accepted: 10/21/2019] [Indexed: 11/10/2022] Open
Abstract
Background Tea is the oldest and among the world’s most popular non-alcoholic beverages, which has important economic, health and cultural values. Tea is commonly produced from the leaves of tea plants (Camellia sinensis), which belong to the genus Camellia of family Theaceae. In the last decade, many studies have generated the transcriptomes of tea plants at different developmental stages or under abiotic and/or biotic stresses to investigate the genetic basis of secondary metabolites that determine tea quality. However, these results exhibited large differences, particularly in the total number of reconstructed transcripts and the quality of the assembled transcriptomes. These differences largely result from limited knowledge regarding the optimized sequencing depth and assembler for transcriptome assembly of structurally complex plant species genomes. Results We employed different amounts of RNA-sequencing data, ranging from 4 to 84 Gb, to assemble the tea plant transcriptome using five well-known and representative transcript assemblers. Although the total number of assembled transcripts increased with increasing sequencing data, the proportion of unassembled transcripts became saturated as revealed by plant BUSCO datasets. Among the five representative assemblers, the Bridger package shows the best performance in both assembly completeness and accuracy as evaluated by the BUSCO datasets and genome alignment. In addition, we showed that Bridger and BinPacker harbored the shortest runtimes followed by SOAPdenovo and Trans-ABySS. Conclusions The present study compares the performance of five representative transcript assemblers and investigates the key factors that affect the assembly quality of the transcriptome of the tea plants. This study will be of significance in helping the tea research community obtain better sequencing and assembly of tea plant transcriptomes under conditions of interest and may thus help to answer major biological questions currently facing the tea industry.
Collapse
Affiliation(s)
- Fang-Dong Li
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei, 230036, China.,School of Science, Anhui Agricultural University, Hefei, 230036, China
| | - Wei Tong
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei, 230036, China
| | - En-Hua Xia
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei, 230036, China.
| | - Chao-Ling Wei
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei, 230036, China.
| |
Collapse
|
17
|
Transcriptome Landscape Variation in the Genus Thymus. Genes (Basel) 2019; 10:genes10080620. [PMID: 31426352 PMCID: PMC6723042 DOI: 10.3390/genes10080620] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2019] [Revised: 07/31/2019] [Accepted: 08/12/2019] [Indexed: 12/13/2022] Open
Abstract
Among the Lamiaceae family, the genus Thymus is an economically important genera due to its medicinal and aromatic properties. Most Thymus molecular research has focused on the determining the phylogenetic relationships between different species, but no published work has focused on the evolution of the transcriptome across the genus to elucidate genes involved in terpenoid biosynthesis. Hence, in this study, the transcriptomes of five different Thymus species were generated and analyzed to mine putative genes involved in thymol and carvacrol biosynthesis. High-throughput sequencing produced ~43 million high-quality reads per sample, which were assembled de novo using several tools, then further subjected to a quality evaluation. The best assembly for each species was used as queries to search within the UniProt, KEGG (Kyoto Encyclopedia of Genes and Genomes), COG (Clusters of Orthologous Groups) and TF (Transcription Factors) databases. Mining the transcriptomes resulted in the identification of 592 single-copy orthogroups used for phylogenetic analysis. The data showed strongly support a close genetic relationship between Thymus vulgaris and Thymus daenensis. Additionally, this study dates the speciation events between 1.5–2.1 and 9–10.2 MYA according to different methodologies. Our study provides a global overview of genes related to the terpenoid pathway in Thymus, and can help establish an understanding of the relationship that exists among Thymus species.
Collapse
|
18
|
Hamabata T, Kinoshita G, Kurita K, Cao PL, Ito M, Murata J, Komaki Y, Isagi Y, Makino T. Endangered island endemic plants have vulnerable genomes. Commun Biol 2019; 2:244. [PMID: 31263788 PMCID: PMC6597543 DOI: 10.1038/s42003-019-0490-7] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2018] [Accepted: 06/03/2019] [Indexed: 11/29/2022] Open
Abstract
Loss of genetic diversity is known to decrease the fitness of species and is a critical factor that increases extinction risk. However, there is little evidence for higher vulnerability and extinction risk in endangered species based on genomic differences between endangered and non-endangered species. This is true even in the case of functional loci, which are more likely to relate to the fitness of species than neutral loci. Here, we compared the genome-wide genetic diversity, proportion of duplicated genes (PD), and accumulation of deleterious variations of endangered island endemic (EIE) plants from four genera with those of their non-endangered (NE) widespread congeners. We focused on exhaustive sequences of expressed genes obtained by RNA sequencing. Most EIE species exhibited significantly lower genetic diversity and PD than NE species. Additionally, all endangered species accumulated deleterious variations. Our findings provide new insights into the genomic traits of EIE species.
Collapse
Affiliation(s)
- Tomoko Hamabata
- Graduate School of Life Sciences, Tohoku University, Aoba-ku, Sendai 980-8578 Japan
| | - Gohta Kinoshita
- Graduate School of Agriculture, Kyoto University, Sakyo-ku, Kyoto 606-8502 Japan
| | - Kazuki Kurita
- Graduate School of Agriculture, Kyoto University, Sakyo-ku, Kyoto 606-8502 Japan
| | - Ping-Lin Cao
- Graduate School of Life Sciences, Tohoku University, Aoba-ku, Sendai 980-8578 Japan
| | - Motomi Ito
- Graduate School of Arts and Sciences, University of Tokyo, Meguro-ku, Tokyo 153-8902 Japan
| | - Jin Murata
- Koishikawa Botanical Garden, Graduate School of Science, University of Tokyo, Bunkyo-ku, Tokyo 112-0001 Japan
| | - Yoshiteru Komaki
- Koishikawa Botanical Garden, Graduate School of Science, University of Tokyo, Bunkyo-ku, Tokyo 112-0001 Japan
| | - Yuji Isagi
- Graduate School of Agriculture, Kyoto University, Sakyo-ku, Kyoto 606-8502 Japan
| | - Takashi Makino
- Graduate School of Life Sciences, Tohoku University, Aoba-ku, Sendai 980-8578 Japan
| |
Collapse
|
19
|
Voshall A, Moriyama EN. Next-generation transcriptome assembly and analysis: Impact of ploidy. Methods 2019; 176:14-24. [PMID: 31176772 DOI: 10.1016/j.ymeth.2019.06.001] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2019] [Revised: 05/30/2019] [Accepted: 06/01/2019] [Indexed: 10/26/2022] Open
Abstract
Whole genome duplications (WGD) occur widely in plants, but the effects of these events impact all branches of life. WGD events have major evolutionary impacts, often leading to major structural changes within the chromosomes and massive changes in gene expression that facilitate rapid speciation and gene diversification. Even for species that currently have diploid genomes, the impact of ancestral duplication events is still present in the genomes, especially in the context of highly similar gene families that are retained from WGD. However, the impact of these ploidies on various bioinformatics workflows has not been studied well. In this review, we overview biological significance of polyploidy in different organisms. We describe the impact of having polyploid transcriptomes on bioinformatics analyses, especially focusing on transcriptome assembly and transcript quantification. We discuss the benefits of using simulated benchmarking data when we examine the performance of various methods. We also present an example strategy to generate simulated allopolyploid transcriptomes and RNAseq datasets and how these benchmark datasets can be used to assess the performance of transcript assembly and quantification methods. Our benchmarking study shows that all transcriptome assembly methods are affected by having polyploid genomes. Quantification accuracy is also impacted by polyploidy depending on the method. These simulated datasets can be adapted for testing, such as, read mapping, variant calling, and differential expression using biologically realistic conditions.
Collapse
Affiliation(s)
- Adam Voshall
- Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, NE 68588, USA; School of Biological Sciences, University of Nebraska-Lincoln, Lincoln, NE 68588, USA; Department of Computer Science and Engineering, University of Nebraska-Lincoln, Lincoln, NE 68588, USA
| | - Etsuko N Moriyama
- Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, NE 68588, USA; School of Biological Sciences, University of Nebraska-Lincoln, Lincoln, NE 68588, USA.
| |
Collapse
|
20
|
Hölzer M, Marz M. De novo transcriptome assembly: A comprehensive cross-species comparison of short-read RNA-Seq assemblers. Gigascience 2019; 8:giz039. [PMID: 31077315 PMCID: PMC6511074 DOI: 10.1093/gigascience/giz039] [Citation(s) in RCA: 114] [Impact Index Per Article: 22.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2018] [Revised: 12/21/2018] [Accepted: 03/09/2019] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND In recent years, massively parallel complementary DNA sequencing (RNA sequencing [RNA-Seq]) has emerged as a fast, cost-effective, and robust technology to study entire transcriptomes in various manners. In particular, for non-model organisms and in the absence of an appropriate reference genome, RNA-Seq is used to reconstruct the transcriptome de novo. Although the de novo transcriptome assembly of non-model organisms has been on the rise recently and new tools are frequently developing, there is still a knowledge gap about which assembly software should be used to build a comprehensive de novo assembly. RESULTS Here, we present a large-scale comparative study in which 10 de novo assembly tools are applied to 9 RNA-Seq data sets spanning different kingdoms of life. Overall, we built >200 single assemblies and evaluated their performance on a combination of 20 biological-based and reference-free metrics. Our study is accompanied by a comprehensive and extensible Electronic Supplement that summarizes all data sets, assembly execution instructions, and evaluation results. Trinity, SPAdes, and Trans-ABySS, followed by Bridger and SOAPdenovo-Trans, generally outperformed the other tools compared. Moreover, we observed species-specific differences in the performance of each assembler. No tool delivered the best results for all data sets. CONCLUSIONS We recommend a careful choice and normalization of evaluation metrics to select the best assembling results as a critical step in the reconstruction of a comprehensive de novo transcriptome assembly.
Collapse
Affiliation(s)
- Martin Hölzer
- RNA Bioinformatics and High-Throughput Analysis, Friedrich Schiller University, Leutragraben 1, 07743 Jena, Germany
- European Virus Bioinformatics Center, Friedrich Schiller University, Leutragraben 1, 07743 Jena, Germany
| | - Manja Marz
- RNA Bioinformatics and High-Throughput Analysis, Friedrich Schiller University, Leutragraben 1, 07743 Jena, Germany
- European Virus Bioinformatics Center, Friedrich Schiller University, Leutragraben 1, 07743 Jena, Germany
- FLI Leibniz Institute for Age Research, Beutenbergstraße 11, 07743 Jena, Germany
| |
Collapse
|
21
|
Frare R, Ayub N, Alleva K, Soto G. The Ammonium Channel NOD26 is the Evolutionary Innovation that Drives the Emergence, Consolidation, and Dissemination of Nitrogen-Fixing Symbiosis in Angiosperms. J Mol Evol 2018; 86:554-565. [DOI: 10.1007/s00239-018-9867-3] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2018] [Accepted: 09/12/2018] [Indexed: 12/01/2022]
|
22
|
Chopra R, Simpson CE, Hillhouse A, Payton P, Sharma J, Burow MD. SNP genotyping reveals major QTLs for plant architectural traits between A-genome peanut wild species. Mol Genet Genomics 2018; 293:1477-1491. [PMID: 30069598 DOI: 10.1007/s00438-018-1472-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2017] [Accepted: 07/09/2018] [Indexed: 12/19/2022]
Abstract
KEY MESSAGE QTL mapping of important architectural traits was successfully applied to an A-genome diploid population using gene-specific variations. Peanut wild species are an important source of resistance to biotic and possibly abiotic stress; because these species differ from the cultigen in many traits, we have undertaken to identify QTLs for several plant architecture-related traits. In this study, we took recently identified SNPs, converted them into markers, and identified QTLs for architectural traits. SNPs from RNASeq data distinguishing two parents, A. duranensis (KSSc38901) and A. cardenasii (GKP10017), of a mapping population were identified using three references-A. duranensis V14167 genome sequence, and transcriptome sequences of A. duranensis KSSc38901 and OLin. More than 49,000 SNPs differentiated the parents, and 87.9% of the 190 SNP calls tested were validated. SNPs were then genotyped on 91 F2 lines using KASP chemistry on a Roche LightCycler 480 and a Fluidigm Biomark HD, and using SNPType chemistry on the Fluidigm Biomark HD. A linkage map was constructed having ten linkage groups, with 144 loci spanning a total map distance of 1040 cM. Comparison of the A-genome map to the A. duranensis genome sequence revealed a high degree of synteny. QTL analysis was also performed on the mapping population for important architectural traits. Fifteen definitive and 16 putative QTLs for petiole length, leaflet length and width, leaflet area, leaflet length/width ratio, main stem height, presence of flowers on the main stem, and seed mass were identified. Results demonstrate that SNPs identified from transcriptome sequencing could be converted to KASP or SNPType markers with a high success rate, and used to identify alleles with significant phenotypic effects, These could serve as information useful for introgression of alleles into cultivated peanut from wild species and have the potential to allow breeders to more easily fix these alleles using a marker-assisted backcrossing approach.
Collapse
Affiliation(s)
- Ratan Chopra
- Department of Plant and Soil Sciences, Texas Tech University, Lubbock, TX, 79409, USA
| | | | - Andrew Hillhouse
- Department of Molecular and Cellular Medicine, Texas A&M University, College Station, TX, 77843, USA
| | | | - Jyotsna Sharma
- Department of Plant and Soil Sciences, Texas Tech University, Lubbock, TX, 79409, USA
| | - Mark D Burow
- Department of Plant and Soil Sciences, Texas Tech University, Lubbock, TX, 79409, USA.
- Texas A&M AgriLife Research, Lubbock, TX, 79403, USA.
| |
Collapse
|
23
|
Pownall ME, Cutler RR, Saha MS. Transcriptome of Xenopus andrei, an octoploid frog, during embryonic development. Data Brief 2018; 19:501-505. [PMID: 29900348 PMCID: PMC5997840 DOI: 10.1016/j.dib.2018.05.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2018] [Accepted: 05/04/2018] [Indexed: 12/02/2022] Open
Abstract
Although polyploidy occurs throughout the fish and amphibian lineages, the Xenopus genus exhibits a high incidence of polyploidy, with 25 out of the 26 known species being polyploid. However, transcriptomic information is currently available for only one of these species, the tetraploid Xenopus laevis. Xenopus andrei, an octoploid species within the Xenopus genus, offers an opportunity for assessing a novel polyploid transcriptome during vertebrate development. RNA-Seq data was generated at nine different developmental stages ranging from unfertilized eggs through swimming tadpole stages and raw FASTQ files were deposited in the NCBI SRA database (accession number SRP134281). Additionally, RNA-seq data from all nine stages were pooled to create a de novo assembly of the transcriptome using Trinity and has been deposited in the NCBI GEO database (accession number GSE111639). To our knowledge, this represents the first published assembly of an octoploid vertebrate transcriptome. In total, 849 Mb were assembled, which led to the identification of 1,650,048 transcripts in the assembly with a contig N50 of 630 bases. This RNA-Seq and transcriptome data will be valuable for comparing polyploid transcriptomes across Xenopus species, as well as understanding evolutionary implications of whole-genome duplication and polyploidy in vertebrates.
Collapse
Affiliation(s)
- Mark E Pownall
- College of William & Mary, Biology Department, Williamsburg, VA 23185, United States
| | - Ronald R Cutler
- College of William & Mary, Biology Department, Williamsburg, VA 23185, United States
| | - Margaret S Saha
- College of William & Mary, Biology Department, Williamsburg, VA 23185, United States
| |
Collapse
|
24
|
Hoang NV, Furtado A, Thirugnanasambandam PP, Botha FC, Henry RJ. De novo assembly and characterizing of the culm-derived meta-transcriptome from the polyploid sugarcane genome based on coding transcripts. Heliyon 2018; 4:e00583. [PMID: 29862346 PMCID: PMC5968133 DOI: 10.1016/j.heliyon.2018.e00583] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2017] [Revised: 03/02/2018] [Accepted: 03/16/2018] [Indexed: 12/31/2022] Open
Abstract
Sugarcane biomass has been used for sugar, bioenergy and biomaterial production. The majority of the sugarcane biomass comes from the culm, which makes it important to understand the genetic control of biomass production in this part of the plant. A meta-transcriptome of the culm was obtained in an earlier study by using about one billion paired-end (150 bp) reads of deep RNA sequencing of samples from 20 diverse sugarcane genotypes and combining de novo assemblies from different assemblers and different settings. Although many genes could be recovered, this resulted in a large combined assembly which created the need for clustering to reduce transcript redundancy while maintaining gene content. Here, we present a comprehensive analysis of the effect of different assembly settings and clustering methods on de novo assembly, annotation and transcript profiling focusing especially on the coding transcripts from the highly polyploid sugarcane genome. The new coding sequence-based transcript clustering resulted in a better representation of transcripts compared to the earlier approach, having 121,987 contigs, which included 78,052 main and 43,935 alternative transcripts. About 73%, 67%, 61% and 10% of the transcriptome was annotated against the NCBI NR protein database, GO terms, orthologous groups and KEGG orthologies, respectively. Using this set for a differential gene expression analysis between the young and mature sugarcane culm tissues, a total of 822 transcripts were found to be differentially expressed, including key transcripts involved in sugar/fiber accumulation in sugarcane. In the context of the lack of a whole genome sequence for sugarcane, the availability of a well annotated culm-derived meta-transcriptome through deep sequencing provides useful information on coding genes specific to the sugarcane culm and will certainly contribute to understanding the process of carbon partitioning, and biomass accumulation in the sugarcane culm.
Collapse
Affiliation(s)
- Nam V. Hoang
- College of Agriculture and Forestry, Hue University, Hue, Vietnam
| | - Agnelo Furtado
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St. Lucia, Queensland, 4072, Australia
| | - Prathima P. Thirugnanasambandam
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St. Lucia, Queensland, 4072, Australia
- ICAR - Sugarcane Breeding Institute, Coimbatore, Tamil Nadu, India
| | - Frederik C. Botha
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St. Lucia, Queensland, 4072, Australia
- Sugar Research Australia, Indooroopilly, Queensland, Australia
| | - Robert J. Henry
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St. Lucia, Queensland, 4072, Australia
| |
Collapse
|
25
|
Carruthers M, Yurchenko AA, Augley JJ, Adams CE, Herzyk P, Elmer KR. De novo transcriptome assembly, annotation and comparison of four ecological and evolutionary model salmonid fish species. BMC Genomics 2018; 19:32. [PMID: 29310597 PMCID: PMC5759245 DOI: 10.1186/s12864-017-4379-x] [Citation(s) in RCA: 52] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2017] [Accepted: 12/11/2017] [Indexed: 12/21/2022] Open
Abstract
Background Salmonid fishes exhibit high levels of phenotypic and ecological variation and are thus ideal model systems for studying evolutionary processes of adaptive divergence and speciation. Furthermore, salmonids are of major interest in fisheries, aquaculture, and conservation research. Improving understanding of the genetic mechanisms underlying traits in these species would significantly progress research in these fields. Here we generate high quality de novo transcriptomes for four salmonid species: Atlantic salmon (Salmo salar), brown trout (Salmo trutta), Arctic charr (Salvelinus alpinus), and European whitefish (Coregonus lavaretus). All species except Atlantic salmon have no reference genome publicly available and few if any genomic studies to date. Results We used paired-end RNA-seq on Illumina to generate high coverage sequencing of multiple individuals, yielding between 180 and 210 M reads per species. After initial assembly, strict filtering was used to remove duplicated, redundant, and low confidence transcripts. The final assemblies consisted of 36,505 protein-coding transcripts for Atlantic salmon, 35,736 for brown trout, 33,126 for Arctic charr, and 33,697 for European whitefish and are made publicly available. Assembly completeness was assessed using three approaches, all of which supported high quality of the assemblies: 1) ~78% of Actinopterygian single-copy orthologs were successfully captured in our assemblies, 2) orthogroup inference identified high overlap in the protein sequences present across all four species (40% shared across all four and 84% shared by at least two), and 3) comparison with the published Atlantic salmon genome suggests that our assemblies represent well covered (~98%) protein-coding transcriptomes. Thorough comparison of the generated assemblies found that 84-90% of transcripts in each assembly were orthologous with at least one of the other three species. We also identified 34-37% of transcripts in each assembly as paralogs. We further compare completeness and annotation statistics of our new assemblies to available related species. Conclusion New, high-confidence protein-coding transcriptomes were generated for four ecologically and economically important species of salmonids. This offers a high quality pipeline for such complex genomes, represents a valuable contribution to the existing genomic resources for these species and provides robust tools for future investigation of gene expression and sequence evolution in these and other salmonid species. Electronic supplementary material The online version of this article (10.1186/s12864-017-4379-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Madeleine Carruthers
- Institute of Biodiversity, Animal Health & Comparative Medicine, College of Medical, Veterinary & Life Sciences, University of Glasgow, G12 8QQ, Glasgow, UK
| | - Andrey A Yurchenko
- Institute of Biodiversity, Animal Health & Comparative Medicine, College of Medical, Veterinary & Life Sciences, University of Glasgow, G12 8QQ, Glasgow, UK
| | - Julian J Augley
- Glasgow Polyomics, Wolfson Wohl Cancer Research Centre, University of Glasgow, G61 1QH, Glasgow, UK.,Present Address: Fios Genomics Ltd., Nine Edinburgh Bioquarter, 9 Little France Road, Edinburgh, EH16 4UX, UK
| | - Colin E Adams
- Institute of Biodiversity, Animal Health & Comparative Medicine, College of Medical, Veterinary & Life Sciences, University of Glasgow, G12 8QQ, Glasgow, UK.,Scottish Centre for Ecology and the Natural Environment, University of Glasgow, Rowardennan, G63 0AW, UK
| | - Pawel Herzyk
- Glasgow Polyomics, Wolfson Wohl Cancer Research Centre, University of Glasgow, G61 1QH, Glasgow, UK.,Institute of Molecular, Cell & Systems Biology, College of Medical, Veterinary & Life Sciences, University of Glasgow, G12 8QQ, Glasgow, UK
| | - Kathryn R Elmer
- Institute of Biodiversity, Animal Health & Comparative Medicine, College of Medical, Veterinary & Life Sciences, University of Glasgow, G12 8QQ, Glasgow, UK.
| |
Collapse
|
26
|
Schedina IM, Groth D, Schlupp I, Tiedemann R. The gonadal transcriptome of the unisexual Amazon molly Poecilia formosa in comparison to its sexual ancestors, Poecilia mexicana and Poecilia latipinna. BMC Genomics 2018; 19:12. [PMID: 29298680 PMCID: PMC5753479 DOI: 10.1186/s12864-017-4382-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2017] [Accepted: 12/14/2017] [Indexed: 12/15/2022] Open
Abstract
Background The unisexual Amazon molly (Poecilia formosa) originated from a hybridization between two sexual species, the sailfin molly (Poecilia latipinna) and the Atlantic molly (Poecilia mexicana). The Amazon molly reproduces clonally via sperm-dependent parthenogenesis (gynogenesis), in which the sperm of closely related species triggers embryogenesis of the apomictic oocytes, but typically does not contribute genetic material to the next generation. We compare for the first time the gonadal transcriptome of the Amazon molly to those of both ancestral species, P. mexicana and P. latipinna. Results We sequenced the gonadal transcriptomes of the P. formosa and its parental species P. mexicana and P. latipinna using Illumina RNA-sequencing techniques (paired-end, 100 bp). De novo assembly of about 50 million raw read pairs for each species was performed using Trinity, yielding 106,922 transcripts for P. formosa, 115,175 for P. latipinna, and 133,025 for P. mexicana after eliminating contaminations. On the basis of sequence similarity comparisons to other teleost species and the UniProt databases, functional annotation, and differential expression analysis, we demonstrate the similarity of the transcriptomes among the three species. More than 40% of the transcripts for each species were functionally annotated and about 70% were assigned to orthologous genes of a closely related species. Differential expression analysis between the sexual and unisexual species uncovered 2035 up-regulated and 564 down-regulated genes in P. formosa. This was exemplary validated for six genes by qRT-PCR. Conclusions We identified more than 130 genes related to meiosis and reproduction within the apomictically reproducing P. formosa. Overall expression of these genes seems to be down-regulated in the P. formosa transcriptome compared to both ancestral species (i.e., 106 genes down-regulated, 29 up-regulated). A further 35 meiosis and reproduction related genes were not found in the P. formosa transcriptome, but were only expressed in the sexual species. Our data support the hypothesis of general down-regulation of meiosis-related genes in the apomictic Amazon molly. Furthermore, the obtained dataset and identified gene catalog will serve as a resource for future research on the molecular mechanisms behind the reproductive mode of this unisexual species. Electronic supplementary material The online version of this article (10.1186/s12864-017-4382-2) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Ina Maria Schedina
- Unit of Evolutionary Biology/Systematic Zoology, Institute of Biochemistry and Biology, University of Potsdam, Karl-Liebknecht-Straße 24-25, Haus 26, 14476, Potsdam, Germany
| | - Detlef Groth
- Department of Bioinformatics, Institute of Biochemistry and Biology, University of Potsdam, Karl-Liebknecht-Straße 24-25, Haus 14, 14476, Potsdam, Germany
| | - Ingo Schlupp
- Department of Biology, University of Oklahoma, 730 Van Vleet Oval, Norman, OK, 73019, USA
| | - Ralph Tiedemann
- Unit of Evolutionary Biology/Systematic Zoology, Institute of Biochemistry and Biology, University of Potsdam, Karl-Liebknecht-Straße 24-25, Haus 26, 14476, Potsdam, Germany.
| |
Collapse
|
27
|
Ojeda PG, Ramírez D, Alzate-Morales J, Caballero J, Kaas Q, González W. Computational Studies of Snake Venom Toxins. Toxins (Basel) 2017; 10:E8. [PMID: 29271884 PMCID: PMC5793095 DOI: 10.3390/toxins10010008] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2017] [Revised: 12/09/2017] [Accepted: 12/18/2017] [Indexed: 12/17/2022] Open
Abstract
Most snake venom toxins are proteins, and participate to envenomation through a diverse array of bioactivities, such as bleeding, inflammation, and pain, cytotoxic, cardiotoxic or neurotoxic effects. The venom of a single snake species contains hundreds of toxins, and the venoms of the 725 species of venomous snakes represent a large pool of potentially bioactive proteins. Despite considerable discovery efforts, most of the snake venom toxins are still uncharacterized. Modern bioinformatics tools have been recently developed to mine snake venoms, helping focus experimental research on the most potentially interesting toxins. Some computational techniques predict toxin molecular targets, and the binding mode to these targets. This review gives an overview of current knowledge on the ~2200 sequences, and more than 400 three-dimensional structures of snake toxins deposited in public repositories, as well as of molecular modeling studies of the interaction between these toxins and their molecular targets. We also describe how modern bioinformatics have been used to study the snake venom protein phospholipase A2, the small basic myotoxin Crotamine, and the three-finger peptide Mambalgin.
Collapse
Affiliation(s)
- Paola G Ojeda
- Center for Bioinformatics and Molecular Simulations (CBSM), Universidad de Talca, 3460000 Talca, Chile.
- Facultad de Ciencias de la Salud, Instituto de Ciencias Biomedicas, Universidad Autonoma de Chile, 3460000 Talca, Chile.
| | - David Ramírez
- Center for Bioinformatics and Molecular Simulations (CBSM), Universidad de Talca, 3460000 Talca, Chile.
- Facultad de Ciencias de la Salud, Instituto de Ciencias Biomedicas, Universidad Autonoma de Chile, 3460000 Talca, Chile.
| | - Jans Alzate-Morales
- Center for Bioinformatics and Molecular Simulations (CBSM), Universidad de Talca, 3460000 Talca, Chile.
| | - Julio Caballero
- Center for Bioinformatics and Molecular Simulations (CBSM), Universidad de Talca, 3460000 Talca, Chile.
| | - Quentin Kaas
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4072, Australia.
| | - Wendy González
- Center for Bioinformatics and Molecular Simulations (CBSM), Universidad de Talca, 3460000 Talca, Chile.
- Millennium Nucleus of Ion Channels-Associated Diseases (MiNICAD), Universidad de Talca, 3460000 Talca, Chile.
| |
Collapse
|
28
|
Chen L, Guo X, Cui Y, Zheng X, Yang H. Comparative Transcriptome Analysis Reveals Hormone Signaling Genes Involved in the Launch of Culm-Shape Differentiation in Dendrocalamus sinicus. Genes (Basel) 2017; 9:E4. [PMID: 29271945 PMCID: PMC5793157 DOI: 10.3390/genes9010004] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2017] [Revised: 12/09/2017] [Accepted: 12/20/2017] [Indexed: 12/23/2022] Open
Abstract
Dendrocalamus sinicus is a sympodial bamboo species endemic to Yunnan Province, China, and is the strongest bamboo species in the world. However, there is substantial variation in the basal culm shape, i.e., straight culm (SC) and bent culm (BC), among different D. sinicus as a result of genetic and growth factors. This study performed a comparative transcriptomic analysis of bamboo shoots of two variants at the early, mid-, and late shoot-development stages to examine the molecular basis of this variation. In total, 98,479 unigenes were annotated, of which 13,495 were differentially expressed in pairwise comparisons of the six libraries. More differentially expressed genes (DEGs) were involved in SC than in BC culm development. The DEGs between BC and SC were assigned to 108 metabolic pathways. The 1064 DEGs in early development might mainly control the launch of culm-shape differentiation. Sixty genes encoding components of hormone signaling pathways were differentially expressed between BC5 and SC5, indicating complex hormonal regulation of culm differentiation. The AUX/IAA, ARF, PP2C, SnRK2, and ABF genes involved in auxin and abscisic acid signaling played key roles. These results help us to understand the molecular mechanism of culm variation and other aspects of culm development in D. sinicus.
Collapse
Affiliation(s)
- Lingna Chen
- Research Institute of Resources Insects, Chinese Academy of Forestry, Bailongsi, Panlong, Kunming 650233, China.
| | - Xiaojuan Guo
- Research Institute of Resources Insects, Chinese Academy of Forestry, Bailongsi, Panlong, Kunming 650233, China.
| | - Yongzhong Cui
- Research Institute of Resources Insects, Chinese Academy of Forestry, Bailongsi, Panlong, Kunming 650233, China.
| | - Xianggan Zheng
- Research Institute of Resources Insects, Chinese Academy of Forestry, Bailongsi, Panlong, Kunming 650233, China.
| | - Hanqi Yang
- Research Institute of Resources Insects, Chinese Academy of Forestry, Bailongsi, Panlong, Kunming 650233, China.
| |
Collapse
|
29
|
Narożna D, Książkiewicz M, Przysiecka Ł, Króliczak J, Wolko B, Naganowska B, Mądrzak CJ. Legume isoflavone synthase genes have evolved by whole-genome and local duplications yielding transcriptionally active paralogs. PLANT SCIENCE : AN INTERNATIONAL JOURNAL OF EXPERIMENTAL PLANT BIOLOGY 2017; 264:149-167. [PMID: 28969795 DOI: 10.1016/j.plantsci.2017.09.007] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/27/2017] [Revised: 09/05/2017] [Accepted: 09/11/2017] [Indexed: 05/04/2023]
Abstract
Isoflavone synthase (IFS) is the key enzyme of isoflavonoid biosynthesis. IFS genes were identified in numerous species, although their evolutionary patterns have not yet been reconstructed. To address this issue, we performed structural and functional genomic analysis. Narrow leafed lupin, Lupinus angustifolius L., was used as a reference species for the genus, because it has the most developed molecular tools available. Nuclear genome BAC library clones carrying IFS homologs were localized by linkage mapping and fluorescence in situ hybridization in three chromosome pairs. Annotation of BAC, scaffold and transcriptome sequences confirmed the presence of three full-length IFS genes in the genome. Microsynteny analysis and Bayesian inference provided clear evidence that IFS genes in legumes have evolved by lineage-specific whole-genome and tandem duplications. Gene expression profiling and RNA-seq data mining showed that the vast majority of legume IFS copies have maintained their transcriptional activity. L. angustifolius IFS homologs exhibited organ-specific expression patterns similar to those observed in other Papilionoideae. Duplicated lupin IFS homologs retained non-negligible levels of substitutions in conserved motifs, putatively due to positive selection acting during early evolution of the genus, before the whole-genome duplication. Strong purifying selection preserved newly arisen IFS duplicates from further nonsynonymous changes.
Collapse
Affiliation(s)
- Dorota Narożna
- Department of Biochemistry and Biotechnology, Faculty of Agronomy and Bioengineering, Poznań University of Life Sciences, Dojazd 11, 60-632, Poznań, Poland.
| | - Michał Książkiewicz
- Department of Genomics, Institute of Plant Genetics, Polish Academy of Sciences, Strzeszyńska 34, 60-479, Poznań, Poland.
| | - Łucja Przysiecka
- Department of Genomics, Institute of Plant Genetics, Polish Academy of Sciences, Strzeszyńska 34, 60-479, Poznań, Poland; NanoBioMedical Centre, Adam Mickiewicz University, Umultowska 85, 61-614, Poznań, Poland.
| | - Joanna Króliczak
- Department of Biochemistry and Biotechnology, Faculty of Agronomy and Bioengineering, Poznań University of Life Sciences, Dojazd 11, 60-632, Poznań, Poland.
| | - Bogdan Wolko
- Department of Genomics, Institute of Plant Genetics, Polish Academy of Sciences, Strzeszyńska 34, 60-479, Poznań, Poland.
| | - Barbara Naganowska
- Department of Genomics, Institute of Plant Genetics, Polish Academy of Sciences, Strzeszyńska 34, 60-479, Poznań, Poland.
| | - Cezary J Mądrzak
- Department of Biochemistry and Biotechnology, Faculty of Agronomy and Bioengineering, Poznań University of Life Sciences, Dojazd 11, 60-632, Poznań, Poland.
| |
Collapse
|
30
|
Gene expression atlas of fruit ripening and transcriptome assembly from RNA-seq data in octoploid strawberry (Fragaria × ananassa). Sci Rep 2017; 7:13737. [PMID: 29062051 PMCID: PMC5653846 DOI: 10.1038/s41598-017-14239-6] [Citation(s) in RCA: 74] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2017] [Accepted: 10/06/2017] [Indexed: 01/09/2023] Open
Abstract
RNA-seq has been used to perform global expression analysis of the achene and the receptacle at four stages of fruit ripening, and of the roots and leaves of strawberry (Fragaria × ananassa). About 967 million reads and 191 Gb of sequence were produced, using Illumina sequencing. Mapping the reads in the related genome of the wild diploid Fragaria vesca revealed differences between the achene and receptacle development program, and reinforced the role played by ethylene in the ripening receptacle. For the strawberry transcriptome assembly, a de novo strategy was followed, generating separate assemblies for each of the ten tissues and stages sampled. The Trinity program was used for these assemblies, resulting in over 1.4 M isoforms. Filtering by a threshold of 0.3 FPKM, and doing Blastx (E-value < 1 e-30) against the UniProt database of plants reduced the number to 472,476 isoforms. Their assembly with the MIRA program (90% homology) resulted in 26,087 contigs. From these, 91.34 percent showed high homology to Fragaria vesca genes and 87.30 percent Fragaria iinumae (BlastN E-value < 1 e-100). Mapping back the reads on the MIRA contigs identified polymorphisms at nucleotide level, using FREEBAYES, as well as estimate their relative abundance in each sample.
Collapse
|
31
|
Moazzzam Jazi M, Seyedi SM, Ebrahimie E, Ebrahimi M, De Moro G, Botanga C. A genome-wide transcriptome map of pistachio (Pistacia vera L.) provides novel insights into salinity-related genes and marker discovery. BMC Genomics 2017; 18:627. [PMID: 28814265 PMCID: PMC5559799 DOI: 10.1186/s12864-017-3989-7] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2017] [Accepted: 08/01/2017] [Indexed: 12/18/2022] Open
Abstract
Background Pistachio (Pistacia vera L.) is one of the most important commercial nut crops worldwide. It is a salt-tolerant and long-lived tree, with the largest cultivation area in Iran. Climate change and subsequent increased soil salt content have adversely affected the pistachio yield in recent years. However, the lack of genomic/global transcriptomic sequences on P. vera impedes comprehensive researches at the molecular level. Hence, whole transcriptome sequencing is required to gain insight into functional genes and pathways in response to salt stress. Results RNA sequencing of a pooled sample representing 24 different tissues of two pistachio cultivars with contrasting salinity tolerance under control and salt treatment by Illumina Hiseq 2000 platform resulted in 368,953,262 clean 100 bp paired-ends reads (90 Gb). Following creating several assemblies and assessing their quality from multiple perspectives, we found that using the annotation-based metrics together with the length-based parameters allows an improved assessment of the transcriptome assembly quality, compared to the solely use of the length-based parameters. The generated assembly by Trinity was adopted for functional annotation and subsequent analyses. In total, 29,119 contigs annotated against all of five public databases, including NR, UniProt, TAIR10, KOG and InterProScan. Among 279 KEGG pathways supported by our assembly, we further examined the pathways involved in the plant hormone biosynthesis and signaling as well as those to be contributed to secondary metabolite biosynthesis due to their importance under salinity stress. In total, 11,337 SSRs were also identified, which the most abundant being dinucleotide repeats. Besides, 13,097 transcripts as candidate stress-responsive genes were identified. Expression of some of these genes experimentally validated through quantitative real-time PCR (qRT-PCR) that further confirmed the accuracy of the assembly. From this analysis, the contrasting expression pattern of NCED3 and SOS1 genes were observed between salt-sensitive and salt-tolerant cultivars. Conclusion This study, as the first report on the whole transcriptome survey of P. vera, provides important resources and paves the way for functional and comparative genomic studies on this major tree to discover the salinity tolerance-related markers and stress response mechanisms for breeding of new pistachio cultivars with more salinity tolerance. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3989-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Maryam Moazzzam Jazi
- Plant Biotechnology Department, National Institute of Genetic Engineering and Biotechnology, Tehran, Iran
| | - Seyed Mahdi Seyedi
- Plant Biotechnology Department, National Institute of Genetic Engineering and Biotechnology, Tehran, Iran.
| | - Esmaeil Ebrahimie
- School of Medicine, The University of Adelaide, Adelaide, Australia.,Institute of Biotechnology, Shiraz University, Shiraz, Iran.,Division of Information Technology, Engineering and the Environment, School of Information Technology and Mathematical Sciences, University of South Australia, Adelaide, Australia.,School of Biological Sciences, Faculty of Science and Engineering, Flinders University, Adelaide, Australia
| | | | - Gianluca De Moro
- Center of Marine Sciences (CCMAR), University of Algarve, Faro, Portugal
| | - Christopher Botanga
- Department of Biological Sciences, Chicago State University, Chicago, IL, USA
| |
Collapse
|
32
|
Exploring the heat-responsive chaperones and microsatellite markers associated with terminal heat stress tolerance in developing wheat. Funct Integr Genomics 2017; 17:621-640. [PMID: 28573536 DOI: 10.1007/s10142-017-0560-1] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2017] [Revised: 04/06/2017] [Accepted: 04/17/2017] [Indexed: 10/19/2022]
Abstract
Global warming is a major threat for agriculture and food security, and in many cases the negative impacts are already apparent. Wheat is one of the most important staple food crops and is highly sensitive to the heat stress (HS) during reproductive and grain-filling stages. Here, whole transcriptome analysis of thermotolerant wheat cv. HD2985 was carried out at the post-anthesis stage under control (22 ± 3 °C) and HS-treated (42 °C, 2 h) conditions using Illumina Hiseq and Roche GS-FLX 454 platforms. We assembled ~24 million (control) and ~23 million (HS-treated) high-quality trimmed reads using different assemblers with optimal parameters. De novo assembly yielded 52,567 (control) and 59,658 (HS-treated) unigenes. We observed 785 transcripts to be upregulated and 431 transcripts to be downregulated under HS; 78 transcripts showed >10-fold upregulation such as HSPs, metabolic pathway-related genes, etc. Maximum number of upregulated genes was observed to be associated with processes such as HS-response, protein-folding, oxidation-reduction and photosynthesis. We identified 2008 and 2483 simple sequence repeats (SSRs) markers from control and HS-treated samples; 243 SSRs were observed to be overlying on stress-associated genes. Polymorphic study validated four SSRs to be heat-responsive in nature. Expression analysis of identified differentially expressed transcripts (DETs) showed very high fold increase in the expression of catalytic chaperones (HSP26, HSP17, and Rca) in contrasting wheat cvs. HD2985 and HD2329 under HS. We observed positive correlation between RNA-seq and qRT-PCR expression data. The present study culminated in greater understanding of the heat-response of tolerant genotype and has provided good candidate genes for the marker development and screening of wheat germplasm for thermotolerance.
Collapse
|
33
|
|
34
|
Transcriptome Sequencing of Diverse Peanut (Arachis) Wild Species and the Cultivated Species Reveals a Wealth of Untapped Genetic Variability. G3-GENES GENOMES GENETICS 2016; 6:3825-3836. [PMID: 27729436 PMCID: PMC5144954 DOI: 10.1534/g3.115.026898] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
To test the hypothesis that the cultivated peanut species possesses almost no molecular variability, we sequenced a diverse panel of 22 Arachis accessions representing Arachis hypogaea botanical classes, A-, B-, and K- genome diploids, a synthetic amphidiploid, and a tetraploid wild species. RNASeq was performed on pools of three tissues, and de novo assembly was performed. Realignment of individual accession reads to transcripts of the cultivar OLin identified 306,820 biallelic SNPs. Among 10 naturally occurring tetraploid accessions, 40,382 unique homozygous SNPs were identified in 14,719 contigs. In eight diploid accessions, 291,115 unique SNPs were identified in 26,320 contigs. The average SNP rate among the 10 cultivated tetraploids was 0.5, and among eight diploids was 9.2 per 1000 bp. Diversity analysis indicated grouping of diploids according to genome classification, and cultivated tetraploids by subspecies. Cluster analysis of variants indicated that sequences of B genome species were the most similar to the tetraploids, and the next closest diploid accession belonged to the A genome species. A subset of 66 SNPs selected from the dataset was validated; of 782 SNP calls, 636 (81.32%) were confirmed using an allele-specific discrimination assay. We conclude that substantial genetic variability exists among wild species. Additionally, significant but lesser variability at the molecular level occurs among accessions of the cultivated species. This survey is the first to report significant SNP level diversity among transcripts, and may explain some of the phenotypic differences observed in germplasm surveys. Understanding SNP variants in the Arachis accessions will benefit in developing markers for selection.
Collapse
|
35
|
Boutte J, Ferreira de Carvalho J, Rousseau-Gueutin M, Poulain J, Da Silva C, Wincker P, Ainouche M, Salmon A. Reference Transcriptomes and Detection of Duplicated Copies in Hexaploid and Allododecaploid Spartina Species (Poaceae). Genome Biol Evol 2016; 8:3030-3044. [PMID: 27614235 PMCID: PMC5633685 DOI: 10.1093/gbe/evw209] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/20/2016] [Indexed: 01/19/2023] Open
Abstract
In this study, we report the assembly and annotation of five reference transcriptomes for the European hexaploid Spartina species (S. maritima, S. alterniflora and their homoploid hybrids S. x townsendii and S. x neyrautii) and the allododecaploid invasive species S. anglica These transcriptomes were constructed from various leaf and root cDNA libraries that were sequenced using both Roche-454 and Illumina technologies. Considering the high ploidy levels of the Spartina genomes under study, and considering the absence of diploid reference genome and the need of an appropriate analytical strategy, we developed generic bioinformatics tools to (1) detect different haplotypes of each gene within each species and (2) assign a parental origin to haplotypes detected in the hexaploid hybrids and the neo-allopolyploid. The approach described here allows the detection of putative homeologs from sets of short reads. Synonymous substitution rate (KS) comparisons between haplotypes from the hexaploid species revealed the presence of one KS peak (likely resulting from the tetraploid duplication event). The procedure developed in this study can be applied for future differential gene expression or genomics experiments to study the fate of duplicated genes in the invasive allododecaploid S. anglica.
Collapse
Affiliation(s)
- Julien Boutte
- UMR CNRS 6553 Ecobio, OSUR (Observatoire des Sciences de l'Univers de Rennes), University of Rennes 1, Rennes Cedex, France
| | - Julie Ferreira de Carvalho
- UMR CNRS 6553 Ecobio, OSUR (Observatoire des Sciences de l'Univers de Rennes), University of Rennes 1, Rennes Cedex, France
| | - Mathieu Rousseau-Gueutin
- UMR CNRS 6553 Ecobio, OSUR (Observatoire des Sciences de l'Univers de Rennes), University of Rennes 1, Rennes Cedex, France UMR Institut de Génétique, Environnement et Protection des Plantes, Institut National de la Recherche Agronomique, Le Rheu Cedex, France
| | | | | | | | - Malika Ainouche
- UMR CNRS 6553 Ecobio, OSUR (Observatoire des Sciences de l'Univers de Rennes), University of Rennes 1, Rennes Cedex, France
| | - Armel Salmon
- UMR CNRS 6553 Ecobio, OSUR (Observatoire des Sciences de l'Univers de Rennes), University of Rennes 1, Rennes Cedex, France
| |
Collapse
|
36
|
Greenwood JM, Ezquerra AL, Behrens S, Branca A, Mallet L. Current analysis of host–parasite interactions with a focus on next generation sequencing data. ZOOLOGY 2016; 119:298-306. [DOI: 10.1016/j.zool.2016.06.010] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2015] [Revised: 06/22/2016] [Accepted: 06/22/2016] [Indexed: 01/21/2023]
|
37
|
Liu B, Wang S, Dong Q, Li S, Liu X. Identification of DNA-binding proteins by combining auto-cross covariance transformation and ensemble learning. IEEE Trans Nanobioscience 2016; 15:328-334. [PMID: 28113908 DOI: 10.1109/tnb.2016.2555951] [Citation(s) in RCA: 65] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
DNA-binding proteins play a pivotal role in various intra- and extra-cellular activities ranging from DNA replication to gene expression control. With the rapid development of next generation of sequencing technique, the number of protein sequences is unprecedentedly increasing. Thus it is necessary to develop computational methods to identify the DNA-binding proteins only based on the protein sequence information. In this study, a novel method called iDNA-KACC is presented, which combines the Support Vector Machine (SVM) and the auto-cross covariance transformation. The protein sequences are first converted into profile-based protein representation, and then converted into a series of fixed-length vectors by the auto-cross covariance transformation with Kmer composition. The sequence order effect can be effectively captured by this scheme. These vectors are then fed into Support Vector Machine (SVM) to discriminate the DNA-binding proteins from the non DNA-binding ones. iDNA-KACC achieves an overall accuracy of 75.16% and Matthew correlation coefficient of 0.5 by a rigorous jackknife test. Its performance is further improved by employing an ensemble learning approach, and the improved predictor is called iDNA-KACC-EL. Experimental results on an independent dataset shows that iDNA-KACC-EL outperforms all the other state-of-the-art predictors, indicating that it would be a useful computational tool for DNA binding protein identification. .
Collapse
|
38
|
Comparison of De Novo Transcriptome Assemblers and k-mer Strategies Using the Killifish, Fundulus heteroclitus. PLoS One 2016; 11:e0153104. [PMID: 27054874 PMCID: PMC4824410 DOI: 10.1371/journal.pone.0153104] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2015] [Accepted: 03/23/2016] [Indexed: 11/25/2022] Open
Abstract
Background De novo assembly of non-model organism’s transcriptomes has recently been on the rise in concert with the number of de novo transcriptome assembly software programs. There is a knowledge gap as to what assembler software or k-mer strategy is best for construction of an optimal de novo assembly. Additionally, there is a lack of consensus on which evaluation metrics should be used to assess the quality of de novo transcriptome assemblies. Result Six different assembly strategies were evaluated from four different assemblers. The Trinity assembly was used in its default 25 single k-mer value while Bridger, Oases, and SOAPdenovo-Trans were performed with multiple k-mer strategies. Bridger, Oases, and SOAPdenovo-Trans used a small multiple k-mer (SMK) strategy consisting of the k-mer lengths of 21, 25, 27, 29, 31, and 33. Additionally, Oases and SOAPdenovo-Trans were performed using a large multiple k-mer (LMK) strategy consisting of k-mer lengths of 25, 35, 45, 55, 65, 75, and 85. Eleven metrics were used to evaluate each assembly strategy including three genome related evaluation metrics (contig number, N50 length, Contigs >1 kb, reads) and eight transcriptome evaluation metrics (mapped back to transcripts (RMBT), number of full length transcripts, number of open reading frames, Detonate RSEM-EVAL score, and percent alignment to the southern platyfish, Amazon molly, BUSCO and CEGMA databases). The assembly strategy that performed the best, that is it was within the top three of each evaluation metric, was the Bridger assembly (10 of 11) followed by the Oases SMK assembly (8 of 11), the Oases LMK assembly (6 of 11), the Trinity assembly (4 of 11), the SOAP LMK assembly (4 of 11), and the SOAP SMK assembly (3 of 11). Conclusion This study provides an in-depth multi k-mer strategy investigation concluding that the assembler itself had a greater impact than k-mer size regardless of the strategy employed. Additionally, the comprehensive performance transcriptome evaluation metrics utilized in this study identified the need for choosing metrics centered on user defined research goals. Based on the evaluation metrics performed, the Bridger assembly was able to construct the best assembly of the testis transcriptome in Fundulus heteroclitus.
Collapse
|
39
|
Kono N, Nakamura H, Ito Y, Tomita M, Arakawa K. Evaluation of the impact of RNA preservation methods of spiders for de novo transcriptome assembly. Mol Ecol Resour 2015; 16:662-72. [PMID: 26561354 DOI: 10.1111/1755-0998.12485] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2015] [Revised: 10/26/2015] [Accepted: 10/29/2015] [Indexed: 11/30/2022]
Abstract
With advances in high-throughput sequencing technologies, de novo transcriptome sequencing and assembly has become a cost-effective method to obtain comprehensive genetic information of a species of interest, especially in nonmodel species with large genomes such as spiders. However, high-quality RNA is essential for successful sequencing, and sample preservation conditions require careful consideration for the effective storage of field-collected samples. To this end, we report a streamlined feasibility study of various storage conditions and their effects on de novo transcriptome assembly results. The storage parameters considered include temperatures ranging from room temperature to -80°C; preservatives, including ethanol, RNAlater, TRIzol and RNAlater-ICE; and sample submersion states. As a result, intact RNA was extracted and assembly was successful when samples were preserved at low temperatures regardless of the type of preservative used. The assemblies as well as the gene expression profiles were shown to be robust to RNA degradation, when 30 million 150-bp paired-end reads are obtained. The parameters for sample storage, RNA extraction, library preparation, sequencing and in silico assembly considered in this work provide a guideline for the study of field-collected samples of spiders.
Collapse
Affiliation(s)
- Nobuaki Kono
- Institute for Advanced Biosciences, Keio University, Mizukami 246-2, Kakuganji, Tsuruoka, Yamagata, 997-0052, Japan
| | - Hiroyuki Nakamura
- Spiber Inc., Mizukami 234-1, Kakuganji, Tsuruoka, Yamagata, 997-0052, Japan
| | - Yusuke Ito
- Spiber Inc., Mizukami 234-1, Kakuganji, Tsuruoka, Yamagata, 997-0052, Japan
| | - Masaru Tomita
- Institute for Advanced Biosciences, Keio University, Mizukami 246-2, Kakuganji, Tsuruoka, Yamagata, 997-0052, Japan
| | - Kazuharu Arakawa
- Institute for Advanced Biosciences, Keio University, Mizukami 246-2, Kakuganji, Tsuruoka, Yamagata, 997-0052, Japan
| |
Collapse
|
40
|
Bankar KG, Todur VN, Shukla RN, Vasudevan M. Ameliorated de novo transcriptome assembly using Illumina paired end sequence data with Trinity Assembler. GENOMICS DATA 2015; 5:352-9. [PMID: 26484285 PMCID: PMC4583709 DOI: 10.1016/j.gdata.2015.07.012] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/10/2015] [Accepted: 07/12/2015] [Indexed: 11/28/2022]
Abstract
Advent of Next Generation Sequencing has led to possibilities of de novo transcriptome assembly of organisms without availability of complete genome sequence. Among various sequencing platforms available, Illumina is the most widely used platform based on data quality, quantity and cost. Various de novo transcriptome assemblers are also available today for construction of de novo transcriptome. In this study, we aimed at obtaining an ameliorated de novo transcriptome assembly with sequence reads obtained from Illumina platform and assembled using Trinity Assembler. We found that, primary transcriptome assembly obtained as a result of Trinity can be ameliorated on the basis of transcript length, coverage, and depth and protein homology. Our approach to ameliorate is reproducible and could enhance the sensitivity and specificity of the assembled transcriptome which could be critical for validation of the assembled transcripts and for planning various downstream biological assays.
Collapse
Affiliation(s)
- Kiran Gopinath Bankar
- Genome Informatics Research Group, Bionivid Technology Pvt Ltd, Bangalore 560043, India
| | - Vivek Nagaraj Todur
- Genome Informatics Research Group, Bionivid Technology Pvt Ltd, Bangalore 560043, India
| | - Rohit Nandan Shukla
- Genome Informatics Research Group, Bionivid Technology Pvt Ltd, Bangalore 560043, India
| | - Madavan Vasudevan
- Genome Informatics Research Group, Bionivid Technology Pvt Ltd, Bangalore 560043, India
| |
Collapse
|
41
|
Bosamia TC, Mishra GP, Thankappan R, Dobaria JR. Novel and Stress Relevant EST Derived SSR Markers Developed and Validated in Peanut. PLoS One 2015; 10:e0129127. [PMID: 26046991 PMCID: PMC4457858 DOI: 10.1371/journal.pone.0129127] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2015] [Accepted: 05/04/2015] [Indexed: 11/18/2022] Open
Abstract
With the aim to increase the number of functional markers in resource poor crop like cultivated peanut (Arachis hypogaea), large numbers of available expressed sequence tags (ESTs) in the public databases, were employed for the development of novel EST derived simple sequence repeat (SSR) markers. From 16424 unigenes, 2784 (16.95%) SSRs containing unigenes having 3373 SSR motifs were identified. Of these, 2027 (72.81%) sequences were annotated and 4124 gene ontology terms were assigned. Among different SSR motif-classes, tri-nucleotide repeats (33.86%) were the most abundant followed by di-nucleotide repeats (27.51%) while AG/CT (20.7%) and AAG/CTT (13.25%) were the most abundant repeat-motifs. A total of 2456 EST-SSR novel primer pairs were designed, of which 366 unigenes having relevance to various stresses and other functions, were PCR validated using a set of 11 diverse peanut genotypes. Of these, 340 (92.62%) primer pairs yielded clear and scorable PCR products and 39 (10.66%) primer pairs exhibited polymorphisms. Overall, the number of alleles per marker ranged from 1-12 with an average of 3.77 and the PIC ranged from 0.028 to 0.375 with an average of 0.325. The identified EST-SSRs not only enriched the existing molecular markers kitty, but would also facilitate the targeted research in marker-trait association for various stresses, inter-specific studies and genetic diversity analysis in peanut.
Collapse
Affiliation(s)
- Tejas C. Bosamia
- Crop Improvement Division, ICAR- Directorate of Groundnut Research, Junagadh, Gujarat, 362001, India
- Department of Biotechnology, College of Agriculture, Junagadh Agricultural University, Junagadh, Gujarat, 362001,India
| | - Gyan P. Mishra
- Crop Improvement Division, ICAR- Directorate of Groundnut Research, Junagadh, Gujarat, 362001, India
| | - Radhakrishnan Thankappan
- Crop Improvement Division, ICAR- Directorate of Groundnut Research, Junagadh, Gujarat, 362001, India
| | - Jentilal R. Dobaria
- Crop Improvement Division, ICAR- Directorate of Groundnut Research, Junagadh, Gujarat, 362001, India
| |
Collapse
|
42
|
Next-generation transcriptome sequencing, SNP discovery and validation in four market classes of peanut, Arachis hypogaea L. Mol Genet Genomics 2015; 290:1169-80. [PMID: 25663138 DOI: 10.1007/s00438-014-0976-4] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2014] [Accepted: 12/06/2014] [Indexed: 10/24/2022]
Abstract
Single-nucleotide polymorphisms, which can be identified in the thousands or millions from comparisons of transcriptome or genome sequences, are ideally suited for making high-resolution genetic maps, investigating population evolutionary history, and discovering marker-trait linkages. Despite significant results from their use in human genetics, progress in identification and use in plants, and particularly polyploid plants, has lagged. As part of a long-term project to identify and use SNPs suitable for these purposes in cultivated peanut, which is tetraploid, we generated transcriptome sequences of four peanut cultivars, namely OLin, New Mexico Valencia C, Tamrun OL07 and Jupiter, which represent the four major market classes of peanut grown in the world, and which are important economically to the US southwest peanut growing region. CopyDNA libraries of each genotype were used to generate 2 × 54 paired-end reads using an Illumina GAIIx sequencer. Raw reads were mapped to a custom reference consisting of Tifrunner 454 sequences plus peanut ESTs in GenBank, compromising 43,108 contigs; 263,840 SNP and indel variants were identified among four genotypes compared to the reference. A subset of 6 variants was assayed across 24 genotypes representing four market types using KASP chemistry to assess the criteria for SNP selection. Results demonstrated that transcriptome sequencing can identify SNPs usable as selectable DNA-based markers in complex polyploid species such as peanut. Criteria for effective use of SNPs as markers are discussed in this context.
Collapse
|