1
|
Transcriptome analysis of mulberry (Morus alba L.) leaves to identify differentially expressed genes associated with post-harvest shelf-life elongation. Sci Rep 2022; 12:18195. [PMID: 36307466 PMCID: PMC9616847 DOI: 10.1038/s41598-022-21828-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Accepted: 10/04/2022] [Indexed: 12/31/2022] Open
Abstract
Present study deals with molecular expression patterns responsible for post-harvest shelf-life extension of mulberry leaves. Quantitative profiling showed retention of primary metabolite and accumulation of stress markers in NS7 and CO7 respectively. The leaf mRNA profiles was sequenced using the Illumina platform to identify DEGs. A total of 3413 DEGs were identified between the treatments. Annotation with Arabidopsis database has identified 1022 DEGs unigenes. STRING generated protein-protein interaction, identified 1013 DEGs nodes with p < 1.0e-16. KEGG classifier has identified genes and their participating biological processes. MCODE and BiNGO detected sub-networking and ontological enrichment, respectively at p ≤ 0.05. Genes associated with chloroplast architecture, photosynthesis, detoxifying ROS and RCS, and innate-immune response were significantly up-regulated, responsible for extending shelf-life in NS7. Loss of storage sucrose, enhanced activity of senescence-related hormones, accumulation of xenobiotics, and development of osmotic stress inside tissue system was the probable reason for tissue deterioration in CO7. qPCR validation of DEGs was in good agreement with RNA sequencing results, indicating the reliability of the sequencing platform. Present outcome provides a molecular insight regarding involvement of genes in self-life extension, which might help the sericulture industry to overcome their pre-existing problems related to landless farmers and larval feeding during monsoon.
Collapse
|
2
|
Singh KP, Kumari P, Yadava DK. Development of de-novo transcriptome assembly and SSRs in allohexaploid Brassica with functional annotations and identification of heat-shock proteins for thermotolerance. Front Genet 2022; 13:958217. [PMID: 36186472 PMCID: PMC9524822 DOI: 10.3389/fgene.2022.958217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Accepted: 08/23/2022] [Indexed: 11/20/2022] Open
Abstract
Crop Brassicas contain monogenomic and digenomic species, with no evidence of a trigenomic Brassica in nature. Through somatic fusion (Sinapis alba + B. juncea), a novel allohexaploid trigenomic Brassica (H1 = AABBSS; 2n = 60) was produced and used for transcriptome analysis to uncover genes for thermotolerance, annotations, and microsatellite markers for future molecular breeding. Illumina Novaseq 6000 generated a total of 76,055,546 paired-end raw reads, which were used for de-novo assembly, resulting in the development of 486,066 transcripts. A total of 133,167 coding sequences (CDSs) were predicted from transcripts with a mean length of 507.12 bp and 46.15% GC content. The BLASTX search of CDSs against public protein databases showed a maximum of 126,131 (94.72%) and a minimum of 29,810 (22.39%) positive hits. Furthermore, 953,773 gene ontology (GO) terms were found in 77,613 (58.28%) CDSs, which were divided into biological processes (49.06%), cellular components (31.67%), and molecular functions (19.27%). CDSs were assigned to 144 pathways by a pathway study using the KEGG database and 1,551 pathways by a similar analysis using the Reactome database. Further investigation led to the discovery of genes encoding over 2,000 heat shock proteins (HSPs). The discovery of a large number of HSPs in allohexaploid Brassica validated our earlier findings for heat tolerance at seed maturity. A total of 15,736 SSRs have been found in 13,595 CDSs, with an average of one SSR per 4.29 kb length and an SSR frequency of 11.82%. The first transcriptome assembly of a meiotically stable allohexaploid Brassica has been given in this article, along with functional annotations and the presence of SSRs, which could aid future genetic and genomic studies.
Collapse
Affiliation(s)
| | - Preetesh Kumari
- Genetics Division, ICAR—Indian Agricultural Research Institute, New Delhi, India
- *Correspondence: Preetesh Kumari,
| | | |
Collapse
|
3
|
An empirical analysis of mtSSRs: could microsatellite distribution patterns explain the evolution of mitogenomes in plants? Funct Integr Genomics 2021; 22:35-53. [PMID: 34751851 DOI: 10.1007/s10142-021-00815-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Revised: 10/18/2021] [Accepted: 10/19/2021] [Indexed: 10/19/2022]
Abstract
Microsatellites (SSRs) are tandem repeat sequences in eukaryote genomes, including plant cytoplasmic genomes. The mitochondrial genome (mtDNA) has been shown to vary in size, number, and distribution of SSRs among different plant groups. Thus, SSRs contribute with genomic diversity in mtDNAs. However, the abundance, distribution, and evolutionary significance of SSRs in mtDNA from a wide range of algae and plants have not been explored. In this study, the mtDNAs of 204 plant and algal species were investigated related to the presence of SSRs. The number of SSRs was positively correlated with genome size. Its distribution is dependent on plant and algal groups analyzed, although the cluster analysis indicates the conservation of some common motifs in algal and terrestrial plants that reflect common ancestry of groups. Many SSRs in coding and non-coding regions can be useful for molecular markers. Moreover, mitochondrial SSRs are highly abundant, representing an important source for natural or induced genetic variation, i.e., for biotechnological approaches that can modulate mtDNA gene regulation. Thus, this comparative study increases the understanding of the plant and algal SSR evolution and brings perspectives for further studies.
Collapse
|
4
|
Malhotra EV, Jain R, Bansal S, Mali SC, Sharma N, Agrawal A. Development of a new set of genic SSR markers in the genus Gentiana: in silico mining, characterization and validation. 3 Biotech 2021; 11:430. [PMID: 34527507 DOI: 10.1007/s13205-021-02969-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Accepted: 08/19/2021] [Indexed: 11/26/2022] Open
Abstract
Gentiana is an important genus of around 360 medicinally important species, majority of which are not well characterized. Despite its importance, very few genomic resources are available for Gentiana L. Till date, the number of informative and robust simple sequence repeat (SSR)-based markers is limited and very few efforts have been made for their development. A set of robust, freely accessible and informative SSR markers for Gentiana is a pre-requisite for any molecular systematic as well as improvement studies in this group of pharmacologically valuable plants. In view of the importance of these plants, Expressed Sequence Tag (EST) sequences of 18 Gentiana species were surveyed for the development of a large set of non-redundant SSR markers. A total of 5808 perfect SSR with an average length of 17 bp and relative abundance of 214 loci/Mb were identified in the analysed 47,487 EST sequences using Krait software. Mapping of the ESTs resulted in gene ontology annotations of 49.14% of the sequences. Based on these perfect SSRs, 2902 primer pairs were designed, and 60 markers were randomly selected and validated on a set of Gentiana kurroo Royle accessions. Among the screened markers, 39 (65%) were found to be cross-species transferable. This is the first report of the largest set of functional, novel genic SSR markers in Gentiana, which will be a valuable resource for future characterization, genotype identification, conservation and genomic studies in the various species of this group of important medicinal plants. SUPPLEMENTARY INFORMATION The online version contains supplementary material available at 10.1007/s13205-021-02969-4.
Collapse
Affiliation(s)
- Era Vaidya Malhotra
- Tissue Culture and Cryopreservation Unit, ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
| | - Rishu Jain
- Tissue Culture and Cryopreservation Unit, ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
| | - Sangita Bansal
- Tissue Culture and Cryopreservation Unit, ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
| | - Suresh Chand Mali
- Tissue Culture and Cryopreservation Unit, ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
| | - Neelam Sharma
- Tissue Culture and Cryopreservation Unit, ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
| | - Anuradha Agrawal
- Tissue Culture and Cryopreservation Unit, ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
| |
Collapse
|
5
|
Zhu L, Wu H, Li H, Tang H, Zhang L, Xu H, Jiao F, Wang N, Yang L. Short Tandem Repeats in plants: Genomic distribution and function prediction. ELECTRON J BIOTECHN 2021. [DOI: 10.1016/j.ejbt.2020.12.003] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
|
6
|
Singh A, Majeed A, Bhardwaj P. Transcriptome characterization and generation of marker resource for Himalayan vulnerable species, Ulmus wallichiana. Mol Biol Rep 2021; 48:721-729. [PMID: 33439411 DOI: 10.1007/s11033-021-06138-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2020] [Accepted: 01/05/2021] [Indexed: 02/02/2023]
Abstract
Ulmus wallichiana is a traditional medicinal plant listed as a vulnerable in the IUCN red list data. Genomic and transcriptomic resources for this species are lacking, hindering its genetic exploration. Further, no polymorphic marker resource is available for this species, thus limiting the elucidation of its underlying genetic diversity, which is a pre-requisite for its conservation. This study was therefore aimed to generate a functionally annotated transcriptomic resource and screen it for SSR regions. We used paired-end Illumina based RNAseq technology and trinity based de novo assembly approach to generate full length transcripts, which were screened for SSR regions and functionally annotated. Around 6.6 million raw reads were de novo assembled transcripts, which were clustered into 146,083 unigenes. 19,909 transcripts were provided with 3986 unique KEGG ids, 70,519 transcripts with 6621 unique Pfam domains, and 45,125 transcripts with 7302 unique INTERPRO domains. 1456 transcripts were identified as transcriptions factors (TFs). Further, 8868 unique GO terms were obtained for the unigenes. The transcripts mapped to 23,056 known pre-determined orthology clusters in the eggNOG database. A total of 16,570 SSRs were identified from the unigenes. Out of the 90 SSRs selected for characterization on 20 genotypes, 28 were polymorphic. Mean effective alleles (Ne) of 2.53, mean observed heterozygosity (Ho) of 0.77, and average polymorphic information content (PIC) of 0.57 were found. This study may facilitate the genetic exploration of this species. The polymorphic SSRs would prove useful to explore its genetic diversity patterns, required for its conservation.
Collapse
Affiliation(s)
- Amandeep Singh
- Molecular Genetics Laboratory, Department of Botany, Central University of Punjab, VPO Ghudda, Distt, Bathinda, 151401, India
| | - Aasim Majeed
- Molecular Genetics Laboratory, Department of Botany, Central University of Punjab, VPO Ghudda, Distt, Bathinda, 151401, India
| | - Pankaj Bhardwaj
- Molecular Genetics Laboratory, Department of Botany, Central University of Punjab, VPO Ghudda, Distt, Bathinda, 151401, India.
| |
Collapse
|
7
|
Satyam R, Jha NK, Kar R, Jha SK, Sharma A, Kumar D, Nand P, Ruokolainen J, Kesari KK, Kamal MA. Deciphering the SSR incidences across viral members of Coronaviridae family. Chem Biol Interact 2020; 331:109226. [PMID: 32971122 PMCID: PMC7505113 DOI: 10.1016/j.cbi.2020.109226] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2020] [Revised: 08/05/2020] [Accepted: 08/11/2020] [Indexed: 12/19/2022]
Abstract
Presence of Simple Sequence Repeats (SSRs), both in genic and intergenic regions, have been widely studied in eukaryotes, prokaryotes, and viruses. In the current study, we undertook a survey to analyze the frequency and distribution of microsatellites or SSRs in multiple genomes of Coronaviridae members. We successfully identified 919 SSRs with length ≥12 bp across 55 reference genomes majority of which (838 SSRs) were found abundant in genic regions. The in-silico analysis further identified the preferential abundance of hexameric SSRs than any other size-based motif class. Our analysis shows that the genome size and GC content of the genome had a weak influence on SSR frequency and density. However, we find a positive correlation of SSRs GC content with genomic GC content. We also report relatively low abundances of all theoretically possible 501 repeat motif classes in all the genomes of Coronaviridae. The majority of SSRs were AT-rich. Overall, we see an underrepresentation of SSRs across the genomes of Coronaviridae. Besides, our integrative study highlights the presence of SSRs in ORF1ab (nsp3, nsp4, nsp5A_3CLpro and nsp5B_3CLpro, nsp6, nsp10, nsp12, nsp13, & nsp15 domains), S, ORF3a, ORF7a, N & 3' UTR regions of SARS-CoV-2 and harbours multiple mutations (3'UTR and ORF1ab SSRs serving as major mutational hotspots). This indicates the genic SSRs are under selection pressure against mutations that might alter the reading frame and at the same time responsible for rapid protein evolution. Our preliminary results indicate the significance of the limited repertoire of SSRs in the genomes of Coronaviridae.
Collapse
Affiliation(s)
- Rohit Satyam
- Department of Biotechnology, Noida Institute of Engineering and Technology (NIET), Greater Noida, India
| | - Niraj Kumar Jha
- Department of Biotechnology, School of Engineering & Technology (SET), Sharda University, Greater Noida, 201310, India.
| | - Rohan Kar
- Indian Institute of Management Ahmedabad (IIMA), Gujarat, 380015, India
| | - Saurabh Kumar Jha
- Department of Biotechnology, School of Engineering & Technology (SET), Sharda University, Greater Noida, 201310, India
| | - Ankur Sharma
- Department of Life Science, School of Basic Science & Research, Sharda University, Greater Noida, 201310, India
| | - Dhruv Kumar
- Amity Institute of Molecular Medicine and Stem Cell Research (AIMMSCR), Amity University Uttar Pradesh, Noida, 201313, India
| | - Parma Nand
- Department of Biotechnology, School of Engineering & Technology (SET), Sharda University, Greater Noida, 201310, India
| | | | | | - Mohammad Amjad Kamal
- King Fahd Medical Research Center, King Abdulaziz University, P. O. Box 80216, Jeddah, 21589, Saudi Arabia; Enzymoics, Novel Global Community Educational Foundation, 7 Peterlee Place, Hebersham, NSW, 2770, Australia
| |
Collapse
|
8
|
Biswas MK, Darbar JN, Borrell JS, Bagchi M, Biswas D, Nuraga GW, Demissew S, Wilkin P, Schwarzacher T, Heslop-Harrison JS. The landscape of microsatellites in the enset (Ensete ventricosum) genome and web-based marker resource development. Sci Rep 2020; 10:15312. [PMID: 32943659 PMCID: PMC7498607 DOI: 10.1038/s41598-020-71984-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2020] [Accepted: 08/24/2020] [Indexed: 12/25/2022] Open
Abstract
Ensete ventricosum (Musaceae, enset) is an Ethiopian food security crop. To realize the potential of enset for rural livelihoods, further knowledge of enset diversity, genetics and genomics is required to support breeding programs and conservation. This study was conducted to explore the enset genome to develop molecular markers, genomics resources, and characterize enset landraces while giving insight into the organization of the genome. We identified 233 microsatellites (simple sequence repeats, SSRs) per Mbp in the enset genome, representing 0.28% of the genome. Mono- and di-nucleotide repeats motifs were found in a higher proportion than other classes of SSR-motifs. In total, 154,586 non-redundant enset microsatellite markers (EMM) were identified and 40 selected for primer development. Marker validation by PCR and low-cost agarose gel electrophoresis revealed that 92.5% were polymorphic, showing a high PIC (Polymorphism Information Content; 0.87) and expected heterozygosity (He = 0.79-0.82). In silico analysis of genomes of closely related species showed 46.86% of the markers were transferable among enset species and 1.90% were transferable to Musa. The SSRs are robust (with basic PCR methods and agarose gel electrophoresis), informative, and applicable in measuring enset diversity, genotyping, selection and potentially breeding. Enset SSRs are available in a web-based database at https://enset-project.org/EnMom@base.html (or https://enset.aau.edu.et/index.html , downloadable from Figshare).
Collapse
Affiliation(s)
- Manosh Kumar Biswas
- Department of Genetics and Genome Biology, University of Leicester, Leicester, LE1 7RH, UK.
| | - Jaypal N Darbar
- Department of Genetics and Genome Biology, University of Leicester, Leicester, LE1 7RH, UK
| | | | - Mita Bagchi
- Department of Genetics and Genome Biology, University of Leicester, Leicester, LE1 7RH, UK
| | - Dhiman Biswas
- Department of Computer Science and Engineering, Maulana Abul Kalam Azad University of Technology, Kolkata, West Bengal, India
| | - Gizachew Woldesenbet Nuraga
- Department of Genetics and Genome Biology, University of Leicester, Leicester, LE1 7RH, UK.,Institute of Biotechnology, Addis Ababa University, Addis Ababa, Ethiopia
| | - Sebsebe Demissew
- Department of Plant Biology and Biodiversity Management, Addis Ababa University, Addis Ababa, Ethiopia
| | - Paul Wilkin
- Royal Botanic Gardens, Kew, Richmond, TW9 3AE, Surrey, UK
| | - Trude Schwarzacher
- Department of Genetics and Genome Biology, University of Leicester, Leicester, LE1 7RH, UK.,South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, People's Republic of China
| | - J S Heslop-Harrison
- Department of Genetics and Genome Biology, University of Leicester, Leicester, LE1 7RH, UK. .,South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, People's Republic of China.
| |
Collapse
|
9
|
Sharma H, Bhandawat A, Kumar P, Rahim MS, Parveen A, Kumar P, Madhawan A, Rishi V, Roy J. Development and characterization of bZIP transcription factor based SSRs in wheat. Gene 2020; 756:144912. [PMID: 32574760 DOI: 10.1016/j.gene.2020.144912] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Revised: 05/31/2020] [Accepted: 06/17/2020] [Indexed: 02/08/2023]
Abstract
Assessment of existing diversity is the key for germplasm conservation and crop improvement. Wheat (Triticum aestivum L.) is among the most important cereal crop and consumed by two billion world's populations. DNA-based markers are predominantly used for diversity characterization because they are easy to develop and not influenced by environment. Among them microsatellites (simple sequence repeats, SSRs) are most suitable due to their genome-wide distribution, hypervariability and reproducibility for their applications in diversity, genetic improvement, and molecular breeding. bZIP transcription factors play major roles in plants in light and stress signalling, seed development, and defence. A total of 846 SSRs were identified from 370 wheat cDNA sequences and a sub-set of 35 polymorphic TabZIPMS (TriticumaestivumbZIP MicroSatellites) was used for diversity and genetic structure analysis of 92 Indian wheat varieties and related species. 114 SSR variants ranging from 2 to 5 per SSR locus were detected for 35 SSRs in the varieties. Average polymorphic information content (PIC) and observed heterozygosity was found to be 0.135 and 0.838, respectively. Thirty-four SSRs showed cross-transferability into different related species. Combined Bayesian model and Jaccard's similarity based genetic clustering analysis revealed two clusters of 80 bread wheat varieties and one separate cluster of related species. In this study, a total 35 novel bZIP-derived SSRs were identified in a set 370 bZIP genes and shown high polymorphism and cross-species transferability in wheat. The findings provide resources for future utilization in genetic resource conservation, trait introgression, breeding and varietal development.
Collapse
Affiliation(s)
- Himanshu Sharma
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute, Knowledge City, Sector-81, Mohali, Punjab 140306, India
| | - Abhishek Bhandawat
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute, Knowledge City, Sector-81, Mohali, Punjab 140306, India
| | - Pankaj Kumar
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute, Knowledge City, Sector-81, Mohali, Punjab 140306, India
| | - Mohammed Saba Rahim
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute, Knowledge City, Sector-81, Mohali, Punjab 140306, India
| | - Afsana Parveen
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute, Knowledge City, Sector-81, Mohali, Punjab 140306, India
| | - Prashant Kumar
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute, Knowledge City, Sector-81, Mohali, Punjab 140306, India
| | - Akansha Madhawan
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute, Knowledge City, Sector-81, Mohali, Punjab 140306, India
| | - Vikas Rishi
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute, Knowledge City, Sector-81, Mohali, Punjab 140306, India
| | - Joy Roy
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute, Knowledge City, Sector-81, Mohali, Punjab 140306, India.
| |
Collapse
|
10
|
Uncu AO, Uncu AT. High-throughput simple sequence repeat (SSR) mining saturates the carrot (Daucus carota L.) genome with chromosome-anchored markers. BIOTECHNOL BIOTEC EQ 2019. [DOI: 10.1080/13102818.2019.1701551] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022] Open
Affiliation(s)
- Ayse Ozgur Uncu
- Department of Biotechnology, Faculty of Science, Necmettin Erbakan University, Meram, Turkey
| | - Ali Tevfik Uncu
- Department of Molecular Biology & Genetics, Faculty of Science, Necmettin Erbakan University, Meram, Turkey
| |
Collapse
|
11
|
Zhang Z, Xie W, Zhao Y, Zhang J, Wang N, Ntakirutimana F, Yan J, Wang Y. EST-SSR marker development based on RNA-sequencing of E. sibiricus and its application for phylogenetic relationships analysis of seventeen Elymus species. BMC PLANT BIOLOGY 2019; 19:235. [PMID: 31159732 PMCID: PMC6547490 DOI: 10.1186/s12870-019-1825-8] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/06/2018] [Accepted: 05/09/2019] [Indexed: 05/20/2023]
Abstract
BACKGROUND Elymus L. is the largest genus in the tribe Triticeae Dumort., encompassing approximately 150 polyploid perennial species widely distributed in the temperate regions of the world. It is considered to be an important gene pool for improving cereal crops. However, a shortage of molecular marker limits the efficiency and accuracy of genetic breeding for Elymus species. High-throughput transcriptome sequencing data is essential for gene discovery and molecular marker development. RESULTS We obtained the transcriptome dataset of E. sibiricus, the type species of the genus Elymus, and identified a total of 8871 putative EST-SSRs from 6685 unigenes. Trinucleotides were the dominant repeat motif (4760, 53.66%), followed by dinucleotides (1993, 22.47%) and mononucleotides (1876, 21.15%). The most dominant trinucleotide repeat motif was CCG/CGG (1119, 23.5%). Sequencing of PCR products showed that the sequenced alleles from different Elymus species were homologous to the original SSR locus from which the primer was designed. Different types of tri-repeats as abundant SSR motifs were observed in repeat regions. Two hundred EST-SSR primer pairs were designed and selected to amplify ten DNA samples of Elymus species. Eighty-seven pairs of primer (43.5%) generated clear and reproducible bands with expected size, and showed good transferability across different Elymus species. Finally, thirty primer pairs successfully amplified ninety-five accessions of seventeen Elymus species, and detected significant amounts of polymorphism. In general, hexaploid Elymus species with genomes StStHHYY had a relatively higher level of genetic diversity (H = 0.219, I = 0.330, %P = 63.7), while tetraploid Elymus species with genomes StStYY had low level of genetic diversity (H = 0.182, I = 0.272, %P = 50.4) in the study. The cluster analysis showed that all ninety-five accessions were clustered into three major clusters. The accessions were grouped mainly according to their genomic components and origins. CONCLUSIONS This study demonstrated that transcriptome sequencing is a fast and cost-effective approach to molecular marker development. These EST-SSR markers developed in this study are valuable tools for genetic diversity, evolutionary, and molecular breeding in E. sibiricus, and other Elymus species.
Collapse
Affiliation(s)
- Zongyu Zhang
- State Key Laboratory of Grassland Agro-ecosystems; Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs; Engineering Research Center of Grassland Industry, Ministry of Education; College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730020 People’s Republic of China
| | - Wengang Xie
- State Key Laboratory of Grassland Agro-ecosystems; Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs; Engineering Research Center of Grassland Industry, Ministry of Education; College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730020 People’s Republic of China
| | - Yongqiang Zhao
- State Key Laboratory of Grassland Agro-ecosystems; Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs; Engineering Research Center of Grassland Industry, Ministry of Education; College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730020 People’s Republic of China
| | - Junchao Zhang
- State Key Laboratory of Grassland Agro-ecosystems; Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs; Engineering Research Center of Grassland Industry, Ministry of Education; College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730020 People’s Republic of China
| | - Na Wang
- State Key Laboratory of Grassland Agro-ecosystems; Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs; Engineering Research Center of Grassland Industry, Ministry of Education; College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730020 People’s Republic of China
| | - Fabrice Ntakirutimana
- State Key Laboratory of Grassland Agro-ecosystems; Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs; Engineering Research Center of Grassland Industry, Ministry of Education; College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730020 People’s Republic of China
| | - Jiajun Yan
- Sichuan Academy of Grassland Science, Chengdu, Sichuan 611731 People’s Republic of China
| | - Yanrong Wang
- State Key Laboratory of Grassland Agro-ecosystems; Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs; Engineering Research Center of Grassland Industry, Ministry of Education; College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730020 People’s Republic of China
| |
Collapse
|
12
|
Characterization and expression profiles of muscle transcriptome in Schizothoracine fish, Schizothorax prenanti. Gene 2019; 685:156-163. [DOI: 10.1016/j.gene.2018.10.070] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2018] [Revised: 10/17/2018] [Accepted: 10/24/2018] [Indexed: 01/13/2023]
|
13
|
Das R, Arora V, Jaiswal S, Iquebal MA, Angadi UB, Fatma S, Singh R, Shil S, Rai A, Kumar D. PolyMorphPredict: A Universal Web-Tool for Rapid Polymorphic Microsatellite Marker Discovery From Whole Genome and Transcriptome Data. FRONTIERS IN PLANT SCIENCE 2019; 9:1966. [PMID: 30687361 PMCID: PMC6337687 DOI: 10.3389/fpls.2018.01966] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/17/2018] [Accepted: 12/18/2018] [Indexed: 06/09/2023]
Abstract
Microsatellites are ubiquitously distributed, polymorphic repeat sequence valuable for association, selection, population structure and identification. They can be mined by genomic library, probe hybridization and sequencing of selected clones. Such approach has many limitations like biased hybridization and selection of larger repeats. In silico mining of polymorphic markers using data of various genotypes can be rapid and economical. Available tools lack in some or other aspects like: targeted user defined primer generation, polymorphism discovery using multiple sequence, size and number limits of input sequence, no option for primer generation and e-PCR evaluation, transferability, lack of complete automation and user-friendliness. They also lack the provision to evaluate published primers in e-PCR mode to generate additional allelic data using re-sequenced data of various genotypes for judicious utilization of previously generated data. We developed the tool (PolyMorphPredict) using Perl, R, Java and launched at Apache which is available at http://webtom.cabgrid.res.in/polypred/. It mines microsatellite loci and computes primers from genome/transcriptome data of any species. It can perform e-PCR using published primers for polymorphism discovery and across species transferability of microsatellite loci. Present tool has been evaluated using five species of different genome size having 21 genotypes. Though server is equipped with genomic data of three species for test run with gel simulation, but can be used for any species. Further, polymorphism predictability has been validated using in silico and in vitro PCR of four rice genotypes. This tool can accelerate the in silico microsatellite polymorphism discovery in re-sequencing projects of any species of plant and animal for their diversity estimation along with variety/breed identification, population structure, MAS, QTL and gene discovery, traceability, parentage testing, fungal diagnostics and genome finishing.
Collapse
Affiliation(s)
- Ritwika Das
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Vasu Arora
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Sarika Jaiswal
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - MA Iquebal
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - UB Angadi
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Samar Fatma
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Rakesh Singh
- ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
| | - Sandip Shil
- Research Center, ICAR-Central Plantation Crops Research Institute, Jalpaiguri, India
| | - Anil Rai
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Dinesh Kumar
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| |
Collapse
|
14
|
Wang Y, Li N, Chen T, Gong Y. Generation and characterization of expressed sequence tags (ESTs) from coralloid root cDNA library of Cycas debaoensis. PLANT DIVERSITY 2018; 40:245-252. [PMID: 30740571 PMCID: PMC6224666 DOI: 10.1016/j.pld.2018.09.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/09/2018] [Revised: 08/31/2018] [Accepted: 09/03/2018] [Indexed: 06/09/2023]
Abstract
A normalized full-length cDNA library was constructed from the coralloid roots of Cycas debaoensis by the DSN (duplex-specific nuclease) normalization method combined with the SMART (Switching Mechanism At 5' end of the RNA Transcript) technique. The titer of the original cDNA library was about 1.5 × 106 cfu·mL-1 and the average insertion size was about 1 kb with a high recombination rate (97%). The 5011 high-quality expressed sequence tags (ESTs) were obtained from 5393 randomly picked cDNA clones. Clustering and assembly of ESTs resulted in 2984 unique sequences, consisting of 618 contigs and 2366 singlets. EST sequence annotation revealed that 2333 and 1901 unigenes were functionally annotated in the NCBI non-redundant database and Swiss-Prot protein database, respectively. Functional analysis demonstrated that 1495 (50.1%) unigenes were associated with 4082 Gene Ontology (GO) terms. A total of 847 unigenes were grouped into 22 Cluster of Orthologous Groups (COG) functional categories. Based on the EST dataset, 22 ESTs that encoded putative receptor-like protein kinase (RLK) genes were screened. Furthermore, a total of 94 simple sequence repeats (SSRs) were discovered, of which 20 loci were successfully amplified in C. debaoensis. This study is the first EST analysis for the coralloid roots of C. debaoensis and provides a valuable genomic resource for novel gene discovery, gene expression and comparative genomics, conservation and management studies as well as applications in C. debaoensis and related cycad species.
Collapse
Affiliation(s)
- Yunhua Wang
- Shenzhen Key Laboratory of Southern Subtropical Plant Diversity, Fairylake Botanical Garden, Shenzhen & Chinese Academy of Sciences, Shenzhen, 518004, Guangdong, China
| | | | | | | |
Collapse
|
15
|
Qi WH, Jiang XM, Yan CC, Zhang WQ, Xiao GS, Yue BS, Zhou CQ. Distribution patterns and variation analysis of simple sequence repeats in different genomic regions of bovid genomes. Sci Rep 2018; 8:14407. [PMID: 30258087 PMCID: PMC6158176 DOI: 10.1038/s41598-018-32286-5] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2018] [Accepted: 09/04/2018] [Indexed: 01/23/2023] Open
Abstract
As the first examination of distribution, guanine-cytosine (GC) pattern, and variation analysis of microsatellites (SSRs) in different genomic regions of six bovid species, SSRs displayed nonrandomly distribution in different regions. SSR abundances are much higher in the introns, transposable elements (TEs), and intergenic regions compared to the 3′-untranslated regions (3′UTRs), 5′UTRs and coding regions. Trinucleotide perfect SSRs (P-SSRs) were the most frequent in the coding regions, whereas, mononucleotide P-SSRs were the most in the introns, 3′UTRs, TEs, and intergenic regions. Trifold P-SSRs had more GC-contents in the 5′UTRs and coding regions than that in the introns, 3′UTRs, TEs, and intergenic regions, whereas mononucleotide P-SSRs had the least GC-contents in all genomic regions. The repeat copy numbers (RCN) of the same mono- to hexanucleotide P-SSRs showed significantly different distributions in different regions (P < 0.01). Except for the coding regions, mononucleotide P-SSRs had the most RCNs, followed by the pattern: di- > tri- > tetra- > penta- > hexanucleotide P-SSRs in the same regions. The analysis of coefficient of variability (CV) of SSRs showed that the CV variations of RCN of the same mono- to hexanucleotide SSRs were relative higher in the intronic and intergenic regions, followed by the CV variation of RCN in the TEs, and the relative lower was in the 5′UTRs, 3′UTRs, and coding regions. Wide SSR analysis of different genomic regions has helped to reveal biological significances of their distributions.
Collapse
Affiliation(s)
- Wen-Hua Qi
- College of Biology and Food Engineering, Chongqing Three Gorges University, Chongqing, 404100, P. R. China
| | - Xue-Mei Jiang
- College of Environmental and Chemistry Engineering, Chongqing Three Gorges University, Chongqing, 404100, P. R. China
| | - Chao-Chao Yan
- Key Laboratory of Bio-resources and Eco-environment (Ministry of Education), College of Life Sciences, Sichuan University, Chengdu, 610064, P. R. China
| | - Wan-Qing Zhang
- College of Life Sciences, Sichuan Agricultural University, Ya'an, Sichuan Province, 625014, P. R. China
| | - Guo-Sheng Xiao
- College of Biology and Food Engineering, Chongqing Three Gorges University, Chongqing, 404100, P. R. China
| | - Bi-Song Yue
- Key Laboratory of Bio-resources and Eco-environment (Ministry of Education), College of Life Sciences, Sichuan University, Chengdu, 610064, P. R. China
| | - Cai-Quan Zhou
- Key Laboratory of Southwest China Wildlife Resources Conservation (Ministry of Education), China West Normal University, Nanchong, 637009, P. R. China.
| |
Collapse
|
16
|
Sen S, Dehury B, Sahu J, Rathi S, Yadav RNS. Mining and comparative survey of EST-SSR markers among members of Euphorbiaceae family. Mol Biol Rep 2018; 45:453-468. [PMID: 29626317 DOI: 10.1007/s11033-018-4181-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2017] [Accepted: 04/02/2018] [Indexed: 11/30/2022]
Abstract
Euphorbiaceae represents flowering plants family of tropical and sub-tropical region rich in secondary metabolites of economic importance. To understand and assess the genetic makeup among the members, this study was undertaken to characterize and compare SSR markers from publicly available ESTs and GSSs of nine selected species of the family. Mining of SSRs was performed by MISA, primer designing by Primer3, while functional annotation, gene ontology (GO) and enrichment analysis were performed by Blast2GO. A total 12,878 number of SSRs were detected from 101,701 number of EST sequences. SSR density ranged from 1 SSR/3.22 kb to 1 SSR/15.65 kb. A total of 1873 primer pairs were designed for the annotated SSR-Contigs. About 77.07% SSR-ESTs could be assigned a significant match to the protein database. 3037 unique SSR-FDM were assigned and IPR003657 (WRKY Domain) was found to be the most dominant FDM among the members. 1810 unique GO terms obtained were further subjected to enrichment analysis to obtain 513 statistically significant GO terms mapped to the SSR containing ESTs. Most frequent enriched GO terms were, GO:0003824 for molecular function, GO:0006350 for biological process and GO:0005886 for cellular component, justifying the richness of defensive secondary metabolites and phytomedicine within the family. The results from this study provides tangible insight to genetic make-up and distribution of SSRs. Functional annotation corresponded many genes of unknown functions which may be considered as novel genes or genes responsible for stress specific secondary metabolites. Further studies are required to understand stress specific genes accountable for leveraging the synthesis of secondary metabolites.
Collapse
Affiliation(s)
- Surojit Sen
- Centre for Biotechnology and Bioinformatics, Dibrugarh University, Dibrugarh, Assam, India.
| | - Budheswar Dehury
- Biomedical Informatics Centre, ICMR-Regional Medical Research Centre, Nalco Square, Chandrasekharpur, Bhubaneswar, Odisha, 751023, India
| | - Jagajjit Sahu
- Distributed Information Center, Department of Agricultural Biotechnology, Assam Agricultural University, Jorhat, Assam, 785013, India
| | - Sunayana Rathi
- Department of Biochemistry and Agricultural Chemistry, Assam Agricultural University, Jorhat, Assam, 785013, India
| | | |
Collapse
|
17
|
Liu Z, Tao J, Luo Y. Development and characterization of polymorphic genomic-SSR markers in Asian long-horned beetle (Anoplophora glabripennis). BULLETIN OF ENTOMOLOGICAL RESEARCH 2017; 107:749-755. [PMID: 28390443 DOI: 10.1017/s0007485317000268] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
The Asian long-horned beetle (ALB), Anoplophora glabripennis (Motschulsky) (Coleoptera: Cerambycidae: Lamiinae), is a wood-borer and polyphagous xylophage that is native to Asia. It infests and seriously harms healthy trees, and therefore is a cause for considerable environmental concern. The analysis of population genetic structure of ALB and sibling species Anoplophora nobilis (Ganglbauer) will not only help to clarify the relationship between environmental variables and mechanisms of speciation, but also will enhance our understanding of evolutionary processes. However, the known genetic markers, particularly microsatellites, are limited for this species. SSRLocator software was used to analyze the distribution and frequencies of genomic simple sequence repeat (SSR), to infer the basic characteristics of repeat motifs, and to design primers. We developed SSR loci of 2-6 repeated units, including 10,650 perfect SSRs, and found 140 types of repeat motifs. A total of 2621 SSR markers were discovered in ALB whole-genome shotgun sequences. 48 pairs of SSR primers were randomly chosen from 2621 SSR markers, and half of these 48 pairs were polymorphic containing 4 di-, 7 tri-, 2 tetra-, and 11-hexamer SSRs. Four populations test the effectiveness of the primers. These results suggest that our method for whole-genome SSR screening is feasible and efficient, and the SSR markers developed in this study are suitable for further population genetics studies of ALB. Moreover, they may also be useful for the development of SSRs for other Coleoptera.
Collapse
Affiliation(s)
- Zhaoyang Liu
- Key Laboratory for Silviculture and Conservation of Ministry of Education, College of Forestry, Beijing Forestry University, No.35 Tsinghua East Road, Haidian District, Beijing 100083, People's Republic of China
| | - Jing Tao
- Key Laboratory for Silviculture and Conservation of Ministry of Education, College of Forestry, Beijing Forestry University, No.35 Tsinghua East Road, Haidian District, Beijing 100083, People's Republic of China
| | - Youqing Luo
- Key Laboratory for Silviculture and Conservation of Ministry of Education, College of Forestry, Beijing Forestry University, No.35 Tsinghua East Road, Haidian District, Beijing 100083, People's Republic of China
| |
Collapse
|
18
|
Distinct patterns of simple sequence repeats and GC distribution in intragenic and intergenic regions of primate genomes. Aging (Albany NY) 2017; 8:2635-2654. [PMID: 27644032 PMCID: PMC5191860 DOI: 10.18632/aging.101025] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2016] [Accepted: 08/22/2016] [Indexed: 01/23/2023]
Abstract
As the first systematic examination of simple sequence repeats (SSRs) and guanine-cytosine (GC) distribution in intragenic and intergenic regions of ten primates, our study showed that SSRs and GC displayed nonrandom distribution for both intragenic and intergenic regions, suggesting that they have potential roles in transcriptional or translational regulation. Our results suggest that the majority of SSRs are distributed in non-coding regions, such as the introns, TEs, and intergenic regions. In these primates, trinucleotide perfect (P) SSRs were the most abundant repeats type in the 5'UTRs and CDSs, whereas, mononucleotide P-SSRs were the most in the intron, 3'UTRs, TEs, and intergenic regions. The GC-contents varied greatly among different intragenic and intergenic regions: 5'UTRs > CDSs > 3'UTRs > TEs > introns > intergenic regions, and high GC-content was frequently distributed in exon-rich regions. Our results also showed that in the same intragenic and intergenic regions, the distribution of GC-contents were great similarity in the different primates. Tri- and hexanucleotide P-SSRs had the most GC-contents in the 5'UTRs and CDSs, whereas mononucleotide P-SSRs had the least GC-contents in the six genomic regions of these primates. The most frequent motifs for different length varied obviously with the different genomic regions.
Collapse
|
19
|
Lopez L, Wolf EM, Pires JC, Edger PP, Koch MA. Molecular Resources from Transcriptomes in the Brassicaceae Family. FRONTIERS IN PLANT SCIENCE 2017; 8:1488. [PMID: 28900436 PMCID: PMC5581910 DOI: 10.3389/fpls.2017.01488] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/16/2017] [Accepted: 08/11/2017] [Indexed: 06/07/2023]
Abstract
The rapidly falling costs and the increasing availability of large DNA sequence data sets facilitate the fast and affordable mining of large molecular markers data sets for comprehensive evolutionary studies. The Brassicaceae (mustards) are an important species-rich family in the plant kingdom with taxa distributed worldwide and a complex evolutionary history. We performed Simple Sequence Repeats (SSRs) mining using de novo assembled transcriptomes from 19 species across the Brassicaceae in order to study SSR evolution and provide comprehensive sets of molecular markers for genetic studies within the family. Moreover, we selected the genus Cochlearia to test the transferability and polymorphism of these markers among species. Additionally, we annotated Cochlearia pyrenaica transcriptome in order to identify the position of each of the mined SSRs. While we introduce a new set of tools that will further enable evolutionary studies across the Brassicaceae, we also discuss some broader aspects of SSR evolution. Overall, we developed 2012 ready-to-use SSR markers with their respective primers in 19 Brassicaceae species and a high quality annotated transcriptome for C. pyrenaica. As indicated by our transferability test with the genus Cochlearia these SSRs are transferable to species within the genus increasing exponentially the number of targeted species. Also, our polymorphism results showed substantial levels of variability for these markers. Finally, despite its complex evolutionary history, SSR evolution across the Brassicaceae family is highly conserved and we found no deviation from patterns reported in other Angiosperms.
Collapse
Affiliation(s)
- Lua Lopez
- Biodiversity and Plant Systematics, Centre of Organismal Studies, University of HeidelbergHeidelberg, Germany
| | - Eva M. Wolf
- Biodiversity and Plant Systematics, Centre of Organismal Studies, University of HeidelbergHeidelberg, Germany
| | - J. Chris Pires
- Division of Biological Sciences, University of MissouriColumbia, MO, United States
| | - Patrick P. Edger
- Department of Horticulture, Michigan State UniversityEast Lansing, MI, United States
- Ecology, Evolutionary Biology and Behavior, Michigan State UniversityEast Lansing, MI, United States
| | - Marcus A. Koch
- Biodiversity and Plant Systematics, Centre of Organismal Studies, University of HeidelbergHeidelberg, Germany
| |
Collapse
|
20
|
Cavaiuolo M, Cocetta G, Spadafora ND, Müller CT, Rogers HJ, Ferrante A. Gene expression analysis of rocket salad under pre-harvest and postharvest stresses: A transcriptomic resource for Diplotaxis tenuifolia. PLoS One 2017; 12:e0178119. [PMID: 28558066 PMCID: PMC5448768 DOI: 10.1371/journal.pone.0178119] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2017] [Accepted: 05/07/2017] [Indexed: 11/25/2022] Open
Abstract
Diplotaxis tenuifolia L. is of important economic value in the fresh-cut industry for its nutraceutical and sensorial properties. However, information on the molecular mechanisms conferring tolerance of harvested leaves to pre- and postharvest stresses during processing and shelf-life have never been investigated. Here, we provide the first transcriptomic resource of rocket by de novo RNA sequencing assembly, functional annotation and stress-induced expression analysis of 33874 transcripts. Transcriptomic changes in leaves subjected to commercially-relevant pre-harvest (salinity, heat and nitrogen starvation) and postharvest stresses (cold, dehydration, dark, wounding) known to affect quality and shelf-life were analysed 24h after stress treatment, a timing relevant to subsequent processing of salad leaves. Transcription factors and genes involved in plant growth regulator signaling, autophagy, senescence and glucosinolate metabolism were the most affected by the stresses. Hundreds of genes with unknown function but uniquely expressed under stress were identified, providing candidates to investigate stress responses in rocket. Dehydration and wounding had the greatest effect on the transcriptome and different stresses elicited changes in the expression of genes related to overlapping groups of hormones. These data will allow development of approaches targeted at improving stress tolerance, quality and shelf-life of rocket with direct applications in the fresh-cut industries.
Collapse
Affiliation(s)
- Marina Cavaiuolo
- Department of Agricultural and Environmental Sciences, Università degli Studi di Milano, Milano, Italy
| | - Giacomo Cocetta
- Department of Agricultural and Environmental Sciences, Università degli Studi di Milano, Milano, Italy
| | | | | | - Hilary J. Rogers
- School of Biosciences, Cardiff University, Cardiff, United Kingdom
| | - Antonio Ferrante
- Department of Agricultural and Environmental Sciences, Università degli Studi di Milano, Milano, Italy
- * E-mail:
| |
Collapse
|
21
|
Sablok G, Pérez-Pulido AJ, Do T, Seong TY, Casimiro-Soriguer CS, La Porta N, Ralph PJ, Squartini A, Muñoz-Merida A, Harikrishna JA. PlantFuncSSR: Integrating First and Next Generation Transcriptomics for Mining of SSR-Functional Domains Markers. FRONTIERS IN PLANT SCIENCE 2016; 7:878. [PMID: 27446111 PMCID: PMC4922199 DOI: 10.3389/fpls.2016.00878] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/08/2016] [Accepted: 06/03/2016] [Indexed: 05/30/2023]
Abstract
Analysis of repetitive DNA sequence content and divergence among the repetitive functional classes is a well-accepted approach for estimation of inter- and intra-generic differences in plant genomes. Among these elements, microsatellites, or Simple Sequence Repeats (SSRs), have been widely demonstrated as powerful genetic markers for species and varieties discrimination. We present PlantFuncSSRs platform having more than 364 plant species with more than 2 million functional SSRs. They are provided with detailed annotations for easy functional browsing of SSRs and with information on primer pairs and associated functional domains. PlantFuncSSRs can be leveraged to identify functional-based genic variability among the species of interest, which might be of particular interest in developing functional markers in plants. This comprehensive on-line portal unifies mining of SSRs from first and next generation sequencing datasets, corresponding primer pairs and associated in-depth functional annotation such as gene ontology annotation, gene interactions and its identification from reference protein databases. PlantFuncSSRs is freely accessible at: http://www.bioinfocabd.upo.es/plantssr.
Collapse
Affiliation(s)
- Gaurav Sablok
- Plant Functional Biology and Climate Change Cluster (C3), University of Technology, SydneyNSW, Australia
| | - Antonio J. Pérez-Pulido
- Centro Andaluz de Biología del Desarrollo (CABD-CSIC), Universidad Pablo de OlavideSevilla, Spain
| | - Thac Do
- Plant Functional Biology and Climate Change Cluster (C3), University of Technology, SydneyNSW, Australia
| | - Tan Y. Seong
- Centre for Research in Biotechnology for Agriculture and Institute of Biological Sciences, Faculty of Science, University of MalayaKuala Lumpur, Malaysia
| | | | - Nicola La Porta
- Department of Sustainable Agro-Ecosystems and Bioresources, Research and Innovation Centre, Fondazione Edmund MachTrento, Italy
- MountFOR Project Centre, European Forest InstituteTrento, Italy
- Consiglio Nazionale delle Ricerche, Istituto per la Valorizzazione del Legno e delle Specie ArboreeFlorence, Italy
| | - Peter J. Ralph
- Plant Functional Biology and Climate Change Cluster (C3), University of Technology, SydneyNSW, Australia
| | - Andrea Squartini
- Department of Agronomy, Food, Natural Resources, Animals and Environment, University of PadovaPadova, Italy
| | - Antonio Muñoz-Merida
- CIBIO Research Centre in Biodiversity and Genetic Resources, InBIO, Universidade do PortoVila do Conde, Portugal
| | - Jennifer A. Harikrishna
- Centre for Research in Biotechnology for Agriculture and Institute of Biological Sciences, Faculty of Science, University of MalayaKuala Lumpur, Malaysia
| |
Collapse
|
22
|
Assessment of Functional EST-SSR Markers (Sugarcane) in Cross-Species Transferability, Genetic Diversity among Poaceae Plants, and Bulk Segregation Analysis. GENETICS RESEARCH INTERNATIONAL 2016; 2016:7052323. [PMID: 27340568 PMCID: PMC4908241 DOI: 10.1155/2016/7052323] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/26/2015] [Revised: 04/13/2016] [Accepted: 04/26/2016] [Indexed: 11/17/2022]
Abstract
Expressed sequence tags (ESTs) are important resource for gene discovery, gene expression and its regulation, molecular marker development, and comparative genomics. We procured 10000 ESTs and analyzed 267 EST-SSRs markers through computational approach. The average density was one SSR/10.45 kb or 6.4% frequency, wherein trinucleotide repeats (66.74%) were the most abundant followed by di- (26.10%), tetra- (4.67%), penta- (1.5%), and hexanucleotide (1.2%) repeats. Functional annotations were done and after-effect newly developed 63 EST-SSRs were used for cross transferability, genetic diversity, and bulk segregation analysis (BSA). Out of 63 EST-SSRs, 42 markers were identified owing to their expansion genetics across 20 different plants which amplified 519 alleles at 180 loci with an average of 2.88 alleles/locus and the polymorphic information content (PIC) ranged from 0.51 to 0.93 with an average of 0.83. The cross transferability ranged from 25% for wheat to 97.22% for Schlerostachya, with an average of 55.86%, and genetic relationships were established based on diversification among them. Moreover, 10 EST-SSRs were recognized as important markers between bulks of pooled DNA of sugarcane cultivars through BSA. This study highlights the employability of the markers in transferability, genetic diversity in grass species, and distinguished sugarcane bulks.
Collapse
|
23
|
Tanhuanpää P, Erkkilä M, Kalendar R, Schulman AH, Manninen O. Assessment of genetic diversity in Nordic timothy ( Phleum pratense L .). Hereditas 2016; 153:5. [PMID: 28096767 PMCID: PMC5226114 DOI: 10.1186/s41065-016-0009-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2015] [Accepted: 04/19/2016] [Indexed: 01/29/2023] Open
Abstract
BACKGROUND Timothy (Phleum pratense L.), a cool-season hexaploid perennial, is the most important forage grass species in Nordic countries. Earlier analyses of genetic diversity in a collection of 96 genebank accessions of timothy with SSR markers demonstrated high levels of diversity but could not resolve population structure. Therefore, we examined a subset of 51 accessions with REMAP markers, which are based on retrotransposons, and compared the diversity results with those obtained with SSR markers. RESULTS Using four primer combinations, 533 REMAP markers were analyzed, compared with 464 polymorphic alleles in the 13 SSR loci previously. The average marker index, which describes information obtained per experiment (per primer combination or locus) was over six times higher with REMAPs. Most of the variation found was within accessions, with somewhat less, 89 %, for REMAPs, than for SSR, with 93 %. CONCLUSIONS SSRs revealed differences in the level of diversity slightly better than REMAPs but neither marker type could reveal any clear clustering of accessions based on countries, vegetation zones, or different cultivar types. In our study, reliable evaluation of SSR allele dosages was not possible, so each allele had to be handled as a dominant marker. SSR and REMAP, which report from different mechanisms of generating genetic diversity and from different genomic regions, together indicate a lack of population structure. Taken together, this likely reflects the outcrossing and hexaploid nature of timothy rather than failures of either marker system.
Collapse
Affiliation(s)
- Pirjo Tanhuanpää
- Green Technology, Natural Resources Institute Finland (Luke), Myllytie 1, FI-31600 Jokioinen, Finland
| | - Maria Erkkilä
- Internal Expert Services, Natural Resources Institute Finland (Luke), Humppilantie 14, FI-31600 Jokioinen, Finland
| | - Ruslan Kalendar
- Internal Expert Services, Natural Resources Institute Finland (Luke), Humppilantie 14, FI-31600 Jokioinen, Finland
| | - Alan Howard Schulman
- Green Technology, Natural Resources Institute Finland (Luke), Myllytie 1, FI-31600 Jokioinen, Finland ; Luke/BI Plant Genome Dynamics Laboratory, Institute of Biotechnology, Viikki Biocenter, University of Helsinki, P.O. Box 56, Viikinkaari 1, FI-00014 Helsinki, Finland
| | - Outi Manninen
- Boreal Plant Breeding Ltd, Myllytie 10, FI-31600 Jokioinen, Finland
| |
Collapse
|
24
|
Ranade SS, Lin YC, Van de Peer Y, García-Gil MR. Comparative in silico analysis of SSRs in coding regions of high confidence predicted genes in Norway spruce (Picea abies) and Loblolly pine (Pinus taeda). BMC Genet 2015; 16:149. [PMID: 26706685 PMCID: PMC4691297 DOI: 10.1186/s12863-015-0304-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2015] [Accepted: 12/10/2015] [Indexed: 11/24/2022] Open
Abstract
Background Microsatellites or simple sequence repeats (SSRs) are DNA sequences consisting of 1–6 bp tandem repeat motifs present in the genome. SSRs are considered to be one of the most powerful tools in genetic studies. We carried out a comparative study of perfect SSR loci belonging to class I (≥20) and class II (≥12 and <20 bp) types located in coding regions of high confidence genes in Picea abies and Pinus taeda. SSRLocator was used to retrieve SSRs from the full length CDS of predicted genes in both species. Results Trimers were the most abundant motifs in class I followed by hexamers in Picea abies, while trimers and hexamers were equally abundant in Pinus taeda class I SSRs. Hexamers were most frequent within class II SSRs followed by trimers, in both species. Although the frequency of genes containing SSRs was slightly higher in Pinus taeda, SSR counts per Mbp for class I was similar in both species (P-value = 0.22); while for class II SSRs, it was significantly higher in Picea abies (P-value = 0.00009). AT-rich motifs were higher in abundance than the GC-rich motifs, within class II SSRs in both the species (P-values = 10−9 and 0). With reference to class I SSRs, AT-rich and GC-rich motifs were detected with equal frequency in Pinus taeda (P-value = 0.24); while in Picea abies, GC-rich motifs were detected with higher frequency than the AT-rich motifs (P-value = 0.0005). Conclusions Our study gives a comparative overview of the genome SSRs composition based on high confidence genes in the two recently sequenced and economically important conifers and, also provides information on functional molecular markers that can be applied in genetic studies in Pinus and Picea species. Electronic supplementary material The online version of this article (doi:10.1186/s12863-015-0304-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Sonali Sachin Ranade
- Department of Forest Genetics and Plant Physiology, Umeå Plant Science Centre, Swedish University of Agricultural Sciences, SE-901 83, Umeå, Sweden.
| | - Yao-Cheng Lin
- Department of Plant Systems Biology (VIB) and Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 927, 9052, Ghent, Belgium.
| | - Yves Van de Peer
- Department of Plant Systems Biology (VIB) and Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 927, 9052, Ghent, Belgium. .,Genomics Research Institute, University of Pretoria, Hatfield Campus, Pretoria, 0028, South Africa. .,Bioinformatics Institute Ghent, Ghent University, 9052, Ghent, Belgium.
| | - María Rosario García-Gil
- Department of Forest Genetics and Plant Physiology, Umeå Plant Science Centre, Swedish University of Agricultural Sciences, SE-901 83, Umeå, Sweden.
| |
Collapse
|
25
|
Qin Z, Wang Y, Wang Q, Li A, Hou F, Zhang L. Evolution Analysis of Simple Sequence Repeats in Plant Genome. PLoS One 2015; 10:e0144108. [PMID: 26630570 PMCID: PMC4668000 DOI: 10.1371/journal.pone.0144108] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2015] [Accepted: 11/13/2015] [Indexed: 01/30/2023] Open
Abstract
Simple sequence repeats (SSRs) are widespread units on genome sequences, and play many important roles in plants. In order to reveal the evolution of plant genomes, we investigated the evolutionary regularities of SSRs during the evolution of plant species and the plant kingdom by analysis of twelve sequenced plant genome sequences. First, in the twelve studied plant genomes, the main SSRs were those which contain repeats of 1–3 nucleotides combination. Second, in mononucleotide SSRs, the A/T percentage gradually increased along with the evolution of plants (except for P. patens). With the increase of SSRs repeat number the percentage of A/T in C. reinhardtii had no significant change, while the percentage of A/T in terrestrial plants species gradually declined. Third, in dinucleotide SSRs, the percentage of AT/TA increased along with the evolution of plant kingdom and the repeat number increased in terrestrial plants species. This trend was more obvious in dicotyledon than monocotyledon. The percentage of CG/GC showed the opposite pattern to the AT/TA. Forth, in trinucleotide SSRs, the percentages of combinations including two or three A/T were in a rising trend along with the evolution of plant kingdom; meanwhile with the increase of SSRs repeat number in plants species, different species chose different combinations as dominant SSRs. SSRs in C. reinhardtii, P. patens, Z. mays and A. thaliana showed their specific patterns related to evolutionary position or specific changes of genome sequences. The results showed that, SSRs not only had the general pattern in the evolution of plant kingdom, but also were associated with the evolution of the specific genome sequence. The study of the evolutionary regularities of SSRs provided new insights for the analysis of the plant genome evolution.
Collapse
Affiliation(s)
- Zhen Qin
- Crop Research Institute, Shandong Academy of Agricultural Sciences, Jinan, China
| | - Yanping Wang
- Shandong Key Laboratory of Animal Disease Control and Breeding/Institute of Animal Science and Veterinary Medicine, Shandong Academy of Agricultural Sciences, Jinan, China
| | - Qingmei Wang
- Crop Research Institute, Shandong Academy of Agricultural Sciences, Jinan, China
| | - Aixian Li
- Crop Research Institute, Shandong Academy of Agricultural Sciences, Jinan, China
| | - Fuyun Hou
- Crop Research Institute, Shandong Academy of Agricultural Sciences, Jinan, China
| | - Liming Zhang
- Crop Research Institute, Shandong Academy of Agricultural Sciences, Jinan, China
- * E-mail:
| |
Collapse
|
26
|
Mining microsatellite markers from public expressed sequence tags databases for the study of threatened plants. BMC Genomics 2015; 16:781. [PMID: 26463180 PMCID: PMC4603344 DOI: 10.1186/s12864-015-2031-1] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2015] [Accepted: 10/09/2015] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Simple Sequence Repeats (SSRs) are widely used in population genetic studies but their classical development is costly and time-consuming. The ever-increasing available DNA datasets generated by high-throughput techniques offer an inexpensive alternative for SSRs discovery. Expressed Sequence Tags (ESTs) have been widely used as SSR source for plants of economic relevance but their application to non-model species is still modest. METHODS Here, we explored the use of publicly available ESTs (GenBank at the National Center for Biotechnology Information-NCBI) for SSRs development in non-model plants, focusing on genera listed by the International Union for the Conservation of Nature (IUCN). We also search two model genera with fully annotated genomes for EST-SSRs, Arabidopsis and Oryza, and used them as controls for genome distribution analyses. Overall, we downloaded 16 031 555 sequences for 258 plant genera which were mined for SSRsand their primers with the help of QDD1. Genome distribution analyses in Oryza and Arabidopsis were done by blasting the sequences with SSR against the Oryza sativa and Arabidopsis thaliana reference genomes implemented in the Basal Local Alignment Tool (BLAST) of the NCBI website. Finally, we performed an empirical test to determine the performance of our EST-SSRs in a few individuals from four species of two eudicot genera, Trifolium and Centaurea. RESULTS We explored a total of 14 498 726 EST sequences from the dbEST database (NCBI) in 257 plant genera from the IUCN Red List. We identify a very large number (17 102) of ready-to-test EST-SSRs in most plant genera (193) at no cost. Overall, dinucleotide and trinucleotide repeats were the prevalent types but the abundance of the various types of repeat differed between taxonomic groups. Control genomes revealed that trinucleotide repeats were mostly located in coding regions while dinucleotide repeats were largely associated with untranslated regions. Our results from the empirical test revealed considerable amplification success and transferability between congenerics. CONCLUSIONS The present work represents the first large-scale study developing SSRs by utilizing publicly accessible EST databases in threatened plants. Here we provide a very large number of ready-to-test EST-SSR (17 102) for 193 genera. The cross-species transferability suggests that the number of possible target species would be large. Since trinucleotide repeats are abundant and mainly linked to exons they might be useful in evolutionary and conservation studies. Altogether, our study highly supports the use of EST databases as an extremely affordable and fast alternative for SSR developing in threatened plants.
Collapse
|
27
|
Olango TM, Tesfaye B, Pagnotta MA, Pè ME, Catellani M. Development of SSR markers and genetic diversity analysis in enset (Ensete ventricosum (Welw.) Cheesman), an orphan food security crop from Southern Ethiopia. BMC Genet 2015. [PMID: 26243662 DOI: 10.1186/s12863-12015-10250-12868] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/05/2023] Open
Abstract
BACKGROUND Enset (Ensete ventricosum (Welw.) Cheesman; Musaceae) is a multipurpose drought-tolerant food security crop with high conservation and improvement concern in Ethiopia, where it supplements the human calorie requirements of around 20 million people. The crop also has an enormous potential in other regions of Sub-Saharan Africa, where it is known only as a wild plant. Despite its potential, genetic and genomic studies supporting breeding programs and conservation efforts are very limited. Molecular methods would substantially improve current conventional approaches. Here we report the development of the first set of SSR markers from enset, their cross-transferability to Musa spp., and their application in genetic diversity, relationship and structure assessments in wild and cultivated enset germplasm. RESULTS SSR markers specific to E. ventricosum were developed through pyrosequencing of an enriched genomic library. Primer pairs were designed for 217 microsatellites with a repeat size > 20 bp from 900 candidates. Primers were validated in parallel by in silico and in vitro PCR approaches. A total of 67 primer pairs successfully amplified specific loci and 59 showed polymorphism. A subset of 34 polymorphic SSR markers were used to study 70 both wild and cultivated enset accessions. A large number of alleles were detected along with a moderate to high level of genetic diversity. AMOVA revealed that intra-population allelic variations contributed more to genetic diversity than inter-population variations. UPGMA based phylogenetic analysis and Discriminant Analysis of Principal Components show that wild enset is clearly separated from cultivated enset and is more closely related to the out-group Musa spp. No cluster pattern associated with the geographical regions, where this crop is grown, was observed for enset landraces. Our results reaffirm the long tradition of extensive seed-sucker exchange between enset cultivating communities in Southern Ethiopia. CONCLUSION The first set of genomic SSR markers were developed in enset. A large proportion of these markers were polymorphic and some were also transferable to related species of the genus Musa. This study demonstrated the usefulness of the markers in assessing genetic diversity and structure in enset germplasm, and provides potentially useful information for developing conservation and breeding strategies in enset.
Collapse
Affiliation(s)
- Temesgen Magule Olango
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Piazza Martiri della Libertà 33, 56127, Pisa, Italy.
- Hawassa University, School of Plant and Horticulture Science, P.O.Box 5, Awassa, Ethiopia.
| | - Bizuayehu Tesfaye
- Hawassa University, School of Plant and Horticulture Science, P.O.Box 5, Awassa, Ethiopia.
| | - Mario Augusto Pagnotta
- Department of Science and Technologies for Agriculture, Forestry, Nature and Energy (DAFNE), Università degli Studi della Tuscia, Via San Camillo de Lellis, 01100, Viterbo, Italy.
| | - Mario Enrico Pè
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Piazza Martiri della Libertà 33, 56127, Pisa, Italy.
| | - Marcello Catellani
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Piazza Martiri della Libertà 33, 56127, Pisa, Italy.
- ENEA, UT BIORAD, Laboratory of Biotechnology, Research Center Casaccia, Via Anguillarese 301, 00123, Rome, Italy.
| |
Collapse
|
28
|
Olango TM, Tesfaye B, Pagnotta MA, Pè ME, Catellani M. Development of SSR markers and genetic diversity analysis in enset (Ensete ventricosum (Welw.) Cheesman), an orphan food security crop from Southern Ethiopia. BMC Genet 2015; 16:98. [PMID: 26243662 PMCID: PMC4524394 DOI: 10.1186/s12863-015-0250-8] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2015] [Accepted: 07/09/2015] [Indexed: 11/13/2022] Open
Abstract
Background Enset (Ensete ventricosum (Welw.) Cheesman; Musaceae) is a multipurpose drought-tolerant food security crop with high conservation and improvement concern in Ethiopia, where it supplements the human calorie requirements of around 20 million people. The crop also has an enormous potential in other regions of Sub-Saharan Africa, where it is known only as a wild plant. Despite its potential, genetic and genomic studies supporting breeding programs and conservation efforts are very limited. Molecular methods would substantially improve current conventional approaches. Here we report the development of the first set of SSR markers from enset, their cross-transferability to Musa spp., and their application in genetic diversity, relationship and structure assessments in wild and cultivated enset germplasm. Results SSR markers specific to E. ventricosum were developed through pyrosequencing of an enriched genomic library. Primer pairs were designed for 217 microsatellites with a repeat size > 20 bp from 900 candidates. Primers were validated in parallel by in silico and in vitro PCR approaches. A total of 67 primer pairs successfully amplified specific loci and 59 showed polymorphism. A subset of 34 polymorphic SSR markers were used to study 70 both wild and cultivated enset accessions. A large number of alleles were detected along with a moderate to high level of genetic diversity. AMOVA revealed that intra-population allelic variations contributed more to genetic diversity than inter-population variations. UPGMA based phylogenetic analysis and Discriminant Analysis of Principal Components show that wild enset is clearly separated from cultivated enset and is more closely related to the out-group Musa spp. No cluster pattern associated with the geographical regions, where this crop is grown, was observed for enset landraces. Our results reaffirm the long tradition of extensive seed-sucker exchange between enset cultivating communities in Southern Ethiopia. Conclusion The first set of genomic SSR markers were developed in enset. A large proportion of these markers were polymorphic and some were also transferable to related species of the genus Musa. This study demonstrated the usefulness of the markers in assessing genetic diversity and structure in enset germplasm, and provides potentially useful information for developing conservation and breeding strategies in enset. Electronic supplementary material The online version of this article (doi:10.1186/s12863-015-0250-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Temesgen Magule Olango
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Piazza Martiri della Libertà 33, 56127, Pisa, Italy. .,Hawassa University, School of Plant and Horticulture Science, P.O.Box 5, Awassa, Ethiopia.
| | - Bizuayehu Tesfaye
- Hawassa University, School of Plant and Horticulture Science, P.O.Box 5, Awassa, Ethiopia.
| | - Mario Augusto Pagnotta
- Department of Science and Technologies for Agriculture, Forestry, Nature and Energy (DAFNE), Università degli Studi della Tuscia, Via San Camillo de Lellis, 01100, Viterbo, Italy.
| | - Mario Enrico Pè
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Piazza Martiri della Libertà 33, 56127, Pisa, Italy.
| | - Marcello Catellani
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Piazza Martiri della Libertà 33, 56127, Pisa, Italy. .,ENEA, UT BIORAD, Laboratory of Biotechnology, Research Center Casaccia, Via Anguillarese 301, 00123, Rome, Italy.
| |
Collapse
|
29
|
Biswas MK, Liu Y, Li C, Sheng O, Mayer C, Yi G. Genome-Wide Computational Analysis of Musa Microsatellites: Classification, Cross-Taxon Transferability, Functional Annotation, Association with Transposons & miRNAs, and Genetic Marker Potential. PLoS One 2015; 10:e0131312. [PMID: 26121637 PMCID: PMC4488140 DOI: 10.1371/journal.pone.0131312] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2015] [Accepted: 06/01/2015] [Indexed: 01/14/2023] Open
Abstract
The development of organized, informative, robust, user-friendly, and freely accessible molecular markers is imperative to the Musa marker assisted breeding program. Although several hundred SSR markers have already been developed, the number of informative, robust, and freely accessible Musa markers remains inadequate for some breeding applications. In view of this issue, we surveyed SSRs in four different data sets, developed large-scale non-redundant highly informative therapeutic SSR markers, and classified them according to their attributes, as well as analyzed their cross-taxon transferability and utility for the genetic study of Musa and its relatives. A high SSR frequency (177 per Mbp) was found in the Musa genome. AT-rich dinucleotide repeats are predominant, and trinucleotide repeats are the most abundant in transcribed regions. A significant number of Musa SSRs are associated with pre-miRNAs, and 83% of these SSRs are promising candidates for the development of therapeutic SSR markers. Overall, 74% of the SSR markers were polymorphic, and 94% were transferable to at least one Musa spp. Two hundred forty-three markers generated a total of 1047 alleles, with 2-8 alleles each and an average of 4.38 alleles per locus. The PIC values ranged from 0.31 to 0.89 and averaged 0.71. We report the largest set of non-redundant, polymorphic, new SSR markers to be developed in Musa. These additional markers could be a valuable resource for marker-assisted breeding, genetic diversity and genomic studies of Musa and related species.
Collapse
Affiliation(s)
- Manosh Kumar Biswas
- Institution of Fruit Tree Research, Guangdong Academy of Agricultural Sciences, Guangzhou, Guangdong Province, China
- Key Laboratory of South Subtropical Fruit Biology and Genetic Resource Utilization, Ministry of Agriculture, Guangzhou, China
- The College of Life Science, South China Agricultural University, Guangzhou, China
| | - Yuxuan Liu
- Key Laboratory of Horticultural Plant Biology, Ministry of Education, Huazhong Agricultural University, Wuhan, Hubei, China
| | - Chunyu Li
- Institution of Fruit Tree Research, Guangdong Academy of Agricultural Sciences, Guangzhou, Guangdong Province, China
- Key Laboratory of South Subtropical Fruit Biology and Genetic Resource Utilization, Ministry of Agriculture, Guangzhou, China
| | - Ou Sheng
- Institution of Fruit Tree Research, Guangdong Academy of Agricultural Sciences, Guangzhou, Guangdong Province, China
- Key Laboratory of South Subtropical Fruit Biology and Genetic Resource Utilization, Ministry of Agriculture, Guangzhou, China
| | - Christoph Mayer
- Forschungsmuseum Alexander Koenig, Bonn, Adenauerallee 160, 53113 Bonn, Germany
| | - Ganjun Yi
- Institution of Fruit Tree Research, Guangdong Academy of Agricultural Sciences, Guangzhou, Guangdong Province, China
- Key Laboratory of South Subtropical Fruit Biology and Genetic Resource Utilization, Ministry of Agriculture, Guangzhou, China
- * E-mail:
| |
Collapse
|
30
|
Zeng T, Zhang L, Li J, Wang D, Tian Y, Lu L. De novo assembly and characterization of Muscovy duck liver transcriptome and analysis of differentially regulated genes in response to heat stress. Cell Stress Chaperones 2015; 20:483-93. [PMID: 25663538 PMCID: PMC4406934 DOI: 10.1007/s12192-015-0573-4] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2014] [Revised: 12/28/2014] [Accepted: 01/14/2015] [Indexed: 01/20/2023] Open
Abstract
High temperature is a major abiotic stress limiting animal growth and productivity worldwide. The Muscovy duck (Cairina moschata), sometimes called the Barbary drake, is a type of duck with a fairly unusual domestication history. In Southeast Asia, duck meat is one of the top meats consumed, and as such, the production of the meat is an important topic of research. The transcriptomic and genomic data presently available are insufficient to understanding the molecular mechanism underlying the heat tolerance of Muscovy ducks. Thus, transcriptome and expression profiling data for this species are required as important resource for identifying genes and developing molecular marker. In this study, de novo transcriptome assembly and gene expression analysis using Illumina sequencing technology were performed. More than 225 million clean reads were generated and assembled into 36,903 unique transcripts with an average length of 1,135 bp. A total of 21,221 (57.50 %) unigenes were annotated. Gene Ontology (GO) analysis of the annotated unigenes revealed that the majority of sequenced genes were associated with transcription, signal transduction, and apoptosis. We also performed gene expression profiling analysis upon heat treatment in Muscovy ducks and identified 470 heat-response unique transcripts. GO term enrichment showed that protein folding and chaperone binding were significant enrichment, whereas KEGG pathway analyses showed that Ras and MAPKs were activated after heat stress in Muscovy ducks. Our research enriched sequences information of Muscovy duck, provided novel insights into responses to heat stress in these ducks, and serve as candidate genes or markers that can be used to guide future efforts to breed heat-tolerant duck strains.
Collapse
Affiliation(s)
- Tao Zeng
- />Institute of Animal Husbandry and Veterinary Medicine, Zhejiang Academy of Agricultural Sciences, Hangzhou, 310021 China
| | - Liping Zhang
- />Institute of Animal Husbandry and Veterinary Medicine, Zhejiang Academy of Agricultural Sciences, Hangzhou, 310021 China
- />College of Chemistry and Life Sciences, Zhejiang Normal University, Jinhua, 321004 China
| | - Jinjun Li
- />Institute of Animal Husbandry and Veterinary Medicine, Zhejiang Academy of Agricultural Sciences, Hangzhou, 310021 China
| | - Deqian Wang
- />Institute of Animal Husbandry and Veterinary Medicine, Zhejiang Academy of Agricultural Sciences, Hangzhou, 310021 China
| | - Yong Tian
- />Institute of Animal Husbandry and Veterinary Medicine, Zhejiang Academy of Agricultural Sciences, Hangzhou, 310021 China
| | - Lizhi Lu
- />Institute of Animal Husbandry and Veterinary Medicine, Zhejiang Academy of Agricultural Sciences, Hangzhou, 310021 China
| |
Collapse
|
31
|
Zhang S, Shi Y, Cheng N, Du H, Fan W, Wang C. De novo characterization of fall dormant and nondormant alfalfa (Medicago sativa L.) leaf transcriptome and identification of candidate genes related to fall dormancy. PLoS One 2015; 10:e0122170. [PMID: 25799491 PMCID: PMC4370819 DOI: 10.1371/journal.pone.0122170] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2014] [Accepted: 02/08/2015] [Indexed: 12/03/2022] Open
Abstract
Alfalfa (Medicago sativa L.) is one of the most widely cultivated perennial forage legumes worldwide. Fall dormancy is an adaptive character related to the biomass production and winter survival in alfalfa. The physiological, biochemical and molecular mechanisms causing fall dormancy and the related genes have not been well studied. In this study, we sequenced two standard varieties of alfalfa (dormant and non-dormant) at two time points and generated approximately 160 million high quality paired-end sequence reads using sequencing by synthesis (SBS) technology. The de novo transcriptome assembly generated a set of 192,875 transcripts with an average length of 856 bp representing about 165.1 Mb of the alfalfa leaf transcriptome. After assembly, 111,062 (57.6%) transcripts were annotated against the NCBI non-redundant database. A total of 30,165 (15.6%) transcripts were mapped to 323 Kyoto Encyclopedia of Genes and Genomes pathways. We also identified 41,973 simple sequence repeats, which can be used to generate markers for alfalfa, and 1,541 transcription factors were identified across 1,350 transcripts. Gene expression between dormant and non-dormant alfalfa at different time points were performed, and we identified several differentially expressed genes potentially related to fall dormancy. The Gene Ontology and pathways information were also identified. We sequenced and assembled the leaf transcriptome of alfalfa related to fall dormancy, and also identified some genes of interest involved in the fall dormancy mechanism. Thus, our research focused on studying fall dormancy in alfalfa through transcriptome sequencing. The sequencing and gene expression data generated in this study may be used further to elucidate the complete mechanisms governing fall dormancy in alfalfa.
Collapse
Affiliation(s)
- Senhao Zhang
- College of Animal Science and Veterinary Medicine, Henan Agricultural University, Zhengzhou, Henan 450002, China
| | - Yinghua Shi
- College of Animal Science and Veterinary Medicine, Henan Agricultural University, Zhengzhou, Henan 450002, China
| | - Ningning Cheng
- College of Animal Science and Veterinary Medicine, Henan Agricultural University, Zhengzhou, Henan 450002, China
| | - Hongqi Du
- College of Animal Science and Veterinary Medicine, Henan Agricultural University, Zhengzhou, Henan 450002, China
| | - Wenna Fan
- College of Animal Science and Veterinary Medicine, Henan Agricultural University, Zhengzhou, Henan 450002, China
| | - Chengzhang Wang
- College of Animal Science and Veterinary Medicine, Henan Agricultural University, Zhengzhou, Henan 450002, China
- * E-mail:
| |
Collapse
|
32
|
Deng SY, Wang XR, Zhu PL, Wen Q, Yang CX. Development of polymorphic microsatellite markers in the medicinal plant Gardenia jasminoides (Rubiaceae). BIOCHEM SYST ECOL 2015. [DOI: 10.1016/j.bse.2014.11.009] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
33
|
Wu N, Zhu Y, Song W, Li Y, Yan Y, Hu Y. Unusual tandem expansion and positive selection in subgroups of the plant GRAS transcription factor superfamily. BMC PLANT BIOLOGY 2014; 14:373. [PMID: 25524588 PMCID: PMC4279901 DOI: 10.1186/s12870-014-0373-5] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2014] [Accepted: 12/08/2014] [Indexed: 05/18/2023]
Abstract
BACKGROUND GRAS proteins belong to a plant transcription factor family that is involved with multifarious roles in plants. Although previous studies of this protein family have been reported for Arabidopsis, rice, Chinese cabbage and other species, investigation of expansion patterns and evolutionary rate on the basis of comparative genomics in different species remains inadequate. RESULTS A total of 289 GRAS genes were identified in Arabidopsis, B. distachyon, rice, soybean, S. moellendorffii, and P. patens and were grouped into seven subfamilies, supported by the similarity of their exon-intron patterns and structural motifs. All of tandem duplicated genes were found in group II except one cluster of rice, indicating that tandem duplication greatly promoted the expansion of group II. Furthermore, segment duplications were mainly found in the soybean genome, whereas no single expansion pattern dominated in other plant species indicating that GRAS genes from these five species might be subject to a more complex evolutionary mechanism. Interestingly, branch-site model analyses of positive selection showed that a number of sites were positively selected under foreground branches I and V. These results strongly indicated that these groups were experiencing higher positive selection pressure. Meanwhile, the site-specific model revealed that the GRAS genes were under strong positive selection in P. patens. DIVERGE v2.0 was used to detect critical amino acid sites, and the results showed that the shifted evolutionary rate was mainly attributed to the functional divergence between the GRAS genes in the two groups. In addition, the results also demonstrated the expression divergence of the GRAS duplicated genes in the evolution. In short, the results above provide a solid foundation for further functional dissection of the GRAS gene superfamily. CONCLUSIONS In this work, differential expression, evolutionary rate, and expansion patterns of the GRAS gene family in the six species were predicted. Especially, tandem duplication events played an important role in expansion of group II. Together, these results contribute to further functional analysis and the molecular evolution of the GRAS gene superfamily.
Collapse
Affiliation(s)
- Ningning Wu
- College of Life Sciences, Capital Normal University, Beijing, 100048 China
| | - Yan Zhu
- College of Life Sciences, Capital Normal University, Beijing, 100048 China
| | - Wanlu Song
- College of Life Sciences, Capital Normal University, Beijing, 100048 China
| | - Yaxuan Li
- College of Life Sciences, Capital Normal University, Beijing, 100048 China
| | - Yueming Yan
- College of Life Sciences, Capital Normal University, Beijing, 100048 China
| | - Yingkao Hu
- College of Life Sciences, Capital Normal University, Beijing, 100048 China
| |
Collapse
|
34
|
Sahu J, Sen P, Choudhury MD, Dehury B, Barooah M, Modi MK, Talukdar AD. Rediscovering medicinal plants' potential with OMICS: microsatellite survey in expressed sequence tags of eleven traditional plants with potent antidiabetic properties. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2014; 18:298-309. [PMID: 24802971 DOI: 10.1089/omi.2013.0147] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Herbal medicines and traditionally used medicinal plants present an untapped potential for novel molecular target discovery using systems science and OMICS biotechnology driven strategies. Since up to 40% of the world's poor people have no access to government health services, traditional and folk medicines are often the only therapeutics available to them. In this vein, North East (NE) India is recognized for its rich bioresources. As part of the Indo-Burma hotspot, it is regarded as an epicenter of biodiversity for several plants having myriad traditional uses, including medicinal use. However, the improvement of these valuable bioresources through molecular breeding strategies, for example, using genic microsatellites or Simple Sequence Repeats (SSRs) or Expressed Sequence Tags (ESTs)-derived SSRs has not been fully utilized in large scale to date. In this study, we identified a total of 47,700 microsatellites from 109,609 ESTs of 11 medicinal plants (pineapple, papaya, noyontara, bitter orange, bermuda brass, ratalu, barbados nut, mango, mulberry, lotus, and guduchi) having proven antidiabetic properties. A total of 58,159 primer pairs were designed for the non-redundant 8060 SSR-positive ESTs and putative functions were assigned to 4483 unique contigs. Among the identified microsatellites, excluding mononucleotide repeats, di-/trinucleotides are predominant, among which repeat motifs of AG/CT and AAG/CTT were most abundant. Similarity search of SSR containing ESTs and antidiabetic gene sequences revealed 11 microsatellites linked to antidiabetic genes in five plants. GO term enrichment analysis revealed a total of 80 enriched GO terms widely distributed in 53 biological processes, 17 molecular functions, and 10 cellular components associated with the 11 markers. The present study therefore provides concrete insights into the frequency and distribution of SSRs in important medicinal resources. The microsatellite markers reported here markedly add to the genetic stock for cross transferability in these plants and the literature on biomarkers and novel drug discovery for common chronic diseases such as diabetes.
Collapse
Affiliation(s)
- Jagajjit Sahu
- 1 Agri-Bioinformatics Promotion Programme, Department of Agricultural Biotechnology, Assam Agricultural University , Assam, India
| | | | | | | | | | | | | |
Collapse
|
35
|
Development of a gene-centered ssr atlas as a resource for papaya (Carica papaya) marker-assisted selection and population genetic studies. PLoS One 2014; 9:e112654. [PMID: 25393538 PMCID: PMC4231050 DOI: 10.1371/journal.pone.0112654] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2014] [Accepted: 10/08/2014] [Indexed: 01/08/2023] Open
Abstract
Carica papaya (papaya) is an economically important tropical fruit. Molecular marker-assisted selection is an inexpensive and reliable tool that has been widely used to improve fruit quality traits and resistance against diseases. In the present study we report the development and validation of an atlas of papaya simple sequence repeat (SSR) markers. We integrated gene predictions and functional annotations to provide a gene-centered perspective for marker-assisted selection studies. Our atlas comprises 160,318 SSRs, from which 21,231 were located in genic regions (i.e. inside exons, exon-intron junctions or introns). A total of 116,453 (72.6%) of all identified repeats were successfully mapped to one of the nine papaya linkage groups. Primer pairs were designed for markers from 9,594 genes (34.5% of the papaya gene complement). Using papaya-tomato orthology assessments, we assembled a list of 300 genes (comprising 785 SSRs) potentially involved in fruit ripening. We validated our atlas by screening 73 SSR markers (including 25 fruit ripening genes), achieving 100% amplification rate and uncovering 26% polymorphism rate between the parental genotypes (Sekati and JS12). The SSR atlas presented here is the first comprehensive gene-centered collection of annotated and genome positioned papaya SSRs. These features combined with thousands of high-quality primer pairs make the atlas an important resource for the papaya research community.
Collapse
|
36
|
Müller BSDF, Sakamoto T, de Menezes IPP, Prado GS, Martins WS, Brondani C, de Barros EG, Vianello RP. Analysis of BAC-end sequences in common bean (Phaseolus vulgaris L.) towards the development and characterization of long motifs SSRs. PLANT MOLECULAR BIOLOGY 2014; 86:455-470. [PMID: 25164100 DOI: 10.1007/s11103-014-0240-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2013] [Accepted: 08/14/2014] [Indexed: 06/03/2023]
Abstract
The increasing volume of genomic data on the Phaseolus vulgaris species have contributed to its importance as a model genetic species and positively affected the investigation of other legumes of scientific and economic value. To expand and gain a more in-depth knowledge of the common bean genome, the ends of a number of bacterial artificial chromosome (BAC) were sequenced, annotated and the presence of repetitive sequences was determined. In total, 52,270 BESs (BAC-end sequences), equivalent to 32 Mbp (~6 %) of the genome, were processed. In total, 3,789 BES-SSRs were identified, with a distribution of one SSR (simple sequence repeat) per 8.36 kbp and 2,000 were suitable for the development of SSRs, of which 194 were evaluated in low-resolution screening. From 40 BES-SSRs based on long motifs SSRs (≥ trinucleotides) analyzed in high-resolution genotyping, 34 showed an equally good amplification for the Andean and for the Mesoamerican genepools, exhibiting an average gene diversity (H E) of 0.490 and 5.59 alleles/locus, of which six classified as Class I showed a H E ≥ 0.7. The PCoA and structure analysis allowed to discriminate the gene pools (K = 2, FST = 0.733). From the 52,270 BESs, 2 % corresponded to transcription factors and 3 % to transposable elements. Putative functions for 24,321 BESs were identified and for 19,363 were assigned functional categories (gene ontology). This study identified highly polymorphic BES-SSRs containing tri- to hexanucleotides motifs and bringing together relevant genetic characteristics useful for breeding programs. Additionally, the BESs were incorporated into the international genome-sequencing project for the common bean.
Collapse
Affiliation(s)
- Bárbara Salomão de Faria Müller
- Laboratório de Genética Molecular de Plantas, Instituto de Biotecnologia Aplicada à Agropecuária (BIOAGRO), Universidade Federal de Viçosa (UFV), Viçosa, MG, Brazil
| | | | | | | | | | | | | | | |
Collapse
|
37
|
Identification and characterization of microsatellites in expressed sequence tags and their cross transferability in different plants. Int J Genomics 2014; 2014:863948. [PMID: 25389527 PMCID: PMC4217358 DOI: 10.1155/2014/863948] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2014] [Revised: 08/31/2014] [Accepted: 09/22/2014] [Indexed: 12/20/2022] Open
Abstract
Expressed sequence tags (EST) are potential source for the development of genic microsatellite markers, gene discovery, comparative genomics, and other genomic studies. In the present study, 7630 ESTs were examined from NCBI for SSR identification and characterization. A total of 263 SSRs were identified with an average density of one SSR/4.2 kb (3.4% frequency). Analysis revealed that trinucleotide repeats (47.52%) were most abundant followed by tetranucleotide (19.77%), dinucleotide (19.01%), pentanucleotide (9.12%), and hexanucleotide repeats (4.56%). Functional annotation was done through homology search and gene ontology, and 35 EST-SSRs were selected. Primer pairs were designed for evaluation of cross transferability and polymorphism among 11 plants belonging to five different families. Total 402 alleles were generated at 155 loci with an average of 2.6 alleles/locus and the polymorphic information content (PIC) ranged from 0.15 to 0.92 with an average of 0.75. The cross transferability ranged from 34.84% to 98.06% in different plants, with an average of 67.86%. Thus, the validation study of annotated 35 EST-SSR markers which correspond to particular metabolic activity revealed polymorphism and evolutionary nature in different families of Angiospermic plants.
Collapse
|
38
|
In silico genome comparison and distribution analysis of simple sequences repeats in cassava. Int J Genomics 2014; 2014:471461. [PMID: 25374887 PMCID: PMC4211302 DOI: 10.1155/2014/471461] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2014] [Accepted: 09/08/2014] [Indexed: 11/17/2022] Open
Abstract
We conducted a SSRs density analysis in different cassava genomic regions. The information obtained was useful to establish comparisons between cassava's SSRs genomic distribution and those of poplar, flax, and Jatropha. In general, cassava has a low SSR density (~50 SSRs/Mbp) and has a high proportion of pentanucleotides, (24,2 SSRs/Mbp). It was found that coding sequences have 15,5 SSRs/Mbp, introns have 82,3 SSRs/Mbp, 5' UTRs have 196,1 SSRs/Mbp, and 3' UTRs have 50,5 SSRs/Mbp. Through motif analysis of cassava's genome SSRs, the most abundant motif was AT/AT while in intron sequences and UTRs regions it was AG/CT. In addition, in coding sequences the motif AAG/CTT was also found to occur most frequently; in fact, it is the third most used codon in cassava. Sequences containing SSRs were classified according to their functional annotation of Gene Ontology categories. The identified SSRs here may be a valuable addition for genetic mapping and future studies in phylogenetic analyses and genomic evolution.
Collapse
|
39
|
Ranade SS, Lin YC, Zuccolo A, Van de Peer Y, García-Gil MDR. Comparative in silico analysis of EST-SSRs in angiosperm and gymnosperm tree genera. BMC PLANT BIOLOGY 2014; 14:220. [PMID: 25143005 PMCID: PMC4160553 DOI: 10.1186/s12870-014-0220-8] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/07/2014] [Accepted: 08/05/2014] [Indexed: 05/24/2023]
Abstract
BACKGROUND Simple Sequence Repeats (SSRs) derived from Expressed Sequence Tags (ESTs) belong to the expressed fraction of the genome and are important for gene regulation, recombination, DNA replication, cell cycle and mismatch repair. Here, we present a comparative analysis of the SSR motif distribution in the 5'UTR, ORF and 3'UTR fractions of ESTs across selected genera of woody trees representing gymnosperms (17 species from seven genera) and angiosperms (40 species from eight genera). RESULTS Our analysis supports a modest contribution of EST-SSR length to genome size in gymnosperms, while EST-SSR density was not associated with genome size in neither angiosperms nor gymnosperms. Multiple factors seem to have contributed to the lower abundance of EST-SSRs in gymnosperms that has resulted in a non-linear relationship with genome size diversity. The AG/CT motif was found to be the most abundant in SSRs of both angiosperms and gymnosperms, with a relative increase in AT/AT in the latter. Our data also reveals a higher abundance of hexamers across the gymnosperm genera. CONCLUSIONS Our analysis provides the foundation for future comparative studies at the species level to unravel the evolutionary processes that control the SSR genesis and divergence between angiosperm and gymnosperm tree species.
Collapse
Affiliation(s)
- Sonali Sachin Ranade
- />Umeå Plant Science Centre (UPSC), Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, SE-901-83 Umeå, Sweden
| | - Yao-Cheng Lin
- />Department of Plant Systems Biology (VIB) and Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 927, 9052 Ghent, Belgium
| | - Andrea Zuccolo
- />Istituto di Genomica Applicata, Via J. Linussio 51, 33100 Udine, Italy
- />Institute of Life Sciences, Scuola Superiore Sant’Anna, 56127 Pisa, Italy
| | - Yves Van de Peer
- />Department of Plant Systems Biology (VIB) and Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 927, 9052 Ghent, Belgium
- />Genomics Research Institute, University of Pretoria, Hatfield Campus, Pretoria, 0028 South Africa
| | - María del Rosario García-Gil
- />Umeå Plant Science Centre (UPSC), Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, SE-901-83 Umeå, Sweden
| |
Collapse
|
40
|
In-silico mining, type and frequency analysis of genic microsatellites of finger millet (Eleusine coracana (L.) Gaertn.): a comparative genomic analysis of NBS-LRR regions of finger millet with rice. Mol Biol Rep 2014; 41:3081-90. [PMID: 24477586 DOI: 10.1007/s11033-014-3168-8] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2013] [Accepted: 01/16/2014] [Indexed: 10/25/2022]
Abstract
In recent years, the increased availability of the DNA sequences has given the possibility to develop and explore the expressed sequence tags (ESTs) derived SSR markers. In the present study, a total of 1956 ESTs of finger millet were used to find the microsatellite type, distribution, frequency and developed a total of 545 primer pairs from the ESTs of finger millet. Thirty-two EST sequences had more than two microsatellites and 1357 sequences did not have any SSR repeats. The most frequent type of repeats was trimeric motif, however the second place was occupied by dimeric motif followed by tetra-, hexa- and penta repeat motifs. The most common dimer repeat motif was GA and in case of trimeric SSRs, it was CGG. The EST sequences of NBS-LRR region of finger millet and rice showed higher synteny and were found on nearly same positions on the rice chromosome map. A total of eight, out of 15 EST based SSR primers were polymorphic among the selected resistant and susceptible finger millet genotypes. The primer FMBLEST5 could able to differentiate them into resistant and susceptible genotypes. The alleles specific to the resistant and susceptible genotypes were sequenced using the ABI 3130XL genetic analyzer and found similarity to NBS-LRR regions of rice and finger millet and contained the characteristic kinase-2 and kinase 3a motifs of plant R-genes belonged to NBS-LRR region. The In-silico and comparative analysis showed that the genes responsible for blast resistance can be identified, mapped and further introgressed through molecular breeding approaches for enhancing the blast resistance in finger millet.
Collapse
|
41
|
Abstract
Tandem repeats (TRs) extensively exist in the genomes of prokaryotes and eukaryotes. Based on the sequenced genomes and gene annotations of 31 plant and algal species in Phytozome version 8.0 (http://www.phytozome.net/), we examined TRs in a genome-wide scale, characterized their distributions and motif features, and explored their putative biological functions. Among the 31 species, no significant correlation was detected between the TR density and genome size. Interestingly, green alga Chlamydomonas reinhardtii (42,059 bp/Mbp) and castor bean Ricinus communis (55,454 bp/Mbp) showed much higher TR densities than all other species (13,209 bp/Mbp on average). In the 29 land plants, including 22 dicots, 5 monocots, and 2 bryophytes, 5′-UTR and upstream intergenic 200-nt (UI200) regions had the first and second highest TR densities, whereas in the two green algae (C. reinhardtii and Volvox carteri) the first and second highest densities were found in intron and coding sequence (CDS) regions, respectively. In CDS regions, trinucleotide and hexanucleotide motifs were those most frequently represented in all species. In intron regions, especially in the two green algae, significantly more TRs were detected near the intron–exon junctions. Within intergenic regions in dicots and monocots, more TRs were found near both the 5′ and 3′ ends of genes. GO annotation in two green algae revealed that the genes with TRs in introns are significantly involved in transcriptional and translational processing. As the first systematic examination of TRs in plant and green algal genomes, our study showed that TRs displayed nonrandom distribution for both intragenic and intergenic regions, suggesting that they have potential roles in transcriptional or translational regulation in plants and green algae.
Collapse
|
42
|
Jain A, Chaudhary S, Sharma PC. Mining of microsatellites using next generation sequencing of seabuckthorn (Hippophae rhamnoides L.) transcriptome. PHYSIOLOGY AND MOLECULAR BIOLOGY OF PLANTS : AN INTERNATIONAL JOURNAL OF FUNCTIONAL PLANT BIOLOGY 2014; 20:115-23. [PMID: 24554845 PMCID: PMC3925482 DOI: 10.1007/s12298-013-0210-6] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/23/2013] [Revised: 08/31/2013] [Accepted: 10/09/2013] [Indexed: 05/26/2023]
Abstract
Gene based microsatellite markers are becoming more popular as compared to traditional random genomic microsatellite markers due to rapid and inexpensive method of isolation and their cross species portability. The present study documents occurrence of microsatellites in the transcriptome of seabuckthorn, a plant with immense medicinal, nutritional and ecological value. De novo assembly of over 80 million high quality short reads generated by high throughput next generation sequencing yielded 88297 putative unigenes. Of these, 7.69 % unigenes harbored microsatellite repeats with an average of one microsatellite per 6.704 Kb transcriptome. Dinucleotide repeats were most abundant followed by trinucleotide repeats. Microsatellites were densely populated in coding regions followed by 3' and 5' untranslated regions. AG and AAG type repeats were most frequently represented. Of the microsatellite positive unigenes, 48.81 % could be assigned gene ontology (GO) terms in order to assess associations between microsatellite containing unigenes and biological role of known genes. Utility of unigene specific microsatellites was assessed on the basis of polymorphism(s) detected in 18 seabuckthorn collections from Leh (India) using a set of randomly selected 25 unigene specific microsatellites. The findings presented here are likely to find immense use in future breeding and molecular biology research projects in seabuckthorn aiming at its overall development as a crop.
Collapse
Affiliation(s)
- Ankit Jain
- University School of Biotechnology, Guru Gobind Singh Indraprastha University, Sector-16C, Dwarka, Delhi 110078 India
| | - Saurabh Chaudhary
- University School of Biotechnology, Guru Gobind Singh Indraprastha University, Sector-16C, Dwarka, Delhi 110078 India
| | - Prakash Chand Sharma
- University School of Biotechnology, Guru Gobind Singh Indraprastha University, Sector-16C, Dwarka, Delhi 110078 India
| |
Collapse
|
43
|
Davey MW, Gudimella R, Harikrishna JA, Sin LW, Khalid N, Keulemans J. "A draft Musa balbisiana genome sequence for molecular genetics in polyploid, inter- and intra-specific Musa hybrids". BMC Genomics 2013; 14:683. [PMID: 24094114 PMCID: PMC3852598 DOI: 10.1186/1471-2164-14-683] [Citation(s) in RCA: 91] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2013] [Accepted: 09/24/2013] [Indexed: 01/14/2023] Open
Abstract
BACKGROUND Modern banana cultivars are primarily interspecific triploid hybrids of two species, Musa acuminata and Musa balbisiana, which respectively contribute the A- and B-genomes. The M. balbisiana genome has been associated with improved vigour and tolerance to biotic and abiotic stresses and is thus a target for Musa breeding programs. However, while a reference M. acuminata genome has recently been released (Nature 488:213-217, 2012), little sequence data is available for the corresponding B-genome.To address these problems we carried out Next Generation gDNA sequencing of the wild diploid M. balbisiana variety 'Pisang Klutuk Wulung' (PKW). Our strategy was to align PKW gDNA reads against the published A-genome and to extract the mapped consensus sequences for subsequent rounds of evaluation and gene annotation. RESULTS The resulting B-genome is 79% the size of the A-genome, and contains 36,638 predicted functional gene sequences which is nearly identical to the 36,542 of the A-genome. There is substantial sequence divergence from the A-genome at a frequency of 1 homozygous SNP per 23.1 bp, and a high degree of heterozygosity corresponding to one heterozygous SNP per 55.9 bp. Using expressed small RNA data, a similar number of microRNA sequences were predicted in both A- and B-genomes, but additional novel miRNAs were detected, including some that are unique to each genome. The usefulness of this B-genome sequence was evaluated by mapping RNA-seq data from a set of triploid AAA and AAB hybrids simultaneously to both genomes. Results for the plantains demonstrated the expected 2:1 distribution of reads across the A- and B-genomes, but for the AAA genomes, results show they contain regions of significant homology to the B-genome supporting proposals that there has been a history of interspecific recombination between homeologous A and B chromosomes in Musa hybrids. CONCLUSIONS We have generated and annotated a draft reference Musa B-genome and demonstrate that this can be used for molecular genetic mapping of gene transcripts and small RNA expression data from several allopolyploid banana cultivars. This draft therefore represents a valuable resource to support the study of metabolism in inter- and intraspecific triploid Musa hybrids and to help direct breeding programs.
Collapse
Affiliation(s)
- Mark W Davey
- Laboratory of Fruit Breeding and Biotechnology, Division of Crop Biotechnics, Department of Biosystems, Katholieke Universiteit Leuven, Willem de Croylaan 42, box 2427B-3001, Heverlee, Leuven, Belgium
| | - Ranganath Gudimella
- Centre for Research in Biotechnology for Agriculture and Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur 50603, Malaysia
| | - Jennifer Ann Harikrishna
- Centre for Research in Biotechnology for Agriculture and Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur 50603, Malaysia
| | - Lee Wan Sin
- Centre for Research in Biotechnology for Agriculture and Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur 50603, Malaysia
| | - Norzulaani Khalid
- Centre for Research in Biotechnology for Agriculture and Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur 50603, Malaysia
| | - Johan Keulemans
- Laboratory of Fruit Breeding and Biotechnology, Division of Crop Biotechnics, Department of Biosystems, Katholieke Universiteit Leuven, Willem de Croylaan 42, box 2427B-3001, Heverlee, Leuven, Belgium
| |
Collapse
|
44
|
Bhardwaj J, Chauhan R, Swarnkar MK, Chahota RK, Singh AK, Shankar R, Yadav SK. Comprehensive transcriptomic study on horse gram (Macrotyloma uniflorum): De novo assembly, functional characterization and comparative analysis in relation to drought stress. BMC Genomics 2013; 14:647. [PMID: 24059455 PMCID: PMC3853109 DOI: 10.1186/1471-2164-14-647] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2013] [Accepted: 09/13/2013] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Drought tolerance is an attribute maintained in plants by cross-talk between multiple and cascading metabolic pathways. Without a sequenced genome available for horse gram, it is difficult to comprehend such complex networks and intercalated genes associated with drought tolerance of horse gram (Macrotyloma uniflorum). Therefore, de novo transcriptome discovery and associated analyses was done for this highly drought tolerant yet under exploited legume to decipher its genetic makeup. RESULTS Eight samples comprising of shoot and root tissues of two horse gram genotypes (drought-sensitive; M-191 and drought-tolerant; M-249) were used for comparison under control and polyethylene glycol-induced drought stress conditions. Using Illumina sequencing technology, a total of 229,297,896 paired end read pairs were generated and utilized for de novo assembly of horse gram. Significant BLAST hits were obtained for 26,045 transcripts while, 3,558 transcripts had no hits but contained important conserved domains. A total of 21,887 unigenes were identified. SSRs containing sequences covered 16.25% of the transcriptome with predominant tri- and mono-nucleotides (43%). The total GC content of the transcriptome was found to be 43.44%. Under Gene Ontology response to stimulus, DNA binding and catalytic activity was highly expressed during drought stress conditions. Serine/threonine protein kinase was found to dominate in Enzyme Classification while pathways belonging to ribosome metabolism followed by plant pathogen interaction and plant hormone signal transduction were predominant in Kyoto Encyclopedia of Genes and Genomes analysis. Independent search on plant metabolic network pathways suggested valine degradation, gluconeogenesis and purine nucleotide degradation to be highly influenced under drought stress in horse gram. Transcription factors belonging to NAC, MYB-related, and WRKY families were found highly represented under drought stress. qRT-PCR validated the expression profile for 9 out of 10 genes analyzed in response to drought stress. CONCLUSIONS De novo transcriptome discovery and analysis has generated enormous information over horse gram genomics. The genes and pathways identified suggest efficient regulation leading to active adaptation as a basal defense response against drought stress by horse gram. The knowledge generated can be further utilized for exploring other underexploited plants for stress responsive genes and improving plant tolerance.
Collapse
Affiliation(s)
- Jyoti Bhardwaj
- Plant Metabolic Engineering Laboratory, Council of Scientific and Industrial Research-Institute of Himalayan Bioresource Technology, Palampur 176061, HP, India.
| | | | | | | | | | | | | |
Collapse
|
45
|
Toledo-Silva G, Cardoso-Silva CB, Jank L, Souza AP. De novo transcriptome assembly for the tropical grass Panicum maximum Jacq. PLoS One 2013; 8:e70781. [PMID: 23923022 PMCID: PMC3726610 DOI: 10.1371/journal.pone.0070781] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2013] [Accepted: 06/21/2013] [Indexed: 12/31/2022] Open
Abstract
Guinea grass (Panicum maximum Jacq.) is a tropical African grass often used to feed beef cattle, which is an important economic activity in Brazil. Brazil is the leader in global meat exportation because of its exclusively pasture-raised bovine herds. Guinea grass also has potential uses in bioenergy production due to its elevated biomass generation through the C4 photosynthesis pathway. We generated approximately 13 Gb of data from Illumina sequencing of P. maximum leaves. Four different genotypes were sequenced, and the combined reads were assembled de novo into 38,192 unigenes and annotated; approximately 63% of the unigenes had homology to other proteins in the NCBI non-redundant protein database. Functional classification through COG (Clusters of Orthologous Groups), GO (Gene Ontology) and KEGG (Kyoto Encyclopedia of Genes and Genomes) analyses showed that the unigenes from Guinea grass leaves are involved in a wide range of biological processes and metabolic pathways, including C4 photosynthesis and lignocellulose generation, which are important for cattle grazing and bioenergy production. The most abundant transcripts were involved in carbon fixation, photosynthesis, RNA translation and heavy metal cellular homeostasis. Finally, we identified a number of potential molecular markers, including 5,035 microsatellites (SSRs) and 346,456 single nucleotide polymorphisms (SNPs). To the best of our knowledge, this is the first study to characterize the complete leaf transcriptome of P. maximum using high-throughput sequencing. The biological information provided here will aid in gene expression studies and marker-assisted selection-based breeding research in tropical grasses.
Collapse
Affiliation(s)
- Guilherme Toledo-Silva
- Molecular Biology Center and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Campinas, São Paulo, Brazil
| | - Claudio Benicio Cardoso-Silva
- Molecular Biology Center and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Campinas, São Paulo, Brazil
| | - Liana Jank
- Embrapa Beef Cattle, Campo Grande, Mato Grosso do Sul, Brazil
| | - Anete Pereira Souza
- Molecular Biology Center and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Campinas, São Paulo, Brazil
- Department of Plant Biology, Biology Institute, University of Campinas (UNICAMP), Campinas, São Paulo, Brazil
| |
Collapse
|
46
|
Chen S, Huang X, Yan X, Liang Y, Wang Y, Li X, Peng X, Ma X, Zhang L, Cai Y, Ma T, Cheng L, Qi D, Zheng H, Yang X, Li X, Liu G. Transcriptome analysis in sheepgrass (Leymus chinensis): a dominant perennial grass of the Eurasian Steppe. PLoS One 2013; 8:e67974. [PMID: 23861841 PMCID: PMC3701641 DOI: 10.1371/journal.pone.0067974] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2013] [Accepted: 05/24/2013] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND Sheepgrass [Leymus chinensis (Trin.) Tzvel.] is an important perennial forage grass across the Eurasian Steppe and is known for its adaptability to various environmental conditions. However, insufficient data resources in public databases for sheepgrass limited our understanding of the mechanism of environmental adaptations, gene discovery and molecular marker development. RESULTS The transcriptome of sheepgrass was sequenced using Roche 454 pyrosequencing technology. We assembled 952,328 high-quality reads into 87,214 unigenes, including 32,416 contigs and 54,798 singletons. There were 15,450 contigs over 500 bp in length. BLAST searches of our database against Swiss-Prot and NCBI non-redundant protein sequences (nr) databases resulted in the annotation of 54,584 (62.6%) of the unigenes. Gene Ontology (GO) analysis assigned 89,129 GO term annotations for 17,463 unigenes. We identified 11,675 core Poaceae-specific and 12,811 putative sheepgrass-specific unigenes by BLAST searches against all plant genome and transcriptome databases. A total of 2,979 specific freezing-responsive unigenes were found from this RNAseq dataset. We identified 3,818 EST-SSRs in 3,597 unigenes, and some SSRs contained unigenes that were also candidates for freezing-response genes. Characterizations of nucleotide repeats and dominant motifs of SSRs in sheepgrass were also performed. Similarity and phylogenetic analysis indicated that sheepgrass is closely related to barley and wheat. CONCLUSIONS This research has greatly enriched sheepgrass transcriptome resources. The identified stress-related genes will help us to decipher the genetic basis of the environmental and ecological adaptations of this species and will be used to improve wheat and barley crops through hybridization or genetic transformation. The EST-SSRs reported here will be a valuable resource for future gene-phenotype studies and for the molecular breeding of sheepgrass and other Poaceae species.
Collapse
Affiliation(s)
- Shuangyan Chen
- Key Laboratory of Plant Resources, Institute of Botany, the Chinese Academy of Sciences, Beijing, P. R. China
- * E-mail: (SC); (XL); (GL)
| | - Xin Huang
- Key Laboratory of Plant Resources, Institute of Botany, the Chinese Academy of Sciences, Beijing, P. R. China
- Graduate Schoo1 of the Chinese Academy of Sciences, Beijing, P. R. China
| | - Xueqing Yan
- Key Laboratory of Plant Resources, Institute of Botany, the Chinese Academy of Sciences, Beijing, P. R. China
- Graduate Schoo1 of the Chinese Academy of Sciences, Beijing, P. R. China
| | - Ye Liang
- Key Laboratory of Plant Resources, Institute of Botany, the Chinese Academy of Sciences, Beijing, P. R. China
- Graduate Schoo1 of the Chinese Academy of Sciences, Beijing, P. R. China
| | - Yuezhu Wang
- Shanghai-MOST Key Laboratory of Health and Disease Genomics, Chinese National Human Genome Center at Shanghai, Shanghai, P. R. China
| | - Xiaofeng Li
- Key Laboratory of Plant Resources, Institute of Botany, the Chinese Academy of Sciences, Beijing, P. R. China
| | - Xianjun Peng
- Key Laboratory of Plant Resources, Institute of Botany, the Chinese Academy of Sciences, Beijing, P. R. China
| | - Xingyong Ma
- Key Laboratory of Plant Resources, Institute of Botany, the Chinese Academy of Sciences, Beijing, P. R. China
- Graduate Schoo1 of the Chinese Academy of Sciences, Beijing, P. R. China
| | - Lexin Zhang
- Key Laboratory of Plant Resources, Institute of Botany, the Chinese Academy of Sciences, Beijing, P. R. China
- Graduate Schoo1 of the Chinese Academy of Sciences, Beijing, P. R. China
| | - Yueyue Cai
- Key Laboratory of Plant Resources, Institute of Botany, the Chinese Academy of Sciences, Beijing, P. R. China
- Graduate Schoo1 of the Chinese Academy of Sciences, Beijing, P. R. China
| | - Tian Ma
- Key Laboratory of Plant Resources, Institute of Botany, the Chinese Academy of Sciences, Beijing, P. R. China
| | - Liqin Cheng
- Key Laboratory of Plant Resources, Institute of Botany, the Chinese Academy of Sciences, Beijing, P. R. China
| | - Dongmei Qi
- Key Laboratory of Plant Resources, Institute of Botany, the Chinese Academy of Sciences, Beijing, P. R. China
| | - Huajun Zheng
- Shanghai-MOST Key Laboratory of Health and Disease Genomics, Chinese National Human Genome Center at Shanghai, Shanghai, P. R. China
| | - Xiaohan Yang
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee, United States of America
| | - Xiaoxia Li
- Key Laboratory of Plant Resources, Institute of Botany, the Chinese Academy of Sciences, Beijing, P. R. China
- Graduate Schoo1 of the Chinese Academy of Sciences, Beijing, P. R. China
- * E-mail: (SC); (XL); (GL)
| | - Gongshe Liu
- Key Laboratory of Plant Resources, Institute of Botany, the Chinese Academy of Sciences, Beijing, P. R. China
- * E-mail: (SC); (XL); (GL)
| |
Collapse
|
47
|
Rhoden SA, Garcia A, Azevedo JL, Pamphile JA. In silico analysis of diverse endophytic fungi by using ITS1-5,8S-ITS2 sequences with isolates from various plant families in Brazil. GENETICS AND MOLECULAR RESEARCH 2013; 12:935-50. [PMID: 23613240 DOI: 10.4238/2013.april.2.10] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
Brazil has a great diversity of plants, and considering that all plant species studied to date have endophytic microorganisms (bacteria or fungi), the country is a resource in the search for bioactive compounds. Endophytes live within plants without causing damage and may be in dynamic equilibrium with the health of the plant. Endophytic fungi can be identified by sequencing the region corresponding to internal transcribed spacer 1-5,8S-internal transcribed spacer 2 ribosomal DNA, and carrying out phylogenetic analyses of these sequences helps to identify species. The objective of this research was to perform in silico phylogenetic analysis of fungi isolated from various plant families in Brazil. For this study, we chose 12 articles published between 2005 and 2012 that examined endophytes isolated in Brazil. We analyzed sequences deposited in the National Center for Biotechnology Information GenBank database and carried out alignment to determine the genetic distance of strains using the Molecular Evolutionary Genetics Analysis version 5 program. The articles yielded 73 plant species belonging to 13 families found in the Brazilian States of Amazonas, Bahia, Minas Gerais, Paraná, and São Paulo. The use of GenBank and the Molecular Evolutionary Genetics Analysis program for phylogenetic observation revealed that several endophytes had been incorrectly identified because inconsistencies were apparent in their location in the phylogenetic tree. However, approximately 98% of the sequences deposited in GenBank were consistent with the identification of related genera, indicating that the database is sufficiently robust to support future studies, in which molecular identification of endophytes is made via analysis of ribosomal DNA sequences.
Collapse
Affiliation(s)
- S A Rhoden
- Departamento de Biotecnologia, Genética e Biologia Celular, Universidade Estadual de Maringá, Maringá, PR, Brasil
| | | | | | | |
Collapse
|
48
|
Shi J, Huang S, Fu D, Yu J, Wang X, Hua W, Liu S, Liu G, Wang H. Evolutionary dynamics of microsatellite distribution in plants: insight from the comparison of sequenced brassica, Arabidopsis and other angiosperm species. PLoS One 2013; 8:e59988. [PMID: 23555856 PMCID: PMC3610691 DOI: 10.1371/journal.pone.0059988] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2012] [Accepted: 02/24/2013] [Indexed: 11/19/2022] Open
Abstract
Despite their ubiquity and functional importance, microsatellites have been largely ignored in comparative genomics, mostly due to the lack of genomic information. In the current study, microsatellite distribution was characterized and compared in the whole genomes and both the coding and non-coding DNA sequences of the sequenced Brassica, Arabidopsis and other angiosperm species to investigate their evolutionary dynamics in plants. The variation in the microsatellite frequencies of these angiosperm species was much smaller than those for their microsatellite numbers and genome sizes, suggesting that microsatellite frequency may be relatively stable in plants. The microsatellite frequencies of these angiosperm species were significantly negatively correlated with both their genome sizes and transposable elements contents. The pattern of microsatellite distribution may differ according to the different genomic regions (such as coding and non-coding sequences). The observed differences in many important microsatellite characteristics (especially the distribution with respect to motif length, type and repeat number) of these angiosperm species were generally accordant with their phylogenetic distance, which suggested that the evolutionary dynamics of microsatellite distribution may be generally consistent with plant divergence/evolution. Importantly, by comparing these microsatellite characteristics (especially the distribution with respect to motif type) the angiosperm species (aside from a few species) all clustered into two obviously different groups that were largely represented by monocots and dicots, suggesting a complex and generally dichotomous evolutionary pattern of microsatellite distribution in angiosperms. Polyploidy may lead to a slight increase in microsatellite frequency in the coding sequences and a significant decrease in microsatellite frequency in the whole genome/non-coding sequences, but have little effect on the microsatellite distribution with respect to motif length, type and repeat number. Interestingly, several microsatellite characteristics seemed to be constant in plant evolution, which can be well explained by the general biological rules.
Collapse
Affiliation(s)
- Jiaqin Shi
- Key Laboratory of Oil Crop Biology of the Ministry of Agriculture, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences, Wuhan, China
| | - Shunmou Huang
- Key Laboratory of Oil Crop Biology of the Ministry of Agriculture, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences, Wuhan, China
| | - Donghui Fu
- Key Laboratory of Crop Physiology, Ecology and Genetic Breeding, Ministry of Education, Agronomy College, Jiangxi Agricultural University, Nanchang, China
| | - Jinyin Yu
- Key Laboratory of Oil Crop Biology of the Ministry of Agriculture, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences, Wuhan, China
| | - Xinfa Wang
- Key Laboratory of Oil Crop Biology of the Ministry of Agriculture, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences, Wuhan, China
| | - Wei Hua
- Key Laboratory of Oil Crop Biology of the Ministry of Agriculture, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences, Wuhan, China
| | - Shengyi Liu
- Key Laboratory of Oil Crop Biology of the Ministry of Agriculture, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences, Wuhan, China
| | - Guihua Liu
- Key Laboratory of Oil Crop Biology of the Ministry of Agriculture, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences, Wuhan, China
| | - Hanzhong Wang
- Key Laboratory of Oil Crop Biology of the Ministry of Agriculture, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences, Wuhan, China
- * E-mail:
| |
Collapse
|
49
|
Xie C, Li B, Xu Y, Ji D, Chen C. Characterization of the global transcriptome for Pyropia haitanensis (Bangiales, Rhodophyta) and development of cSSR markers. BMC Genomics 2013; 14:107. [PMID: 23414227 PMCID: PMC3626662 DOI: 10.1186/1471-2164-14-107] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2012] [Accepted: 02/02/2013] [Indexed: 01/12/2023] Open
Abstract
Background Pyropia haitanensis is an economically important mariculture crop in China and is also valuable in life science research. However, the lack of genetic information of this organism hinders the understanding of the molecular mechanisms of specific traits. Thus, high-throughput sequencing is needed to generate a number of transcriptome sequences to be used for gene discovery and molecular marker development. Results In this study, high-throughput sequencing was used to analyze the global transcriptome of P. haitanensis. Approximately 103 million 90 bp paired-end reads were generated using an Illumina HiSeq 2000. De novo assembly with paired-end information yielded 24,575 unigenes with an average length of 645 bp. Based on sequence similarity searches with known proteins, a total of 16,377 (66.64%) genes were identified. Of these annotated unigenes, 5,471 and 9,168 unigenes were assigned to gene ontology and clusters of orthologous groups, respectively. Searching against the KEGG database indicated that 12,167 (49.51%) unigenes mapped to 124 KEGG pathways. Among the carbon fixation pathways, almost all the essential genes related to the C3- and C4-pathways for P. haitanensis were discovered. Significantly different expression levels of three key genes (Rubisco, PEPC and PEPCK) in different lifecycle stages of P. haitanensis indicated that the carbon fixation pathway in the conchocelis and thallus were different, and the C4-like pathway might play important roles in the conchocelis stage. In addition, 2,727 cSSRs loci were identified in the unigenes. Among them, trinucleotide SSRs were the dominant repeat motif (87.17%, 2,377) and GCC/CCG motifs were the most common repeats (60.07%, 1,638). High quality primers to 824 loci were designed and 100 primer pairs were randomly evaluated in six strains of P. haitanensis. Eighty-seven primer pairs successfully yielded amplicons. Conclusion This study generated a large number of putative P. haitanensis transcript sequences, which can be used for novel gene discovery and gene expression profiling analyses under different physiological conditions. A number of the cSSR markers identified can be used for molecular markers and will facilitate marker assisted selection in P. haitanensis breeding. These sequences and markers will provide valuable resources for further P. haitanensis studies.
Collapse
Affiliation(s)
- Chaotian Xie
- Fisheries College, Jimei University, Xiamen, Fujian Province 361021, People's Republic of China
| | | | | | | | | |
Collapse
|
50
|
Evolution of an Ancient Microsatellite Hotspot in the Conifer Mitochondrial Genome and Comparison with Other Plants. J Mol Evol 2013; 76:146-57. [DOI: 10.1007/s00239-013-9547-2] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
|