1
|
Sun N, Chen J, Wang Y, Hussain I, Lei N, Ma X, Li W, Liu K, Yu H, Zhao K, Zhao T, Zhang Y, Yu X. Development and utility of SSR markers based on Brassica sp. whole-genome in triangle of U. FRONTIERS IN PLANT SCIENCE 2024; 14:1259736. [PMID: 38259948 PMCID: PMC10801002 DOI: 10.3389/fpls.2023.1259736] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/16/2023] [Accepted: 12/13/2023] [Indexed: 01/24/2024]
Abstract
Introduction Simple sequence repeats (SSR), also known as microsatellites, are crucial molecular markers in both animals and plants. Despite extensive previous research on SSRs, the development of microsatellite markers in Brassica crops remains limited and inefficient. Methods Krait software was used to identify microsatellites by genome-wide and marker development based on three recently sequenced basic species of Brassica crops in the triangle of U (Brassica rapa, B. nigra and B. oleracea), as well as three allotetraploids (B. juncea, B. napus and B. carinata) using public databases. Subsequently, the primers and the characteristics of microsatellites for most of them were accordingly designed on each chromosome of each of the six Brassica species, and their physical locations were identified,and the cross-transferability of primers have been carried out. In addition, a B-genome specific SSR marker was screened out. Results A total of 79341, 92089, 125443, 173964, 173604, and 222160 SSR loci have been identified from the whole genome sequences of Brassica crops within the triangle of U crops, B. rapa (AA), B. nigra (BB), B. oleracea (CC), B. napus (AACC), B. juncea (AABB) and B. carinata (BBCC), respectively. Comparing the number distribution of the three allotetraploid SSR loci in the three subgenomes AA, BB and CC, results indicate that the allotetraploid species have significant reduction in the number of SSR loci in the genome compared with their basic diploid counterparts. Moreover, we compared the basic species with their corresponding varieties, and found that the microsatellite characters between the allotetraploids and their corresponding basic species were very similar or almost identical. Subsequently, each of the 40 SSR primers was employed to investigate the polymorphism potential of B. rapa (85.27%), B. nigra (81.33%) and B. oleracea (73.45%), and B. rapa was found to have a higher cross-transfer rate among the basic species in the triangle of U. Meanwhile, a B-genome specific SSR marker, BniSSR23228 possessing the (AAGGA)3 sequence characteristics was obtained, and it located in chromosome B3 with a total length of 97 bp. Discussion In this study, results suggest that the pattern of distribution may be highly conserved during the differentiation of basic Brassica species and their allotetraploid counterparts. Our data indicated that the allotetraploidization process resulted in a significant reduction in SSR loci in the three subgenomes AA, BB and CC. The reasons may be partial gene dominated chromosomal homologous recombination and rearrangement during the evolution of basic diploid species into allotetraploids. This study provides a basis for future genomics and genetic research on the relatedness of Brassica species.
Collapse
Affiliation(s)
- Nairan Sun
- Group of Vegetable Breeding, Hainan Institute of Zhejiang University, Sanya, China
- Department of Horticulture, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
- Zhejiang Provincial Key Laboratory of Horticultural Plant Integrative Biology, Hangzhou, China
| | - Jisuan Chen
- Department of Supply Chain, Ningbo Haitong Food Technology Co., Ltd., Ningbo, China
| | - Yuqi Wang
- Group of Vegetable Breeding, Hainan Institute of Zhejiang University, Sanya, China
- Department of Horticulture, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
- Zhejiang Provincial Key Laboratory of Horticultural Plant Integrative Biology, Hangzhou, China
| | - Iqbal Hussain
- Department of Horticulture, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
- Zhejiang Provincial Key Laboratory of Horticultural Plant Integrative Biology, Hangzhou, China
| | - Na Lei
- Section of Horticulture and Landscape Architecture, Harbin Academy of Agricultural Sciences, Harbin, China
| | - Xinyan Ma
- Department of Horticulture, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
- Zhejiang Provincial Key Laboratory of Horticultural Plant Integrative Biology, Hangzhou, China
| | - Weiqiang Li
- Group of Vegetable Breeding, Hainan Institute of Zhejiang University, Sanya, China
- Department of Horticulture, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
- Zhejiang Provincial Key Laboratory of Horticultural Plant Integrative Biology, Hangzhou, China
| | - Kaiwen Liu
- Department of Horticulture, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
- Zhejiang Provincial Key Laboratory of Horticultural Plant Integrative Biology, Hangzhou, China
| | - Hongrui Yu
- Department of Horticulture, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
- Zhejiang Provincial Key Laboratory of Horticultural Plant Integrative Biology, Hangzhou, China
| | - Kun Zhao
- Department of Horticulture, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
- Zhejiang Provincial Key Laboratory of Horticultural Plant Integrative Biology, Hangzhou, China
| | - Tong Zhao
- Department of Horticulture, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
- Zhejiang Provincial Key Laboratory of Horticultural Plant Integrative Biology, Hangzhou, China
| | - Yi Zhang
- Department of Horticulture, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
- Zhejiang Provincial Key Laboratory of Horticultural Plant Integrative Biology, Hangzhou, China
| | - Xiaolin Yu
- Group of Vegetable Breeding, Hainan Institute of Zhejiang University, Sanya, China
- Department of Horticulture, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China
- Zhejiang Provincial Key Laboratory of Horticultural Plant Integrative Biology, Hangzhou, China
| |
Collapse
|
2
|
Pei D, Song S, Kang J, Zhang C, Wang J, Dong T, Ge M, Pervaiz T, Zhang P, Fang J. Characterization of Simple Sequence Repeat (SSR) Markers Mined in Whole Grape Genomes. Genes (Basel) 2023; 14:genes14030663. [PMID: 36980935 PMCID: PMC10048371 DOI: 10.3390/genes14030663] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 02/25/2023] [Accepted: 03/01/2023] [Indexed: 03/09/2023] Open
Abstract
SSR (simple sequence repeat) DNA markers are widely used for genotype DNA identification, QTL mapping, and analyzing genetic biodiversity. However, SSRs in grapes are still in their early stages, with a few primer pairs accessible. With the whole-genome sequencing (WGS) of several grape varieties, characterization of grape SSR changed to be necessary not only to genomics but to also help SSR development and utility. Based on this, we identified the whole-genome SSR of nine grape cultivars (‘PN40024’, ‘Cabernet Sauvignon’, ‘Carménère’, ‘Chardonnay’, ‘Merlot’, ‘Riesling’, ‘Zinfandel’, ‘Shine Muscat’, and ‘Muscat Hamburg’) with whole-genome sequences released publicly and found that there are great differences in the distribution of SSR loci in different varieties. According to the difference in genome size, the number of SSRs ranged from 267,385 (Cabernet Sauvignon) to 627,429 (Carménère), the density of the SSR locus in the genome of nine cultivars was generally 1 per Kb. SSR motif distribution characteristic analysis of these grape cultivars showed that the distribution patterns among grape cultivars were conservative, mainly enriched in A/T. However, there are some differences in motif types (especially tetranucleotides, pentanucleotides, and hexanucleotides), quantity, total length, and average length in different varieties, which might be related to the size of the assembled genome or the specificity of variety domestication. The distribution characteristics of SSRs were revealed by whole-genome analysis of simple repeats of grape varieties. In this study, 32 pairs of primers with lower polymorphism have been screened, which provided an important research foundation for the development of molecular markers of grape variety identification and the construction of linkage maps of important agronomic traits for crop improvement.
Collapse
Affiliation(s)
- Dan Pei
- Fruit Crop Genetic Improvement and Seedling Propagation Engineering Center of Jiangsu Province, College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China
| | - Siyan Song
- Zhenjiang Institute of Agricultural Sciences in Hilly Area of Jiangsu Province, Zhenjiang 212400, China
| | - Jun Kang
- Fruit Crop Genetic Improvement and Seedling Propagation Engineering Center of Jiangsu Province, College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China
| | - Chuan Zhang
- Fruit Crop Genetic Improvement and Seedling Propagation Engineering Center of Jiangsu Province, College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China
| | - Jing Wang
- Fruit Crop Genetic Improvement and Seedling Propagation Engineering Center of Jiangsu Province, College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China
| | - Tianyu Dong
- Fruit Crop Genetic Improvement and Seedling Propagation Engineering Center of Jiangsu Province, College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China
| | - Mengqing Ge
- Fruit Crop Genetic Improvement and Seedling Propagation Engineering Center of Jiangsu Province, College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China
| | - Tariq Pervaiz
- Department of Botany and Plant Sciences, University of California Riverside, Riverside, CA 22963, USA
| | - Peian Zhang
- Fruit Crop Genetic Improvement and Seedling Propagation Engineering Center of Jiangsu Province, College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China
| | - Jinggui Fang
- Fruit Crop Genetic Improvement and Seedling Propagation Engineering Center of Jiangsu Province, College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China
- Correspondence:
| |
Collapse
|
3
|
Optimized Method for the Identification of Candidate Genes and Molecular Maker Development Related to Drought Tolerance in Oil Palm (Elaeis guineensis Jacq.). PLANTS 2022; 11:plants11172317. [PMID: 36079700 PMCID: PMC9460821 DOI: 10.3390/plants11172317] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Revised: 09/01/2022] [Accepted: 09/02/2022] [Indexed: 11/17/2022]
Abstract
Drought is a major constraint in oil palm (Elaeis guineensis Jacq.) production. As oil palm breeding takes a long time, molecular markers of genes related to drought tolerance characteristics were developed for effective selection. Two methods of gene identification associated with drought, differential display reverse transcription polymerase chain reaction (DDRT-PCR) and pyrosequencing platform, were conducted before developing the EST-SSR marker. By DDRT-PCR, fourteen out of twenty-four primer combinations yielded the polymorphism in leaf as 77.66% and root as 96.09%, respectively. BLASTN and BLASTX revealed nucleotides from 8 out of 236 different banding similarities to genes associated with drought stress. Five out of eight genes gave a similarity with our pyrosequencing sequencing database. Furthermore, pyrosequencing analysis of two oil palm libraries, drought-tolerant, and drought sensitive, found 117 proteins associated with drought tolerance. Thirteen out of sixty EST-SSR primers could be distinguished in 119 oil palm parents in our breeding program. All of our found genes revealed an ability to develop as a molecular marker for drought tolerance. However, the function of the validated genes on drought response in oil palm must be evaluated.
Collapse
|
4
|
High genetic variability of Nosema ceranae populations in Apis mellifera from East Asia compared to central Asia and the Americas. Biol Invasions 2022. [DOI: 10.1007/s10530-022-02835-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
5
|
DNA-Based Tools to Certify Authenticity of Rice Varieties—An Overview. Foods 2022; 11:foods11030258. [PMID: 35159410 PMCID: PMC8834242 DOI: 10.3390/foods11030258] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Revised: 01/04/2022] [Accepted: 01/12/2022] [Indexed: 02/07/2023] Open
Abstract
Rice (Oryza sativa L.) is one of the most cultivated and consumed crops worldwide. It is mainly produced in Asia but, due to its large genetic pool, it has expanded to several ecosystems, latitudes and climatic conditions. Europe is a rice producing region, especially in the Mediterranean countries, that grow mostly typical japonica varieties. The European consumer interest in rice has increased over the last decades towards more exotic types, often more expensive (e.g., aromatic rice) and Europe is a net importer of this commodity. This has increased food fraud opportunities in the rice supply chain, which may deliver mixtures with lower quality rice, a problem that is now global. The development of tools to clearly identify undesirable mixtures thus became urgent. Among the various tools available, DNA-based markers are considered particularly reliable and stable for discrimination of rice varieties. This review covers aspects ranging from rice diversity and fraud issues to the DNA-based methods used to distinguish varieties and detect unwanted mixtures. Although not exhaustive, the review covers the diversity of strategies and ongoing improvements already tested, highlighting important advantages and disadvantages in terms of costs, reliability, labor-effort and potential scalability for routine fraud detection.
Collapse
|
6
|
Malhotra EV, Jain R, Bansal S, Mali SC, Sharma N, Agrawal A. Development of a new set of genic SSR markers in the genus Gentiana: in silico mining, characterization and validation. 3 Biotech 2021; 11:430. [PMID: 34527507 DOI: 10.1007/s13205-021-02969-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Accepted: 08/19/2021] [Indexed: 11/26/2022] Open
Abstract
Gentiana is an important genus of around 360 medicinally important species, majority of which are not well characterized. Despite its importance, very few genomic resources are available for Gentiana L. Till date, the number of informative and robust simple sequence repeat (SSR)-based markers is limited and very few efforts have been made for their development. A set of robust, freely accessible and informative SSR markers for Gentiana is a pre-requisite for any molecular systematic as well as improvement studies in this group of pharmacologically valuable plants. In view of the importance of these plants, Expressed Sequence Tag (EST) sequences of 18 Gentiana species were surveyed for the development of a large set of non-redundant SSR markers. A total of 5808 perfect SSR with an average length of 17 bp and relative abundance of 214 loci/Mb were identified in the analysed 47,487 EST sequences using Krait software. Mapping of the ESTs resulted in gene ontology annotations of 49.14% of the sequences. Based on these perfect SSRs, 2902 primer pairs were designed, and 60 markers were randomly selected and validated on a set of Gentiana kurroo Royle accessions. Among the screened markers, 39 (65%) were found to be cross-species transferable. This is the first report of the largest set of functional, novel genic SSR markers in Gentiana, which will be a valuable resource for future characterization, genotype identification, conservation and genomic studies in the various species of this group of important medicinal plants. SUPPLEMENTARY INFORMATION The online version contains supplementary material available at 10.1007/s13205-021-02969-4.
Collapse
Affiliation(s)
- Era Vaidya Malhotra
- Tissue Culture and Cryopreservation Unit, ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
| | - Rishu Jain
- Tissue Culture and Cryopreservation Unit, ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
| | - Sangita Bansal
- Tissue Culture and Cryopreservation Unit, ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
| | - Suresh Chand Mali
- Tissue Culture and Cryopreservation Unit, ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
| | - Neelam Sharma
- Tissue Culture and Cryopreservation Unit, ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
| | - Anuradha Agrawal
- Tissue Culture and Cryopreservation Unit, ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
| |
Collapse
|
7
|
Kumar R, Kumar C, Paliwal R, Roy Choudhury D, Singh I, Kumar A, Kumari A, Singh R. Development of Novel Genomic Simple Sequence Repeat (g-SSR) Markers and Their Validation for Genetic Diversity Analyses in Kalmegh [ Andrographis paniculata (Burm. F.) Nees]. PLANTS (BASEL, SWITZERLAND) 2020; 9:E1734. [PMID: 33316874 PMCID: PMC7763852 DOI: 10.3390/plants9121734] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/12/2020] [Revised: 11/05/2020] [Accepted: 11/17/2020] [Indexed: 11/16/2022]
Abstract
Kalmegh (Andrographis paniculata (Burm. F.) Nees) is one of the most important medicinal plants and has been widely explored as traditional medicine. To exploit its natural genetic diversity and initiations of molecular breeding to develop novel cultivars or varieties, developments of genomic resources are essential. Four microsatellite-enriched genomic libraries-(CT)14, (GT)12, (AG)15 and (AAC)8-were constructed using the genomic DNA of A. paniculata. Initially, 183 recombinant colonies were screened for the presence of CT, GT, AG, and AAC microsatellite repeats, out of which 47 clones found positive for the desired simple sequence repeats (SSRs). It was found that few colonies had more than one desirable SSR. Thus, a sum of 67 SSRs were designed and synthesized for their validation among 42 A. paniculata accessions. Out of the 67 SSRs used for genotyping, only 41 were found to be polymorphic. The developed set of g-SSR markers showed substantial genetic variability among the selected A. paniculata accessions, with an average polymorphic information content (PIC) value of 0.32. Neighbor-joining tree analysis, population structure analysis, analysis of molecular variance (AMOVA), and principal coordinate analysis (PCoA) illustrated the considerable genetic diversity among them. The novel g-SSR markers developed in the present study could be important genomic resources for future applications in A. paniculata.
Collapse
Affiliation(s)
- Ramesh Kumar
- Division of Genomic Resources, ICAR-National Bureau of Plant Genetic Resources, New Delhi 110012, India; (R.K.); (R.P.); (D.R.C.)
- Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida 201313, Uttar Pradesh, India;
| | - Chavlesh Kumar
- Division of Fruits and Horticultural Technology, ICAR-Indian Agricultural Research Institute, New Delhi 110012, India;
| | - Ritu Paliwal
- Division of Genomic Resources, ICAR-National Bureau of Plant Genetic Resources, New Delhi 110012, India; (R.K.); (R.P.); (D.R.C.)
| | - Debjani Roy Choudhury
- Division of Genomic Resources, ICAR-National Bureau of Plant Genetic Resources, New Delhi 110012, India; (R.K.); (R.P.); (D.R.C.)
| | - Isha Singh
- Division of Plant Physiology, ICAR-Indian Agricultural Research Institute, New Delhi 110012, India;
- School of Biomolecular and Biomedical Sciences, University College of Dublin, D04V1W8 Dublin, Ireland
| | - Ashok Kumar
- Division of Germplasm Evaluation, ICAR-National Bureau of Plant Genetic Resources, New Delhi 110012, India;
| | - Abha Kumari
- Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida 201313, Uttar Pradesh, India;
| | - Rakesh Singh
- Division of Genomic Resources, ICAR-National Bureau of Plant Genetic Resources, New Delhi 110012, India; (R.K.); (R.P.); (D.R.C.)
| |
Collapse
|
8
|
Sahoo A, Kar B, Sahoo S, Jena S, Ray A, Padhiari BM, Nayak S. In silico mining of SSR markers from expressed sequence tags of Clematis chinensis. GENE REPORTS 2020. [DOI: 10.1016/j.genrep.2020.100810] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
9
|
Transcriptome wide SSR discovery cross-taxa transferability and development of marker database for studying genetic diversity population structure of Lilium species. Sci Rep 2020; 10:18621. [PMID: 33122761 PMCID: PMC7596044 DOI: 10.1038/s41598-020-75553-0] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2020] [Accepted: 10/05/2020] [Indexed: 01/06/2023] Open
Abstract
Lily belongs to family liliaceae, which mainly propagates vegetatively. Therefore, sufficient number of polymorphic, informative, and functional molecular markers are essential for studying a wide range of genetic parameters in Lilium species. We attempted to develop, characterize and design SSR (simple sequence repeat) markers using online genetic resources for analyzing genetic diversity and population structure of Lilium species. We found di-nucleotide repeat motif were more frequent (4684) within 0.14 gb (giga bases) transcriptome than other repeats, of which was two times higher than tetra-repeat motifs. Frequency of di-(AG/CT), tri-(AGG/CTT), tetra-(AAAT), penta-(AGAGG), and hexa-(AGAGGG) repeats was 34.9%, 7.0%, 0.4%, 0.3%, and 0.2%, respectively. A total of 3607 non-redundant SSR primer pairs was designed based on the sequences of CDS, 5'-UTR and 3'-UTR region covering 34%, 14%, 23%, respectively. Among them, a sub set of primers (245 SSR) was validated using polymerase chain reaction (PCR) amplification, of which 167 primers gave expected PCR amplicon and 101 primers showed polymorphism. Each locus contained 2 to 12 alleles on average 0.82 PIC (polymorphic information content) value. A total of 87 lily accessions was subjected to genetic diversity analysis using polymorphic SSRs and found to separate into seven groups with 0.73 to 0.79 heterozygosity. Our data on large scale SSR based genetic diversity and population structure analysis may help to accelerate the breeding programs of lily through utilizing different genomes, understanding genetics and characterizing germplasm with efficient manner.
Collapse
|
10
|
Sharma H, Bhandawat A, Kumar P, Rahim MS, Parveen A, Kumar P, Madhawan A, Rishi V, Roy J. Development and characterization of bZIP transcription factor based SSRs in wheat. Gene 2020; 756:144912. [PMID: 32574760 DOI: 10.1016/j.gene.2020.144912] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Revised: 05/31/2020] [Accepted: 06/17/2020] [Indexed: 02/08/2023]
Abstract
Assessment of existing diversity is the key for germplasm conservation and crop improvement. Wheat (Triticum aestivum L.) is among the most important cereal crop and consumed by two billion world's populations. DNA-based markers are predominantly used for diversity characterization because they are easy to develop and not influenced by environment. Among them microsatellites (simple sequence repeats, SSRs) are most suitable due to their genome-wide distribution, hypervariability and reproducibility for their applications in diversity, genetic improvement, and molecular breeding. bZIP transcription factors play major roles in plants in light and stress signalling, seed development, and defence. A total of 846 SSRs were identified from 370 wheat cDNA sequences and a sub-set of 35 polymorphic TabZIPMS (TriticumaestivumbZIP MicroSatellites) was used for diversity and genetic structure analysis of 92 Indian wheat varieties and related species. 114 SSR variants ranging from 2 to 5 per SSR locus were detected for 35 SSRs in the varieties. Average polymorphic information content (PIC) and observed heterozygosity was found to be 0.135 and 0.838, respectively. Thirty-four SSRs showed cross-transferability into different related species. Combined Bayesian model and Jaccard's similarity based genetic clustering analysis revealed two clusters of 80 bread wheat varieties and one separate cluster of related species. In this study, a total 35 novel bZIP-derived SSRs were identified in a set 370 bZIP genes and shown high polymorphism and cross-species transferability in wheat. The findings provide resources for future utilization in genetic resource conservation, trait introgression, breeding and varietal development.
Collapse
Affiliation(s)
- Himanshu Sharma
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute, Knowledge City, Sector-81, Mohali, Punjab 140306, India
| | - Abhishek Bhandawat
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute, Knowledge City, Sector-81, Mohali, Punjab 140306, India
| | - Pankaj Kumar
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute, Knowledge City, Sector-81, Mohali, Punjab 140306, India
| | - Mohammed Saba Rahim
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute, Knowledge City, Sector-81, Mohali, Punjab 140306, India
| | - Afsana Parveen
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute, Knowledge City, Sector-81, Mohali, Punjab 140306, India
| | - Prashant Kumar
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute, Knowledge City, Sector-81, Mohali, Punjab 140306, India
| | - Akansha Madhawan
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute, Knowledge City, Sector-81, Mohali, Punjab 140306, India
| | - Vikas Rishi
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute, Knowledge City, Sector-81, Mohali, Punjab 140306, India
| | - Joy Roy
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute, Knowledge City, Sector-81, Mohali, Punjab 140306, India.
| |
Collapse
|
11
|
Kumari R, Wankhede DP, Bajpai A, Maurya A, Prasad K, Gautam D, Rangan P, Latha M, John K. J, A. S, Bhat KV, Gaikwad AB. Genome wide identification and characterization of microsatellite markers in black pepper (Piper nigrum): A valuable resource for boosting genomics applications. PLoS One 2019; 14:e0226002. [PMID: 31834893 PMCID: PMC6910694 DOI: 10.1371/journal.pone.0226002] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2019] [Accepted: 11/18/2019] [Indexed: 11/23/2022] Open
Abstract
Black pepper is one of the most valued and widely used spices in the world and dominates multi-billion dollar global spices trade. India is amongst the major producers, consumers and exporters of black pepper. In spite of its commercial and cultural importance, black pepper has received meagre attention in terms of generation of genomic resources. Availability of markers distributed throughout the genome would facilitate and accelerate genetic studies, QTL identification, genetic enhancement and crop improvement in black pepper. In this perspective, the sequence information from the recently sequenced black pepper (Piper nigrum) genome has been used for identification and characterisation of Simple Sequence Repeats (SSRs). Total 69,126 SSRs were identified from assembled genomic sequence of P. nigrum. The SSR frequency was 158 per MB making it, one SSR for every 6.3 kb in the assembled genome. Among the different types of microsatellite repeat motifs, dinucleotides were the most abundant (48.6%), followed by trinucleotide (23.7%) and compound repeats (20.62%). A set of 85 SSRs were used for validation, of which 74 produced amplification products of expected size. Genetic diversity of 30 black pepper accessions using 50 SSRs revealed four distinct clusters. Further, the cross species transferability of the SSRs was checked in nine other Piper species. Out of 50 SSRs used, 19 and 31 SSRs were amplified in nine and seven species, respectively. Thus the identified SSRs may have application in other species of the genus Piper where genome sequence is not available yet. Present study reports the first NGS based genomic SSRs in black pepper and thus constitute a valuable resource for a whole fleet of applications in genetics and plant breeding studies such as genetic map construction, QTL identification, map-based gene cloning, marker-assisted selection and evolutionary studies in Piper nigrum and related species.
Collapse
Affiliation(s)
- Ratna Kumari
- ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
| | | | - Akansha Bajpai
- ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
| | - Avantika Maurya
- ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
| | - Kartikay Prasad
- ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
| | - Dikshant Gautam
- ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
| | - Parimalan Rangan
- ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
| | - M. Latha
- ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
| | - Joseph John K.
- ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
| | - Suma A.
- ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
| | - Kangila V. Bhat
- ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
| | - Ambika B. Gaikwad
- ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
- * E-mail:
| |
Collapse
|
12
|
Ruan X, Wang Z, Wang T, Su Y. Characterization and Application of EST-SSR Markers Developed From the Transcriptome of Amentotaxus argotaenia (Taxaceae), a Relict Vulnerable Conifer. Front Genet 2019; 10:1014. [PMID: 31681436 PMCID: PMC6813739 DOI: 10.3389/fgene.2019.01014] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2019] [Accepted: 09/24/2019] [Indexed: 11/13/2022] Open
Abstract
Amentotaxus argotaenia (Taxaceae) is a vulnerable coniferous species with preference for shade and moist environment. Accurate estimation of genetic variation is crucial for its conservation, especially in the context of global warming. In this study, we acquired a transcriptome from A. argotaenia leaves using Illumina sequencing and de novo assembled 62,896 unigenes, of which 5510 EST-SSRs were detected. Twenty-two polymorphic EST-SSRs were successfully developed and further used to investigate genetic variation, linkage disequilibrium, and bottleneck signatures of A. argotaenia. The results showed that A. argotaenia had moderate genetic variation and high genetic differentiation, which may provide raw material to protect against climatic changes and accelerate local adaptation, respectively. No bottlenecks were found to occur in A. argotaenia. Our study not only showed that these EST markers are very effective in population genetic analysis but also lay a solid foundation for further investigating adaptive evolution and conservation strategies of A. argotaenia.
Collapse
Affiliation(s)
- Xiaoxian Ruan
- School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Zhen Wang
- School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Ting Wang
- College of Life Sciences, South China Agricultural University, Guangzhou, China
| | - Yingjuan Su
- School of Life Sciences, Sun Yat-sen University, Guangzhou, China
- Research Institute of Sun Yat-sen University, Shenzhen, China
| |
Collapse
|
13
|
De novo transcriptome of Gymnema sylvestre identified putative lncRNA and genes regulating terpenoid biosynthesis pathway. Sci Rep 2019; 9:14876. [PMID: 31619732 PMCID: PMC6795813 DOI: 10.1038/s41598-019-51355-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2019] [Accepted: 09/16/2019] [Indexed: 01/06/2023] Open
Abstract
Gymnema sylvestre is a highly valuable medicinal plant in traditional Indian system of medicine and used in many polyherbal formulations especially in treating diabetes. However, the lack of genomic resources has impeded its research at molecular level. The present study investigated functional gene profile of G. sylvestre via RNA sequencing technology. The de novo assembly of 88.9 million high quality reads yielded 23,126 unigenes, of which 18116 were annotated against databases such as NCBI nr database, gene ontology (GO), KEGG, Pfam, CDD, PlantTFcat, UniProt & GreeNC. Total 808 unigenes mapped to 78 different Transcription Factor families, whereas 39 unigenes assigned to CYP450 and 111 unigenes coding for enzymes involved in the biosynthesis of terpenoids including transcripts for synthesis of important compounds like Vitamin E, beta-amyrin and squalene. Among them, presence of six important enzyme coding transcripts were validated using qRT-PCR, which showed high expression of enzymes involved in methyl-erythritol phosphate (MEP) pathway. This study also revealed 1428 simple sequence repeats (SSRs), which may aid in molecular breeding studies. Besides this, 8 putative long non-coding RNAs (lncRNAs) were predicted from un-annotated sequences, which may hold key role in regulation of essential biological processes in G. sylvestre. The study provides an opportunity for future functional genomic studies and to uncover functions of the lncRNAs in G. sylvestre.
Collapse
|
14
|
Transcriptome analysis and codominant markers development in caper, a drought tolerant orphan crop with medicinal value. Sci Rep 2019; 9:10411. [PMID: 31320697 PMCID: PMC6639398 DOI: 10.1038/s41598-019-46613-x] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2019] [Accepted: 06/27/2019] [Indexed: 01/08/2023] Open
Abstract
Caper (Capparis spinosa L.) is a xerophytic shrub cultivated for its flower buds and fruits, used as food and for their medicinal properties. Breeding programs and even proper taxonomic classification of the genus Capparis has been hampered so far by the lack of reliable genetic information and molecular markers. Here, we present the first genomic resource for C. spinosa, generated by transcriptomic approach and de novo assembly. The sequencing effort produced nearly 80 million clean reads assembled into 124,723 unitranscripts. Careful annotation and comparison with public databases revealed homologs to genes with a key role in important metabolic pathways linked to abiotic stress tolerance and bio-compounds production, such purine, thiamine and phenylpropanoid biosynthesis, α-linolenic acid and lipid metabolism. Additionally, a panel of genes involved in stomatal development/distribution and encoding for Stress Associated Proteins (SAPs) was also identified. We also used the transcriptomic data to uncover novel molecular markers for caper. Out of 50 SSRs tested, 14 proved polymorphic and represent the first set of SSR markers for the genus Capparis. This transcriptome will be an important contribution to future studies and breeding programs for this orphan crop, aiding to the development of improved varieties to sustain agriculture in arid conditions.
Collapse
|
15
|
Zhang Z, Xie W, Zhao Y, Zhang J, Wang N, Ntakirutimana F, Yan J, Wang Y. EST-SSR marker development based on RNA-sequencing of E. sibiricus and its application for phylogenetic relationships analysis of seventeen Elymus species. BMC PLANT BIOLOGY 2019; 19:235. [PMID: 31159732 PMCID: PMC6547490 DOI: 10.1186/s12870-019-1825-8] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/06/2018] [Accepted: 05/09/2019] [Indexed: 05/20/2023]
Abstract
BACKGROUND Elymus L. is the largest genus in the tribe Triticeae Dumort., encompassing approximately 150 polyploid perennial species widely distributed in the temperate regions of the world. It is considered to be an important gene pool for improving cereal crops. However, a shortage of molecular marker limits the efficiency and accuracy of genetic breeding for Elymus species. High-throughput transcriptome sequencing data is essential for gene discovery and molecular marker development. RESULTS We obtained the transcriptome dataset of E. sibiricus, the type species of the genus Elymus, and identified a total of 8871 putative EST-SSRs from 6685 unigenes. Trinucleotides were the dominant repeat motif (4760, 53.66%), followed by dinucleotides (1993, 22.47%) and mononucleotides (1876, 21.15%). The most dominant trinucleotide repeat motif was CCG/CGG (1119, 23.5%). Sequencing of PCR products showed that the sequenced alleles from different Elymus species were homologous to the original SSR locus from which the primer was designed. Different types of tri-repeats as abundant SSR motifs were observed in repeat regions. Two hundred EST-SSR primer pairs were designed and selected to amplify ten DNA samples of Elymus species. Eighty-seven pairs of primer (43.5%) generated clear and reproducible bands with expected size, and showed good transferability across different Elymus species. Finally, thirty primer pairs successfully amplified ninety-five accessions of seventeen Elymus species, and detected significant amounts of polymorphism. In general, hexaploid Elymus species with genomes StStHHYY had a relatively higher level of genetic diversity (H = 0.219, I = 0.330, %P = 63.7), while tetraploid Elymus species with genomes StStYY had low level of genetic diversity (H = 0.182, I = 0.272, %P = 50.4) in the study. The cluster analysis showed that all ninety-five accessions were clustered into three major clusters. The accessions were grouped mainly according to their genomic components and origins. CONCLUSIONS This study demonstrated that transcriptome sequencing is a fast and cost-effective approach to molecular marker development. These EST-SSR markers developed in this study are valuable tools for genetic diversity, evolutionary, and molecular breeding in E. sibiricus, and other Elymus species.
Collapse
Affiliation(s)
- Zongyu Zhang
- State Key Laboratory of Grassland Agro-ecosystems; Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs; Engineering Research Center of Grassland Industry, Ministry of Education; College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730020 People’s Republic of China
| | - Wengang Xie
- State Key Laboratory of Grassland Agro-ecosystems; Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs; Engineering Research Center of Grassland Industry, Ministry of Education; College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730020 People’s Republic of China
| | - Yongqiang Zhao
- State Key Laboratory of Grassland Agro-ecosystems; Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs; Engineering Research Center of Grassland Industry, Ministry of Education; College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730020 People’s Republic of China
| | - Junchao Zhang
- State Key Laboratory of Grassland Agro-ecosystems; Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs; Engineering Research Center of Grassland Industry, Ministry of Education; College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730020 People’s Republic of China
| | - Na Wang
- State Key Laboratory of Grassland Agro-ecosystems; Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs; Engineering Research Center of Grassland Industry, Ministry of Education; College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730020 People’s Republic of China
| | - Fabrice Ntakirutimana
- State Key Laboratory of Grassland Agro-ecosystems; Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs; Engineering Research Center of Grassland Industry, Ministry of Education; College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730020 People’s Republic of China
| | - Jiajun Yan
- Sichuan Academy of Grassland Science, Chengdu, Sichuan 611731 People’s Republic of China
| | - Yanrong Wang
- State Key Laboratory of Grassland Agro-ecosystems; Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs; Engineering Research Center of Grassland Industry, Ministry of Education; College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730020 People’s Republic of China
| |
Collapse
|
16
|
Shamanskiy VA, Timonina VN, Popadin KY, Gunbin KV. ImtRDB: a database and software for mitochondrial imperfect interspersed repeats annotation. BMC Genomics 2019; 20:295. [PMID: 31284879 PMCID: PMC6614062 DOI: 10.1186/s12864-019-5536-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND Mitochondria is a powerhouse of all eukaryotic cells that have its own circular DNA (mtDNA) encoding various RNAs and proteins. Somatic perturbations of mtDNA are accumulating with age thus it is of great importance to uncover the main sources of mtDNA instability. Recent analyses demonstrated that somatic mtDNA deletions depend on imperfect repeats of various nature between distant mtDNA segments. However, till now there are no comprehensive databases annotating all types of imperfect repeats in numerous species with sequenced complete mitochondrial genome as well as there are no algorithms capable to call all types of imperfect repeats in circular mtDNA. RESULTS We implemented naïve algorithm of pattern recognition by analogy to standard dot-plot construction procedures allowing us to find both perfect and imperfect repeats of four main types: direct, inverted, mirror and complementary. Our algorithm is adapted to specific characteristics of mtDNA such as circularity and an excess of short repeats - it calls imperfect repeats starting from the length of 10 b.p. We constructed interactive web available database ImtRDB depositing perfect and imperfect repeats positions in mtDNAs of more than 3500 Vertebrate species. Additional tools, such as visualization of repeats within a genome, comparison of repeat densities among different genomes and a possibility to download all results make this database useful for many biologists. Our first analyses of the database demonstrated that mtDNA imperfect repeats (i) are usually short; (ii) associated with unfolded DNA structures; (iii) four types of repeats positively correlate with each other forming two equivalent pairs: direct and mirror versus inverted and complementary, with identical nucleotide content and similar distribution between species; (iv) abundance of repeats is negatively associated with GC content; (v) dinucleotides GC versus CG are overrepresented on light chain of mtDNA covered by repeats. CONCLUSIONS ImtRDB is available at http://bioinfodbs.kantiana.ru/ImtRDB/ . It is accompanied by the software calling all types of interspersed repeats with different level of degeneracy in circular DNA. This database and software can become a very useful tool in various areas of mitochondrial and chloroplast DNA research.
Collapse
Affiliation(s)
- Viktor A Shamanskiy
- Center for Mitochondrial Functional Genomics, School of Life Science, Immanuel Kant Baltic Federal University, Kaliningrad, Russia
| | - Valeria N Timonina
- Center for Mitochondrial Functional Genomics, School of Life Science, Immanuel Kant Baltic Federal University, Kaliningrad, Russia
| | - Konstantin Yu Popadin
- Center for Mitochondrial Functional Genomics, School of Life Science, Immanuel Kant Baltic Federal University, Kaliningrad, Russia.,Center for Integrative Genomics, University of Lausanne, Lausanne, Switzerland.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Konstantin V Gunbin
- Center for Mitochondrial Functional Genomics, School of Life Science, Immanuel Kant Baltic Federal University, Kaliningrad, Russia. .,Center of Brain Neurobiology and Neurogenetics, Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia.
| |
Collapse
|
17
|
Cai K, Zhu L, Zhang K, Li L, Zhao Z, Zeng W, Lin X. Development and Characterization of EST-SSR Markers From RNA-Seq Data in Phyllostachys violascens. FRONTIERS IN PLANT SCIENCE 2019; 10:50. [PMID: 30774640 PMCID: PMC6367221 DOI: 10.3389/fpls.2019.00050] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/17/2018] [Accepted: 01/15/2019] [Indexed: 05/02/2023]
Abstract
Bamboo are woody grass species containing important economic and ecological values. Lei bamboo (Phyllostachys violascens) is a kind of shoot-producing bamboo species with the highest economic yield per unit area. However, identifying different varieties of Lei bamboo based on morphological characteristics is difficult. Microsatellites play an important role in plant identification and genetic diversity analysis and are superior to other molecular markers. In this study, we identified 18,356 expressed sequence tag-simple sequence repeat (EST-SSR) loci in Lei bamboo transcriptome data. A total of 11,264 primer pairs were successfully designed from unigenes of all EST-SSR loci, and 96 primer pairs were randomly selected and synthesized. A total of 54 primer pairs were used for classifying 16 Lei bamboo varieties and 10 different Phyllostachys species. The number of polymorphism alleles among the 54 primer pairs ranged from 3 to 12 for P. violascens varieties and 3 to 20 for Phyllostachys. The phylogenetic tree based on polymorphism alleles successfully distinguished 16 P. violascens varieties and 10 Phyllostachys species. Our study provides abundant EST-SSR resources that are useful for genetic diversity analysis and molecular verification of bamboo and suggests that SSR markers developed from Lei bamboo are more efficient and reliable than ISSR, SRAP or AFLP markers.
Collapse
Affiliation(s)
- Kai Cai
- Sino-Australia Plant Cell Wall Research Centre, State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Lin’an, China
- Zhejiang Provincial Collaborative Innovation Center for Bamboo Resources and High-Efficiency Utilization, Zhejiang A & F University, Lin’an, China
| | - Longfei Zhu
- Sino-Australia Plant Cell Wall Research Centre, State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Lin’an, China
- Department of Genome Biology, Adam Mickiewicz University, Poznań, Poland
| | - Keke Zhang
- Sino-Australia Plant Cell Wall Research Centre, State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Lin’an, China
| | - Ling Li
- Sino-Australia Plant Cell Wall Research Centre, State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Lin’an, China
| | - Zhongyu Zhao
- Sino-Australia Plant Cell Wall Research Centre, State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Lin’an, China
| | - Wei Zeng
- Sino-Australia Plant Cell Wall Research Centre, State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Lin’an, China
| | - Xinchun Lin
- Sino-Australia Plant Cell Wall Research Centre, State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Lin’an, China
- Zhejiang Provincial Collaborative Innovation Center for Bamboo Resources and High-Efficiency Utilization, Zhejiang A & F University, Lin’an, China
- *Correspondence: Xinchun Lin,
| |
Collapse
|
18
|
Durigan M, Cardoso-Silva CB, Ciampi-Guillardi M, Toledo-Silva G, Mori GM, Franco RMB, Souza AP. Molecular genotyping, diversity studies and high-resolution molecular markers unveiled by microsatellites in Giardia duodenalis. PLoS Negl Trop Dis 2018; 12:e0006928. [PMID: 30500829 PMCID: PMC6291164 DOI: 10.1371/journal.pntd.0006928] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2017] [Revised: 12/12/2018] [Accepted: 10/16/2018] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND Giardia duodenalis (synonyms G. lamblia and G. intestinalis) is an enteric protozoan parasite of a wide range of mammalian hosts, including humans and various domestic and wild animals. There is considerable genetic variability in G. duodenalis and isolates of this parasite have been divided into eight genetic assemblages. Microsatellites markers can be used to discriminate isolates with a high level of sensitivity. This study was conducted to identify and characterize genomic microsatellites (simple sequence repeats-SSRs), sequences of one- to six-nucleotide motifs repeated in tandem, present in the available genomes of G. duodenalis and to develop new markers that can serve as a tool for detection and for characterizing the genetic diversity of this parasite. METHODOLOGY/ PRINCIPAL FINDINGS For each genetic assemblage, polymorphism levels for the microsatellite markers were evaluated. After performing the analysis using the MISA and SciRoKo software, 1,853 simple sequence repeats (SSRs) were identified. In all the genomes, trinucleotide repeats were the most common class followed by tetranucleotide. Many of the SSR loci are assemblage-specific, and 36 SSR loci shared among all the genomes were identified. Together with hypothetical proteins, variant-specific surface proteins represented nearly half of the annotated SSR loci. The results regarding the most common repeat among the SSRs led us to infer that positive selection occurred to avoid frameshift mutations. Additionally, based on inter- and intra-genetic assemblages polymorphism analyses, we unveiled previously undetected genetic variation, indicating that the microsatellite markers we developed are useful molecular tools for epidemiological inferences based on population genetics patterns and processes. CONCLUSIONS There is increasing demand for the development of new molecular markers and for the characterization of pathogens at a higher resolution level. In this study, we present 60 G. duodenalis microsatellites markers that exhibited high polymerase chain reaction (PCR) amplification efficiency among the different genetic assemblages. Twenty of these markers presented nucleotide sequence polymorphisms and may be used as a genotyping tool. The monomorphic markers can be used for the detection of the parasite at the species and genetic assemblage level. These polymorphic markers revealed a genetic diversity that was previously undetectable, thus they can be considered valuable molecular tools for high resolution markers in future studies investigating Giardia and may also be used for epidemiological inferences based on populations genetics patterns and processes.
Collapse
Affiliation(s)
- Maurício Durigan
- Centro de Biologia Molecular e Engenharia Genética (CBMEG), Universidade Estadual de Campinas (UNICAMP), Campinas, São Paulo, Brazil
| | - Claudio Benício Cardoso-Silva
- Centro de Biologia Molecular e Engenharia Genética (CBMEG), Universidade Estadual de Campinas (UNICAMP), Campinas, São Paulo, Brazil
| | - Maísa Ciampi-Guillardi
- Centro de Biologia Molecular e Engenharia Genética (CBMEG), Universidade Estadual de Campinas (UNICAMP), Campinas, São Paulo, Brazil
- Departamento de Fitopatologia–ESALQ–Universidade de São Paulo, Piracicaba, São Paulo, Brazil
| | - Guilherme Toledo-Silva
- Centro de Biologia Molecular e Engenharia Genética (CBMEG), Universidade Estadual de Campinas (UNICAMP), Campinas, São Paulo, Brazil
| | - Gustavo M. Mori
- Instituto de Biociências, Campus do Litoral Paulista, Universidade Estadual Paulista (Unesp), São Vicente, Sao Paulo, Brazil
| | - Regina M. B. Franco
- Departamento de Biologia Animal, Instituto de Biologia, Universidade Estadual de Campinas (UNICAMP), Campinas, São Paulo, Brazil
| | - Anete P. Souza
- Centro de Biologia Molecular e Engenharia Genética (CBMEG), Universidade Estadual de Campinas (UNICAMP), Campinas, São Paulo, Brazil
- Departamento de Biologia Vegetal, Instituto de Biologia, Universidade Estadual de Campinas (UNICAMP), Campinas, São Paulo, Brazil
| |
Collapse
|
19
|
Purru S, Sahu S, Rai S, Rao AR, Bhat KV. GinMicrosatDb: a genome-wide microsatellite markers database for sesame ( Sesamum indicum L.). PHYSIOLOGY AND MOLECULAR BIOLOGY OF PLANTS : AN INTERNATIONAL JOURNAL OF FUNCTIONAL PLANT BIOLOGY 2018; 24:929-937. [PMID: 30150867 PMCID: PMC6103941 DOI: 10.1007/s12298-018-0558-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/14/2017] [Revised: 03/08/2018] [Accepted: 05/21/2018] [Indexed: 06/08/2023]
Abstract
Molecular breeding in sesame is still at infancy due to limited number of microsatellite markers available and the low level of polymorphism exhibited by them. Therefore, whole genome sequencing was used for development of microsatellite markers so as to ensure availability of substantial number of polymorphic markers for use in marker assisted breeding programs. Whole genome sequencing of sesame variety 'Swetha' was done using Illumina paired-end sequencing and Roche 454 shotgun sequencing technologies (GCA_000975565.1 in GenBank). 'GinMicrosatDb', a genome-wide microsatellite marker database has been developed using the whole genome sequence data of sesame variety 'Swetha'. The database consists of microsatellites localized on both linkage groups and scaffolds with their genomic co-ordinates. It provides five sets of forward and reverse primers for each of the microsatellite loci along with the flanking sequences, primer GC content, product size and melting temperature etc. The distribution of microsatellites can be viewed and selected through a genome browser as well as through a physical map. The newly identified microsatellite markers are expected to help sesame breeders in developing marker tags for traits of economic importance thereby bringing about greater efficiency in marker-assisted selection programs.
Collapse
Affiliation(s)
- Supriya Purru
- ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Sarika Sahu
- ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Saurabh Rai
- ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - A. R. Rao
- ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - K. V. Bhat
- Division of Genomic Resources, ICAR-National Bureau of Plant Genetic Resources, New Delhi, 110012 India
| |
Collapse
|
20
|
Hussain T, Plunkett B, Ejaz M, Espley RV, Kayser O. Identification of Putative Precursor Genes for the Biosynthesis of Cannabinoid-Like Compound in Radula marginata. FRONTIERS IN PLANT SCIENCE 2018; 9:537. [PMID: 29868043 PMCID: PMC5954354 DOI: 10.3389/fpls.2018.00537] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/29/2018] [Accepted: 04/06/2018] [Indexed: 05/06/2023]
Abstract
The liverwort Radula marginata belongs to the bryophyte division of land plants and is a prospective alternate source of cannabinoid-like compounds. However, mechanistic insights into the molecular pathways directing the synthesis of these cannabinoid-like compounds have been hindered due to the lack of genetic information. This prompted us to do deep sequencing, de novo assembly and annotation of R. marginata transcriptome, which resulted in the identification and validation of the genes for cannabinoid biosynthetic pathway. In total, we have identified 11,421 putative genes encoding 1,554 enzymes from 145 biosynthetic pathways. Interestingly, we have identified all the upstream genes of the central precursor of cannabinoid biosynthesis, cannabigerolic acid (CBGA), including its two first intermediates, stilbene acid (SA) and geranyl diphosphate (GPP). Expression of all these genes was validated using quantitative real-time PCR. We have characterized the protein structure of stilbene synthase (STS), which is considered as a homolog of olivetolic acid in R. marginata. Moreover, the metabolomics approach enabled us to identify CBGA-analogous compounds using electrospray ionization mass spectrometry (ESI-MS/MS) and gas chromatography mass spectrometry (GC-MS). Transcriptomic analysis revealed 1085 transcription factors (TF) from 39 families. Comparative analysis showed that six TF families have been uniquely predicted in R. marginata. In addition, the bioinformatics analysis predicted a large number of simple sequence repeats (SSRs) and non-coding RNAs (ncRNAs). Our results collectively provide mechanistic insights into the putative precursor genes for the biosynthesis of cannabinoid-like compounds and a novel transcriptomic resource for R. marginata. The large-scale transcriptomic resource generated in this study would further serve as a reference transcriptome to explore the Radulaceae family.
Collapse
Affiliation(s)
- Tajammul Hussain
- Department of Technical Biochemistry, TU Dortmund University, Dortmund, Germany
- *Correspondence: Tajammul Hussain
| | - Blue Plunkett
- The New Zealand Institute for Plant & Food Research Limited (PFR), Auckland, New Zealand
| | - Mahwish Ejaz
- Max Planck Institute for Plant Breeding Research, Cologne, Germany
| | - Richard V. Espley
- The New Zealand Institute for Plant & Food Research Limited (PFR), Auckland, New Zealand
| | - Oliver Kayser
- Department of Technical Biochemistry, TU Dortmund University, Dortmund, Germany
- Oliver Kayser
| |
Collapse
|
21
|
Tong GX, Xu W, Zhang YQ, Zhang QY, Yin JS, Kuang YY. De novo assembly and characterization of the Hucho taimen transcriptome. Ecol Evol 2017; 8:1271-1285. [PMID: 29375797 PMCID: PMC5773338 DOI: 10.1002/ece3.3735] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2017] [Revised: 11/22/2017] [Accepted: 11/27/2017] [Indexed: 11/10/2022] Open
Abstract
Taimen (Hucho taimen) is an important ecological and economic species that is classified as vulnerable by the IUCN Red List of Threatened Species; however, limited genomic information is available on this species. RNA‐Seq is a useful tool for obtaining genetic information and developing genetic markers for nonmodel species in addition to its application in gene expression profiling. In this study, we performed a comprehensive RNA‐Seq analysis of taimen. We obtained 157 M clean reads (14.7 Gb) and used them to de novo assemble a high‐quality transcriptome with a N50 size of 1,060 bp. In the assembly, 82% of the transcripts were annotated using several databases, and 14,666 of the transcripts contained a full open reading frame. The assembly covered 75% of the transcripts of Atlantic salmon and 57.3% of the protein‐coding genes of rainbow trout. To learn about the genome evolution, we performed a systematic comparative analysis across 11 teleosts including eight salmonids and found 313 unique gene families in taimen. Using Atlantic salmon and rainbow trout transcriptomes as the background, we identified 250 positive selection transcripts. The pathway enrichment analysis revealed a unique characteristic of taimen: It possesses more immune‐related genes than Atlantic salmon and rainbow trout; moreover, some genes have undergone strong positive selection. We also developed a pipeline for identifying microsatellite marker genotypes in samples and successfully identified 24 polymorphic microsatellite markers for taimen. These data and tools are useful for studying conservation genetics, phylogenetics, evolution among salmonids, and selective breeding for threatened taimen.
Collapse
Affiliation(s)
- Guang-Xiang Tong
- Heilongjiang River Fisheries Research Institute of Chinese Academy of Fishery Sciences Daoli District, Harbin Heilongjiang Province China
| | - Wei Xu
- Heilongjiang River Fisheries Research Institute of Chinese Academy of Fishery Sciences Daoli District, Harbin Heilongjiang Province China
| | - Yong-Quan Zhang
- Heilongjiang River Fisheries Research Institute of Chinese Academy of Fishery Sciences Daoli District, Harbin Heilongjiang Province China
| | - Qing-Yu Zhang
- Heilongjiang River Fisheries Research Institute of Chinese Academy of Fishery Sciences Daoli District, Harbin Heilongjiang Province China
| | - Jia-Sheng Yin
- Heilongjiang River Fisheries Research Institute of Chinese Academy of Fishery Sciences Daoli District, Harbin Heilongjiang Province China
| | - You-Yi Kuang
- Heilongjiang River Fisheries Research Institute of Chinese Academy of Fishery Sciences Daoli District, Harbin Heilongjiang Province China
| |
Collapse
|
22
|
Genetic relationships among Iranian and exotic safflower using microsatellite markers. ACTA ACUST UNITED AC 2017. [DOI: 10.1007/s12892-017-0001-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
23
|
Wang J, Chen Z, Jin S, Hu Z, Huang Y, Diao Y. Development and characterization of simple sequence repeat (SSR) markers based on a full-length cDNA library of Napier Grass (Pennisetum purpureum Schum). Genes Genomics 2017. [DOI: 10.1007/s13258-017-0536-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
|
24
|
Chen H, Wang L, Liu X, Hu L, Wang S, Cheng X. De novo transcriptomic analysis of cowpea (Vigna unguiculata L. Walp.) for genic SSR marker development. BMC Genet 2017; 18:65. [PMID: 28693419 PMCID: PMC5504845 DOI: 10.1186/s12863-017-0531-5] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2016] [Accepted: 06/28/2017] [Indexed: 01/14/2023] Open
Abstract
Background Cowpea [Vigna unguiculata (L.) Walp.] is one of the most important legumes in tropical and semi-arid regions. However, there is relatively little genomic information available for genetic research on and breeding of cowpea. The objectives of this study were to analyse the cowpea transcriptome and develop genic molecular markers for future genetic studies of this genus. Results Approximately 54 million high-quality cDNA sequence reads were obtained from cowpea based on Illumina paired-end sequencing technology and were de novo assembled to generate 47,899 unigenes with an N50 length of 1534 bp. Sequence similarity analysis revealed 36,289 unigenes (75.8%) with significant similarity to known proteins in the non-redundant (Nr) protein database, 23,471 unigenes (49.0%) with BLAST hits in the Swiss-Prot database, and 20,654 unigenes (43.1%) with high similarity in the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. Further analysis identified 5560 simple sequence repeats (SSRs) as potential genic molecular markers. Validating a random set of 500 SSR markers yielded 54 polymorphic markers among 32 cowpea accessions. Conclusions This transcriptomic analysis of cowpea provided a valuable set of genomic data for characterizing genes with important agronomic traits in Vigna unguiculata and a new set of genic SSR markers for further genetic studies and breeding in cowpea and related Vigna species. Electronic supplementary material The online version of this article (doi:10.1186/s12863-017-0531-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Honglin Chen
- The National Key Facility for Crop Gene, Resources and Genetic Improvement, Institute of Crop Science, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Lixia Wang
- The National Key Facility for Crop Gene, Resources and Genetic Improvement, Institute of Crop Science, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Xiaoyan Liu
- The National Key Facility for Crop Gene, Resources and Genetic Improvement, Institute of Crop Science, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Liangliang Hu
- The National Key Facility for Crop Gene, Resources and Genetic Improvement, Institute of Crop Science, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Suhua Wang
- The National Key Facility for Crop Gene, Resources and Genetic Improvement, Institute of Crop Science, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Xuzhen Cheng
- The National Key Facility for Crop Gene, Resources and Genetic Improvement, Institute of Crop Science, Chinese Academy of Agricultural Sciences, Beijing, 100081, China.
| |
Collapse
|
25
|
Ponyared P, Ponsawat J, Tongsima S, Seresangtakul P, Akkasaeng C, Tantisuwichwong N. ESAP plus: a web-based server for EST-SSR marker development. BMC Genomics 2016; 17:1035. [PMID: 28155670 PMCID: PMC5260030 DOI: 10.1186/s12864-016-3328-4] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Background Simple sequence repeats (SSRs) have become widely used as molecular markers in plant genetic studies due to their abundance, high allelic variation at each locus and simplicity to analyze using conventional PCR amplification. To study plants with unknown genome sequence, SSR markers from Expressed Sequence Tags (ESTs), which can be obtained from the plant mRNA (converted to cDNA), must be utilized. With the advent of high-throughput sequencing technology, huge EST sequence data have been generated and are now accessible from many public databases. However, SSR marker identification from a large in-house or public EST collection requires a computational pipeline that makes use of several standard bioinformatic tools to design high quality EST-SSR primers. Some of these computational tools are not users friendly and must be tightly integrated with reference genomic databases. Results A web-based bioinformatic pipeline, called EST Analysis Pipeline Plus (ESAP Plus), was constructed for assisting researchers to develop SSR markers from a large EST collection. ESAP Plus incorporates several bioinformatic scripts and some useful standard software tools necessary for the four main procedures of EST-SSR marker development, namely 1) pre-processing, 2) clustering and assembly, 3) SSR mining and 4) SSR primer design. The proposed pipeline also provides two alternative steps for reducing EST redundancy and identifying SSR loci. Using public sugarcane ESTs, ESAP Plus automatically executed the aforementioned computational pipeline via a simple web user interface, which was implemented using standard PHP, HTML, CSS and Java scripts. With ESAP Plus, users can upload raw EST data and choose various filtering options and parameters to analyze each of the four main procedures through this web interface. All input EST data and their predicted SSR results will be stored in the ESAP Plus MySQL database. Users will be notified via e-mail when the automatic process is completed and they can download all the results through the web interface. Conclusions ESAP Plus is a comprehensive and convenient web-based bioinformatic tool for SSR marker development. ESAP Plus offers all necessary EST-SSR development processes with various adjustable options that users can easily use to identify SSR markers from a large EST collection. With familiar web interface, users can upload the raw EST using the data submission page and visualize/download the corresponding EST-SSR information from within ESAP Plus. ESAP Plus can handle considerably large EST datasets. This EST-SSR discovery tool can be accessed directly from: http://gbp.kku.ac.th/esap_plus/. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-3328-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Piyarat Ponyared
- Department of Biology, Faculty of Science, Khon Kaen University, Khon Kaen, 40002, Thailand
| | - Jiradej Ponsawat
- Department of Computer Engineering, Faculty of Engineering, Khon Kaen University, Khon Kaen, 40002, Thailand
| | - Sissades Tongsima
- National Center for Genetic Engineering and Biotechnology (BIOTEC), Pathum Thani, 12120, Thailand
| | - Pusadee Seresangtakul
- Department of Computer Science, Faculty of Science, Khon Kaen University, Khon Kaen, 40002, Thailand
| | - Chutipong Akkasaeng
- Department of Plant Science and Agricultural Resources, Faculty of Agriculture, Khon Kaen University, Khon Kaen, 40002, Thailand
| | | |
Collapse
|
26
|
Transcriptome Analysis of the Tadpole Shrimp (Triops longicaudatus) by Illumina Paired-End Sequencing: Assembly, Annotation, and Marker Discovery. Genes (Basel) 2016; 7:genes7120114. [PMID: 27918468 PMCID: PMC5192490 DOI: 10.3390/genes7120114] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2016] [Revised: 11/19/2016] [Accepted: 11/24/2016] [Indexed: 11/17/2022] Open
Abstract
The tadpole shrimp (Triops longicaudatus) is an aquatic crustacean that helps control pest populations. It inhabits freshwater ponds and pools and has been described as a living fossil. T. longicaudatus was officially declared an endangered species South Korea in 2005; however, through subsequent protection and conservation management, it was removed from the endangered species list in 2012. The limited number of available genetic resources on T. longicaudatus makes it difficult to obtain valuable genetic information for marker-aided selection programs. In this study, whole-transcriptome sequencing of T. longicaudatus generated 39.74 GB of clean data and a total of 269,822 contigs using the Illumina HiSeq 2500 platform. After clustering, a total of 208,813 unigenes with an N50 length of 1089 bp were generated. A total of 95,105 unigenes were successfully annotated against Protostome (PANM), Unigene, Eukaryotic Orthologous Groups (KOG), Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases using BLASTX with a cut-off of 1E−5. A total of 57,731 unigenes were assigned to GO terms, and 7247 unigenes were mapped to 129 KEGG pathways. Furthermore, 1595 simple sequence repeats (SSRs) were detected from the unigenes with 1387 potential SSR markers. This is the first report of high-throughput transcriptome analysis of T. longicaudatus, and it provides valuable insights for genetic research and molecular-assisted breeding of this important species.
Collapse
|
27
|
Bedre R, Mangu VR, Srivastava S, Sanchez LE, Baisakh N. Transcriptome analysis of smooth cordgrass (Spartina alterniflora Loisel), a monocot halophyte, reveals candidate genes involved in its adaptation to salinity. BMC Genomics 2016; 17:657. [PMID: 27542721 PMCID: PMC4992267 DOI: 10.1186/s12864-016-3017-3] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2015] [Accepted: 08/13/2016] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Soil salinity affects growth and yield of crop plants. Plants respond to salinity by physiological and biochemical adjustments through a coordinated regulation and expression of a cascade of genes. Recently, halophytes have attracted attention of the biologists to understand their salt adaptation mechanisms. Spartina alterniflora (smooth cordgrass) is a Louisiana native monocot halophyte that can withstand salinity up to double the strength of sea water. To dissect the molecular mechanisms underlying its salinity adaptation, leaf and root transcriptome of S. alterniflora was sequenced using 454/GS-FLX. RESULTS Altogether, 770,690 high quality reads with an average length 324-bp were assembled de novo into 73,131 contigs (average 577-bp long) with 5.9X sequence coverage. Most unigenes (95 %) annotated to proteins with known functions, and had more than 90 % similarity to rice genes. About 28 % unigenes were considered specific to S. alterniflora. Digital expression profiles revealed significant enrichment (P < 0.01) of transporters, vacuolar proton pump members and transcription factors under salt stress, which suggested the role of ion homeostasis and transcriptional regulation in the salinity adaptation of this grass. Also, 10,805 SSRs markers from 9457 unigenes were generated and validated through genetic diversity analysis among 13 accessions of S. alterniflora. CONCLUSIONS The present study explores the transcriptome of S. alterniflora to understand the gene regulation under salt stress in halophytes. The sequenced transcriptome (control and salt-regulated) of S. alterniflora provides a platform for further gene finding studies in grasses. This study and our previously published studies suggested that S. alterniflora is a rich reservoir of salt tolerance genes that can be used to develop salt tolerant cereal crops, especially rice, a major food crop of global importance.
Collapse
Affiliation(s)
- Renesh Bedre
- School of Plant, Environmental and Soil Sciences, Louisiana State University Agricultural Center, Baton Rouge, LA, 70803, USA
| | - Venkata Ramanarao Mangu
- School of Plant, Environmental and Soil Sciences, Louisiana State University Agricultural Center, Baton Rouge, LA, 70803, USA
| | - Subodh Srivastava
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, 29634, USA
| | - Luis Eduardo Sanchez
- School of Plant, Environmental and Soil Sciences, Louisiana State University Agricultural Center, Baton Rouge, LA, 70803, USA
- Current address: Centro de Investigaciones Biotecnológicas del Ecuador (CIBE), Km 30.5 Via Perimetral, Guayaquil, Ecuador
| | - Niranjan Baisakh
- School of Plant, Environmental and Soil Sciences, Louisiana State University Agricultural Center, Baton Rouge, LA, 70803, USA.
| |
Collapse
|
28
|
De novo transcriptome sequencing facilitates genomic resource generation in Tinospora cordifolia. Funct Integr Genomics 2016; 16:581-91. [PMID: 27465295 DOI: 10.1007/s10142-016-0508-x] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2016] [Revised: 06/30/2016] [Accepted: 07/18/2016] [Indexed: 01/23/2023]
Abstract
Tinospora cordifolia is known for its medicinal properties owing to the presence of useful constituents such as terpenes, glycosides, steroids, alkaloids, and flavonoids belonging to secondary metabolism origin. However, there is little information available pertaining to critical genomic elements (ESTs, molecular markers) necessary for judicious exploitation of its germplasm. We employed 454 GS-FLX pyrosequencing of entire transcripts and altogether ∼25 K assembled transcripts or Expressed sequence tags (ESTs) were identified. As the interest in T. cordifolia is primarily due to its secondary metabolite constituents, the ESTs pertaining to terpenoids biosynthetic pathway were identified in the present study. Additionally, several ESTs were assigned to different transcription factor families. To validate our transcripts dataset, the novel EST-SSR markers were generated to assess the genetic diversity among germplasm of T. cordifolia. These EST-SSR markers were found to be polymorphic and the dendrogram based on dice similarity index revealed three distinct clustering of accessions. The present study demonstrates effectiveness in using both NEWBLER and MIRA sequence read assembler software for enriching transcript-dataset and thus enables better exploitation of EST resources for mining candidate genes and designing molecular markers.
Collapse
|
29
|
Zhou X, Dong Y, Zhao J, Huang L, Ren X, Chen Y, Huang S, Liao B, Lei Y, Yan L, Jiang H. Genomic survey sequencing for development and validation of single-locus SSR markers in peanut (Arachis hypogaea L.). BMC Genomics 2016; 17:420. [PMID: 27251557 PMCID: PMC4888616 DOI: 10.1186/s12864-016-2743-x] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2015] [Accepted: 05/14/2016] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Single-locus markers have many advantages compared with multi-locus markers in genetic and breeding studies because their alleles can be assigned to particular genomic loci in diversity analyses. However, there is little research on single-locus SSR markers in peanut. Through the de novo assembly of DNA sequencing reads of A. hypogaea, we developed single-locus SSR markers in a genomic survey for better application in genetic and breeding studies of peanut. RESULTS In this study, DNA libraries with four different insert sizes were used for sequencing with 150 bp paired-end reads. Approximately 237 gigabases of clean data containing 1,675,631,984 reads were obtained after filtering. These reads were assembled into 2,102,446 contigs with an N50 length of 1,782 bp, and the contigs were further assembled into 1,176,527 scaffolds with an N50 of 3,920 bp. The total length of the assembled scaffold sequences was 2.0 Gbp, and 134,652 single-locus SSRs were identified from 375,180 SSRs. Among these developed single-locus SSRs, trinucleotide motifs were the most abundant, followed by tetra-, di-, mono-, penta- and hexanucleotide motifs. The most common motif repeats for the various types of single-locus SSRs have a tendency to be A/T rich. A total of 1,790 developed in silico single-locus SSR markers were chosen and used in PCR experiments to confirm amplification patterns. Of them, 1,637 markers that produced single amplicons in twelve inbred lines were considered putative single-locus markers, and 290 (17.7 %) showed polymorphisms. A further F2 population study showed that the segregation ratios of the 97 developed SSR markers, which showed polymorphisms between the parents, were consistent with the Mendelian inheritance law for single loci (1:2:1). Finally, 89 markers were assigned to an A. hypogaea linkage map. A subset of 100 single-locus SSR markers was shown to be highly stable and universal in a collection of 96 peanut accessions. A neighbor-joining tree of this natural population showed that genotypes have obviously correlation with botanical varieties. CONCLUSIONS We have shown that the detection of single-locus SSR markers from a de novo genomic assembly of a combination of different-insert-size libraries is highly efficient. This is the first report of the development of genome-wide single-locus markers for A. hypogaea, and the markers developed in this study will be useful for gene tagging, sequence scaffold assignment, linkage map construction, diversity analysis, variety identification and association mapping in peanut.
Collapse
Affiliation(s)
- Xiaojing Zhou
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences, Wuhan, 430062, Hubei, China
| | - Yang Dong
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences, Wuhan, 430062, Hubei, China
| | - Jiaojiao Zhao
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences, Wuhan, 430062, Hubei, China
| | - Li Huang
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences, Wuhan, 430062, Hubei, China
| | - Xiaoping Ren
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences, Wuhan, 430062, Hubei, China
| | - Yuning Chen
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences, Wuhan, 430062, Hubei, China
| | - Shunmou Huang
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences, Wuhan, 430062, Hubei, China.,Databridge Technologies Corporation, Wuhan, 430062, Hubei, China
| | - Boshou Liao
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences, Wuhan, 430062, Hubei, China
| | - Yong Lei
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences, Wuhan, 430062, Hubei, China
| | - Liying Yan
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences, Wuhan, 430062, Hubei, China
| | - Huifang Jiang
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences, Wuhan, 430062, Hubei, China.
| |
Collapse
|
30
|
Cuadrado Á, Jouve N. Chromosomal detection of simple sequence repeats (SSRs) using nondenaturing FISH (ND-FISH). Chromosoma 2016; 119:495-503. [PMID: 20393739 DOI: 10.1007/s00412-010-0273-x] [Citation(s) in RCA: 67] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
Simple Sequence Repeats (SSRs) are known to be scattered and present in high number in eukaryotic genomes. We demonstrate that dye-labeled oligodeoxyribonucleotides with repeated mono-, di-, tri, or tetranucleotide motifs (15-20 nucleotides in length) have an unexpected ability to recognize SSR target sequences in non-denatured chromosomes. The results show that all these probes are able to invade chromosomes, independent of the size of the repeat motif, their nucleotide sequence, or their ability to form alternative B-DNA structures such as triplex DNA. This novel and remarkable property of binding SSR oligonucleotides to duplex DNA targets permitted the development of a non-denaturing fluorescence in situ hybridization method that quickly and efficiently detects SSR-enriched chromosome regions in mitotic, meiotic, and polytene chromosome spreads of different model organisms. These results have implications for genome analysis and for investigating the roles of SSRs in chromosome structure and function.
Collapse
Affiliation(s)
- Ángeles Cuadrado
- Department of Cell Biology and Genetics, University of Alcalá de Henares, 28871, Alcalá de Henares, Madrid, Spain.
| | | |
Collapse
|
31
|
Yi G, Zhao T, Yin Z, Tang P, Qiu Y, Yao X. Mining and characterizing EST-SSR markers from ESTs of Torreya grandis. BIOCHEM SYST ECOL 2016. [DOI: 10.1016/j.bse.2016.02.017] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
32
|
Zhao YM, Zhou T, Li ZH, Zhao GF. Characterization of Global Transcriptome Using Illumina Paired-End Sequencing and Development of EST-SSR Markers in Two Species of Gynostemma (Cucurbitaceae). Molecules 2015; 20:21214-31. [PMID: 26633323 PMCID: PMC6332360 DOI: 10.3390/molecules201219758] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2015] [Revised: 11/05/2015] [Accepted: 11/09/2015] [Indexed: 01/16/2023] Open
Abstract
Gynostemma pentaphyllum is an important medicinal herb of the Cucurbitaceae family, but limited genomic data have hindered genetic studies. In this study, transcriptomes of two closely-related Gynostemma species, Gynostemma cardiospermum and G. pentaphyllum, were sequenced using Illumina paired-end sequencing technology. A total of 71,607 nonredundant unigenes were assembled. Of these unigenes, 60.45% (43,288) were annotated based on sequence similarity search with known proteins. A total of 11,059 unigenes were identified in the Kyoto Encyclopedia of Genes and Genomes Pathway (KEGG) database. A total of 3891 simple sequence repeats (SSRs) were detected in 3526 nonredundant unigenes, 2596 primer pairs were designed and 360 of them were randomly selected for validation. Of these, 268 primer pairs yielded clear products among six G. pentaphyllum samples. Thirty polymorphic SSR markers were used to test polymorphism and transferability in Gynostemma. Finally, 15 SSR makers that amplified in all 12 Gynostemma species were used to assess genetic diversity. Our results generated a comprehensive sequence resource for Gynostemma research.
Collapse
Affiliation(s)
- Yue-Mei Zhao
- Key Laboratory of Resource Biology and Biotechnology in Western China (Ministry of Education), College of Life Sciences, Northwest University, 229 Taibai Bei Road, Xi'an 710069, China.
- College of Biopharmaceutical and Food Engineering, Shangluo University, Beixin Street, Shangluo 726000, China.
| | - Tao Zhou
- Key Laboratory of Resource Biology and Biotechnology in Western China (Ministry of Education), College of Life Sciences, Northwest University, 229 Taibai Bei Road, Xi'an 710069, China.
| | - Zhong-Hu Li
- Key Laboratory of Resource Biology and Biotechnology in Western China (Ministry of Education), College of Life Sciences, Northwest University, 229 Taibai Bei Road, Xi'an 710069, China.
| | - Gui-Fang Zhao
- Key Laboratory of Resource Biology and Biotechnology in Western China (Ministry of Education), College of Life Sciences, Northwest University, 229 Taibai Bei Road, Xi'an 710069, China.
| |
Collapse
|
33
|
Gupta S, Kumar T, Verma S, Bharadwaj C, Bhatia S. Development of gene-based markers for use in construction of the chickpea (Cicer arietinum L.) genetic linkage map and identification of QTLs associated with seed weight and plant height. Mol Biol Rep 2015; 42:1571-80. [PMID: 26446030 DOI: 10.1007/s11033-015-3925-3] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2015] [Accepted: 10/03/2015] [Indexed: 11/28/2022]
Abstract
Seed weight and plant height are important agronomic traits and contribute to seed yield. The objective of this study was to identify QTLs underlying these traits using an intra-specific mapping population of chickpea. A F11 population of 177 recombinant inbred lines derived from a cross between SBD377 (100-seed weight--48 g and plant height--53 cm) and BGD112 (100-seed weight--15 g and plant height--65 cm) was used. A total of 367 novel EST-derived functional markers were developed which included 187 EST-SSRs, 130 potential intron polymorphisms (PIPs) and 50 expressed sequence tag polymorphisms (ESTPs). Along with these, 590 previously published markers including 385 EST-based markers and 205 genomic SSRs were utilized. Of the 957 markers tested for analysis of parental polymorphism between the two parents of the mapping population, 135 (14.64%) were found to be polymorphic. Of these, 131 polymorphic markers could be mapped to the 8 linkage groups. The linkage map had a total length of 1140.54 cM with an average marker density of 8.7 cM. The map was further used for QTL identification using composite interval mapping method (CIM). Two QTLs each for seed weight, qSW-1 and qSW-2 (explaining 11.54 and 19.24% of phenotypic variance, respectively) and plant height, qPH-1 and qPH-2 (explaining 13.98 and 12.17% of phenotypic variance, respectively) were detected. The novel set of genic markers, the intra-specific linkage map and the QTLs identified in the present study will serve as valuable genomic resources in improving the chickpea seed yield using marker-assisted selection (MAS) strategies.
Collapse
Affiliation(s)
- Shefali Gupta
- National Institute of Plant Genome Research, Aruna Asaf Ali Marg, PO Box No. 10531, New Delhi, 110067, India
| | - Tapan Kumar
- Indian Agricultural Research Institute, Pusa, New Delhi, 110012, India
| | - Subodh Verma
- National Institute of Plant Genome Research, Aruna Asaf Ali Marg, PO Box No. 10531, New Delhi, 110067, India
| | | | - Sabhyata Bhatia
- National Institute of Plant Genome Research, Aruna Asaf Ali Marg, PO Box No. 10531, New Delhi, 110067, India.
| |
Collapse
|
34
|
Characterization and Development of EST-SSRs by Deep Transcriptome Sequencing in Chinese Cabbage (Brassica rapa L. ssp. pekinensis). Int J Genomics 2015; 2015:473028. [PMID: 26504770 PMCID: PMC4609433 DOI: 10.1155/2015/473028] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2015] [Accepted: 03/26/2015] [Indexed: 11/17/2022] Open
Abstract
Simple sequence repeats (SSRs) are among the most important markers for population analysis and have been widely used in plant genetic mapping and molecular breeding. Expressed sequence tag-SSR (EST-SSR) markers, located in the coding regions, are potentially more efficient for QTL mapping, gene targeting, and marker-assisted breeding. In this study, we investigated 51,694 nonredundant unigenes, assembled from clean reads from deep transcriptome sequencing with a Solexa/Illumina platform, for identification and development of EST-SSRs in Chinese cabbage. In total, 10,420 EST-SSRs with over 12 bp were identified and characterized, among which 2744 EST-SSRs are new and 2317 are known ones showing polymorphism with previously reported SSRs. A total of 7877 PCR primer pairs for 1561 EST-SSR loci were designed, and primer pairs for twenty-four EST-SSRs were selected for primer evaluation. In nineteen EST-SSR loci (79.2%), amplicons were successfully generated with high quality. Seventeen (89.5%) showed polymorphism in twenty-four cultivars of Chinese cabbage. The polymorphic alleles of each polymorphic locus were sequenced, and the results showed that most polymorphisms were due to variations of SSR repeat motifs. The EST-SSRs identified and characterized in this study have important implications for developing new tools for genetics and molecular breeding in Chinese cabbage.
Collapse
|
35
|
Burt C, Steed A, Gosman N, Lemmens M, Bird N, Ramirez-Gonzalez R, Holdgate S, Nicholson P. Mapping a Type 1 FHB resistance on chromosome 4AS of Triticum macha and deployment in combination with two Type 2 resistances. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2015; 128:1725-1738. [PMID: 26040404 PMCID: PMC4540761 DOI: 10.1007/s00122-015-2542-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/21/2015] [Accepted: 05/18/2015] [Indexed: 06/04/2023]
Abstract
Markers closely flanking a Type 1 FHB resistance have been produced and the potential of combining this with Type 2 resistances to improve control of FHB has been demonstrated. Two categories of resistance to Fusarium head blight (FHB) in wheat are generally recognised: resistance to initial infection (Type 1) and resistance to spread within the head (Type 2). While numerous sources of Type 2 resistance have been reported, relatively fewer Type 1 resistances have been characterised. Previous study identified a Type 1 FHB resistance (QFhs.jic-4AS) on chromosome 4A in Triticum macha. Little is known about the effect of combining Type 1 and Type 2 resistances on overall FHB symptoms or accumulation of the mycotoxin deoxynivalenol (DON). QFhs.jic-4AS was combined independently with two Type 2 FHB resistances (Fhb1 and one associated with the 1BL/1RS translocation). While combining Type 1 and Type 2 resistances generally reduced visual symptom development, the effect on DON accumulation was marginal. A lack of polymorphic markers and a limited number of recombinants had originally prevented accurate mapping of the QFhs.jic-4AS resistance. Using an array of recently produced markers in combination with new populations, the position of QFhs.jic-4AS has been determined to allow this resistance to be followed in breeding programmes.
Collapse
Affiliation(s)
- C. Burt
- />John Innes Centre, Norwich Research Park, Norwich, NR4 7UH UK
| | - A. Steed
- />John Innes Centre, Norwich Research Park, Norwich, NR4 7UH UK
| | - N. Gosman
- />John Innes Centre, Norwich Research Park, Norwich, NR4 7UH UK
| | - M. Lemmens
- />IFA-Tulln, University of Natural Resources and Life Sciences, Konrad Lorenz Strasse 20, 3430 Tulln, Austria
| | - N. Bird
- />John Innes Centre, Norwich Research Park, Norwich, NR4 7UH UK
| | | | - S. Holdgate
- />RAGT, Grange Road, Ickleton, Essex, CB10 1TA UK
| | - P. Nicholson
- />John Innes Centre, Norwich Research Park, Norwich, NR4 7UH UK
| |
Collapse
|
36
|
Cai C, Yang Y, Cheng L, Tong C, Feng J. Development and assessment of EST-SSR marker for the genetic diversity among tobaccos (Nicotiana tabacum L.). RUSS J GENET+ 2015. [DOI: 10.1134/s1022795415020064] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
37
|
Nithin C, Patwa N, Thomas A, Bahadur RP, Basak J. Computational prediction of miRNAs and their targets in Phaseolus vulgaris using simple sequence repeat signatures. BMC PLANT BIOLOGY 2015; 15:140. [PMID: 26067253 PMCID: PMC4464996 DOI: 10.1186/s12870-015-0516-3] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/28/2015] [Accepted: 04/29/2015] [Indexed: 05/02/2023]
Abstract
BACKGROUND MicroRNAs (miRNAs) are endogenous, noncoding, short RNAs directly involved in regulating gene expression at the post-transcriptional level. In spite of immense importance, limited information of P. vulgaris miRNAs and their expression patterns prompted us to identify new miRNAs in P. vulgaris by computational methods. Besides conventional approaches, we have used the simple sequence repeat (SSR) signatures as one of the prediction parameter. Moreover, for all other parameters including normalized Shannon entropy, normalized base pairing index and normalized base-pair distance, instead of taking a fixed cut-off value, we have used 99% probability range derived from the available data. RESULTS We have identified 208 mature miRNAs in P. vulgaris belonging to 118 families, of which 201 are novel. 97 of the predicted miRNAs in P. vulgaris were validated with the sequencing data obtained from the small RNA sequencing of P. vulgaris. Randomly selected predicted miRNAs were also validated using qRT-PCR. A total of 1305 target sequences were identified for 130 predicted miRNAs. Using 80% sequence identity cut-off, proteins coded by 563 targets were identified. The computational method developed in this study was also validated by predicting 229 miRNAs of A. thaliana and 462 miRNAs of G. max, of which 213 for A. thaliana and 397 for G. max are existing in miRBase 20. CONCLUSIONS There is no universal SSR that is conserved among all precursors of Viridiplantae, but conserved SSR exists within a miRNA family and is used as a signature in our prediction method. Prediction of known miRNAs of A. thaliana and G. max validates the accuracy of our method. Our findings will contribute to the present knowledge of miRNAs and their targets in P. vulgaris. This computational method can be applied to any species of Viridiplantae for the successful prediction of miRNAs and their targets.
Collapse
Affiliation(s)
- Chandran Nithin
- Computational Structural Biology Lab, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur, 721302, India.
| | - Nisha Patwa
- Department of Biotechnology, Visva-Bharati, Santiniketan, 731235, India.
| | - Amal Thomas
- Computational Structural Biology Lab, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur, 721302, India.
| | - Ranjit Prasad Bahadur
- Computational Structural Biology Lab, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur, 721302, India.
| | - Jolly Basak
- Department of Biotechnology, Visva-Bharati, Santiniketan, 731235, India.
| |
Collapse
|
38
|
Transcriptome sequencing of mung bean (Vigna radiate L.) genes and the identification of EST-SSR markers. PLoS One 2015; 10:e0120273. [PMID: 25830701 PMCID: PMC4382333 DOI: 10.1371/journal.pone.0120273] [Citation(s) in RCA: 64] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2014] [Accepted: 02/02/2015] [Indexed: 11/19/2022] Open
Abstract
Mung bean (Vigna radiate (L.) Wilczek) is an important traditional food legume crop, with high economic and nutritional value. It is widely grown in China and other Asian countries. Despite its importance, genomic information is currently unavailable for this crop plant species or some of its close relatives in the Vigna genus. In this study, more than 103 million high quality cDNA sequence reads were obtained from mung bean using Illumina paired-end sequencing technology. The processed reads were assembled into 48,693 unigenes with an average length of 874 bp. Of these unigenes, 25,820 (53.0%) and 23,235 (47.7%) showed significant similarity to proteins in the NCBI non-redundant protein and nucleotide sequence databases, respectively. Furthermore, 19,242 (39.5%) could be classified into gene ontology categories, 18,316 (37.6%) into Swiss-Prot categories and 10,918 (22.4%) into KOG database categories (E-value < 1.0E-5). A total of 6,585 (8.3%) were mapped onto 244 pathways using the Kyoto Encyclopedia of Genes and Genome (KEGG) pathway database. Among the unigenes, 10,053 sequences contained a unique simple sequence repeat (SSR), and 2,303 sequences contained more than one SSR together in the same expressed sequence tag (EST). A total of 13,134 EST-SSRs were identified as potential molecular markers, with mono-nucleotide A/T repeats being the most abundant motif class and G/C repeats being rare. In this SSR analysis, we found five main repeat motifs: AG/CT (30.8%), GAA/TTC (12.6%), AAAT/ATTT (6.8%), AAAAT/ATTTT (6.2%) and AAAAAT/ATTTTT (1.9%). A total of 200 SSR loci were randomly selected for validation by PCR amplification as EST-SSR markers. Of these, 66 marker primer pairs produced reproducible amplicons that were polymorphic among 31 mung bean accessions selected from diverse geographical locations. The large number of SSR-containing sequences found in this study will be valuable for the construction of a high-resolution genetic linkage maps, association or comparative mapping and genetic analyses of various Vigna species.
Collapse
|
39
|
ProGeRF: proteome and genome repeat finder utilizing a fast parallel hash function. BIOMED RESEARCH INTERNATIONAL 2015; 2015:394157. [PMID: 25811026 PMCID: PMC4355816 DOI: 10.1155/2015/394157] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/03/2014] [Revised: 01/19/2015] [Accepted: 01/31/2015] [Indexed: 12/20/2022]
Abstract
Repetitive element sequences are adjacent, repeating patterns, also called motifs, and can be of different lengths; repetitions can involve their exact or approximate copies. They have been widely used as molecular markers in population
biology. Given the sizes of sequenced genomes, various bioinformatics tools have been developed for the extraction of repetitive elements from DNA sequences. However, currently available tools do not provide options for identifying repetitive elements in the genome or proteome, displaying a user-friendly web interface, and performing-exhaustive searches. ProGeRF is a web site for extracting repetitive regions from genome and proteome sequences. It was designed to be efficient, fast, and accurate and
primarily user-friendly web tool allowing many ways to view and analyse the results. ProGeRF (Proteome and Genome Repeat Finder) is freely available as a stand-alone program, from which the users can download the source code, and as a web tool. It was developed using the hash table approach to extract perfect and imperfect repetitive regions in a (multi)FASTA file, while allowing a linear time complexity.
Collapse
|
40
|
Long Y, Wang Y, Wu S, Wang J, Tian X, Pei X. De novo assembly of transcriptome sequencing in Caragana korshinskii Kom. and characterization of EST-SSR markers. PLoS One 2015; 10:e0115805. [PMID: 25629164 PMCID: PMC4309406 DOI: 10.1371/journal.pone.0115805] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2014] [Accepted: 11/27/2014] [Indexed: 01/05/2023] Open
Abstract
Caragana korshinskii Kom. is widely distributed in various habitats, including gravel desert, clay desert, fixed and semi-fixed sand, and saline land in the Asian and African deserts. To date, no previous genomic information or EST-SSR marker has been reported in Caragana Fabr. genus. In this study, more than two billion bases of high-quality sequence of C. korshinskii were generated by using illumina sequencing technology and demonstrated the de novo assembly and annotation of genes without prior genome information. These reads were assembled into 86,265 unigenes (mean length = 709 bp). The similarity search indicated that 33,955 and 21,978 unigenes showed significant similarities to known proteins from NCBI non-redundant and Swissprot protein databases, respectively. Among these annotated unigenes, 26,232 a unigenes were separately assigned to Gene Ontology (GO) database. When 22,756 unigenes searched against the Kyoto Encyclopedia of Genes and Genomes Pathway (KEGG) database, 5,598 unigenes were assigned to 5 main categories including 32 KEGG pathways. Among the main KEGG categories, metabolism was the biggest category (2,862, 43.7%), suggesting the active metabolic processes in the desert tree. In addition, a total of 19,150 EST-SSRs were identified from 15,484 unigenes, and the characterizations of EST-SSRs were further compared with other four species in Fabraceae. 126 potential marker sites were randomly selected to validate the assembly quality and develop EST-SSR markers. Among the 9 germplasms in Caranaga Fabr. genus, PCR success rate were 93.7% and the phylogenic tree was constructed based on the genotypic data. This research generated a substantial fraction of transcriptome sequences, which were very useful resources for gene annotation and discovery, molecular markers development, genome assembly and annotation. The EST-SSR markers identified and developed in this study will facilitate marker-assisted selection breeding.
Collapse
Affiliation(s)
- Yan Long
- Institute of Biotechnology, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Yanyan Wang
- College of Plant science and technology, Huazhong Agricultural University, Wuhan, 430070, China
| | - Shanshan Wu
- College of Plant science and technology, Huazhong Agricultural University, Wuhan, 430070, China
| | - Jiao Wang
- Institute of Biotechnology, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Xinjie Tian
- Institute of Biotechnology, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Xinwu Pei
- Institute of Biotechnology, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
- * E-mail:
| |
Collapse
|
41
|
Jung WY, Lee SS, Kim CW, Kim HS, Min SR, Moon JS, Kwon SY, Jeon JH, Cho HS. RNA-seq analysis and de novo transcriptome assembly of Jerusalem artichoke (Helianthus tuberosus Linne). PLoS One 2014; 9:e111982. [PMID: 25375764 PMCID: PMC4222968 DOI: 10.1371/journal.pone.0111982] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2014] [Accepted: 10/09/2014] [Indexed: 11/18/2022] Open
Abstract
Jerusalem artichoke (Helianthus tuberosus L.) has long been cultivated as a vegetable and as a source of fructans (inulin) for pharmaceutical applications in diabetes and obesity prevention. However, transcriptomic and genomic data for Jerusalem artichoke remain scarce. In this study, Illumina RNA sequencing (RNA-Seq) was performed on samples from Jerusalem artichoke leaves, roots, stems and two different tuber tissues (early and late tuber development). Data were used for de novo assembly and characterization of the transcriptome. In total 206,215,632 paired-end reads were generated. These were assembled into 66,322 loci with 272,548 transcripts. Loci were annotated by querying against the NCBI non-redundant, Phytozome and UniProt databases, and 40,215 loci were homologous to existing database sequences. Gene Ontology terms were assigned to 19,848 loci, 15,434 loci were matched to 25 Clusters of Eukaryotic Orthologous Groups classifications, and 11,844 loci were classified into 142 Kyoto Encyclopedia of Genes and Genomes pathways. The assembled loci also contained 10,778 potential simple sequence repeats. The newly assembled transcriptome was used to identify loci with tissue-specific differential expression patterns. In total, 670 loci exhibited tissue-specific expression, and a subset of these were confirmed using RT-PCR and qRT-PCR. Gene expression related to inulin biosynthesis in tuber tissue was also investigated. Exsiting genetic and genomic data for H. tuberosus are scarce. The sequence resources developed in this study will enable the analysis of thousands of transcripts and will thus accelerate marker-assisted breeding studies and studies of inulin biosynthesis in Jerusalem artichoke.
Collapse
Affiliation(s)
- Won Yong Jung
- Plant Systems Engineering Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Korea; Animal Material Engineering, Gyeongnam National University of Science and Technology, Jinju, Korea
| | - Sang Sook Lee
- Plant Systems Engineering Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Korea
| | - Chul Wook Kim
- Animal Material Engineering, Gyeongnam National University of Science and Technology, Jinju, Korea
| | - Hyun-Soon Kim
- Plant Systems Engineering Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Korea
| | - Sung Ran Min
- Plant Systems Engineering Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Korea
| | - Jae Sun Moon
- Plant Systems Engineering Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Korea
| | - Suk-Yoon Kwon
- Plant Systems Engineering Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Korea
| | - Jae-Heung Jeon
- Plant Systems Engineering Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Korea
| | - Hye Sun Cho
- Plant Systems Engineering Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Korea
| |
Collapse
|
42
|
Jiang Q, Li Q, Yu H, Kong L. Genome-wide analysis of simple sequence repeats in marine animals-a comparative approach. MARINE BIOTECHNOLOGY (NEW YORK, N.Y.) 2014; 16:604-619. [PMID: 24939717 DOI: 10.1007/s10126-014-9580-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2014] [Accepted: 05/22/2014] [Indexed: 06/03/2023]
Abstract
Tandem simple sequence repeats (SSRs) are one of the most popular molecular markers in genetic analysis owing to their ubiquitous occurrence,high reproducibility, multiallelic nature, and codominant mode. High mutability makes SSRs play a role in genome evolution and correspondingly show different patterns. Comparative analysis of genomic SSRs in different taxonomic groups usually focuses on land species, while marine animals have been neglected. This study examined the abundance of genomic SSRs with repeated unit lengths of 1-6 bp in 30 marine animals including nine taxonomic groups and further compared with the land species. More than thousands of SSRs were discovered in every organism which provided a huge resource for the development of molecular markers. Thirty marine animals showed profound differences in SSR characteristics, but some group-specific trends were also found. Both similarities and differences of repeat patterns were discovered between the land and marine species. Two taxon-specific SSR types were discovered: the pentanucleotides motif AGAGG in Euteleostei and the hexanucleotide repeats of ATGTAC in Porifera and Echinodermata. Gene ontology (GO) enrichment analysis of two representative species (Amphimedon queenslandica for Porifera and Strongylocentrotus purpuratus for Echinodermata) revealed functional preference of the ATGTAC motif associated genes, and this might hint at evolutionary significance.
Collapse
Affiliation(s)
- Qun Jiang
- The Key Laboratory of Mariculture, Ministry of Education, Ocean University of China, 266003, Qingdao, China
| | | | | | | |
Collapse
|
43
|
Mantello CC, Cardoso-Silva CB, da Silva CC, de Souza LM, Scaloppi Junior EJ, de Souza Gonçalves P, Vicentini R, de Souza AP. De novo assembly and transcriptome analysis of the rubber tree (Hevea brasiliensis) and SNP markers development for rubber biosynthesis pathways. PLoS One 2014; 9:e102665. [PMID: 25048025 PMCID: PMC4105465 DOI: 10.1371/journal.pone.0102665] [Citation(s) in RCA: 68] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2013] [Accepted: 06/22/2014] [Indexed: 01/26/2023] Open
Abstract
Hevea brasiliensis (Willd. Ex Adr. Juss.) Muell.-Arg. is the primary source of natural rubber that is native to the Amazon rainforest. The singular properties of natural rubber make it superior to and competitive with synthetic rubber for use in several applications. Here, we performed RNA sequencing (RNA-seq) of H. brasiliensis bark on the Illumina GAIIx platform, which generated 179,326,804 raw reads on the Illumina GAIIx platform. A total of 50,384 contigs that were over 400 bp in size were obtained and subjected to further analyses. A similarity search against the non-redundant (nr) protein database returned 32,018 (63%) positive BLASTx hits. The transcriptome analysis was annotated using the clusters of orthologous groups (COG), gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and Pfam databases. A search for putative molecular marker was performed to identify simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs). In total, 17,927 SSRs and 404,114 SNPs were detected. Finally, we selected sequences that were identified as belonging to the mevalonate (MVA) and 2-C-methyl-D-erythritol 4-phosphate (MEP) pathways, which are involved in rubber biosynthesis, to validate the SNP markers. A total of 78 SNPs were validated in 36 genotypes of H. brasiliensis. This new dataset represents a powerful information source for rubber tree bark genes and will be an important tool for the development of microsatellites and SNP markers for use in future genetic analyses such as genetic linkage mapping, quantitative trait loci identification, investigations of linkage disequilibrium and marker-assisted selection.
Collapse
Affiliation(s)
- Camila Campos Mantello
- Centro de Biologia Molecular e Engenharia Genética (CBMEG) - Universidade Estadual de Campinas (UNICAMP), Cidade Universitária Zeferino Vaz, Campinas, São Paulo, Brazil
- * E-mail: (APS); (CCM)
| | - Claudio Benicio Cardoso-Silva
- Centro de Biologia Molecular e Engenharia Genética (CBMEG) - Universidade Estadual de Campinas (UNICAMP), Cidade Universitária Zeferino Vaz, Campinas, São Paulo, Brazil
| | - Carla Cristina da Silva
- Centro de Biologia Molecular e Engenharia Genética (CBMEG) - Universidade Estadual de Campinas (UNICAMP), Cidade Universitária Zeferino Vaz, Campinas, São Paulo, Brazil
| | - Livia Moura de Souza
- Centro de Biologia Molecular e Engenharia Genética (CBMEG) - Universidade Estadual de Campinas (UNICAMP), Cidade Universitária Zeferino Vaz, Campinas, São Paulo, Brazil
| | | | | | - Renato Vicentini
- Centro de Biologia Molecular e Engenharia Genética (CBMEG) - Universidade Estadual de Campinas (UNICAMP), Cidade Universitária Zeferino Vaz, Campinas, São Paulo, Brazil
| | - Anete Pereira de Souza
- Centro de Biologia Molecular e Engenharia Genética (CBMEG) - Universidade Estadual de Campinas (UNICAMP), Cidade Universitária Zeferino Vaz, Campinas, São Paulo, Brazil
- Departamento de Biologia Vegetal, Instituto de Biologia, Universidade Estadual de Campinas (UNICAMP), Cidade Universitária Zeferino Vaz, Campinas, São Paulo, Brazil
- * E-mail: (APS); (CCM)
| |
Collapse
|
44
|
Qian G, Ping J, Lu J, Zhang Z, Wang L, Xu D. Construction of Full-Length cDNA Library and Development of EST-Derived Simple Sequence Repeat (EST-SSR) Markers in Senecio scandens. Biochem Genet 2014; 52:494-508. [DOI: 10.1007/s10528-014-9662-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2013] [Accepted: 06/10/2014] [Indexed: 11/28/2022]
|
45
|
Asadi AA, Rashidi Monfared S. Characterization of EST-SSR markers in durum wheat EST library and functional analysis of SSR-containing EST fragments. Mol Genet Genomics 2014; 289:625-40. [PMID: 24652471 DOI: 10.1007/s00438-014-0839-z] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2013] [Accepted: 03/01/2014] [Indexed: 11/28/2022]
Abstract
The goal of this study is to identify characterization of expressed sequence tag (EST)-simple sequence repeats (SSR) markers from EST library of durum wheat and functional analysis of SSR-containing EST sequences for application in comparative genomics and breeding. 19,141 sequences were analyzed among which 18,937 ESTs were selected. Consistent with MISA results, 313 EST-SSRs were yielded. The final EST-SSRs were compared to the GenBank non-redundant database using BLASTX and classified based on these functions. Results indicated that the perfect EST-SSRs are the most frequent. The TTG/CTG imperfect EST-SSR had gamma-gliadin putative function that can be appropriate for durum wheat. Also, the mononucleotides and trinucleotides were the most frequent. Findings suggested that the identified EST-SSRs could be categorized into 83 types. Motifs TTG in trinucleotides and TC in dinucleotides had the highest frequency. TTG is the new motif in durum wheat identified in this study. We identified new EST-SSRs with more than trinucleotide and detected motifs that have potential to code amino acids. Arginine was the most frequent amino acid. Enzymes had the highest frequency among predicted functions. EST-SSRs have been identified in this study can be used for developing ESS-SSR-based detection tool for durum wheat in future studies and will be a useful resource for molecular breeding, genetics, genomics, and environmental stress studies. Motifs coding amino acids could be used as a new source of functional markers and biological study. In addition to, designed new PCR primer pairs are new resources for to identify useful alleles in transcription factors, storage proteins, and enzymes which incorporated them again into the cultivated material.
Collapse
Affiliation(s)
- Ali Akbar Asadi
- Plant Breeding and Biotechnology Department, Agriculture College, Tarbiat Modares University, Tehran, Iran,
| | | |
Collapse
|
46
|
Wang C, Liu X, Peng S, Xu Q, Yuan X, Feng Y, Yu H, Wang Y, Wei X. Development of novel microsatellite markers for the BBCC Oryza genome (Poaceae) using high-throughput sequencing technology. PLoS One 2014; 9:e91826. [PMID: 24632997 PMCID: PMC3954799 DOI: 10.1371/journal.pone.0091826] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2013] [Accepted: 02/14/2014] [Indexed: 11/24/2022] Open
Abstract
Wild species of Oryza are extremely valuable sources of genetic material that can be used to broaden the genetic background of cultivated rice, and to increase its resistance to abiotic and biotic stresses. Until recently, there was no sequence information for the BBCC Oryza genome; therefore, no special markers had been developed for this genome type. The lack of suitable markers made it difficult to search for valuable genes in the BBCC genome. The aim of this study was to develop microsatellite markers for the BBCC genome. We obtained 13,991 SSR-containing sequences and designed 14,508 primer pairs. The most abundant was hexanuclelotide (31.39%), followed by trinucleotide (27.67%) and dinucleotide (19.04%). 600 markers were selected for validation in 23 accessions of Oryza species with the BBCC genome. A set of 495 markers produced clear amplified fragments of the expected sizes. The average number of alleles per locus (Na) was 2.5, ranging from 1 to 9. The genetic diversity per locus (He) ranged from 0 to 0.844 with a mean of 0.333. The mean polymorphism information content (PIC) was 0.290, and ranged from 0 to 0.825. Of the 495 markers, 12 were only found in the BB genome, 173 were unique to the CC genome, and 198 were also present in the AA genome. These microsatellite markers could be used to evaluate the phylogenetic relationships among different Oryza genomes, and to construct a genetic linkage map for locating and identifying valuable genes in the BBCC genome, and would also for marker-assisted breeding programs that included accessions with the AA genome, especially Oryza sativa.
Collapse
Affiliation(s)
- Caihong Wang
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou, China
| | - Xiaojiao Liu
- College of Agricultural Sciences, Shanxi Agricultural University, Taigu, China
| | - Suotang Peng
- College of Agricultural Sciences, Shanxi Agricultural University, Taigu, China
| | - Qun Xu
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou, China
| | - Xiaoping Yuan
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou, China
| | - Yue Feng
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou, China
| | - Hanyong Yu
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou, China
| | - Yiping Wang
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou, China
| | - Xinghua Wei
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou, China
| |
Collapse
|
47
|
Kavakiotis I, Triantafyllidis A, Samaras P, Voulgaridis A, Karaiskou N, Konstantinidis E, Vlahavas I. Pattern discovery for microsatellite genome analysis. Comput Biol Med 2014; 46:71-8. [DOI: 10.1016/j.compbiomed.2014.01.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2013] [Revised: 12/06/2013] [Accepted: 01/07/2014] [Indexed: 10/25/2022]
|
48
|
Giordano A, Cogan NOI, Kaur S, Drayton M, Mouradov A, Panter S, Schrauf GE, Mason JG, Spangenberg GC. Gene discovery and molecular marker development, based on high-throughput transcript sequencing of Paspalum dilatatum Poir. PLoS One 2014; 9:e85050. [PMID: 24520314 PMCID: PMC3919698 DOI: 10.1371/journal.pone.0085050] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2013] [Accepted: 11/21/2013] [Indexed: 12/23/2022] Open
Abstract
Background Paspalum dilatatum Poir. (common name dallisgrass) is a native grass species of South America, with special relevance to dairy and red meat production. P. dilatatum exhibits higher forage quality than other C4 forage grasses and is tolerant to frost and water stress. This species is predominantly cultivated in an apomictic monoculture, with an inherent high risk that biotic and abiotic stresses could potentially devastate productivity. Therefore, advanced breeding strategies that characterise and use available genetic diversity, or assess germplasm collections effectively are required to deliver advanced cultivars for production systems. However, there are limited genomic resources available for this forage grass species. Results Transcriptome sequencing using second-generation sequencing platforms has been employed using pooled RNA from different tissues (stems, roots, leaves and inflorescences) at the final reproductive stage of P. dilatatum cultivar Primo. A total of 324,695 sequence reads were obtained, corresponding to c. 102 Mbp. The sequences were assembled, generating 20,169 contigs of a combined length of 9,336,138 nucleotides. The contigs were BLAST analysed against the fully sequenced grass species of Oryza sativa subsp. japonica, Brachypodium distachyon, the closely related Sorghum bicolor and foxtail millet (Setaria italica) genomes as well as against the UniRef 90 protein database allowing a comprehensive gene ontology analysis to be performed. The contigs generated from the transcript sequencing were also analysed for the presence of simple sequence repeats (SSRs). A total of 2,339 SSR motifs were identified within 1,989 contigs and corresponding primer pairs were designed. Empirical validation of a cohort of 96 SSRs was performed, with 34% being polymorphic between sexual and apomictic biotypes. Conclusions The development of genetic and genomic resources for P. dilatatum will contribute to gene discovery and expression studies. Association of gene function with agronomic traits will significantly enable molecular breeding and advance germplasm enhancement.
Collapse
Affiliation(s)
- Andrea Giordano
- Department of Environment and Primary Industries, AgriBio, Centre for AgriBioscience, Bundoora, Victoria, Australia
- Molecular Plant Breeding Cooperative Research Centre, Bundoora, Victoria, Australia
- Dairy Futures Cooperative Research Centre, Bundoora, Victoria, Australia
- La Trobe University, Bundoora, Victoria, Australia
| | - Noel O. I. Cogan
- Department of Environment and Primary Industries, AgriBio, Centre for AgriBioscience, Bundoora, Victoria, Australia
- Molecular Plant Breeding Cooperative Research Centre, Bundoora, Victoria, Australia
- Dairy Futures Cooperative Research Centre, Bundoora, Victoria, Australia
| | - Sukhjiwan Kaur
- Department of Environment and Primary Industries, AgriBio, Centre for AgriBioscience, Bundoora, Victoria, Australia
| | - Michelle Drayton
- Department of Environment and Primary Industries, AgriBio, Centre for AgriBioscience, Bundoora, Victoria, Australia
- Molecular Plant Breeding Cooperative Research Centre, Bundoora, Victoria, Australia
- Dairy Futures Cooperative Research Centre, Bundoora, Victoria, Australia
| | - Aidyn Mouradov
- Department of Environment and Primary Industries, AgriBio, Centre for AgriBioscience, Bundoora, Victoria, Australia
- Molecular Plant Breeding Cooperative Research Centre, Bundoora, Victoria, Australia
- Dairy Futures Cooperative Research Centre, Bundoora, Victoria, Australia
| | - Stephen Panter
- Department of Environment and Primary Industries, AgriBio, Centre for AgriBioscience, Bundoora, Victoria, Australia
- Molecular Plant Breeding Cooperative Research Centre, Bundoora, Victoria, Australia
- Dairy Futures Cooperative Research Centre, Bundoora, Victoria, Australia
| | - Gustavo E. Schrauf
- Facultad de Agronomia, Universidad de Buenos Aires, Buenos Aires, Argentina
| | - John G. Mason
- Department of Environment and Primary Industries, AgriBio, Centre for AgriBioscience, Bundoora, Victoria, Australia
- Dairy Futures Cooperative Research Centre, Bundoora, Victoria, Australia
- La Trobe University, Bundoora, Victoria, Australia
| | - German C. Spangenberg
- Department of Environment and Primary Industries, AgriBio, Centre for AgriBioscience, Bundoora, Victoria, Australia
- Molecular Plant Breeding Cooperative Research Centre, Bundoora, Victoria, Australia
- Dairy Futures Cooperative Research Centre, Bundoora, Victoria, Australia
- La Trobe University, Bundoora, Victoria, Australia
- * E-mail:
| |
Collapse
|
49
|
Li X, Luo J, Yan T, Xiang L, Jin F, Qin D, Sun C, Xie M. Deep sequencing-based analysis of the Cymbidium ensifolium floral transcriptome. PLoS One 2013; 8:e85480. [PMID: 24392013 PMCID: PMC3877369 DOI: 10.1371/journal.pone.0085480] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2013] [Accepted: 11/27/2013] [Indexed: 12/13/2022] Open
Abstract
Cymbidium ensifolium is a Chinese Cymbidium with an elegant shape, beautiful appearance, and a fragrant aroma. C. ensifolium has a long history of cultivation in China and it has excellent commercial value as a potted plant and cut flower. The development of C. ensifolium genomic resources has been delayed because of its large genome size. Taking advantage of technical and cost improvement of RNA-Seq, we extracted total mRNA from flower buds and mature flowers and obtained a total of 9.52 Gb of filtered nucleotides comprising 98,819,349 filtered reads. The filtered reads were assembled into 101,423 isotigs, representing 51,696 genes. Of the 101,423 isotigs, 41,873 were putative homologs of annotated sequences in the public databases, of which 158 were associated with floral development and 119 were associated with flowering. The isotigs were categorized according to their putative functions. In total, 10,212 of the isotigs were assigned into 25 eukaryotic orthologous groups (KOGs), 41,690 into 58 gene ontology (GO) terms, and 9,830 into 126 Arabidopsis Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways, and 9,539 isotigs into 123 rice pathways. Comparison of the isotigs with those of the two related orchid species P. equestris and C. sinense showed that 17,906 isotigs are unique to C. ensifolium. In addition, a total of 7,936 SSRs and 16,676 putative SNPs were identified. To our knowledge, this transcriptome database is the first major genomic resource for C. ensifolium and the most comprehensive transcriptomic resource for genus Cymbidium. These sequences provide valuable information for understanding the molecular mechanisms of floral development and flowering. Sequences predicted to be unique to C. ensifolium would provide more insights into C. ensifolium gene diversity. The numerous SNPs and SSRs identified in the present study will contribute to marker development for C. ensifolium.
Collapse
Affiliation(s)
- Xiaobai Li
- Institute of Horticulture, Zhejiang Academy of Agricultural Sciences, Hangzhou, People’s Republic of China
| | - Jie Luo
- Institute of Horticulture, Zhejiang Academy of Agricultural Sciences, Hangzhou, People’s Republic of China
| | - Tianlian Yan
- Department of Gastroenterology, The First Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou, People’s Republic of China
| | - Lin Xiang
- Institute of Horticulture, Zhejiang Academy of Agricultural Sciences, Hangzhou, People’s Republic of China
| | - Feng Jin
- College of Life Sciences, Hubei University, Wuhan, People's Republic of China
| | - Dehui Qin
- Institute of Horticulture, Zhejiang Academy of Agricultural Sciences, Hangzhou, People’s Republic of China
| | - Chongbo Sun
- Institute of Horticulture, Zhejiang Academy of Agricultural Sciences, Hangzhou, People’s Republic of China
| | - Ming Xie
- Institute of Horticulture, Zhejiang Academy of Agricultural Sciences, Hangzhou, People’s Republic of China
| |
Collapse
|
50
|
Colasuonno P, Maria MA, Blanco A, Gadaleta A. Description of durum wheat linkage map and comparative sequence analysis of wheat mapped DArT markers with rice and Brachypodium genomes. BMC Genet 2013; 14:114. [PMID: 24304553 PMCID: PMC3866978 DOI: 10.1186/1471-2156-14-114] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2013] [Accepted: 11/25/2013] [Indexed: 01/29/2023] Open
Abstract
Background The importance of wheat to the world economy, together with progresses in high-throughput next-generation DNA sequencing, have accelerated initiatives of genetic research for wheat improvement. The availability of high density linkage maps is crucial to identify genotype-phenotype associations, but also for anchoring BAC contigs to genetic maps, a strategy followed for sequencing the wheat genome. Results Here we report a genetic linkage map in a durum wheat segregating population and the study of mapped DArT markers. The linkage map consists of 126 gSSR, 31 EST-SSR and 351 DArT markers distributed in 24 linkage groups for a total length of 1,272 cM. Through bioinformatic approaches we have analysed 327 DArT clones to reveal their redundancy, syntenic and functional aspects. The DNA sequences of 174 DArT markers were assembled into a non-redundant set of 60 marker clusters. This explained the generation of clusters in very small chromosome regions across genomes. Of these DArT markers, 61 showed highly significant (Expectation < E-10) BLAST similarity to gene sequences in public databases of model species such as Brachypodium and rice. Based on sequence alignments, the analysis revealed a mosaic gene conservation, with 54 and 72 genes present in rice and Brachypodium species, respectively. Conclusions In the present manuscript we provide a detailed DArT markers characterization and the basis for future efforts in durum wheat map comparing.
Collapse
Affiliation(s)
| | | | | | - Agata Gadaleta
- Department of Soil, Plant and Food Sciences, University of Bari "Aldo Moro", Via Amendola 165/A, Bari 70126, Italy.
| |
Collapse
|