1
|
Savadi S, Muralidhara BM, Venkataravanappa V, Adiga JD. Genome-wide survey and characterization of microsatellites in cashew and design of a web-based microsatellite database: CMDB. FRONTIERS IN PLANT SCIENCE 2023; 14:1242025. [PMID: 37670858 PMCID: PMC10475544 DOI: 10.3389/fpls.2023.1242025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/18/2023] [Accepted: 07/31/2023] [Indexed: 09/07/2023]
Abstract
The cashew is an edible tree nut crop having a wide range of food and industrial applications. Despite great economic importance, the genome-wide characterization of microsatellites [simple sequence repeats (SSRs)] in cashew is lacking. In this study, we carried out the first comprehensive genome-wide microsatellites/SSRs characterization in cashew and developed polymorphic markers and a web-based microsatellite database. A total of 54526 SSRs were discovered in the cashew genome, with a mean frequency of 153 SSRs/Mb. Among the mined genome-wide SSRs (2-6 bp size motifs), the dinucleotide repeat motifs were dominant (68.98%) followed by the trinucleotides (24.56%). The Class I type of SSRs (≥20 bp) were 45.10%, while Class II repeat motifs (≥12-<20 bp) were 54.89% of the total genomic SSRs discovered here. Further, the AT-rich SSRs occurred more frequently in the cashew genome (84%) compared to the GC-rich SSRs. The validation of the in silico-mined genome-wide SSRs by PCR screening in cashew genotypes resulted in the development of 59 polymorphic SSR markers, and the polymorphism information content (PIC) of the polymorphic SSR markers ranged from 0.19 to 0.84. Further, a web-based database, "Cashew Microsatellite Database (CMDB)," was constructed to provide access to the genome-wide SSRs mined in this study as well as transcriptome-based SSRs from our previous study to the research community through a user-friendly searchable interface. Besides, CMDB provides information on experimentally validated SSRs. CMDB permits the retrieval of SSR markers information with the customized search options. Altogether, the genome-wide SSRs characterization, the polymorphic markers and CMDB database developed in this study would serve as valuable marker resources for DNA fingerprinting, germplasm characterization, genetic studies, and molecular breeding in cashew and related Anacardium species.
Collapse
Affiliation(s)
- Siddanna Savadi
- ICAR- Directorate of Cashew Research (DCR), Puttur, Karnataka, India
| | - B. M. Muralidhara
- ICAR-Indian Institute of Horticultural Research (IIHR), CHES, Madikeri, Karnataka, India
| | - V. Venkataravanappa
- ICAR-Indian Institute of Horticultural Research (IIHR), CHES, Madikeri, Karnataka, India
| | - J. D. Adiga
- ICAR- Directorate of Cashew Research (DCR), Puttur, Karnataka, India
| |
Collapse
|
2
|
Microsatellite analysis and polymorphic marker development based on the full-length transcriptome of Camellia chekiangoleosa. Sci Rep 2022; 12:18906. [PMID: 36344600 PMCID: PMC9640616 DOI: 10.1038/s41598-022-23333-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2022] [Accepted: 10/29/2022] [Indexed: 11/09/2022] Open
Abstract
Camellia chekiangoleosa is a popular variety of Oil-camellia that has high oil production and ornamental value. Microsatellite (SSR) markers are the preferred tool for the molecular marker-assisted breeding of C. chekiangoleosa. By focusing on the problems of the low development efficiency of polymorphic SSR markers and the lack of available functional markers in Oil-camellia, we identified 97,510 SSR loci based on the full-length transcriptome sequence of C. chekiangoleosa. An analysis of SSR characteristics showed that mononucleotide (51.29%) and dinucleotide (34.36%) SSRs were the main repeat types. The main SSR distribution areas based on proportion covered were ordered as follows: 5'UTR > 3'UTR > CDS. By comparing our data with those in databases such as GO and KEGG, we obtained functional annotations of unigene sequences containing SSR sites. The data showed that the amplification efficiency of the SSR primers was 51.72%, and the development efficiency of polymorphic SSR primers was 26.72%. Experiments verified that dinucleotide and pentanucleotide SSRs located in UTR regions could produce more polymorphic markers. An investigation into the genetic diversity of several C. chekiangoleosa populations also suggested that the developed SSR markers had higher levels of polymorphism. This study will provide a reference and high-quality markers for the large-scale development of functional SSR markers and genetic research in Oil-camellia.
Collapse
|
3
|
The Diversity of Melia azedarach L. from China Based on Transcriptome-Developed SSR Marker. FORESTS 2022. [DOI: 10.3390/f13071011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
Melia azedarach L. is a native tree species that can be used in a comprehensive way and is widely distributed in all provinces south of the Yellow River in China. Genetic diversity analysis of different M. azedarach germplasm sources is an important basic work for the selection, evaluation, and genetic improvement of M. azedarach germplasm resources. In this study, 100 pairs of SSR primers were designed and synthesized based on M. azedarach transcriptome data, and 16 pairs of reliable SSR primers were finally selected. The developed primers were used to analyze the genetic diversity of M. azedarach from 15 sources in 10 provinces in East, Central, and South China. The results showed that the frequency of the M. azedarach transcriptome SSR loci was high, and the distribution density was high. There were 15 sources of M. azedarach genetic diversity at a moderate level, and genetic variation was mainly present within the sources. The present study further enriches the existing SSR marker database of the M. azedarach family and can provide a reference for genetic diversity analysis and molecularly assisted breeding of M. azedarach plants at the genomic level.
Collapse
|
4
|
Genic microsatellite marker characterization and development in little millet (Panicum sumatrense) using transcriptome sequencing. Sci Rep 2021; 11:20620. [PMID: 34663808 PMCID: PMC8523711 DOI: 10.1038/s41598-021-00100-4] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2021] [Accepted: 09/29/2021] [Indexed: 11/08/2022] Open
Abstract
Little millet is a climate-resilient and high-nutrient value plant. The lack of molecular markers severely limits the adoption of modern genomic approaches in millet breeding studies. Here the transcriptome of three samples were sequenced. A total of 4443 genic-SSR motifs were identified in 30,220 unigene sequences. SSRs were found at a rate of 12.25 percent, with an average of one SSR locus per 10 kb. Among different repeat motifs, tri-nucleotide repeat (66.67) was the most abundant one, followed by di- (27.39P), and tetra- (3.83P) repeats. CDS contained fewer motifs with the majority of tri-nucleotides, while 3' and 5' UTR carry more motifs but have shorter repeats. Functional annotation of unigenes containing microsatellites, revealed that most of them were linked to metabolism, gene expression regulation, and response to environmental stresses. Fifty primers were randomly chosen and validated in five little millet and 20 minor millet genotypes; 48% showed polymorphism, with a high transferability (70%) rate. Identified microsatellites can be a noteworthy resource for future research into QTL-based breeding, genetic resource conservation, MAS selection, and evolutionary genetics.
Collapse
|
5
|
Development and Characterization of Simple Sequence Repeat Markers for, and Genetic Diversity Analysis of Liquidambar formosana. FORESTS 2020. [DOI: 10.3390/f11020203] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Liquidambar formosana (Hamamelidaceae) is a relatively fast-growing deciduous tree of high ornamental value that is indigenous to China. However, few molecular markers are available for the species or its close relatives; this has hindered genomic and genetic studies. Here, we develop a series of transferable expressed sequence tag-simple sequence repeats (EST-SSRs) for genomic analysis of L. formosana. We downloaded the sequence of the L. formosana transcriptome from the National Center of Biotechnology Information Database and identified SSR loci in the Unigene library. We found 3284 EST-SSRs by mining 34,491 assembled unigenes. We synthesized 100 random primer pairs for validation of eight L. formosana individuals; of the 100 pairs, 32 were polymorphic. We successfully transferred 12 EST-SSR markers across three related Liquidambar species; the markers exhibited excellent cross-species transferability and will facilitate genetic studies and breeding of Liquidambar. A total of 72 clones of three Liquidambar species were uniquely divided into three main clusters; principal coordinate analysis (PCoA) supported this division. Additionally, a set of 20 SSR markers that did not exhibit nonspecific amplification were used to genotype more than 53 L. formosana trees. The mean number of alleles (Na) was 5.75 and the average polymorphism information content (PIC) was 0.578, which was higher than that of the natural L. formosana population (0.390). In other words, the genetic diversity of the plus L. formosana population increased, but excellent phenotypic features were maintained. The primers will be valuable for genomic mapping, germplasm characterization, gene tagging, and further genetic studies. Analyses of genetic diversity in L. formosana will provide a basis for efficient application of genetic materials and rational management of L. formosana breeding programs.
Collapse
|
6
|
Genome-Wide SNP discovery and genomic characterization in avocado (Persea americana Mill.). Sci Rep 2019; 9:20137. [PMID: 31882769 PMCID: PMC6934854 DOI: 10.1038/s41598-019-56526-4] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2019] [Accepted: 12/13/2019] [Indexed: 11/23/2022] Open
Abstract
Modern crop breeding is based on the use of genetically and phenotypically diverse plant material and, consequently, a proper understanding of population structure and genetic diversity is essential for the effective development of breeding programs. An example is avocado, a woody perennial fruit crop native to Mesoamerica with an increasing popularity worldwide. Despite its commercial success, there are important gaps in the molecular tools available to support on-going avocado breeding programs. In order to fill this gap, in this study, an avocado 'Hass' draft assembly was developed and used as reference to study 71 avocado accessions which represent the three traditionally recognized avocado horticultural races or subspecies (Mexican, Guatemalan and West Indian). An average of 5.72 M reads per individual and a total of 7,108 single nucleotide polymorphism (SNP) markers were produced for the 71 accessions analyzed. These molecular markers were used in a study of genetic diversity and population structure. The results broadly separate the accessions studied according to their botanical race in four main groups: Mexican, Guatemalan, West Indian and an additional group of Guatemalan × Mexican hybrids. The high number of SNP markers developed in this study will be a useful genomic resource for the avocado community.
Collapse
|
7
|
Ge Y, Dong X, Wu B, Wang N, Chen D, Chen H, Zou M, Xu Z, Tan L, Zhan R. Evolutionary analysis of six chloroplast genomes from three Persea americana ecological races: Insights into sequence divergences and phylogenetic relationships. PLoS One 2019; 14:e0221827. [PMID: 31532782 PMCID: PMC6750585 DOI: 10.1371/journal.pone.0221827] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2019] [Accepted: 08/15/2019] [Indexed: 11/19/2022] Open
Abstract
Chloroplasts significantly influence species phylogenies because of their maternal inheritance and the moderate evolutionary rate of their genomes. Avocado, which is a member of the family Lauraceae, has received considerable attention from botanists, likely because of its position as a basal angiosperm. However, there is relatively little avocado genomic information currently available. In this study, six complete avocado chloroplast genomes from three ecological races were assembled to examine the sequence diversity among the three avocado ecological races. A comparative genomic analysis revealed that 515 simple sequence repeat loci and 176 repeats belonging to four other types were polymorphic across the six chloroplast genomes. Three highly variable regions (trnC-GCA-petN, petN-psbM, and petA-psbJ) were identified as highly informative markers. A phylogenetic analysis based on 79 common protein-coding genes indicated that the six examined avocado accessions from three ecological races form a monophyletic clade. The other three genera belonging to the Persea group clustered to form a sister clade with a high bootstrap value. These chloroplast genomes provide important genetic information for future attempts at identifying avocado races and for the related biological research.
Collapse
Affiliation(s)
- Yu Ge
- Haikou Experimental Station, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
| | - Xiangshu Dong
- College of Agriculture, Yunnan University, Yunnan, China
| | - Bin Wu
- Haikou Experimental Station, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
| | - Nan Wang
- Haikou Experimental Station, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
| | - Di Chen
- Haikou Experimental Station, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
| | - Haihong Chen
- College of Agriculture, Guangxi Vocational and Technical College, Nanning, China
| | - Minghong Zou
- South Subtropical Crops Research Institute, Chinese Academy of Tropical Agricultural Sciences, Zhanjiang, China
| | - Zining Xu
- Haikou Experimental Station, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
| | - Lin Tan
- Haikou Experimental Station, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
| | - Rulin Zhan
- Haikou Experimental Station, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
| |
Collapse
|
8
|
Transcriptome Profiling Provides Insight into the Genes in Carotenoid Biosynthesis during the Mesocarp and Seed Developmental Stages of Avocado ( Persea americana). Int J Mol Sci 2019; 20:ijms20174117. [PMID: 31450745 PMCID: PMC6747375 DOI: 10.3390/ijms20174117] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2019] [Revised: 08/20/2019] [Accepted: 08/21/2019] [Indexed: 11/17/2022] Open
Abstract
Avocado (Persea americana Mill.) is an economically important crop because of its high nutritional value. However, the absence of a sequenced avocado reference genome has hindered investigations of secondary metabolism. For next-generation high-throughput transcriptome sequencing, we obtained 365,615,152 and 348,623,402 clean reads as well as 109.13 and 104.10 Gb of sequencing data for avocado mesocarp and seed, respectively, during five developmental stages. High-quality reads were assembled into 100,837 unigenes with an average length of 847.40 bp (N50 = 1725 bp). Additionally, 16,903 differentially expressed genes (DEGs) were detected, 17 of which were related to carotenoid biosynthesis. The expression levels of most of these 17 DEGs were higher in the mesocarp than in the seed during five developmental stages. In this study, the avocado mesocarp and seed transcriptome were also sequenced using single-molecule long-read sequencing to acquired 25.79 and 17.67 Gb clean data, respectively. We identified 233,014 and 238,219 consensus isoforms in avocado mesocarp and seed, respectively. Furthermore, 104 and 59 isoforms were found to correspond to the putative 11 carotenoid biosynthetic-related genes in the avocado mesocarp and seed, respectively. The isoform numbers of 10 out of the putative 11 genes involved in the carotenoid biosynthetic pathway were higher in the mesocarp than those in the seed. Besides, alpha- and beta-carotene contents in the avocado mesocarp and seed during five developmental stages were also measured, and they were higher in the mesocarp than in the seed, which validated the results of transcriptome profiling. Gene expression changes and the associated variations in gene dosage could influence carotenoid biosynthesis. These results will help to further elucidate carotenoid biosynthesis in avocado.
Collapse
|