1
|
Li T, Cai S, Cai Z, Fu Y, Liu W, Zhu X, Lai C, Cui L, Pan W, Li Y. TriticeaeSSRdb: a comprehensive database of simple sequence repeats in Triticeae. FRONTIERS IN PLANT SCIENCE 2024; 15:1412953. [PMID: 38841284 PMCID: PMC11150838 DOI: 10.3389/fpls.2024.1412953] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/06/2024] [Accepted: 05/08/2024] [Indexed: 06/07/2024]
Abstract
Microsatellites, known as simple sequence repeats (SSRs), are short tandem repeats of 1 to 6 nucleotide motifs found in all genomes, particularly eukaryotes. They are widely used as co-dominant markers in genetic analyses and molecular breeding. Triticeae, a tribe of grasses, includes major cereal crops such as bread wheat, barley, and rye, as well as abundant forage and lawn grasses, playing a crucial role in global food production and agriculture. To enhance genetic work and expedite the improvement of Triticeae crops, we have developed TriticeaeSSRdb, an integrated and user-friendly database. It contains 3,891,705 SSRs from 21 species and offers browsing options based on genomic regions, chromosomes, motif types, and repeat motif sequences. Advanced search functions allow personalized searches based on chromosome location and length of SSR. Users can also explore the genes associated with SSRs, design customized primer pairs for PCR validation, and utilize practical tools for whole-genome browsing, sequence alignment, and in silico SSR prediction from local sequences. We continually update TriticeaeSSRdb with additional species and practical utilities. We anticipate that this database will greatly facilitate trait genetic analyses and enhance molecular breeding strategies for Triticeae crops. Researchers can freely access the database at http://triticeaessrdb.com/.
Collapse
Affiliation(s)
- Tingting Li
- College of Bioscience and Engineering, Jiangxi Agricultural University, Nanchang, Jiangxi, China
- State Key Laboratory for Crop Stress Resistance and High-Efficiency Production, Northwest A&F University, Yangling, Shaanxi, China
| | - Shaoshuai Cai
- College of Bioscience and Engineering, Jiangxi Agricultural University, Nanchang, Jiangxi, China
| | - Zhibo Cai
- State Key Laboratory for Crop Stress Resistance and High-Efficiency Production, Northwest A&F University, Yangling, Shaanxi, China
| | - Yi Fu
- College of Bioscience and Engineering, Jiangxi Agricultural University, Nanchang, Jiangxi, China
| | - Wenqiang Liu
- College of Bioscience and Engineering, Jiangxi Agricultural University, Nanchang, Jiangxi, China
| | - Xiangdong Zhu
- College of Bioscience and Engineering, Jiangxi Agricultural University, Nanchang, Jiangxi, China
| | - Chongde Lai
- College of Bioscience and Engineering, Jiangxi Agricultural University, Nanchang, Jiangxi, China
- The Public Instrument Platform of Jiangxi Agricultural University, Jiangxi Agricultural University, Nanchang, Jiangxi, China
| | - Licao Cui
- College of Bioscience and Engineering, Jiangxi Agricultural University, Nanchang, Jiangxi, China
| | - Wenqiu Pan
- State Key Laboratory for Crop Stress Resistance and High-Efficiency Production, Northwest A&F University, Yangling, Shaanxi, China
| | - Yihan Li
- College of Bioscience and Engineering, Jiangxi Agricultural University, Nanchang, Jiangxi, China
| |
Collapse
|
2
|
Lu R, Hu K, Sun X, Chen M. Low-coverage whole genome sequencing of diverse Dioscorea bulbifera accessions for plastome resource development, polymorphic nuclear SSR identification, and phylogenetic analyses. FRONTIERS IN PLANT SCIENCE 2024; 15:1373297. [PMID: 38510439 PMCID: PMC10950973 DOI: 10.3389/fpls.2024.1373297] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Accepted: 02/22/2024] [Indexed: 03/22/2024]
Abstract
Dioscorea bulbifera (Dioscoreaceae), a versatile herbaceous climber native to Africa and Asia, holds significant nutritional and medicinal value. Despite extensive characterization and genetic variability analyses of African accessions, studies on the genetic variation of this species in China are limited. To address this gap, we conducted low-coverage whole genome sequencing on D. bulbifera accessions from diverse regions across mainland China and Taiwan island. Our initial investigation encompassed comprehensive comparative plastome analyses of these D. bulbifera accessions, and developing plastome resources (including plastome-derived repetitive sequences, SSRs, and divergent hotspots). We also explored polymorphic nuclear SSRs and elucidated the intraspecific phylogeny of these accessions. Comparative plastome analyses revealed that D. bulbifera plastomes exhibited a conserved quadripartite structure with minimal size variation mainly attributed to intergenic spacer regions, reinforcing prior observations of a high degree of conservation within a species. We identified 46 to 52 dispersed repeats and 151 to 163 plastome-derived SSRs, as well as highlighted eight key divergent hotspots in these D. bulbifera accessions. Furthermore, we developed 2731 high-quality candidate polymorphic nuclear SSRs for D. bulbifera. Intraspecific phylogenetic analysis revealed three distinct clades, where accessions from Southeast China formed a sister group to those from South China and Taiwan island, and collectively, these two clades formed a sister group to the remaining accessions, indicating potential regional genetic divergence. These findings not only contributed to the understanding of the genetic variation of D. bulbifera, but also offered valuable resources for future research, breeding efforts, and utilization of this economically important plant species.
Collapse
Affiliation(s)
- Ruisen Lu
- Institute of Botany, Jiangsu Province and Chinese Academy of Sciences, Nanjing, China
- Jiangsu Key Laboratory for the Research and Utilization of Plant Resources, Nanjing, China
| | - Ke Hu
- Institute of Botany, Jiangsu Province and Chinese Academy of Sciences, Nanjing, China
- Jiangsu Key Laboratory for the Research and Utilization of Plant Resources, Nanjing, China
| | - Xiaoqin Sun
- Institute of Botany, Jiangsu Province and Chinese Academy of Sciences, Nanjing, China
- Jiangsu Key Laboratory for the Research and Utilization of Plant Resources, Nanjing, China
- Jiangsu Provincial Science and Technology Resources Coordination Platform (Agricultural Germplasm Resources) Germplasm Resources Nursery of Medicinal Plants, Nanjing, China
| | - Min Chen
- Institute of Botany, Jiangsu Province and Chinese Academy of Sciences, Nanjing, China
- Jiangsu Key Laboratory for the Research and Utilization of Plant Resources, Nanjing, China
| |
Collapse
|
3
|
Gao Q, Tong W, Li F, Wang Y, Wu Q, Wan X, Xia E. TPIA2: an updated tea plant information archive for Camellia genomics. Nucleic Acids Res 2024; 52:D1661-D1667. [PMID: 37650644 PMCID: PMC10767884 DOI: 10.1093/nar/gkad701] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Revised: 08/08/2023] [Accepted: 08/14/2023] [Indexed: 09/01/2023] Open
Abstract
The genus Camellia consists of about 200 species, which include many economically important species widely used for making tea, ornamental flowers and edible oil. Here, we present an updated tea plant information archive for Camellia genomics (TPIA2; http://tpia.teaplants.cn) by integrating more novel large-scale genomic, transcriptomic, metabolic and genetic variation datasets as well as a variety of useful tools. Specifically, TPIA2 hosts all currently available and well assembled 10 Camellia genomes and their comprehensive annotations from three major sections of Camellia. A collection of 15 million SNPs and 950 950 small indels from large-scale genome resequencing of 350 diverse tea accessions were newly incorporated, followed by the implementation of a novel 'Variation' module to facilitate data retrieval and analysis of the functionally annotated variome. Moreover, 116 Camellia transcriptomes were newly assembled and added, leading to a significant extension of expression profiles of Camellia genes to 13 developmental stages and eight abiotic/biotic treatments. An updated 'Expression' function has also been implemented to provide a comprehensive gene expression atlas for Camellia. Two novel analytic tools (e.g. Gene ID Convert and Population Genetic Analysis) were specifically designed to facilitate the data exchange and population genomics in Camellia. Collectively, TPIA2 provides diverse updated valuable genomic resources and powerful functions, and will continue to be an important gateway for functional genomics and population genetic studies in Camellia.
Collapse
Affiliation(s)
- Qijuan Gao
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
- School of Computer and Artificial Intelligence, Hefei Normal University, Hefei 230061, China
| | - Wei Tong
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
| | - Fangdong Li
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
- School of Science, Anhui Agricultural University, Hefei 230036, China
| | - Yanli Wang
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
| | - Qiong Wu
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
- Tea Research Institute, Anhui Academy of Agricultural Sciences, Hefei 230031, China
| | - Xiaochun Wan
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
| | - Enhua Xia
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
| |
Collapse
|
4
|
Hu K, Chen M, Li P, Sun X, Lu R. Intraspecific phylogeny and genomic resources development for an important medical plant Dioscorea nipponica, based on low-coverage whole genome sequencing data. FRONTIERS IN PLANT SCIENCE 2023; 14:1320473. [PMID: 38148859 PMCID: PMC10749966 DOI: 10.3389/fpls.2023.1320473] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/12/2023] [Accepted: 11/27/2023] [Indexed: 12/28/2023]
Abstract
Dioscorea nipponica Makino, a perennial twining herb with medicinal importance, has a disjunctive distribution in the Sino-Japanese Floristic Region. It has a long history in traditional Chinese medicine, with demonstrated efficacy against various health conditions. However, the limited genomic data and knowledge of genetic variation have hindered its comprehensive exploration, utilization and conservation. In this study, we undertook low-coverage whole genome sequencing of diverse D. nipponica accessions to develop both plastome (including whole plastome sequences, plastome-derived SSRs and plastome-divergent hotspots) and nuclear genomic resources (including polymorphic nuclear SSRs and single-copy nuclear genes), as well as elucidate the intraspecific phylogeny of this species. Our research revealed 639 plastome-derived SSRs and highlighted six key mutational hotspots (namely CDS ycf1, IGS trnL-rpl32, IGS trnE-trnT, IGS rps16-trnQ, Intron 1 of clpP, and Intron trnG) within these accessions. Besides, three IGS regions (i.e., ndhD-cssA, trnL-rpl32, trnD-trnY), and the intron rps16 were identified as potential markers for distinguishing D. nipponica from its closely related species. In parallel, we successfully developed 988 high-quality candidate polymorphic nuclear SSRs and identified 17 single-copy nuclear genes for D. nipponica, all of which empower us to conduct in-depth investigations into phylogenetics and population genetics of this species. Although our phylogenetic analyses, based on plastome sequences and single-copy nuclear genes revealed cytonuclear discordance within D. nipponica, both findings challenged the current subspecies classification. In summary, this study developed a wealth of genomic resources for D. nipponica and enhanced our understanding of the intraspecific phylogeny of this species, offering valuable insights that can be instrumental in the conservation and strategic utilization of this economically significant plant.
Collapse
Affiliation(s)
- Ke Hu
- Institute of Botany, Jiangsu Province and Chinese Academy of Sciences, Nanjing, China
| | - Min Chen
- Institute of Botany, Jiangsu Province and Chinese Academy of Sciences, Nanjing, China
- Jiangsu Key Laboratory for the Research and Utilization of Plant Resources, Nanjing, China
| | - Pan Li
- Laboratory of Systematic & Evolutionary Botany and Biodiversity, College of Life Sciences, Zhejiang University, Hangzhou, China
| | - Xiaoqin Sun
- Institute of Botany, Jiangsu Province and Chinese Academy of Sciences, Nanjing, China
- Jiangsu Key Laboratory for the Research and Utilization of Plant Resources, Nanjing, China
- Jiangsu Provincial Science and Technology Resources Coordination Platform (Agricultural Germplasm Resources) Germplasm Resources Nursery of Medicinal Plants, Nanjing, China
| | - Ruisen Lu
- Institute of Botany, Jiangsu Province and Chinese Academy of Sciences, Nanjing, China
- Jiangsu Key Laboratory for the Research and Utilization of Plant Resources, Nanjing, China
| |
Collapse
|
5
|
Hu K, Sun XQ, Chen M, Lu RS. Low-coverage whole genome sequencing of eleven species/subspecies in Dioscorea sect. Stenophora (Dioscoreaceae): comparative plastome analyses, molecular markers development and phylogenetic inference. FRONTIERS IN PLANT SCIENCE 2023; 14:1196176. [PMID: 37346115 PMCID: PMC10281252 DOI: 10.3389/fpls.2023.1196176] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Accepted: 04/26/2023] [Indexed: 06/23/2023]
Abstract
Dioscorea sect. Stenophora (Dioscoreaceae) comprises about 30 species that are distributed in the temperate and subtropical regions of the Northern Hemisphere. Despite being evolutionarily "primitive" and medically valuable, genomic resources and molecular studies of this section are still scarce. Here, we conducted low-coverage whole genome sequencing of 11 Stenophora species/subspecies to retrieve their plastome information (whole plastome characteristics, plastome-divergent hotspots, plastome-derived SSRs, etc.) and polymorphic nuclear SSRs, as well as performed comparative plastome and phylogenetic analyses within this section. The plastomes of Stenophora species/subspecies ranged from 153,691 bp (D. zingiberensis) to 154,149 bp (D. biformifolia) in length, and they all contained the same 114 unique genes. All these plastomes were highly conserved in gene structure, gene order and GC content, although variations at the IR/SC borders contributed to the whole length differences among them. The number of plastome-derived SSRs among Stenophora species/subspecies varied from 74 (D. futschauensis) to 93 (D. zingiberensis), with A/T found to be the most frequent one. Seven highly variable regions and 12 polymorphic nuclear SSRs were identified in this section, thereby providing important information for further taxonomical, phylogenetic and population genetic studies. Phylogenomic analyses based on whole plastome sequences and 80 common protein coding genes strongly supported D. biformifolia and D. banzhuana constituted the successive sister species to the remaining sampled species, which could be furtherly divided into three clades. Overall, this study provided a new perspective for plastome evolution of Stenophora, and proved the role of plastome phylogenomic in improving the phylogenetic resolution in this section. These results also provided an important reference for the protection and utilization of this economically important section.
Collapse
Affiliation(s)
- Ke Hu
- Institute of Botany, Jiangsu Province and Chinese Academy of Sciences, Nanjing, China
- Jiangsu Key Laboratory for the Research and Utilization of Plant Resources, Nanjing, China
- Jiangsu Provincial Science and Technology Resources Coordination Platform (Agricultural Germplasm Resources) Germplasm Resources Nursery of Medicinal Plants, Nanjing, China
| | - Xiao-Qin Sun
- Institute of Botany, Jiangsu Province and Chinese Academy of Sciences, Nanjing, China
- Jiangsu Key Laboratory for the Research and Utilization of Plant Resources, Nanjing, China
- Jiangsu Provincial Science and Technology Resources Coordination Platform (Agricultural Germplasm Resources) Germplasm Resources Nursery of Medicinal Plants, Nanjing, China
| | - Min Chen
- Institute of Botany, Jiangsu Province and Chinese Academy of Sciences, Nanjing, China
- Jiangsu Key Laboratory for the Research and Utilization of Plant Resources, Nanjing, China
- Jiangsu Provincial Science and Technology Resources Coordination Platform (Agricultural Germplasm Resources) Germplasm Resources Nursery of Medicinal Plants, Nanjing, China
| | - Rui-Sen Lu
- Institute of Botany, Jiangsu Province and Chinese Academy of Sciences, Nanjing, China
- Jiangsu Key Laboratory for the Research and Utilization of Plant Resources, Nanjing, China
- Jiangsu Provincial Science and Technology Resources Coordination Platform (Agricultural Germplasm Resources) Germplasm Resources Nursery of Medicinal Plants, Nanjing, China
| |
Collapse
|
6
|
Wang Y, Yang T, Wang X, Sun X, Liu H, Wang D, Wang H, Zhang G, Li Y, Wang X, Wei Z. Develop a preliminary core germplasm with the novel polymorphism EST-SSRs derived from three transcriptomes of colored calla lily ( Zantedeschia hybrida). FRONTIERS IN PLANT SCIENCE 2023; 14:1055881. [PMID: 36818854 PMCID: PMC9933510 DOI: 10.3389/fpls.2023.1055881] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Accepted: 01/06/2023] [Indexed: 06/18/2023]
Abstract
The development of high-throughput sequencing technology has made it possible to develop molecular markers such as EST-SSR from transcriptome sequences in non-model plants such as bulbous flowers. However, the EST-SSR markers that have been developed are weakly validated and low polymorphic due to the short read size and poor quality of the assembled sequences. This study therefore used the CandiSSR pipeline to identify 550 potential polymorphic SSR loci among 487 homologous unigenes based on the transcriptomic sequences of three varieties of colored calla lily, and 460 of these loci with appropriate flanking sequences were suitable for primer pairs design. A further validation with 200 randomly selected EST-SSRs demonstrated an increase of more than 30% and 100% in amplification validity and polymorphism, respectively, in comparison with our previous study. In addition, since most of the current varieties of colored calla lily are hybridized from a few species, which have low genetic diversity, we subsequently identified primary core germplasm for 160 colored calla lily accessions using the aforementioned 40 polymorphic EST-SSRs. It was concluded that the core germplasm containing 42 accessions derived from the M strategy incorporated into the software Power Core was the most representative of all 160 original germplasm, as evidenced by the preservation of 100% of the EST-SSR variation, with a higher level of genetic diversity and heterogeneity (Nei = 0.40, I = 0.66, PIC = 0.43). This study provides a practical example of polymorphism EST-SSR markers developed from multiple transcriptomes for non-model plants. A future breeding program for colored calla lily will also benefit from the core germplasm defined by those molecular markers.
Collapse
Affiliation(s)
- Yi Wang
- Institute of Grassland, Flowers and Ecology, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China
- College of Horticulture, China Agricultural University, Beijing, China
| | - Tuo Yang
- College of Horticulture, China Agricultural University, Beijing, China
| | - Xue Wang
- Institute of Grassland, Flowers and Ecology, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China
- Hebei Key Laboratory of Horticultural Germplasm Excavation and Innovative Utilization, College of Horticultural Science & Technology, Hebei Normal University of Science & Technology, Qinhuangdao, China
| | - Xuan Sun
- Institute of Grassland, Flowers and Ecology, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China
- Hebei Key Laboratory of Horticultural Germplasm Excavation and Innovative Utilization, College of Horticultural Science & Technology, Hebei Normal University of Science & Technology, Qinhuangdao, China
| | - Hongyan Liu
- Institute of Grassland, Flowers and Ecology, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China
- Hebei Key Laboratory of Horticultural Germplasm Excavation and Innovative Utilization, College of Horticultural Science & Technology, Hebei Normal University of Science & Technology, Qinhuangdao, China
| | - Di Wang
- Institute of Grassland, Flowers and Ecology, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China
- Hebei Key Laboratory of Horticultural Germplasm Excavation and Innovative Utilization, College of Horticultural Science & Technology, Hebei Normal University of Science & Technology, Qinhuangdao, China
| | - Huanxiao Wang
- Institute of Grassland, Flowers and Ecology, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China
- Hebei Key Laboratory of Horticultural Germplasm Excavation and Innovative Utilization, College of Horticultural Science & Technology, Hebei Normal University of Science & Technology, Qinhuangdao, China
| | - Guojun Zhang
- Hebei Key Laboratory of Horticultural Germplasm Excavation and Innovative Utilization, College of Horticultural Science & Technology, Hebei Normal University of Science & Technology, Qinhuangdao, China
| | - Yanbing Li
- Landscape Engineering Technology Research Center, Zhoukou Normal University, Zhoukou, China
| | - Xian Wang
- Institute of Grassland, Flowers and Ecology, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China
| | - Zunzheng Wei
- Institute of Grassland, Flowers and Ecology, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China
| |
Collapse
|
7
|
Wang P, Bai J, Li X, Liu T, Yan Y, Yang Y, Li H. Phylogenetic relationship and comparative analysis of the main Bupleuri Radix species in China. PeerJ 2023; 11:e15157. [PMID: 37077311 PMCID: PMC10108860 DOI: 10.7717/peerj.15157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Accepted: 03/10/2023] [Indexed: 04/21/2023] Open
Abstract
Background Bupleuri Radix (Chaihu) is a famous traditional Chinese medicine derived from Bupleurum, Apiaceae. The origin of cultivated Chaihu germplasm in China is unclear, which has led to unstable Chaihu quality. In this study, we reconstructed the phylogeny of the main Chaihu germplasm species in China and identified potential molecular markers to authenticate its origin. Methods Three Bupleurum species (eight individuals), B. bicaule, B. chinense, and B. scorzonerifolium, were selected for genome skimming. Published genomes from B. falcatum and B. marginatum var. stenophyllum were used for comparative analysis. Results Sequences of the complete plastid genomes were conserved with 113 identical genes ranging from 155,540 to 155,866 bp in length. Phylogenetic reconstruction based on complete plastid genomes resolved intrageneric relationships of the five Bupleurum species with high support. Conflicts between the plastid and nuclear phylogenies were observed, which were mainly ascribed to introgressive hybridization. Comparative analysis showed that noncoding regions of the plastomes had most of the variable sequences. Eight regions (atpF-atpH, petN-psbM, rps16-psbK, petA-psbJ, ndhC-trnV/UAC and ycf1) had high divergence values in Bupleurum species and could be promising DNA barcodes for Chaihu authentication. A total of seven polymorphic cpSSRs and 438 polymorphic nSSRs were detected across the five Chaihu germplasms. Three photosynthesis-related genes were under positive selection, of which accD reflected the adaptation fingerprint of B. chinense to different ecological habitats. Our study provides valuable genetic information for phylogenetic investigation, germplasm authentication, and molecular breeding of Chaihu species.
Collapse
Affiliation(s)
- Ping Wang
- Xianyang Normal University, Xianyang, China
| | - Jiqing Bai
- Shaanxi University of Chinese Medicine, Xianyang, China
| | - Xue Li
- Xianyang Food and Drug Administration, Xianyang, China
| | | | - Yumeng Yan
- Xianyang Normal University, Xianyang, China
| | | | - Huaizhu Li
- Xianyang Normal University, Xianyang, China
| |
Collapse
|
8
|
Polymorphic Microsatellite Development, Genetic Diversity, Population Differentiation and Sexual State of Phytophthora capsici on Commercial Peppers in Three Provinces of Southwest China. Appl Environ Microbiol 2022; 88:e0161122. [PMID: 36354348 PMCID: PMC9746301 DOI: 10.1128/aem.01611-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Pepper blight, caused by the oomycete pathogen Phytophthora capsici (P. capsici), is one of the most destructive soilborne diseases worldwide. Between 2019 and 2020, 235 single spore isolates of P. capsici were collected from 36 commercial pepper planting areas in Sichuan, Chongqing, and Guizhou provinces in China. A novel full set of 323 high-quality polymorphic microsatellites was obtained by resequencing 10 isolates. In total, 163 isolates with two alleles per microsatellite locus were used for population analysis and resulted in 156 genotypes on 10 microsatellite loci. The genetic diversity, population differentiation, principal component, genetic structure, and genetic relationships analyses showed an extensive variety of the P. capsici in Sichuan and Guizhou with clonal lineages, two shared genotypes, and no geographic differentiation. The population from Chongqing was differentiated from that of Sichuan and Guizhou and had the highest genetic diversity. There was no significant distinction between the populations of the two sampling years, but there was a small differentiation between the populations from bell peppers and hot peppers. The isolates from Southwest China were largely distant from the two reference isolates from the USA. The analysis of molecular variance showed that the major variance of the populations was within populations. The linkage equilibrium test, mating type composition, and oospore detection indicated that only P. capsici from the Jiulongpo district of Chongqing had appeared in sexual recombination. Overall, this study revealed that the high and complex genetic diversity population of P. capsici in Sichuan, Chongqing, and Guizhou with uneven geographic variation and limited sexual reproductive behavior in Chongqing, potentially driven by differences in the geographical environment, reproductive patterns, different cultivars, and artificial long-distance transfers. IMPORTANCE Phytophthora capsici, a notorious soilborne and rapidly evolving pathogen with a wide range of hosts, is a huge threat to pepper production worldwide. However, the detailed genetic structure and dynamics of P. capsici in most Chinese provinces are still unclear, even though China is the world's largest producer and consumer of peppers. Here, a novel full set of high-quality polymorphic microsatellites, obtained by genome resequencing data of 10 isolates from Southwest China, was provided for future population analyses. In this study, we further investigated and established the genetic structure, sexual recombination, geographic subdivisions, interannual stability, differentiation in different types of host peppers, and member relationships of P. capsici from three provinces in Southwest China. These results reveal the genetic structure and dynamics of P. capsici in three provinces of Southwest China and help us to execute more effective management strategies in the future.
Collapse
|
9
|
Zhou QY, Cai HX, Liu ZH, Yuan LX, Yang L, Yang T, Li B, Li P. Development of genomic resources for Wenchengia alternifolia (Lamiaceae) based on genome skimming data. PLANT DIVERSITY 2022; 44:542-551. [PMID: 36540711 PMCID: PMC9751079 DOI: 10.1016/j.pld.2021.09.006] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Revised: 08/27/2021] [Accepted: 09/28/2021] [Indexed: 06/17/2023]
Abstract
Wenchengia alternifolia (Lamiaceae), the sole species of the genus Wenchengia is extremely rare and is currently listed as Critically Endangered (CR) on the IUCN Red List. The species had long been considered endemic to Hainan Island, China and was once believed to be extinct until a small remnant population was rediscovered at the type locality in 2010. Four more populations were later found on Hainan and in Vietnam. In order to develop genomic resources for further studies on population genetics and conservation biology of this rare species, we identified infraspecific molecular markers in the present study, using genome skimming data of five individuals collected from two populations on Hainan Island and three populations in Vietnam respectively. The length of plastome of the five individuals varied from 152,961 bp to 150,204 bp, and exhibited a typical angiosperm quadripartite structure. Six plastid hotspot regions with the Pi > 0.01 (trnH-psbA, psbA-trnK, rpl22, ndhE, ndhG-ndhI and rps15-ycf1), 1621 polymorphic gSSRs, as well as 1657 candidate SNPs in 237 variant nuclear genes were identified, thereby providing important information for further genetic studies.
Collapse
Affiliation(s)
- Qi-Yue Zhou
- Laboratory of Systematic & Evolutionary Botany and Biodiversity, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, Shanghai, 201602, China
| | - Hui-Xia Cai
- Laboratory of Systematic & Evolutionary Botany and Biodiversity, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Zi-Han Liu
- Laboratory of Systematic & Evolutionary Botany and Biodiversity, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China
| | | | - Lei Yang
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, Shanghai, 201602, China
| | - Tuo Yang
- Laboratory of Systematic & Evolutionary Botany and Biodiversity, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China
- Orchid Conservation & Research Center of Shenzhen, Shenzhen, 518114, China
| | - Bo Li
- Research Centre of Ecological Sciences, College of Agronomy, Jiangxi Agricultural University, Nanchang, 330045, China
| | - Pan Li
- Laboratory of Systematic & Evolutionary Botany and Biodiversity, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China
| |
Collapse
|
10
|
Wang S, Wang Y, Zhou J, Li P, Lin H, Peng Y, Yu L, Zhang Y, Wang Z. Genetic Diversity and Population Structure of an Arctic Tertiary Relict Tree Endemic to China ( Sassafras tzumu) Revealed by Novel Nuclear Microsatellite (nSSR) Markers. PLANTS (BASEL, SWITZERLAND) 2022; 11:plants11202706. [PMID: 36297730 PMCID: PMC9610890 DOI: 10.3390/plants11202706] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Revised: 10/09/2022] [Accepted: 10/10/2022] [Indexed: 05/11/2023]
Abstract
Sassafras tzumu (Hemsl.) Hemsl., as an Arctic Tertiary relict woody species, is an ecologically and economically important deciduous tree endemic to southern China. Nonetheless, the genetic resources and backgrounds of S. tzumu are still lacking and remain largely unclear. Here, we predicted 16,215 candidate polymorphic nuclear microsatellite (nSSR) loci from the assembled nucleus databases of six geographic-distant individuals of S. tzumu via CandiSSR. Among these nSSRs, the di- (75.53%) and tri-nucleotide (19.75%) repeats were the most abundant, and 27 new polymorphic SSRs were developed and characterized in 136 individuals from six natural populations of S. tzumu. The majority of the above 27 SSRs (24 loci, 88.89%) presented moderate polymorphism (mean PIC = 0.356), and the transferability of these markers in other Sassafras species was high (85.19%). A moderately low level of genetic diversity and a high variation (FST = 0.286) of six wild populations of S. tzumu were illuminated by 16 selected polymorphic nSSRs, with the average expected heterozygosity (HE) of 0.430 at the species level and HE ranging from 0.195 to 0.387 at the population level. Meanwhile, a bottleneck effect was shown in two populations. Consistent with the results of the principal coordinate analysis (PCoA) and phylogenetic trees, structure analysis optimally divided these six S. tzumu populations into two clusters, and the further strong population subdivision appeared from K = 2 to K = 5, which corresponded to two evolutionarily significant units (ESUs). Moreover, the significant correlation between genetic and geographic distance was tested by the Mantel test (r = 0.742, p = 0.006), clarifying the effect about isolation by distance (IBD), which could be possibly explained by the low gene flow (Nm = 0.625), a relatively high degree of inbreeding (FIS = 0.166), a relatively large distribution, and mountainous barriers. Above all, our research not only enlarged the useful genetic resources for future studies of population genetics, molecular breeding, and germplasm management of S. tzumu and its siblings but also contributed to proposing scientific conservation strategies and schemes for the better preservation of S. tzumu and other Sassafras (Lauraceae) species.
Collapse
Affiliation(s)
- Shuang Wang
- College of Life Sciences, Nanjing University, Nanjing 210023, China
| | - Ying Wang
- College of Life Sciences, Nanjing University, Nanjing 210023, China
| | - Jingbo Zhou
- College of Life Sciences, Nanjing University, Nanjing 210023, China
| | - Pan Li
- Systematic & Evolutionary Botany and Biodiversity Group, MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, Hangzhou 310058, China
| | - Hungwei Lin
- College of Life Sciences, Nanjing University, Nanjing 210023, China
| | - Ye Peng
- College of Biology and the Environment, Nanjing Forestry University, Nanjing 210037, China
| | - Lipeng Yu
- Mount Longwang Nature Reserve, Huzhou 313300, China
| | - Yunyan Zhang
- College of Life Sciences, Nanjing University, Nanjing 210023, China
- Correspondence: (Y.Z.); (Z.W.); Tel.: +86-15261868978 (Y.Z.); +86-13770650868 (Z.W.)
| | - Zhongsheng Wang
- College of Life Sciences, Nanjing University, Nanjing 210023, China
- Correspondence: (Y.Z.); (Z.W.); Tel.: +86-15261868978 (Y.Z.); +86-13770650868 (Z.W.)
| |
Collapse
|
11
|
Liu K, Xie N. Pipeline for developing polymorphic microsatellites in species without reference genomes. 3 Biotech 2022; 12:248. [PMID: 36039078 PMCID: PMC9418399 DOI: 10.1007/s13205-022-03313-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Accepted: 08/16/2022] [Indexed: 11/01/2022] Open
Abstract
Microsatellites, also known as simple sequence repeats (SSRs), are the preferred type of marker for many genetic applications. In conjunction with the ongoing development of next-generation sequencing, several bioinformatic tools have been developed for identifying SSRs from genomic or transcriptomic sequences. Although these tools are handy for generating polymorphic SSRs, their application almost always depends on an existing reference genome or self-assembly of the reference genome. With this in mind, we propose a pipeline for developing polymorphic SSRs that may be applied to species without reference genomes. Using a species without a reference genome (black Amur bream; Megalobrama terminalis Richardson, 1846) as a model, our pipeline was able to effectively discover polymorphic SSRs. Under different R parameters of a reference-free single nucleotide polymorphisms (SNPs) caller (ebwt2InDel), a total of 258, 208, 102, and 11 polymorphic SSRs were mined. To quantify the accuracy of the polymorphic SSRs detected using our pipeline, we analyzed 25 SSRs with PCR experiments. All primers were successfully amplified, and most SSRs (23 SSRs, 92%) were polymorphic. From the 36 individual black Amur bream, we acquired an average of 3.36 alleles per locus, ranging from one to 11. This demonstrates the effectiveness of our pipeline in identifying polymorphic SSRs and designing primers for SSR genotyping. Ultimately, our pipeline can effectively mine polymorphic SSRs for species without reference genomes, complementing SSR mining approaches based on reference genomes and helping to resolve biological issues that accompany these methods. Supplementary Information The online version contains supplementary material available at 10.1007/s13205-022-03313-0.
Collapse
Affiliation(s)
- Kai Liu
- Institute of Fishery Science, Hangzhou Academy of Agricultural Sciences, Hangzhou, Zhejiang China
| | - Nan Xie
- Institute of Fishery Science, Hangzhou Academy of Agricultural Sciences, Hangzhou, Zhejiang China
| |
Collapse
|
12
|
AutomAted RepeaT Identifier (AARTI): A tool to identify common, polymorphic, and unique microsatellites. Mitochondrion 2022; 65:161-165. [PMID: 35738354 DOI: 10.1016/j.mito.2022.06.002] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Accepted: 06/19/2022] [Indexed: 11/21/2022]
Abstract
Here we are presenting an automated computational pipeline used to mine 5976 mitochondrial genomes to identify common, polymorphic, and unique microsatellites also known as simple sequence repeats (SSRs). Microsatellites are repetitive motifs of 1-6 bases in a DNA sequence. Due to their abundance and highly polymorphic nature, microsatellites have become one of the widely used molecular/genetic markers valuable for many studies including gene tagging, genetic diversity, and species identification. Several computational tools dedicated to mine and categorize microsatellites in nucleotide sequences were developed; however, there is no tool which can identify unique, common and polymorphic microsatellites between each pair of nucleotide sequences. To explore such microsatellites, we have developed a fully automated computational pipeline named AutomAted RepeaT Identifier (AARTI). The AARTI is the only tool till date, which identifies common, polymorphic, and unique microsatellites between each pair of nucleotide sequences. The computational pipeline was constructed using the PERL programming language and the web server for the pipeline was developed with the help of PHP, HTML, CSS, and JavaScript. It was successfully tested to reproduce the results of previous study on 7 mitochondrial genomes of genus Orthotrichum. Moreover, the pipeline was also applied on 5846 (Metazoa) and 130 (Viridiplantae) mitochondrial genomes. The AARTI is freely available at https://lms.snu.edu.in/aarti/ and will certainly accelerate the studies of length variation in microsatellites between species. Additionally, it will be useful in comparative genomic studies, for the computational characterization of microsatellites, and has the potential to be a routine genome analysis pipeline for mitochondrial genomes.
Collapse
|
13
|
Chromosome-scale genome assembly of an important medicinal plant honeysuckle. Sci Data 2022; 9:226. [PMID: 35610245 PMCID: PMC9130202 DOI: 10.1038/s41597-022-01385-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Accepted: 05/10/2022] [Indexed: 11/12/2022] Open
Abstract
Lonicera japonica (honeysuckle) is one of the most important medicinal plants and widely utilized in traditional Chinese medicine. At present, there are many varieties of honeysuckle used in cultivation, among which Sijihua variety are widely cultivated due to its wide adaptability, stress resistance, early flowering and high yield. In this study, we assembled the genome of Sijihua, which was approximately 886.04 Mb in size with a scaffold N50 of 79.5 Mb. 93.28% of the total assembled sequences were anchored to 9 pseudo-chromosomes by using PacBio long reads and Hi-C sequencing data. We predicted 39,320 protein-coding genes and 92.87% of them could be annotated in NR, GO, KOG, KEGG and other databases. In addition, we identified 644 tRNAs, 2,156 rRNAs, 109 miRNAs and 5,502 pseudogenes from the genome. The chromosome-scale genome of Sijihua will be a significant resource for understanding the genetic basis of high stress-resistance, which will facilitate further study of the genetic diversity and accelerate the genetic improvement and breeding of L. japonica. Measurement(s) | Lonicera japonica • RNA sequencing • genome assembly • sequence annotation | Technology Type(s) | SMRT Sequencing • RNA sequencing • Hi-C • biomolecular annotation design | Factor Type(s) | Genotype | Sample Characteristic - Organism | Lonicera japonica | Sample Characteristic - Environment | occurrence | Sample Characteristic - Location | Shandong Province |
Collapse
|
14
|
Automating microsatellite screening and primer design from multi-individual libraries using Micro-Primers. Sci Rep 2022; 12:295. [PMID: 34997147 PMCID: PMC8741888 DOI: 10.1038/s41598-021-04275-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Accepted: 12/10/2021] [Indexed: 11/08/2022] Open
Abstract
Analysis of intra- and inter-population diversity has become important for defining the genetic status and distribution patterns of a species and a powerful tool for conservation programs, as high levels of inbreeding could lead into whole population extinction in few generations. Microsatellites (SSR) are commonly used in population studies but discovering highly variable regions across species' genomes requires demanding computation and laboratorial optimization. In this work, we combine next generation sequencing (NGS) with automatic computing to develop a genomic-oriented tool for characterizing SSRs at the population level. Herein, we describe a new Python pipeline, named Micro-Primers, designed to identify, and design PCR primers for amplification of SSR loci from a multi-individual microsatellite library. By combining commonly used programs for data cleaning and microsatellite mining, this pipeline easily generates, from a fastq file produced by high-throughput sequencing, standard information about the selected microsatellite loci, including the number of alleles in the population subset, and the melting temperature and respective PCR product of each primer set. Additionally, potential polymorphic loci can be identified based on the allele ranges observed in the population, to easily guide the selection of optimal markers for the species. Experimental results show that Micro-Primers significantly reduces processing time in comparison to manual analysis while keeping the same quality of the results. The elapsed times at each step can be longer depending on the number of sequences to analyze and, if not assisted, the selection of polymorphic loci from multiple individuals can represent a major bottleneck in population studies.
Collapse
|
15
|
Liu L, Low SL, Sakaguchi S, Feng Y, Ge B, Konowalik K, Li P. Development of nuclear and chloroplast polymorphic microsatellites for Crossostephium chinense (Asteraceae). Mol Biol Rep 2021; 48:6259-6267. [PMID: 34392450 DOI: 10.1007/s11033-021-06590-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Accepted: 07/23/2021] [Indexed: 10/20/2022]
Abstract
BACKGROUND Crossostephium chinense is a traditional Chinese medicinal herb and it is often cultivated as an ornamental plant. Previous studies on this species mainly focused on its chemical composition and it was rarely represented in genetic studies, and thus genomic resources remain scarce. METHODS AND RESULTS Both chloroplast and nuclear polymorphic microsatellites of C. chinense were screened from genome skimming data of two individuals. 64 and 63 cpSSR markers were identified from two chloroplast genomes of C. chinense. A total of 133 polymorphic nSSRs were developed. Ten nSSRs were randomly selected to test their transferability across 35 individuals from three populations of C. chinense, and 20 individuals each of Artemisia stolonifera and A. argyi. Cross-amplifications were successfully done for C. chinense and were partially amplified for both Artemisia species. The number of alleles varied from two to nine. The observed heterozygosity and expected heterozygosity per locus ranged from 0.000 to 0.286 and from 0.029 to 0.755, respectively. CONCLUSIONS In this study, we developed polymorphic cpSSRs and nSSRs markers for C. chinense based on genome skimming sequencing. These genomic resources will be valuable for population genetics and conservation studies in C. chinense and Artemisia.
Collapse
Affiliation(s)
- Luxian Liu
- Key Laboratory of Plant Stress Biology, Laboratory of Plant Germplasm and Genetic Engineering, School of Life Sciences, Henan University, Kaifeng, 475000, China
| | | | - Shota Sakaguchi
- Division of Forest and Biomaterials Science, Graduate School of Agriculture, Kyoto University, Kyoto, 6068502, Japan
| | - Yu Feng
- Laboratory of Systematic & Evolutionary Botany and Biodiversity, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Binjie Ge
- Eastern China Conservation Center for Wild Endangered Plant Resources, Shanghai Chenshan Botanical Garden, Shanghai, China
| | - Kamil Konowalik
- Institute of Environmental Biology, Wrocław University of Environmental and Life Sciences, 51-631, Wrocław, Poland.
| | - Pan Li
- Laboratory of Systematic & Evolutionary Botany and Biodiversity, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China.
| |
Collapse
|
16
|
Liu L, Zhang Y, Li P. Development of genomic resources for the genus Celtis (Cannabaceae) based on genome skimming data. PLANT DIVERSITY 2021; 43:43-53. [PMID: 33778224 PMCID: PMC7987720 DOI: 10.1016/j.pld.2020.09.005] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/22/2020] [Revised: 09/22/2020] [Accepted: 09/24/2020] [Indexed: 06/02/2023]
Abstract
Celtis is a Cannabaceae genus of 60-70 species of trees, or rarely shrubs, commonly known as hackberries. This woody genus consists of very valuable forest plants that provide important wildlife habitat for birds and mammals. Although previous studies have identified its phylogenetic position, interspecific relationships within Celtis remain unclear. In this study, we generated genome skimming data from five Celtis species to analyze phylogenetic relationships within the genus and develop genome resources. The plastomes of Celtis ranged in length from 158,989 bp to 159,082 bp, with a typical angiosperm quadripartite structure, and encoded a total of 132 genes with 20 duplicated in the IRs. Comparative analyses showed that plastome content and structure were relatively conserved. Whole plastomes showed no signs of gene loss, translocations, inversions, or genome rearrangement. Six plastid hotspot regions (trnH-psbA, psbA-trnK, trnG-trnR, psbC-trnS, cemA-petA and rps8-rpl14), 4097 polymorphic nuclear SSRs, as well as 62 low or single-copy gene fragments were identified within Celtis. Moreover, the phylogenetic relationships based on the complete plastome sequences strongly endorse the placement of C. biondii as sister to the ((((C. koraiensis, C. sinensis), C. tetrandra), C. julianae), C. cerasifera) clade. These findings and the genetic resources developed here will be conducive to further studies on the genus Celtis involving phylogeny, population genetics, and conservation biology.
Collapse
Affiliation(s)
- Luxian Liu
- Key Laboratory of Plant Stress Biology, School of Life Sciences, Henan University, Kaifeng, 475000, China
| | - Yonghua Zhang
- College of Life and Environmental Sciences, Wenzhou University, Wenzhou, 325035, China
| | - Pan Li
- Laboratory of Systematic & Evolutionary Botany and Biodiversity, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China
| |
Collapse
|
17
|
Gou X, Shi H, Yu S, Wang Z, Li C, Liu S, Ma J, Chen G, Liu T, Liu Y. SSRMMD: A Rapid and Accurate Algorithm for Mining SSR Feature Loci and Candidate Polymorphic SSRs Based on Assembled Sequences. Front Genet 2020; 11:706. [PMID: 32849772 PMCID: PMC7398111 DOI: 10.3389/fgene.2020.00706] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2020] [Accepted: 06/10/2020] [Indexed: 12/16/2022] Open
Abstract
Microsatellites or simple sequence repeats (SSRs) are short tandem repeats of DNA widespread in genomes and transcriptomes of diverse organisms and are used in various genetic studies. Few software programs that mine SSRs can be further used to mine polymorphic SSRs, and these programs have poor portability, have slow computational speed, are highly dependent on other programs, and have low marker development rates. In this study, we develop an algorithm named Simple Sequence Repeat Molecular Marker Developer (SSRMMD), which uses improved regular expressions to rapidly and exhaustively mine perfect SSR loci from any size of assembled sequence. To mine polymorphic SSRs, SSRMMD uses a novel three-stage method to assess the conservativeness of SSR flanking sequences and then uses the sliding window method to fragment each assembled sequence to assess its uniqueness. Furthermore, molecular biology assays support the polymorphic SSRs identified by SSRMMD. SSRMMD is implemented using the Perl programming language and can be downloaded from https://github.com/GouXiangJian/SSRMMD.
Collapse
Affiliation(s)
- Xiangjian Gou
- Triticeae Research Institute, Sichuan Agricultural University, Chengdu, China.,Maize Research Institute, Sichuan Agricultural University, Chengdu, China
| | - Haoran Shi
- Triticeae Research Institute, Sichuan Agricultural University, Chengdu, China
| | - Shifan Yu
- Triticeae Research Institute, Sichuan Agricultural University, Chengdu, China
| | - Zhiqiang Wang
- Triticeae Research Institute, Sichuan Agricultural University, Chengdu, China
| | - Caixia Li
- Triticeae Research Institute, Sichuan Agricultural University, Chengdu, China
| | - Shihang Liu
- Triticeae Research Institute, Sichuan Agricultural University, Chengdu, China
| | - Jian Ma
- Triticeae Research Institute, Sichuan Agricultural University, Chengdu, China
| | - Guangdeng Chen
- College of Resources, Sichuan Agricultural University, Chengdu, China
| | - Tao Liu
- College of Information Engineering, Sichuan Agricultural University, Ya'an, China
| | - Yaxi Liu
- Triticeae Research Institute, Sichuan Agricultural University, Chengdu, China.,State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Chengdu, China
| |
Collapse
|
18
|
Mokhtar MM, Atia MAM. SSRome: an integrated database and pipelines for exploring microsatellites in all organisms. Nucleic Acids Res 2020; 47:D244-D252. [PMID: 30365025 PMCID: PMC6323889 DOI: 10.1093/nar/gky998] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2018] [Accepted: 10/14/2018] [Indexed: 11/23/2022] Open
Abstract
Over the past decade, many databases focusing on microsatellite mining on a genomic scale were released online with at least one of the following major deficiencies: (i) lacking the classification of microsatellites as genic or non-genic, (ii) not comparing microsatellite motifs at both genic and non-genic levels in order to identify unique motifs for each class or (iii) missing SSR marker development. In this study, we have developed ‘SSRome’ as a web-based, user-friendly, comprehensive and dynamic database with pipelines for exploring microsatellites in 6533 organisms. In the SSRome database, 158 million microsatellite motifs are identified across all taxa, in addition to all the mitochondrial and chloroplast genomes and expressed sequence tags available from NCBI. Moreover, 45.1 million microsatellite markers were developed and classified as genic or non-genic. All the stored motif and marker datasets can be downloaded freely. In addition, SSRome provides three user-friendly tools to identify, classify and compare motifs on either a genome- or transcriptome-wide scale. With the implementation of PHP, HTML and JavaScript, users can upload their data for analysis via a user-friendly GUI. SSRome represents a powerful database and mega-tool that will assist researchers in developing and dissecting microsatellite markers on a high-throughput scale.
Collapse
Affiliation(s)
- Morad M Mokhtar
- Molecular Genetics and Genome Mapping Laboratory, Genome Mapping Department, Agricultural Genetic Engineering Research Institute (AGERI), ARC, Giza, 12619, Egypt
| | - Mohamed A M Atia
- Molecular Genetics and Genome Mapping Laboratory, Genome Mapping Department, Agricultural Genetic Engineering Research Institute (AGERI), ARC, Giza, 12619, Egypt
| |
Collapse
|
19
|
Hina F, Yisilam G, Wang S, Li P, Fu C. De novo Transcriptome Assembly, Gene Annotation and SSR Marker Development in the Moon Seed Genus Menispermum (Menispermaceae). Front Genet 2020; 11:380. [PMID: 32457795 PMCID: PMC7227793 DOI: 10.3389/fgene.2020.00380] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2019] [Accepted: 03/27/2020] [Indexed: 12/27/2022] Open
Abstract
The moonseed genus Menispermum L. (Menispermaceae) is disjunctly distributed in East Asia and eastern North America. Although Menispermum has important medicinal value, genetic and genomic information is scarce, with very few available molecular markers. In the current study, we used Illumina transcriptome sequencing and de novo assembly of the two Menispermum species to obtain in-depth genetic knowledge. From de novo assembly, 53,712 and 78,921 unigenes were generated for M. canadense and M. dauricum, with 37,527 (69.87%) and 55,211 (69.96%) showing significant similarities against the six functional databases, respectively. Moreover, 521 polymorphic EST-SSRs were identified. Of them, 23 polymorphic EST-SSR markers were selected to investigate the population genetic diversity within the genus. The newly developed EST-SSR markers also revealed high transferability among the three examined Menispermaceae species. Overall, we provide the very first transcriptomic analyses of this important medicinal genus. In addition, the novel microsatellite markers developed here will aid future studies on the population genetics and phylogeographic patterns of Menispermum at the intercontinental geographical scale.
Collapse
Affiliation(s)
- Faiza Hina
- Laboratory of Systematic and Evolutionary Botany and Biodiversity, College of Life Sciences, Zhejiang University, Hangzhou, China
| | - Gulbar Yisilam
- Laboratory of Systematic and Evolutionary Botany and Biodiversity, College of Life Sciences, Zhejiang University, Hangzhou, China
| | - Shenyi Wang
- Department of Botany, University of Wisconsin–Madison, Madison, WI, United States
| | - Pan Li
- Laboratory of Systematic and Evolutionary Botany and Biodiversity, College of Life Sciences, Zhejiang University, Hangzhou, China
| | - Chengxin Fu
- Laboratory of Systematic and Evolutionary Botany and Biodiversity, College of Life Sciences, Zhejiang University, Hangzhou, China
| |
Collapse
|
20
|
Dang Z, Huang L, Jia Y, Lockhart PJ, Fong Y, Tian Y. Identification of Genic SSRs Provide a Perspective for Studying Environmental Adaptation in the Endemic Shrub Tetraena mongolica. Genes (Basel) 2020; 11:E322. [PMID: 32197402 PMCID: PMC7140860 DOI: 10.3390/genes11030322] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2020] [Revised: 03/10/2020] [Accepted: 03/16/2020] [Indexed: 02/03/2023] Open
Abstract
Tetraena mongolica is a xerophytic shrub endemic to desert regions in Inner Mongolia. This species has evolved distinct survival strategies that allow it to adapt to hyper-drought and heterogeneous habitats. Simple sequence repeats (SSRs) may provide a molecular basis in plants for fast adaptation to environmental change. Thus, identifying SSRs and their possible effects on gene behavior has the potential to provide valuable information for studies of adaptation. In this study, we sequenced six individual transcriptomes of T. mongolica from heterogeneous habitats, focused on SSRs located in genes, and identified 811 polymorphic SSRs. Of the identified SSRs, 172, 470, and 76 were located in 5' UTRs, CDSs, and 3' UTRs in 591 transcripts; and AG/CT, AAC/GTT, and AT/AT were the most abundant repeats in each gene region. Functional annotation showed that many of the identified polymorphic SSRs were in genes that were enriched in several GO terms and KEGG pathways, suggesting the functional significance of these genes in the environmental adaptation process. The identification of polymorphic genic SSRs in our study lays a foundation for future studies investigating the contribution of SSRs to regulation of genes in natural populations of T. mongolica and their importance for adaptive evolution of this species.
Collapse
Affiliation(s)
- Zhenhua Dang
- Inner Mongolia Key Laboratory of Grassland Ecology & Ministry of Education Key Laboratory of Ecology and Resource Use of the Mongolian Plateau, School of Ecology and Environment, Inner Mongolia University, Hohhot 010021, China; (Z.D.); (L.H.); (Y.J.)
| | - Lei Huang
- Inner Mongolia Key Laboratory of Grassland Ecology & Ministry of Education Key Laboratory of Ecology and Resource Use of the Mongolian Plateau, School of Ecology and Environment, Inner Mongolia University, Hohhot 010021, China; (Z.D.); (L.H.); (Y.J.)
| | - Yuanyuan Jia
- Inner Mongolia Key Laboratory of Grassland Ecology & Ministry of Education Key Laboratory of Ecology and Resource Use of the Mongolian Plateau, School of Ecology and Environment, Inner Mongolia University, Hohhot 010021, China; (Z.D.); (L.H.); (Y.J.)
| | - Peter J. Lockhart
- School of Fundamental Sciences, College of Sciences, Massey University, Palmerston North 4442, New Zealand; (P.J.L.); (Y.F.)
| | - Yang Fong
- School of Fundamental Sciences, College of Sciences, Massey University, Palmerston North 4442, New Zealand; (P.J.L.); (Y.F.)
| | - Yunyun Tian
- Ministry of Education Key Laboratory of Herbage & Endemic Crop Biotechnology, School of Life Sciences, Inner Mongolia University, Hohhot 010021, China
| |
Collapse
|
21
|
Guo L, Yang Q, Yang JW, Zhang N, Liu BS, Zhu KC, Guo HY, Jiang SG, Zhang DC. MultiplexSSR: A pipeline for developing multiplex SSR-PCR assays from resequencing data. Ecol Evol 2020; 10:3055-3067. [PMID: 32211176 PMCID: PMC7083706 DOI: 10.1002/ece3.6121] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2019] [Revised: 02/02/2020] [Accepted: 02/05/2020] [Indexed: 12/15/2022] Open
Abstract
Next-generation sequencing has greatly promoted the investigation of single nucleotide polymorphisms, while studies of simple sequence repeats are sharply decreasing. However, simple sequence repeats still present some advantages in conservation genetics. In this study, an end-to-end pipeline referred to as MultiplexSSR was established to develop multiplex PCR assays in batches with highly polymorphic simple sequence repeats for capillary platforms from resequencing data. The distribution of single sequence repeats in the genome, the error profiles of genotypes and allelotypes, and the increase in the allele length range depending on the number of individuals were investigated. A total of 98% of single sequence repeats presented lengths of less than 100 bp. The error rate of the genotyping and allelotyping of dimeric patterns was ten times higher than those for other patterns. The error rate of allelotyping was less than that of genotyping. The allele length range reached approximate saturation with 10 individuals. This pipeline uses allele numbers to select highly polymorphic loci, masks loci with variation, and applies in silico PCR to improve primer specificity. The application of the developed multiplex SSR-PCR assays validated the pipeline's robustness, showing higher polymorphism and stability for the developed simple sequence repeats and a lower cost for genotyping and providing low-depth resequencing data from less than a dozen individuals for the development of markers. This pipeline fills the gap between next-generation sequencing and multiplex SSR-PCR.
Collapse
Affiliation(s)
- Liang Guo
- Key Laboratory of South China Sea Fishery Resources Exploitation and Utilization Ministry of Agriculture and Rural Affairs South China Sea Fisheries Research Institute Chinese Academy of Fishery Sciences Guangzhou China
- Guangdong Provincial Engineer Technology Research Center of Marine Biological Seed Industry Guangzhou China
| | - Quan Yang
- Key Laboratory of South China Sea Fishery Resources Exploitation and Utilization Ministry of Agriculture and Rural Affairs South China Sea Fisheries Research Institute Chinese Academy of Fishery Sciences Guangzhou China
- Guangdong Provincial Engineer Technology Research Center of Marine Biological Seed Industry Guangzhou China
- National Demonstration Center for Experimental Fisheries Science Education Shanghai Ocean University Shanghai China
| | - Jing-Wen Yang
- Key Laboratory of South China Sea Fishery Resources Exploitation and Utilization Ministry of Agriculture and Rural Affairs South China Sea Fisheries Research Institute Chinese Academy of Fishery Sciences Guangzhou China
- Guangdong Provincial Engineer Technology Research Center of Marine Biological Seed Industry Guangzhou China
| | - Nan Zhang
- Key Laboratory of South China Sea Fishery Resources Exploitation and Utilization Ministry of Agriculture and Rural Affairs South China Sea Fisheries Research Institute Chinese Academy of Fishery Sciences Guangzhou China
- Guangdong Provincial Engineer Technology Research Center of Marine Biological Seed Industry Guangzhou China
| | - Bao-Suo Liu
- Key Laboratory of South China Sea Fishery Resources Exploitation and Utilization Ministry of Agriculture and Rural Affairs South China Sea Fisheries Research Institute Chinese Academy of Fishery Sciences Guangzhou China
- Guangdong Provincial Engineer Technology Research Center of Marine Biological Seed Industry Guangzhou China
| | - Ke-Cheng Zhu
- Key Laboratory of South China Sea Fishery Resources Exploitation and Utilization Ministry of Agriculture and Rural Affairs South China Sea Fisheries Research Institute Chinese Academy of Fishery Sciences Guangzhou China
- Guangdong Provincial Engineer Technology Research Center of Marine Biological Seed Industry Guangzhou China
| | - Hua-Yang Guo
- Key Laboratory of South China Sea Fishery Resources Exploitation and Utilization Ministry of Agriculture and Rural Affairs South China Sea Fisheries Research Institute Chinese Academy of Fishery Sciences Guangzhou China
- Guangdong Provincial Engineer Technology Research Center of Marine Biological Seed Industry Guangzhou China
| | - Shi-Gui Jiang
- Key Laboratory of South China Sea Fishery Resources Exploitation and Utilization Ministry of Agriculture and Rural Affairs South China Sea Fisheries Research Institute Chinese Academy of Fishery Sciences Guangzhou China
- Guangdong Provincial Engineer Technology Research Center of Marine Biological Seed Industry Guangzhou China
| | - Dian-Chang Zhang
- Key Laboratory of South China Sea Fishery Resources Exploitation and Utilization Ministry of Agriculture and Rural Affairs South China Sea Fisheries Research Institute Chinese Academy of Fishery Sciences Guangzhou China
- Guangdong Provincial Engineer Technology Research Center of Marine Biological Seed Industry Guangzhou China
| |
Collapse
|
22
|
Dubey H, Rawal HC, Rohilla M, Lama U, Kumar PM, Bandyopadhyay T, Gogoi M, Singh NK, Mondal TK. TeaMiD: a comprehensive database of simple sequence repeat markers of tea. Database (Oxford) 2020; 2020:baaa013. [PMID: 32159215 PMCID: PMC7065459 DOI: 10.1093/database/baaa013] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2019] [Revised: 01/05/2020] [Accepted: 01/25/2020] [Indexed: 12/05/2022]
Abstract
Tea is a highly cross-pollinated, woody, perennial tree. High heterozygosity combined with a long gestational period makes conventional breeding a cumbersome process. Therefore, marker-assisted breeding is a better alternative approach when compared with conventional breeding. Considering the large genome size of tea (~3 Gb), information about simple sequence repeat (SSR) is scanty. Thus, we have taken advantage of the recently published tea genomes to identify large numbers of SSR markers in the tea. Besides the genomic sequences, we identified SSRs from the other publicly available sequences such as RNA-seq, GSS, ESTs and organelle genomes (chloroplasts and mitochondrial) and also searched published literature to catalog validated set of tea SSR markers. The complete exercise yielded a total of 935 547 SSRs. Out of the total, 82 SSRs were selected for validation among a diverse set of tea genotypes. Six primers (each with four to six alleles, an average of five alleles per locus) out of the total 27 polymorphic primers were used for a diversity analysis in 36 tea genotypes with mean polymorphic information content of 0.61-0.76. Finally, using all the information generated in this study, we have developed a user-friendly database (TeaMiD; http://indianteagenome.in:8080/teamid/) that hosts SSR from all the six resources including three nuclear genomes of tea and transcriptome sequences of 17 Camellia wild species. Database URL: http://indianteagenome.in:8080/teamid/.
Collapse
Affiliation(s)
- Himanshu Dubey
- Indian Council Agricultural Research-National Institute for Plant Biotechnology, Lal Bahadur Sashtri Centre, Indian Agricultural Research Institute, Pusa, New Delhi 110012, India
| | - Hukam C Rawal
- Indian Council Agricultural Research-National Institute for Plant Biotechnology, Lal Bahadur Sashtri Centre, Indian Agricultural Research Institute, Pusa, New Delhi 110012, India
| | - Megha Rohilla
- Indian Council Agricultural Research-National Institute for Plant Biotechnology, Lal Bahadur Sashtri Centre, Indian Agricultural Research Institute, Pusa, New Delhi 110012, India
| | - Urvashi Lama
- Darjeeling Tea Research and Development Centre, Tea Board, Ministry of Commerce, B.T.M. Sarani (Brabourne Road), Kolkata, West Bengal 700001, India
| | - P Mohan Kumar
- Darjeeling Tea Research and Development Centre, Tea Board, Ministry of Commerce, B.T.M. Sarani (Brabourne Road), Kolkata, West Bengal 700001, India
| | - Tanoy Bandyopadhyay
- Department of Biotechnology, Tocklai Experimental Station, Tea Research Association, Jorhat, Assam, India
| | - Madhurjya Gogoi
- Department of Biotechnology, Tocklai Experimental Station, Tea Research Association, Jorhat, Assam, India
| | - Nagendra Kumar Singh
- Indian Council Agricultural Research-National Institute for Plant Biotechnology, Lal Bahadur Sashtri Centre, Indian Agricultural Research Institute, Pusa, New Delhi 110012, India
| | - Tapan Kumar Mondal
- Indian Council Agricultural Research-National Institute for Plant Biotechnology, Lal Bahadur Sashtri Centre, Indian Agricultural Research Institute, Pusa, New Delhi 110012, India
| |
Collapse
|
23
|
Liu LX, Du YX, Folk RA, Wang SY, Soltis DE, Shang FD, Li P. Plastome Evolution in Saxifragaceae and Multiple Plastid Capture Events Involving Heuchera and Tiarella. FRONTIERS IN PLANT SCIENCE 2020; 11:361. [PMID: 32391025 PMCID: PMC7193090 DOI: 10.3389/fpls.2020.00361] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/20/2019] [Accepted: 03/12/2020] [Indexed: 05/02/2023]
Abstract
Saxifragaceae, a family of over 600 species and approximately 30 genera of herbaceous perennials, is well-known for intergeneric hybridization. Of the main lineages in this family, the Heuchera group represents a valuable model for the analysis of plastid capture and its impact on phylogeny reconstruction. In this study, we investigated plastome evolution across the family, reconstructed the phylogeny of the Heuchera group and examined putative plastid capture between Heuchera and Tiarella. Seven species (11 individuals) representing Tiarella, as well as Mitella and Heuchera, were selected for genome skimming. We assembled the plastomes, and then compared these to six others published for Saxifragaceae; the plastomes were found to be highly similar in overall size, structure, gene order and content. Moreover, ycf15 was lost due to pseudogenization and rpl2 lost its only intron for all the analyzed plastomes. Comparative plastome analysis revealed that size variations of the plastomes are purely ascribed to the length differences of LSC, SSC, and IRs regions. Using nuclear ITS + ETS and the complete plastome, we fully resolved the species relationships of Tiarella, finding that the genus is monophyletic and the Asian species is most closely related to the western North American species. However, the position of the Heuchera species was highly incongruent between nuclear and plastid data. Comparisons of nuclear and plastid phylogenies revealed that multiple plastid capture events have occurred between Heuchera and Tiarella, through putative ancient hybridization. Moreover, we developed numerous molecular markers for Tiarella (e.g., plastid hotspot and polymorphic nuclear SSRs), which will be useful for future studies on the population genetics and phylogeography of this disjunct genus.
Collapse
Affiliation(s)
- Lu-Xian Liu
- Key Laboratory of Plant Stress Biology, School of Life Sciences, Henan University, Kaifeng, China
| | - Ying-Xue Du
- Key Laboratory of Plant Stress Biology, School of Life Sciences, Henan University, Kaifeng, China
| | - Ryan A. Folk
- Department of Biological Sciences, Mississippi State University, Starkville, MS, United States
| | - Shen-Yi Wang
- Department of Botany, University of Wisconsin-Madison, Madison, WI, United States
| | - Douglas E. Soltis
- Florida Museum of Natural History, University of Florida, Gainesville, FL, United States
- Department of Biology, University of Florida, Gainesville, FL, United States
| | - Fu-De Shang
- Key Laboratory of Plant Stress Biology, School of Life Sciences, Henan University, Kaifeng, China
- *Correspondence: Fu-De Shang,
| | - Pan Li
- Laboratory of Systematic & Evolutionary Botany and Biodiversity, College of Life Sciences, Zhejiang University, Hangzhou, China
- Pan Li,
| |
Collapse
|
24
|
Xia E, Li F, Tong W, Li P, Wu Q, Zhao H, Ge R, Li R, Li Y, Zhang Z, Wei C, Wan X. Tea Plant Information Archive: a comprehensive genomics and bioinformatics platform for tea plant. PLANT BIOTECHNOLOGY JOURNAL 2019; 17:1938-1953. [PMID: 30913342 PMCID: PMC6737018 DOI: 10.1111/pbi.13111] [Citation(s) in RCA: 156] [Impact Index Per Article: 31.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/08/2019] [Revised: 03/04/2019] [Accepted: 03/07/2019] [Indexed: 05/05/2023]
Abstract
Tea is the world's widely consumed nonalcohol beverage with essential economic and health benefits. Confronted with the increasing large-scale omics-data set particularly the genome sequence released in tea plant, the construction of a comprehensive knowledgebase is urgently needed to facilitate the utilization of these data sets towards molecular breeding. We hereby present the first integrative and specially designed web-accessible database, Tea Plant Information Archive (TPIA; http://tpia.teaplant.org). The current release of TPIA employs the comprehensively annotated tea plant genome as framework and incorporates with abundant well-organized transcriptomes, gene expressions (across species, tissues and stresses), orthologs and characteristic metabolites determining tea quality. It also hosts massive transcription factors, polymorphic simple sequence repeats, single nucleotide polymorphisms, correlations, manually curated functional genes and globally collected germplasm information. A variety of versatile analytic tools (e.g. JBrowse, blast, enrichment analysis, etc.) are established helping users to perform further comparative, evolutionary and functional analysis. We show a case application of TPIA that provides novel and interesting insights into the phytochemical content variation of section Thea of genus Camellia under a well-resolved phylogenetic framework. The constructed knowledgebase of tea plant will serve as a central gateway for global tea community to better understand the tea plant biology that largely benefits the whole tea industry.
Collapse
Affiliation(s)
- En‐Hua Xia
- State Key Laboratory of Tea Plant Biology and UtilizationAnhui Agricultural UniversityHefei230036China
| | - Fang‐Dong Li
- State Key Laboratory of Tea Plant Biology and UtilizationAnhui Agricultural UniversityHefei230036China
| | - Wei Tong
- State Key Laboratory of Tea Plant Biology and UtilizationAnhui Agricultural UniversityHefei230036China
| | - Peng‐Hui Li
- State Key Laboratory of Tea Plant Biology and UtilizationAnhui Agricultural UniversityHefei230036China
| | - Qiong Wu
- State Key Laboratory of Tea Plant Biology and UtilizationAnhui Agricultural UniversityHefei230036China
| | - Hui‐Juan Zhao
- State Key Laboratory of Tea Plant Biology and UtilizationAnhui Agricultural UniversityHefei230036China
| | - Ruo‐Heng Ge
- State Key Laboratory of Tea Plant Biology and UtilizationAnhui Agricultural UniversityHefei230036China
| | - Ruo‐Pei Li
- State Key Laboratory of Tea Plant Biology and UtilizationAnhui Agricultural UniversityHefei230036China
| | - Ye‐Yun Li
- State Key Laboratory of Tea Plant Biology and UtilizationAnhui Agricultural UniversityHefei230036China
| | - Zheng‐Zhu Zhang
- State Key Laboratory of Tea Plant Biology and UtilizationAnhui Agricultural UniversityHefei230036China
| | - Chao‐Ling Wei
- State Key Laboratory of Tea Plant Biology and UtilizationAnhui Agricultural UniversityHefei230036China
| | - Xiao‐Chun Wan
- State Key Laboratory of Tea Plant Biology and UtilizationAnhui Agricultural UniversityHefei230036China
| |
Collapse
|
25
|
Liao R, Luo Y, Yisilam G, Lu R, Wang Y, Li P. Development and characterization of SSR markers for Sanguinaria canadensis based on genome skimming. APPLICATIONS IN PLANT SCIENCES 2019; 7:e11289. [PMID: 31572630 PMCID: PMC6764431 DOI: 10.1002/aps3.11289] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/17/2019] [Accepted: 07/14/2019] [Indexed: 05/31/2023]
Abstract
PREMISE Polymorphic nuclear simple sequence repeat (nSSR) markers were developed for Sanguinaria canadensis (Papaveraceae), a spring ephemeral native to eastern North America. METHODS AND RESULTS Based on the genome skimming data of S. canadensis, a total of 240 nSSR primer pairs were designed for 80 loci from the assembled nuclear contigs. Of these primer pairs, 19 were selected for initial validation in four populations (80 individuals). All 19 loci produced heterologous amplification. The numbers of alleles per locus ranged from one to 21; the levels of observed and expected heterozygosity per locus ranged from 0.000 to 1.000 and from 0.000 to 0.847, respectively. Transferability of the loci was tested in the related species Eomecon chionantha. CONCLUSIONS The developed nSSR markers revealed polymorphism in the four studied populations and may contribute to investigations of the genetic diversity of S. canadensis and E. chionantha.
Collapse
Affiliation(s)
- Renyu Liao
- Laboratory of Systematic and Evolutionary Botany and BiodiversityCollege of Life SciencesZhejiang UniversityHangzhouZhejiang310058People's Republic of China
| | - Yuxin Luo
- Laboratory of Systematic and Evolutionary Botany and BiodiversityCollege of Life SciencesZhejiang UniversityHangzhouZhejiang310058People's Republic of China
| | - Gulbar Yisilam
- Laboratory of Systematic and Evolutionary Botany and BiodiversityCollege of Life SciencesZhejiang UniversityHangzhouZhejiang310058People's Republic of China
| | - Ruisen Lu
- Laboratory of Systematic and Evolutionary Botany and BiodiversityCollege of Life SciencesZhejiang UniversityHangzhouZhejiang310058People's Republic of China
| | - Yuguo Wang
- Ministry of Education Key Laboratory for Biodiversity Science and Ecological EngineeringInstitute of Biodiversity ScienceFudan UniversityShanghai200433People's Republic of China
| | - Pan Li
- Laboratory of Systematic and Evolutionary Botany and BiodiversityCollege of Life SciencesZhejiang UniversityHangzhouZhejiang310058People's Republic of China
| |
Collapse
|
26
|
IDSSR: An Efficient Pipeline for Identifying Polymorphic Microsatellites from a Single Genome Sequence. Int J Mol Sci 2019; 20:ijms20143497. [PMID: 31315288 PMCID: PMC6678329 DOI: 10.3390/ijms20143497] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2019] [Revised: 06/25/2019] [Accepted: 07/15/2019] [Indexed: 12/02/2022] Open
Abstract
Simple sequence repeats (SSRs) are known as microsatellites, and consist of tandem 1–6-base motifs. They have become one of the most popular molecular markers, and are widely used in molecular ecology, conservation biology, molecular breeding, and many other fields. Previously reported methods identify monomorphic and polymorphic SSRs and determine the polymorphic SSRs via experimental validation, which is potentially time-consuming and costly. Herein, we present a new strategy named insertion/deletion (INDEL) SSR (IDSSR) to identify polymorphic SSRs by integrating SSRs with nucleotide insertions/deletions (INDEL) solely based on a single genome sequence and the sequenced pair-end reads. These INDEL indexes and polymorphic SSRs were identified, as well as the number of repeats, repeat motifs, chromosome location, annealing temperature, and primer sequences, enabling future experimental approaches to determine the correctness and polymorphism. Experimental validation with the giant panda demonstrated that our method has high reliability and stability. The efficient SSR pipeline would help researchers obtain high-quality genetic markers for plants and animals of interest, save labor, and reduce costly marker-screening experiments. IDSSR is freely available at https://github.com/Allsummerking/IDSSR.
Collapse
|
27
|
Mining and characterization of novel EST-SSR markers of Parrotia subaequalis (Hamamelidaceae) from the first Illumina-based transcriptome datasets. PLoS One 2019; 14:e0215874. [PMID: 31059560 PMCID: PMC6502335 DOI: 10.1371/journal.pone.0215874] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2019] [Accepted: 04/09/2019] [Indexed: 11/28/2022] Open
Abstract
Parrotia subaequalis is an endangered Tertiary relict tree from eastern China. Despite its important ecological and horticultural value, no transcriptomic data and limited molecular markers are currently available in this species. In this study, we first performed high-throughput transcriptome sequencing of two individuals representing the northernmost (TX) and southernmost (SJD) population of P. subaequalis on the Illumina HiSeq 2500 platform. We gathered a total of 69,135 unigenes for P. subaequalis (TX) and 84,009 unigenes for P. subaequalis (SJD). From two unigenes datasets, 497 candidate polymorphic novel expressed sequence tag-simple sequence repeats (EST-SSRs) were identified using CandiSSR. Among these repeats, di-nucleotide repeats were the most abundant repeat type (62.78%) followed by tri-, tetra- and hexa-nucleotide repeats. We then randomly selected 54 primer pairs for polymorphism validation, of which 27 (50%) were successfully amplified and showed polymorphisms in 96 individuals from six natural populations of P. subaequalis. The average number of alleles per locus and the polymorphism information content values were 3.70 and 0.343; the average observed and expected heterozygosity were 0.378 and 0.394. A relatively high level of genetic diversity (HT = 0.393) and genetic differentiation level (FST = 0.171) were surveyed, indicating P. subaequalis maintained high levels of species diversity in the long-term evolutionary history. Additionally, a high level of cross-transferability (92.59%) was displayed in five congeneric Hamamelidaceae species. Therefore, these new transcriptomic data and novel polymorphic EST-SSR markers will pinpoint genetic resources and facilitate future studies on population genetics and molecular breeding of P. subaequalis and other Hamamelidaceae species.
Collapse
|
28
|
Das R, Arora V, Jaiswal S, Iquebal MA, Angadi UB, Fatma S, Singh R, Shil S, Rai A, Kumar D. PolyMorphPredict: A Universal Web-Tool for Rapid Polymorphic Microsatellite Marker Discovery From Whole Genome and Transcriptome Data. FRONTIERS IN PLANT SCIENCE 2019; 9:1966. [PMID: 30687361 PMCID: PMC6337687 DOI: 10.3389/fpls.2018.01966] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/17/2018] [Accepted: 12/18/2018] [Indexed: 06/09/2023]
Abstract
Microsatellites are ubiquitously distributed, polymorphic repeat sequence valuable for association, selection, population structure and identification. They can be mined by genomic library, probe hybridization and sequencing of selected clones. Such approach has many limitations like biased hybridization and selection of larger repeats. In silico mining of polymorphic markers using data of various genotypes can be rapid and economical. Available tools lack in some or other aspects like: targeted user defined primer generation, polymorphism discovery using multiple sequence, size and number limits of input sequence, no option for primer generation and e-PCR evaluation, transferability, lack of complete automation and user-friendliness. They also lack the provision to evaluate published primers in e-PCR mode to generate additional allelic data using re-sequenced data of various genotypes for judicious utilization of previously generated data. We developed the tool (PolyMorphPredict) using Perl, R, Java and launched at Apache which is available at http://webtom.cabgrid.res.in/polypred/. It mines microsatellite loci and computes primers from genome/transcriptome data of any species. It can perform e-PCR using published primers for polymorphism discovery and across species transferability of microsatellite loci. Present tool has been evaluated using five species of different genome size having 21 genotypes. Though server is equipped with genomic data of three species for test run with gel simulation, but can be used for any species. Further, polymorphism predictability has been validated using in silico and in vitro PCR of four rice genotypes. This tool can accelerate the in silico microsatellite polymorphism discovery in re-sequencing projects of any species of plant and animal for their diversity estimation along with variety/breed identification, population structure, MAS, QTL and gene discovery, traceability, parentage testing, fungal diagnostics and genome finishing.
Collapse
Affiliation(s)
- Ritwika Das
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Vasu Arora
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Sarika Jaiswal
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - MA Iquebal
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - UB Angadi
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Samar Fatma
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Rakesh Singh
- ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
| | - Sandip Shil
- Research Center, ICAR-Central Plantation Crops Research Institute, Jalpaiguri, India
| | - Anil Rai
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Dinesh Kumar
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| |
Collapse
|
29
|
Wang S, Yang C, Zhao X, Chen S, Qu GZ. Complete chloroplast genome sequence of Betula platyphylla: gene organization, RNA editing, and comparative and phylogenetic analyses. BMC Genomics 2018; 19:950. [PMID: 30572840 PMCID: PMC6302522 DOI: 10.1186/s12864-018-5346-x] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2018] [Accepted: 11/30/2018] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Betula platyphylla is a common tree species in northern China that has high economic and medicinal value. Our laboratory has been devoted to genome research on B. platyphylla for approximately 10 years. As primary organelle genomes, the complete genome sequences of chloroplasts are important to study the divergence of species, RNA editing and phylogeny. In this study, we sequenced and analyzed the complete chloroplast (cp) genome sequence of B. platyphylla. RESULTS The complete cp genome of B. platyphylla was 160,518 bp in length, which included a pair of inverted repeats (IRs) of 26,056 bp that separated a large single copy (LSC) region of 89,397 bp and a small single copy (SSC) region of 19,009 bp. The annotation contained a total of 129 genes, including 84 protein-coding genes, 37 tRNA genes and 8 rRNA genes. There were 3 genes using alternative initiation codons. Comparative genomics showed that the sequence of the Fagales species cp genome was relatively conserved, but there were still some high variation regions that could be used as molecular markers. The IR expansion event of B. platyphylla resulted in larger cp genomes and rps19 pseudogene formation. The simple sequence repeat (SSR) analysis showed that there were 105 SSRs in the cp genome of B. platyphylla. RNA editing sites recognition indicated that at least 80 RNA editing events occurred in the cp genome. Most of the substitutions were C to U, while a small proportion of them were not. In particular, three editing loci on the rRNA were converted to more than two other bases that had never been reported. For synonymous conversion, most of them increased the relative synonymous codon usage (RSCU) value of the codons. The phylogenetic analysis suggested that B. platyphylla had a closer evolutionary relationship with B. pendula than B. nana. CONCLUSIONS In this study, we not only obtained and annotated the complete cp genome sequence of B. platyphylla, but we also identified new RNA editing sites and predicted the phylogenetic relationships among Fagales species. These findings will facilitate genomic, genetic engineering and phylogenetic studies of this important species.
Collapse
Affiliation(s)
- Sui Wang
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, 26 Hexing Road, Harbin, 150040 China
| | - Chuanping Yang
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, 26 Hexing Road, Harbin, 150040 China
| | - Xiyang Zhao
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, 26 Hexing Road, Harbin, 150040 China
| | - Su Chen
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, 26 Hexing Road, Harbin, 150040 China
| | - Guan-Zheng Qu
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, 26 Hexing Road, Harbin, 150040 China
| |
Collapse
|
30
|
Liu L, Wang Y, He P, Li P, Lee J, Soltis DE, Fu C. Chloroplast genome analyses and genomic resource development for epilithic sister genera Oresitrophe and Mukdenia (Saxifragaceae), using genome skimming data. BMC Genomics 2018; 19:235. [PMID: 29618324 PMCID: PMC5885378 DOI: 10.1186/s12864-018-4633-x] [Citation(s) in RCA: 75] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2017] [Accepted: 03/27/2018] [Indexed: 11/13/2022] Open
Abstract
Background Epilithic sister genera Oresitrophe and Mukdenia (Saxifragaceae) have an epilithic habitat (rocky slopes) and a parapatric distribution in East Asia, which makes them an ideal model for a more comprehensive understanding of the demographic and divergence history and the influence of climate changes in East Asia. However, the genetic background and resources for these two genera are scarce. Results The complete chloroplast (cp) genomes of two Oresitrophe rupifraga and one Mukdenia rossii individuals were reconstructed and comparative analyses were conducted to examine the evolutionary pattern of chloroplast genomes in Saxifragaceae. The cp genomes ranged from 156,738 bp to 156,960 bp in length and had a typical quadripartite structure with a conserved genome arrangement. Comparative analysis revealed the intron of rpl2 has been lost in Heuchera parviflora, Tiarella polyphylla, M. rossii and O. rupifraga but presents in the reference genome of Penthorum chinense. Seven cp hotspot regions (trnH-psbA, trnR-atpA, atpI-rps2, rps2-rpoC2, petN-psbM, rps4-trnT and rpl33-rps18) were identified between Oresitrophe and Mukdenia, while four hotspots (trnQ-psbK, trnR-atpA, trnS-psbZ and rpl33-rps18) were identified within Oresitrophe. In addition, 24 polymorphic cpSSR loci were found between Oresitrophe and Mukdenia. Most importantly, we successfully developed 126 intergeneric polymorphic gSSR markers between Oresitrophe and Mukdenia, as well as 452 intrageneric ones within Oresitrophe. Twelve randomly selected intergeneric gSSRs have shown that these two genera exhibit a significant genetic structure. Conclusions In this study, we conducted genome skimming for Oresitrophe rupifraga and Mukdenia rossii. Using these data, we were able to not only assemble their complete chloroplast genomes, but also develop abundant genetic resources (cp hotspots, cpSSRs, polymorphic gSSRs). The genomic patterns and genetic resources presented here will contribute to further studies on population genetics, phylogeny and conservation biology in Saxifragaceae. Electronic supplementary material The online version of this article (10.1186/s12864-018-4633-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Luxian Liu
- Key Laboratory of Plant Stress Biology, Laboratory of Plant Germplasm and Genetic Engineering, College of Life Sciences, Henan University, Kaifeng, 475000, China
| | - Yuewen Wang
- Key Laboratory of Plant Stress Biology, Laboratory of Plant Germplasm and Genetic Engineering, College of Life Sciences, Henan University, Kaifeng, 475000, China
| | - Peizi He
- Key Laboratory of Plant Stress Biology, Laboratory of Plant Germplasm and Genetic Engineering, College of Life Sciences, Henan University, Kaifeng, 475000, China
| | - Pan Li
- Key Laboratory of Conservation Biology for Endangered Wildlife of the Ministry of Education, and Laboratory of Systematic & Evolutionary Botany and Biodiversity, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China.
| | - Joongku Lee
- Department of Environment and Forest Resources, Chungnam National University, Daejeon, 34134, South Korea
| | - Douglas E Soltis
- Department of Biology, University of Florida, Gainesville, FL, 32611, USA
| | - Chengxin Fu
- Key Laboratory of Conservation Biology for Endangered Wildlife of the Ministry of Education, and Laboratory of Systematic & Evolutionary Botany and Biodiversity, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China
| |
Collapse
|
31
|
Zhang YY, Shi E, Yang ZP, Geng QF, Qiu YX, Wang ZS. Development and Application of Genomic Resources in an Endangered Palaeoendemic Tree, Parrotia subaequalis (Hamamelidaceae) From Eastern China. FRONTIERS IN PLANT SCIENCE 2018; 9:246. [PMID: 29545814 PMCID: PMC5838013 DOI: 10.3389/fpls.2018.00246] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/16/2017] [Accepted: 02/12/2018] [Indexed: 05/14/2023]
Abstract
Parrotia subaequalis is an endangered palaeoendemic tree from disjunct montane sites in eastern China. Due to the lack of effective genomic resources, the genetic diversity and population structure of this endangered species are not clearly understood. In this study, we conducted paired-end shotgun sequencing (2 × 125 bp) of genomic DNA for two individuals of P. subaequalis on the Illumina HiSeq platform. Based on the resulting sequences, we have successfully assembled the complete chloroplast genome of P. subaequalis, as well as identified the polymorphic chloroplast microsatellites (cpSSRs), nuclear microsatellites (nSSRs) and mutational hotspots of chloroplast. Ten polymorphic cpSSR loci and 12 polymorphic nSSR loci were used to genotype 96 individuals of P. subaequalis from six populations to estimate genetic diversity and population structure. Our results revealed that P. subaequalis exhibited abundant genetic diversity (e.g., cpSSRs: Hcp = 0.862; nSSRs: HT = 0.559) and high genetic differentiation (e.g., cpSSRs: RST = 0.652; nSSRs: RST = 0.331), and characterized by a low pollen-to-seed migration ratio (r ≈ 1.78). These genetic patterns are attributable to its long evolutionary histories and low levels of contemporary inter-population gene flow by pollen and seed. In addition, lack of isolation-by-distance pattern and strong population genetic structuring in both marker systems, suggests that long-term isolation and/or habitat fragmentation as well as genetic drift may have also contributed to the geographic differentiation of P. subaequalis. Therefore, long-term habitat protection is the most important methods to prevent further loss of genetic variation and a decrease in effective population size. Furthermore, both cpSSRs and nSSRs revealed that P. subaequalis populations consisted of three genetic clusters, which should be considered as separated conservation units.
Collapse
Affiliation(s)
- Yun-Yan Zhang
- College of Life Sciences, Nanjing University, Nanjing, China
| | - En Shi
- College of Life Sciences, Nanjing University, Nanjing, China
| | - Zhao-Ping Yang
- Key Laboratory of Conservation Biology for Endangered Wildlife of the Ministry of Education, College of Life Sciences, Zhejiang University, Hangzhou, China
- College of Life Sciences, Tarim University, Alaer, China
| | - Qi-Fang Geng
- College of Life Sciences, Nanjing University, Nanjing, China
- Asian Natural Environmental Science Center, The University of Tokyo, Tokyo, Japan
| | - Ying-Xiong Qiu
- Key Laboratory of Conservation Biology for Endangered Wildlife of the Ministry of Education, College of Life Sciences, Zhejiang University, Hangzhou, China
| | | |
Collapse
|
32
|
Huang H, Xia EH, Zhang HB, Yao QY, Gao LZ. De novo transcriptome sequencing of Camellia sasanqua and the analysis of major candidate genes related to floral traits. PLANT PHYSIOLOGY AND BIOCHEMISTRY : PPB 2017; 120:103-111. [PMID: 28992542 DOI: 10.1016/j.plaphy.2017.08.028] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/08/2017] [Revised: 08/28/2017] [Accepted: 08/29/2017] [Indexed: 06/07/2023]
Abstract
Camellia sasanqua is one of the most famous horticultural plants in Camellia (Theaceae) due to its aesthetic appeal as landscape plant. Knowledge regarding the genetic basis of flowering time, floral aroma and color in C. sasanqua is limited, but is essential to breed new varieties with desired floral traits. Here, we described the de novo transcriptome of young leaves, flower buds and flowers of C. sasanqua. A total of 60,127 unigenes were functionally annotated based on the sequence similarity. After analysis, we found that two floral integrator genes, SOC1 and AP1, in flowering time pathway showed evidence of gene family expansion. Compared with 1-deoxy-D-xylulose-5-phosphate pathway, some genes in the mevalonate pathway were most highly expressed, suggesting that this might represent the major pathway for terpenoid biosynthesis related to floral aroma in C. sasanqua. In flavonoid biosynthesis pathway, PAL, CHI, DFR and ANS showing significantly higher expression levels in flowers and flower buds might have important role in regulation of floral color. The top five most transcription factors (TFs) families in C. sasanqua transcriptome were MYB, MIKC, C3H, FAR1 and HD-ZIP, many of which have a direct relationship with floral traits. In addition, we also identified 33,540 simple sequence repeats (SSRs) in the C. sasanqua transcriptome. Collectively, the C. sasanqua transcriptome dataset generated from this study along with the SSR markers provide a new resource for the identification of novel regulatory transcripts and will accelerate the genetic improvement of C. sasanqua breeding programs.
Collapse
Affiliation(s)
- Hui Huang
- Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species in Southwest China, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, China
| | - En-Hua Xia
- Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species in Southwest China, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, China
| | - Hai-Bin Zhang
- Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species in Southwest China, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, China
| | - Qiu-Yang Yao
- Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species in Southwest China, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, China
| | - Li-Zhi Gao
- Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species in Southwest China, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, China; Institution of Genomics and Bioinformatics, South China Agricultural University, Guangzhou 510642, China.
| |
Collapse
|
33
|
Evangelistella C, Valentini A, Ludovisi R, Firrincieli A, Fabbrini F, Scalabrin S, Cattonaro F, Morgante M, Mugnozza GS, Keurentjes JJB, Harfouche A. De novo assembly, functional annotation, and analysis of the giant reed ( Arundo donax L.) leaf transcriptome provide tools for the development of a biofuel feedstock. BIOTECHNOLOGY FOR BIOFUELS 2017; 10:138. [PMID: 28572841 PMCID: PMC5450047 DOI: 10.1186/s13068-017-0828-7] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2016] [Accepted: 05/23/2017] [Indexed: 05/07/2023]
Abstract
BACKGROUND Arundo donax has attracted renewed interest as a potential candidate energy crop for use in biomass-to-liquid fuel conversion processes and biorefineries. This is due to its high productivity, adaptability to marginal land conditions, and suitability for biofuel and biomaterial production. Despite its importance, the genomic resources currently available for supporting the improvement of this species are still limited. RESULTS We used RNA sequencing (RNA-Seq) to de novo assemble and characterize the A. donax leaf transcriptome. The sequencing generated 1249 million clean reads that were assembled using single-k-mer and multi-k-mer approaches into 62,596 unique sequences (unitranscripts) with an N50 of 1134 bp. TransDecoder and Trinotate software suites were used to obtain putative coding sequences and annotate them by mapping to UniProtKB/Swiss-Prot and UniRef90 databases, searching for known transcripts, proteins, protein domains, and signal peptides. Furthermore, the unitranscripts were annotated by mapping them to the NCBI non-redundant, GO and KEGG pathway databases using Blast2GO. The transcriptome was also characterized by BLAST searches to investigate homologous transcripts of key genes involved in important metabolic pathways, such as lignin, cellulose, purine, and thiamine biosynthesis and carbon fixation. Moreover, a set of homologous transcripts of key genes involved in stomatal development and of genes coding for stress-associated proteins (SAPs) were identified. Additionally, 8364 simple sequence repeat (SSR) markers were identified and surveyed. SSRs appeared more abundant in non-coding regions (63.18%) than in coding regions (36.82%). This SSR dataset represents the first marker catalogue of A. donax. 53 SSRs (PolySSRs) were then predicted to be polymorphic between ecotype-specific assemblies, suggesting genetic variability in the studied ecotypes. CONCLUSIONS This study provides the first publicly available leaf transcriptome for the A. donax bioenergy crop. The functional annotation and characterization of the transcriptome will be highly useful for providing insight into the molecular mechanisms underlying its extreme adaptability. The identification of homologous transcripts involved in key metabolic pathways offers a platform for directing future efforts in genetic improvement of this species. Finally, the identified SSRs will facilitate the harnessing of untapped genetic diversity. This transcriptome should be of value to ongoing functional genomics and genetic studies in this crop of paramount economic importance.
Collapse
Affiliation(s)
- Chiara Evangelistella
- Department for Innovation in Biological, Agro-food and Forest Systems, University of Tuscia, Via S. Camillo de Lellis snc, 01100 Viterbo, Italy
| | - Alessio Valentini
- Department for Innovation in Biological, Agro-food and Forest Systems, University of Tuscia, Via S. Camillo de Lellis snc, 01100 Viterbo, Italy
| | - Riccardo Ludovisi
- Department for Innovation in Biological, Agro-food and Forest Systems, University of Tuscia, Via S. Camillo de Lellis snc, 01100 Viterbo, Italy
| | - Andrea Firrincieli
- Department for Innovation in Biological, Agro-food and Forest Systems, University of Tuscia, Via S. Camillo de Lellis snc, 01100 Viterbo, Italy
| | - Francesco Fabbrini
- Department for Innovation in Biological, Agro-food and Forest Systems, University of Tuscia, Via S. Camillo de Lellis snc, 01100 Viterbo, Italy
- Alasia Franco Vivai s.s., Strada Solerette, 5/A, 12038 Savigliano, Italy
| | - Simone Scalabrin
- IGA Technology Services, Via J. Linussio, 51-Z.I.U, 33100 Udine, Italy
| | | | - Michele Morgante
- Department of Agricultural and Environmental Sciences, University of Udine, Via delle Scienze, 206, 33100 Udine, Italy
- Institute of Applied Genomics, Via J. Linussio, 51-Z.I.U, 33100 Udine, Italy
| | - Giuseppe Scarascia Mugnozza
- Department for Innovation in Biological, Agro-food and Forest Systems, University of Tuscia, Via S. Camillo de Lellis snc, 01100 Viterbo, Italy
| | - Joost J. B. Keurentjes
- Laboratory of Genetics, Wageningen University, Droevendaalsesteeg 1, 6708 PB Wageningen, The Netherlands
| | - Antoine Harfouche
- Department for Innovation in Biological, Agro-food and Forest Systems, University of Tuscia, Via S. Camillo de Lellis snc, 01100 Viterbo, Italy
| |
Collapse
|
34
|
Wang X, Wang L. GMATA: An Integrated Software Package for Genome-Scale SSR Mining, Marker Development and Viewing. FRONTIERS IN PLANT SCIENCE 2016; 7:1350. [PMID: 27679641 PMCID: PMC5020087 DOI: 10.3389/fpls.2016.01350] [Citation(s) in RCA: 105] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/23/2016] [Accepted: 08/23/2016] [Indexed: 05/19/2023]
Abstract
Simple sequence repeats (SSRs), also referred to as microsatellites, are highly variable tandem DNAs that are widely used as genetic markers. The increasing availability of whole-genome and transcript sequences provides information resources for SSR marker development. However, efficient software is required to efficiently identify and display SSR information along with other gene features at a genome scale. We developed novel software package Genome-wide Microsatellite Analyzing Tool Package (GMATA) integrating SSR mining, statistical analysis and plotting, marker design, polymorphism screening and marker transferability, and enabled simultaneously display SSR markers with other genome features. GMATA applies novel strategies for SSR analysis and primer design in large genomes, which allows GMATA to perform faster calculation and provides more accurate results than existing tools. Our package is also capable of processing DNA sequences of any size on a standard computer. GMATA is user friendly, only requires mouse clicks or types inputs on the command line, and is executable in multiple computing platforms. We demonstrated the application of GMATA in plants genomes and reveal a novel distribution pattern of SSRs in 15 grass genomes. The most abundant motifs are dimer GA/TC, the A/T monomer and the GCG/CGC trimer, rather than the rich G/C content in DNA sequence. We also revealed that SSR count is a linear to the chromosome length in fully assembled grass genomes. GMATA represents a powerful application tool that facilitates genomic sequence analyses. GAMTA is freely available at http://sourceforge.net/projects/gmata/?source=navbar.
Collapse
Affiliation(s)
- Xuewen Wang
- Germplasm Bank of Wild Species in China, Kunming Institute of Botany, Chinese Academy of SciencesKunming, China
- *Correspondence: Xuewen Wang
| | - Le Wang
- Key Laboratory of Forensic Genetics and Beijing Engineering Research Center of Crime Scene Evidence Examination, Institute of Forensic Science, Ministry of Public SecurityBeijing, China
| |
Collapse
|