1
|
Meena RK, Kashyap P, Shamoon A, Dhyani P, Sharma H, Bhandari MS, Barthwal S, Ginwal HS. Genome survey sequencing-based SSR marker development and their validation in Dendrocalamus longispathus. Funct Integr Genomics 2023; 23:103. [PMID: 36973584 DOI: 10.1007/s10142-023-01033-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Revised: 03/16/2023] [Accepted: 03/16/2023] [Indexed: 03/29/2023]
Abstract
Bamboo is an important genetic resource in India, supporting rural livelihood and industries. Unfortunately, most Indian bamboo taxa are devoid of basic genomic or marker information required to comprehend the genetic processes for further conservation and management. In this study, we perform genome survey sequencing for development of de novo genomic SSRs in Dendrocalamus longispathus, a socioeconomically important bamboo species of northeast India. Using Illumina platform, 69.49 million raw reads were generated and assembled into 1,145,321 contig with GC content 43% and N50 1228 bp. In total, 46,984 microsatellite repeats were mined-out wherein di-nucleotide repeats were most abundant (54.71%) followed by mono- (31.91%) and tri-repeats (9.85%). Overall, AT-rich repeats were predominant in the genome, but GC-rich motifs were more frequent in tri-repeats. Afterwards, 21,596 SSR loci were successfully tagged with the primer pairs, and a subset of 50 were validated through polymerase chain reaction amplification. Of these, 36 SSR loci were successfully amplified, and 16 demonstrated polymorphism. Using 13 polymorphic SSRs, a moderate level of gene diversity (He = 0.480; Ar = 3.52) was recorded in the analysed populations of D. longispathus. Despite the high gene flow (Nm = 4.928) and low genetic differentiation (FST = 0.119), severe inbreeding (FIS = 0.407) was detected. Further, genetic clustering and STRUCTURE analysis revealed that the entire genetic variability is captured under two major gene pools. Conclusively, we present a comprehensive set of novel SSR markers in D. longispathus as well as other taxa of tropical woody bamboos.
Collapse
Affiliation(s)
- Rajendra K Meena
- Division of Genetics & Tree Improvement, ICFRE-Forest Research Institute, Dehradun, 248 195, Uttarakhand, India.
| | - Priyanka Kashyap
- Division of Genetics & Tree Improvement, ICFRE-Forest Research Institute, Dehradun, 248 195, Uttarakhand, India
| | - Arzoo Shamoon
- Division of Genetics & Tree Improvement, ICFRE-Forest Research Institute, Dehradun, 248 195, Uttarakhand, India
| | - Payal Dhyani
- Division of Genetics & Tree Improvement, ICFRE-Forest Research Institute, Dehradun, 248 195, Uttarakhand, India
| | - Hansraj Sharma
- ICFRE - Bamboo & Rattan Centre, Aizawl, 796007, Mizoram, India
- ICFRE-Rain Forest Research Institute, Jorhat, 785001, Assam, India
| | - Maneesh S Bhandari
- Division of Genetics & Tree Improvement, ICFRE-Forest Research Institute, Dehradun, 248 195, Uttarakhand, India
| | - Santan Barthwal
- Division of Genetics & Tree Improvement, ICFRE-Forest Research Institute, Dehradun, 248 195, Uttarakhand, India
| | - Harish S Ginwal
- Division of Genetics & Tree Improvement, ICFRE-Forest Research Institute, Dehradun, 248 195, Uttarakhand, India
| |
Collapse
|
2
|
Jamdade RA, Mahmoud T, Gairola S. Prospects of genomic resources available at the global databases for the flora of United Arab Emirates. 3 Biotech 2019; 9:333. [PMID: 31475085 PMCID: PMC6702620 DOI: 10.1007/s13205-019-1855-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2019] [Accepted: 08/01/2019] [Indexed: 10/26/2022] Open
Abstract
This article emphasizes available genomic resources at the global databases National Center for Biotechnology Information (NCBI) GenBank, Gramene and Phytozome for the selected 378 plant taxa of the United Arab Emirates (UAE). Germplasm of these species was collected and banked at the Sharjah Seed Bank and Herbarium (SSBH) along with their related information on habit, habitat and occurrence. The occurrence statistics exhibits almost 19.84% species under rare-to-very rare category, the GenBank search statistics for this category indicates 17.72% species as studied and 2.11% as not studied. Overall, from the global search statistics for 378 plant species, it seems that about 40 (10.58%) species remained unstudied. Most of the unstudied species were herbaceous plants belonging to the mountainous habitat. Moreover, full genomes were recorded for 7 species at NCBI GenBank, 2 species at Phytozome and 1 species at Gramene database. The local search statistics (for UAE) exhibits about 10.58% of the flora that still remained unstudied and only 11 (2.90%) of the recorded species were having genomic information at NCBI GenBank. It is necessary to prioritize studies on such species that could provide valuable insight on their genetic composition in order to understand their adaptation to the natural environment. At present, the SSBH is cataloguing UAE's flora using core barcode and assisted markers that could provide a robust DNA barcode library for native plants of UAE. Our study appeals researchers to recognize and prioritize the species that need attention to enrich their genomic resources at the global databases by supporting nucleotide libraries with their conspecifics. At present, genomic resources for UAE plant taxa are limited, but with the advent of low-cost sequencing technologies these resources would flourish in the near future. Nevertheless, the information generated through genomic studies could be utilized for conservation and management of threatened and endangered plant species, Crop Wild Relatives and medicinal plants. We hope this article will promote interest in conducting additional studies in genomics of desert plants by encouraging researchers to participate in this emerging field.
Collapse
Affiliation(s)
- Rahul A. Jamdade
- Plant Biotechnology Laboratory, Sharjah Research Academy, P. Box 60999, Sharjah, UAE
| | - Tamer Mahmoud
- Sharjah Seed Bank and Herbarium, Sharjah Research Academy, P. Box 60999, Sharjah, UAE
| | - Sanjay Gairola
- Sharjah Seed Bank and Herbarium, Sharjah Research Academy, P. Box 60999, Sharjah, UAE
| |
Collapse
|
3
|
Moisy C, Schulman AH, Kalendar R, Buchmann JP, Pelsy F. The Tvv1 retrotransposon family is conserved between plant genomes separated by over 100 million years. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2014; 127:1223-35. [PMID: 24590356 DOI: 10.1007/s00122-014-2293-z] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2013] [Accepted: 02/21/2014] [Indexed: 05/18/2023]
Abstract
Combining several different approaches, we have examined the structure, variability, and distribution of Tvv1 retrotransposons. Tvv1 is an unusual example of a low-copy retrotransposon metapopulation dispersed unevenly among very distant species and is promising for the development of molecular markers. Retrotransposons are ubiquitous throughout the genomes of the vascular plants, but individual retrotransposon families tend to be confined to the level of plant genus or at most family. This restricts the general applicability of a family as molecular markers. Here, we characterize a new plant retrotransposon named Tvv1_Sdem, a member of the Copia superfamily of LTR retrotransposons, from the genome of the wild potato Solanum demissum. Comparative analyses based on structure and sequence showed a high level of similarity of Tvv1_Sdem with Tvv1-VB, a retrotransposon previously described in the grapevine genome Vitis vinifera. Extending the analysis to other species by in silico and in vitro approaches revealed the presence of Tvv1 family members in potato, tomato, and poplar genomes, and led to the identification of full-length copies of Tvv1 in these species. We were also able to identify polymorphism in UTL sequences between Tvv1_Sdem copies from wild and cultivated potatoes that are useful as molecular markers. Combining different approaches, our results suggest that the Tvv1 family of retrotransposons has a monophyletic origin and has been maintained in both the rosids and the asterids, the major clades of dicotyledonous plants, since their divergence about 100 MYA. To our knowledge, Tvv1 represents an unusual plant retrotransposon metapopulation comprising highly similar members disjointedly dispersed among very distant species. The twin features of Tvv1 presence in evolutionarily distant genomes and the diversity of its UTL region in each species make it useful as a source of robust molecular markers for diversity studies and breeding.
Collapse
Affiliation(s)
- Cédric Moisy
- MTT/BI Plant Genomics Lab, Institute of Biotechnology, University of Helsinki, P.O. Box 65, Biocenter 3, Viikinkaari 1, 00014, Helsinki, Finland,
| | | | | | | | | |
Collapse
|
4
|
Nadeau NJ, Whibley A, Jones RT, Davey JW, Dasmahapatra KK, Baxter SW, Quail MA, Joron M, ffrench-Constant RH, Blaxter ML, Mallet J, Jiggins CD. Genomic islands of divergence in hybridizing Heliconius butterflies identified by large-scale targeted sequencing. Philos Trans R Soc Lond B Biol Sci 2012; 367:343-53. [PMID: 22201164 DOI: 10.1098/rstb.2011.0198] [Citation(s) in RCA: 274] [Impact Index Per Article: 22.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Heliconius butterflies represent a recent radiation of species, in which wing pattern divergence has been implicated in speciation. Several loci that control wing pattern phenotypes have been mapped and two were identified through sequencing. These same gene regions play a role in adaptation across the whole Heliconius radiation. Previous studies of population genetic patterns at these regions have sequenced small amplicons. Here, we use targeted next-generation sequence capture to survey patterns of divergence across these entire regions in divergent geographical races and species of Heliconius. This technique was successful both within and between species for obtaining high coverage of almost all coding regions and sufficient coverage of non-coding regions to perform population genetic analyses. We find major peaks of elevated population differentiation between races across hybrid zones, which indicate regions under strong divergent selection. These 'islands' of divergence appear to be more extensive between closely related species, but there is less clear evidence for such islands between more distantly related species at two further points along the 'speciation continuum'. We also sequence fosmid clones across these regions in different Heliconius melpomene races. We find no major structural rearrangements but many relatively large (greater than 1 kb) insertion/deletion events (including gain/loss of transposable elements) that are variable between races.
Collapse
Affiliation(s)
- Nicola J Nadeau
- Department of Zoology, University of Cambridge, Cambridge CB2 3EJ, UK.
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
5
|
Janicki M, Rooke R, Yang G. Bioinformatics and genomic analysis of transposable elements in eukaryotic genomes. Chromosome Res 2012; 19:787-808. [PMID: 21850457 DOI: 10.1007/s10577-011-9230-7] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
A major portion of most eukaryotic genomes are transposable elements (TEs). During evolution, TEs have introduced profound changes to genome size, structure, and function. As integral parts of genomes, the dynamic presence of TEs will continue to be a major force in reshaping genomes. Early computational analyses of TEs in genome sequences focused on filtering out "junk" sequences to facilitate gene annotation. When the high abundance and diversity of TEs in eukaryotic genomes were recognized, these early efforts transformed into the systematic genome-wide categorization and classification of TEs. The availability of genomic sequence data reversed the classical genetic approaches to discovering new TE families and superfamilies. Curated TE databases and their accurate annotation of genome sequences in turn facilitated the studies on TEs in a number of frontiers including: (1) TE-mediated changes of genome size and structure, (2) the influence of TEs on genome and gene functions, (3) TE regulation by host, (4) the evolution of TEs and their population dynamics, and (5) genomic scale studies of TE activity. Bioinformatics and genomic approaches have become an integral part of large-scale studies on TEs to extract information with pure in silico analyses or to assist wet lab experimental studies. The current revolution in genome sequencing technology facilitates further progress in the existing frontiers of research and emergence of new initiatives. The rapid generation of large-sequence datasets at record low costs on a routine basis is challenging the computing industry on storage capacity and manipulation speed and the bioinformatics community for improvement in algorithms and their implementations.
Collapse
Affiliation(s)
- Mateusz Janicki
- Department of Biology, University of Toronto at Mississauga, 3359 Mississauga Road, Mississauga, ON L5L1C6, Canada
| | | | | |
Collapse
|
6
|
Ferguson L, Lee SF, Chamberlain N, Nadeau N, Joron M, Baxter S, Wilkinson P, Papanicolaou A, Kumar S, Kee TJ, Clark R, Davidson C, Glithero R, Beasley H, Vogel H, Ffrench-Constant R, Jiggins C. Characterization of a hotspot for mimicry: assembly of a butterfly wing transcriptome to genomic sequence at theHmYb/Sblocus. Mol Ecol 2010; 19 Suppl 1:240-54. [PMID: 20331783 DOI: 10.1111/j.1365-294x.2009.04475.x] [Citation(s) in RCA: 62] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
7
|
Genomic hotspots for adaptation: the population genetics of Müllerian mimicry in the Heliconius melpomene clade. PLoS Genet 2010; 6:e1000794. [PMID: 20140188 PMCID: PMC2816687 DOI: 10.1371/journal.pgen.1000794] [Citation(s) in RCA: 90] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2009] [Accepted: 11/30/2009] [Indexed: 11/19/2022] Open
Abstract
Wing patterning in Heliconius butterflies is a longstanding example of both Müllerian mimicry and phenotypic radiation under strong natural selection. The loci controlling such patterns are “hotspots” for adaptive evolution with great allelic diversity across different species in the genus. We characterise nucleotide variation, genotype-by-phenotype associations, linkage disequilibrium, and candidate gene expression at two loci and across multiple hybrid zones in Heliconius melpomene and relatives. Alleles at HmB control the presence or absence of the red forewing band, while alleles at HmYb control the yellow hindwing bar. Across HmYb two regions, separated by ∼100 kb, show significant genotype-by-phenotype associations that are replicated across independent hybrid zones. In contrast, at HmB a single peak of association indicates the likely position of functional sites at three genes, encoding a kinesin, a G-protein coupled receptor, and an mRNA splicing factor. At both HmYb and HmB there is evidence for enhanced linkage disequilibrium (LD) between associated sites separated by up to 14 kb, suggesting that multiple sites are under selection. However, there was no evidence for reduced variation or deviations from neutrality that might indicate a recent selective sweep, consistent with these alleles being relatively old. Of the three genes showing an association with the HmB locus, the kinesin shows differences in wing disc expression between races that are replicated in the co-mimic, Heliconius erato, providing striking evidence for parallel changes in gene expression between Müllerian co-mimics. Wing patterning loci in Heliconius melpomene therefore show a haplotype structure maintained by selection, but no evidence for a recent selective sweep. The complex genetic pattern contrasts with the simple genetic basis of many adaptive traits studied previously, but may provide a better model for most adaptation in natural populations that has arisen over millions rather than tens of years. The diversity of wing patterns in Heliconius butterflies is a longstanding example of both Müllerian mimicry and adaptive radiation. The genetic regions controlling such patterns are “hotspots” for adaptive evolution, with small regions of the genome controlling major changes in wing pattern. Across multiple hybrid zones in Heliconius melpomene and related species, we no find no strong population signal of recent selection. Nonetheless, we find significant associations between genetic variation and wing pattern at multiple sites. This suggests patterning alleles are relatively old, and might be a better model for most natural adaptation, in contrast to the simple genetic basis of recent human-induced selection such as pesticide resistance. Strikingly, across the region controlling the red forewing band, a very strong association with phenotype implicates three genes as potentially being involved in control of wing pattern. One of these, a kinesin gene, shows parallel differences in expression levels between divergent forms in the two mimetic species, making it a strong candidate for control of wing pattern. These results show that mimicry involves parallel changes in gene expression and strongly suggest a role for this gene in control of wing pattern.
Collapse
|
8
|
Identifying repeats and transposable elements in sequenced genomes: how to find your way through the dense forest of programs. Heredity (Edinb) 2009; 104:520-33. [PMID: 19935826 DOI: 10.1038/hdy.2009.165] [Citation(s) in RCA: 137] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
The production of genome sequences has led to another important advance in their annotation, which is closely linked to the exact determination of their content in terms of repeats, among which are transposable elements (TEs). The evolutionary implications and the presence of coding regions in some TEs can confuse gene annotation, and also hinder the process of genome assembly, making particularly crucial to be able to annotate and classify them correctly in genome sequences. This review is intended to provide an overview as comprehensive as possible of the automated methods currently used to annotate and classify TEs in sequenced genomes. Different categories of programs exist according to their methodology and the repeat, which they can identify. I describe here the main characteristics of the programs, their main goals and the difficulties they can entail. The drawbacks of the different methods are also highlighted to help biologists who are unfamiliar with algorithmic methods to understand this methodology better. Globally, using several different programs and carrying out a cross comparison of their results has the best chance of finding reliable results as any single program. However, this makes it essential to verify the results provided by each program independently. The ideal solution would be to test all programs against the same data set to obtain a true comparison of their actual performance.
Collapse
|
9
|
Rasmussen DA, Noor MAF. What can you do with 0.1x genome coverage? A case study based on a genome survey of the scuttle fly Megaselia scalaris (Phoridae). BMC Genomics 2009; 10:382. [PMID: 19689807 PMCID: PMC2735751 DOI: 10.1186/1471-2164-10-382] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2009] [Accepted: 08/18/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The declining cost of DNA sequencing is making genome sequencing a feasible option for more organisms, including many of interest to ecologists and evolutionary biologists. While obtaining high-depth, completely assembled genome sequences for most non-model organisms remains challenging, low-coverage genome survey sequences (GSS) can provide a wealth of biologically useful information at low cost. Here, using a random pyrosequencing approach, we sequence the genome of the scuttle fly Megaselia scalaris and evaluate the utility of our low-coverage GSS approach. RESULTS Random pyrosequencing of the M. scalaris genome provided a depth of coverage (0.05x0.1x) much lower than typical GSS studies. We demonstrate that, even with extremely low-coverage sequencing, bioinformatics approaches can yield extensive information about functional and repetitive elements. We also use our GSS data to develop genomic resources such as a nearly complete mitochondrial genome sequence and microsatellite markers for M. scalaris. CONCLUSION We conclude that low-coverage genome surveys are effective at generating useful information about organisms currently lacking genomic sequence data.
Collapse
|
10
|
Koressaar T, Jõers K, Remm M. Automatic identification of species-specific repetitive DNA sequences and their utilization for detecting microbial organisms. Bioinformatics 2009; 25:1349-55. [PMID: 19357101 PMCID: PMC2682524 DOI: 10.1093/bioinformatics/btp241] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2009] [Revised: 03/16/2009] [Accepted: 04/03/2009] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION The concentration of pathogen DNA in biological samples is often very low. Therefore, the sensitivity of diagnostic tests is always a critical factor. RESULTS We have developed a novel computational method that identifies species-specific repeats from microbial organisms and automatically designs species-specific PCR primers for these repeats. We tested the methodology on 30 randomly chosen microbial species and we demonstrate that species-specific repeats longer than 300 bp exist in all these genomes. We also used our methodology to design species-specific PCR primers for 86 repeats from five medically relevant microbial species. These PCR primers were tested experimentally. We demonstrate that using species-specific repeats as a PCR template region can increase the sensitivity of PCR in diagnostic tests. AVAILABILITY AND IMPLEMENTATION A web version of the method called MultiMPrimer3 was implemented and is freely available at (http://bioinfo.ut.ee/multimprimer3/).
Collapse
Affiliation(s)
- Triinu Koressaar
- Department of Bioinformatics, Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
| | | | | |
Collapse
|