51
|
Wang H, Walla JA, Zhong S, Huang D, Dai W. Development and cross-species/genera transferability of microsatellite markers discovered using 454 genome sequencing in chokecherry (Prunus virginiana L.). PLANT CELL REPORTS 2012; 31:2047-2055. [PMID: 22837059 DOI: 10.1007/s00299-012-1315-z] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/21/2012] [Revised: 06/25/2012] [Accepted: 07/11/2012] [Indexed: 06/01/2023]
Abstract
Chokecherry (Prunus virginiana L.) (2n = 4x = 32) is a unique Prunus species for both genetics and disease-resistance research due to its tetraploid nature and X-disease resistance. However, no genetic and genomic information on chokecherry is available. A partial chokecherry genome was sequenced using Roche 454 sequencing technology. A total of 145,094 reads covering 4.8 Mbp of the chokecherry genome were generated and 15,113 contigs were assembled, of which 11,675 contigs were larger than 100 bp in size. A total of 481 SSR loci were identified from 234 (out of 11,675) contigs and 246 polymerase chain reaction (PCR) primer pairs were designed. Of 246 primers, 212 (86.2 %) effectively produced amplification from the genomic DNA of chokecherry. All 212 amplifiable chokecherry primers were used to amplify genomic DNA from 11 other rosaceous species (sour cherry, sweet cherry, black cherry, peach, apricot, plum, apple, crabapple, pear, juneberry, and raspberry). Thus, chokecherry SSR primers can be transferable across Prunus species and other rosaceous species. An average of 63.2 and 58.7 % of amplifiable chokecherry primers amplified DNA from cherry and other Prunus species, respectively, while 47.2 % of amplifiable chokecherry primers amplified DNA from other rosaceous species. Using random genome sequence data generated from next-generation sequencing technology to identify microsatellite loci appears to be rapid and cost-efficient, particularly for species with no sequence information available. Sequence information and confirmed transferability of the identified chokecherry SSRs among species will be valuable for genetic research in Prunus and other rosaceous species. Key message A total of 246 SSR primers were identified from chokecherry genome sequences. Of which, 212 were confirmed amplifiable both in chokecherry and other 11 other rosaceous species.
Collapse
Affiliation(s)
- Hongxia Wang
- Department of Plant Sciences, North Dakota State University, Fargo, ND, USA
| | | | | | | | | |
Collapse
|
52
|
Khajudparn P, Prajongjai T, Poolsawat O, Tantasawat PA. Application of ISSR markers for verification of F₁ hybrids in mungbean (Vigna radiata). GENETICS AND MOLECULAR RESEARCH 2012; 11:3329-38. [PMID: 23079826 DOI: 10.4238/2012.september.17.3] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
Mungbean improvement via hybridization requires the identification of true F(1) hybrids from controlled crosses before further generations of selfing/crossing and selection. We utilized inter-simple sequence repeat (ISSR) markers for identifying putative F(1) hybrids from six cross combinations whose morphological characteristics were very similar to those of their respective female parents and could not be visually discriminated from the self-pollinated progeny. Based on 10 ISSR primers, polymorphisms were found between female and male parents of all six cross combinations. The highest value of genetic differentiation (21.4%) was found between male and female parents of the SUT3 x M5-1 cross. These 10 ISSR primers gave 2.8-25.0% polymorphism between male and female parents, with a mean of 12.1%, and 0-13.0% polymorphism between F(1) hybrid and female parents, with a mean of 4.8%. F(1) hybrids of all six cross combinations could be differentiated from the self-pollinated progeny of their female parents by using only either ISSR 841 or 857 primers, together with the ISSR 835 primer. We conclude that ISSR markers are useful and efficient for identifying mungbean F(1) hybrids in controlled crosses from different genetic background.
Collapse
Affiliation(s)
- P Khajudparn
- School of Crop Production Technology, Institute of Agricultural Technology, Suranaree University of Technology, Nakhon Ratchasima, Thailand
| | | | | | | |
Collapse
|
53
|
Guimarães PM, Brasileiro ACM, Morgante CV, Martins ACQ, Pappas G, Silva OB, Togawa R, Leal-Bertioli SCM, Araujo ACG, Moretzsohn MC, Bertioli DJ. Global transcriptome analysis of two wild relatives of peanut under drought and fungi infection. BMC Genomics 2012; 13:387. [PMID: 22888963 PMCID: PMC3496627 DOI: 10.1186/1471-2164-13-387] [Citation(s) in RCA: 74] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2011] [Accepted: 06/05/2012] [Indexed: 11/25/2022] Open
Abstract
Background Cultivated peanut (Arachis hypogaea) is one of the most widely grown grain legumes in the world, being valued for its high protein and unsaturated oil contents. Worldwide, the major constraints to peanut production are drought and fungal diseases. Wild Arachis species, which are exclusively South American in origin, have high genetic diversity and have been selected during evolution in a range of environments and biotic stresses, constituting a rich source of allele diversity. Arachis stenosperma harbors resistances to a number of pests, including fungal diseases, whilst A. duranensis has shown improved tolerance to water limited stress. In this study, these species were used for the creation of an extensive databank of wild Arachis transcripts under stress which will constitute a rich source for gene discovery and molecular markers development. Results Transcriptome analysis of cDNA collections from A. stenosperma challenged with Cercosporidium personatum (Berk. and M.A. Curtis) Deighton, and A. duranensis submitted to gradual water limited stress was conducted using 454 GS FLX Titanium generating a total of 7.4 x 105 raw sequence reads covering 211 Mbp of both genomes. High quality reads were assembled to 7,723 contigs for A. stenosperma and 12,792 for A. duranensis and functional annotation indicated that 95% of the contigs in both species could be appointed to GO annotation categories. A number of transcription factors families and defense related genes were identified in both species. Additionally, the expression of five A. stenosperma Resistance Gene Analogs (RGAs) and four retrotransposon (FIDEL-related) sequences were analyzed by qRT-PCR. This data set was used to design a total of 2,325 EST-SSRs, of which a subset of 584 amplified in both species and 214 were shown to be polymorphic using ePCR. Conclusions This study comprises one of the largest unigene dataset for wild Arachis species and will help to elucidate genes involved in responses to biological processes such as fungal diseases and water limited stress. Moreover, it will also facilitate basic and applied research on the genetics of peanut through the development of new molecular markers and the study of adaptive variation across the genus.
Collapse
Affiliation(s)
- Patricia M Guimarães
- EMBRAPA Recursos Genéticos e Biotecnologia, Parque Estação Biológica, CP 02372 Final W5 Norte, Brasília, DF, Brazil.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
54
|
Iorizzo M, Senalik D, Szklarczyk M, Grzebelus D, Spooner D, Simon P. De novo assembly of the carrot mitochondrial genome using next generation sequencing of whole genomic DNA provides first evidence of DNA transfer into an angiosperm plastid genome. BMC PLANT BIOLOGY 2012; 12:61. [PMID: 22548759 PMCID: PMC3413510 DOI: 10.1186/1471-2229-12-61] [Citation(s) in RCA: 86] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/23/2011] [Accepted: 05/01/2012] [Indexed: 05/02/2023]
Abstract
BACKGROUND Sequence analysis of organelle genomes has revealed important aspects of plant cell evolution. The scope of this study was to develop an approach for de novo assembly of the carrot mitochondrial genome using next generation sequence data from total genomic DNA. RESULTS Sequencing data from a carrot 454 whole genome library were used to develop a de novo assembly of the mitochondrial genome. Development of a new bioinformatic tool allowed visualizing contig connections and elucidation of the de novo assembly. Southern hybridization demonstrated recombination across two large repeats. Genome annotation allowed identification of 44 protein coding genes, three rRNA and 17 tRNA. Identification of the plastid genome sequence allowed organelle genome comparison. Mitochondrial intergenic sequence analysis allowed detection of a fragment of DNA specific to the carrot plastid genome. PCR amplification and sequence analysis across different Apiaceae species revealed consistent conservation of this fragment in the mitochondrial genomes and an insertion in Daucus plastid genomes, giving evidence of a mitochondrial to plastid transfer of DNA. Sequence similarity with a retrotransposon element suggests a possibility that a transposon-like event transferred this sequence into the plastid genome. CONCLUSIONS This study confirmed that whole genome sequencing is a practical approach for de novo assembly of higher plant mitochondrial genomes. In addition, a new aspect of intercompartmental genome interaction was reported providing the first evidence for DNA transfer into an angiosperm plastid genome. The approach used here could be used more broadly to sequence and assemble mitochondrial genomes of diverse species. This information will allow us to better understand intercompartmental interactions and cell evolution.
Collapse
Affiliation(s)
- Massimo Iorizzo
- Department of Horticulture, University of Wisconsin-Madison, 1575 Linden Drive, Madison, WI 53706, USA
| | - Douglas Senalik
- Department of Horticulture, University of Wisconsin-Madison, 1575 Linden Drive, Madison, WI 53706, USA
- USDA-Agricultural Research Service, Vegetable Crops Research Unit, University of Wisconsin, 1575 Linden Drive, Madison, WI 53706, USA
| | - Marek Szklarczyk
- Department of Genetics, Plant Breeding and Seed Science, University of Agriculture Krakow, Al. 29 Listopada 54, 31-425, Krakow, Poland
| | - Dariusz Grzebelus
- Department of Genetics, Plant Breeding and Seed Science, University of Agriculture Krakow, Al. 29 Listopada 54, 31-425, Krakow, Poland
| | - David Spooner
- Department of Horticulture, University of Wisconsin-Madison, 1575 Linden Drive, Madison, WI 53706, USA
- USDA-Agricultural Research Service, Vegetable Crops Research Unit, University of Wisconsin, 1575 Linden Drive, Madison, WI 53706, USA
| | - Philipp Simon
- Department of Horticulture, University of Wisconsin-Madison, 1575 Linden Drive, Madison, WI 53706, USA
- USDA-Agricultural Research Service, Vegetable Crops Research Unit, University of Wisconsin, 1575 Linden Drive, Madison, WI 53706, USA
| |
Collapse
|
55
|
Wang Y, Zeng X, Iyer NJ, Bryant DW, Mockler TC, Mahalingam R. Exploring the switchgrass transcriptome using second-generation sequencing technology. PLoS One 2012; 7:e34225. [PMID: 22479570 PMCID: PMC3315583 DOI: 10.1371/journal.pone.0034225] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2011] [Accepted: 02/27/2012] [Indexed: 01/19/2023] Open
Abstract
Background Switchgrass (Panicum virgatum L.) is a C4 perennial grass and widely popular as an important bioenergy crop. To accelerate the pace of developing high yielding switchgrass cultivars adapted to diverse environmental niches, the generation of genomic resources for this plant is necessary. The large genome size and polyploid nature of switchgrass makes whole genome sequencing a daunting task even with current technologies. Exploring the transcriptional landscape using next generation sequencing technologies provides a viable alternative to whole genome sequencing in switchgrass. Principal Findings Switchgrass cDNA libraries from germinating seedlings, emerging tillers, flowers, and dormant seeds were sequenced using Roche 454 GS-FLX Titanium technology, generating 980,000 reads with an average read length of 367 bp. De novo assembly generated 243,600 contigs with an average length of 535 bp. Using the foxtail millet genome as a reference greatly improved the assembly and annotation of switchgrass ESTs. Comparative analysis of the 454-derived switchgrass EST reads with other sequenced monocots including Brachypodium, sorghum, rice and maize indicated a 70–80% overlap. RPKM analysis demonstrated unique transcriptional signatures of the four tissues analyzed in this study. More than 24,000 ESTs were identified in the dormant seed library. In silico analysis indicated that there are more than 2000 EST-SSRs in this collection. Expression of several orphan ESTs was confirmed by RT-PCR. Significance We estimate that about 90% of the switchgrass gene space has been covered in this analysis. This study nearly doubles the amount of EST information for switchgrass currently in the public domain. The celerity and economical nature of second-generation sequencing technologies provide an in-depth view of the gene space of complex genomes like switchgrass. Sequence analysis of closely related members of the NAD+-malic enzyme type C4 grasses such as the model system Setaria viridis can serve as a viable proxy for the switchgrass genome.
Collapse
Affiliation(s)
- Yixing Wang
- Department of Biochemistry and Molecular Biology, Oklahoma State University, Stillwater, Oklahoma, United States of America
| | - Xin Zeng
- Department of Biochemistry and Molecular Biology, Oklahoma State University, Stillwater, Oklahoma, United States of America
| | - Niranjani J. Iyer
- Department of Biochemistry and Molecular Biology, Oklahoma State University, Stillwater, Oklahoma, United States of America
| | - Douglas W. Bryant
- Donald Danforth Plant Science Center, St. Louis, Missouri, United States of America
| | - Todd C. Mockler
- Donald Danforth Plant Science Center, St. Louis, Missouri, United States of America
| | - Ramamurthy Mahalingam
- Department of Biochemistry and Molecular Biology, Oklahoma State University, Stillwater, Oklahoma, United States of America
- * E-mail:
| |
Collapse
|
56
|
Sompong U, Somta P, Raboy V, Srinives P. Mapping of quantitative trait loci for phytic acid and phosphorus contents in seed and seedling of mungbean (Vigna radiata (L.) Wilczek). BREEDING SCIENCE 2012; 62:87-92. [PMID: 23136518 PMCID: PMC3405952 DOI: 10.1270/jsbbs.62.87] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/05/2011] [Accepted: 11/21/2011] [Indexed: 05/03/2023]
Abstract
Phytic acid (PA) is the storage form of phosphorus (P) in seeds and plays an important role in the nutritional quality of food crops. There is little information on the genetics of seed and seedling PA in mungbean [Vigna radiata (L.) Wilczek]. Quantitative trait loci (QTL) were identified for phytic acid P (PAP), total P (TP), and inorganic P (IP) in mungbean seeds and seedlings, and for flowering, maturity and seed weight, in an F(2) population developed from a cross between low PAP cultivated mungbean (V1725BG) and high PAP wild mungbean (AusTRCF321925). Seven QTLs were detected for P compounds in seed; two for PAP, four for IP and one for TP. Six QTLs were identified for P compounds in seedling; three for PAP, two for TP and one for IP. Only one QTL co-localized between P compounds in seed and seedling suggesting that low PAP seed and low PAP seedling must be selected for at different QTLs. Seed PAP and TP were positively correlated with days to flowering and maturity, indicating the importance of plant phenology to seed P content.
Collapse
Affiliation(s)
- Utumporn Sompong
- Department of Agronomy, Faculty of Agriculture at Kamphaeng Saen, Kasetsart University, Nakhon Pathom 73140, Thailand
| | - Prakit Somta
- Department of Agronomy, Faculty of Agriculture at Kamphaeng Saen, Kasetsart University, Nakhon Pathom 73140, Thailand
| | - Victor Raboy
- USDA-ARS Small Grains and Potato Research Unit, 1691 South 2700 West, Aberdeen, Idaho 83210, USA
| | - Peerasak Srinives
- Department of Agronomy, Faculty of Agriculture at Kamphaeng Saen, Kasetsart University, Nakhon Pathom 73140, Thailand
- Corresponding author (e-mail: )
| |
Collapse
|
57
|
Kongjaimun A, Kaga A, Tomooka N, Somta P, Shimizu T, Shu Y, Isemura T, Vaughan DA, Srinives P. An SSR-based linkage map of yardlong bean (Vigna unguiculata (L.) Walp. subsp. unguiculata Sesquipedalis Group) and QTL analysis of pod length. Genome 2012; 55:81-92. [DOI: 10.1139/g11-078] [Citation(s) in RCA: 64] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Yardlong bean (Vigna unguiculata (L.) Walp. subsp. unguiculata Sesquipedalis Group) (2n = 2x = 22) is one of the most important vegetable legumes of Asia. The objectives of this study were to develop a genetic linkage map of yardlong bean using SSR makers from related Vigna species and to identify QTLs for pod length. The map was constructed from 226 simple sequence repeat (SSR) markers from cowpea (Vigna unguiculata (L.) Walp. subsp. unguiculata Unguiculata Group), azuki bean (Vigna angularis (Willd.) Ohwi & Ohashi), and mungbean (Vigna radiata (L.) Wilczek) in a BC1F1 ((JP81610 × TVnu457) × JP81610) population derived from the cross between yardlong bean accession JP81610 and wild cowpea (Vigna unguiculata subsp. unguiculata var. spontanea) accession TVnu457. The markers were clustered into 11 linkage groups (LGs) spanning 852.4 cM in total length with a mean distance between adjacent markers of 3.96 cM. All markers on LG11 showed segregation distortion towards the homozygous yardlong bean JP81610 genotype. The markers on LG11 were also distorted in the rice bean (Vigna umbellata (Thunb.) Ohwi & Ohashi) map, suggesting the presence of common segregation distortion factors in Vigna species on this LG. One major and six minor QTLs were identified for pod length variation between yardlong bean and wild cowpea. Using flanking markers, six of the seven QTLs were confirmed in an F2 population of JP81610 × TVnu457. The molecular linkage map developed and markers linked to pod length QTLs would be potentially useful for yardlong bean and cowpea breeding.
Collapse
Affiliation(s)
- Alisa Kongjaimun
- Program in Plant Breeding, Faculty of Agriculture at Kamphaeng Saen, Kasetsart University, Nakhon Pathom 73140, Thailand
| | - Akito Kaga
- Genebank, National Institute of Agrobiological Sciences, Kannondai 2-1-2, Tsukuba, Ibaraki 305-8602, Japan
| | - Norihiko Tomooka
- Genebank, National Institute of Agrobiological Sciences, Kannondai 2-1-2, Tsukuba, Ibaraki 305-8602, Japan
| | - Prakit Somta
- Department of Agronomy, Faculty of Agriculture at Kamphaeng Saen, Kasetsart University, Nakhon Pathom 73140, Thailand
| | - Takehiko Shimizu
- Institute of Society for Techno-Innovation of Agriculture, Forestry and Fisheries, Kamiyokoba Ippaizuka 446-1, Tsukuba, Ibaraki 305-0854, Japan
| | - Yujian Shu
- Genebank, National Institute of Agrobiological Sciences, Kannondai 2-1-2, Tsukuba, Ibaraki 305-8602, Japan
| | - Takehisa Isemura
- Genebank, National Institute of Agrobiological Sciences, Kannondai 2-1-2, Tsukuba, Ibaraki 305-8602, Japan
| | - Duncan A. Vaughan
- Genebank, National Institute of Agrobiological Sciences, Kannondai 2-1-2, Tsukuba, Ibaraki 305-8602, Japan
- FAO Regional Office for Asia and the Pacific, Maliwan Mansion, 39 Phra Atit Road, Bangkok 10200, Thailand
| | - Peerasak Srinives
- Department of Agronomy, Faculty of Agriculture at Kamphaeng Saen, Kasetsart University, Nakhon Pathom 73140, Thailand
| |
Collapse
|
58
|
Zalapa JE, Cuevas H, Zhu H, Steffan S, Senalik D, Zeldin E, McCown B, Harbut R, Simon P. Using next-generation sequencing approaches to isolate simple sequence repeat (SSR) loci in the plant sciences. AMERICAN JOURNAL OF BOTANY 2012; 99:193-208. [PMID: 22186186 DOI: 10.3732/ajb.1100394] [Citation(s) in RCA: 228] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
The application of next-generation sequencing (NGS) technologies for the development of simple sequence repeat (SSR) or microsatellite loci for genetic research in the botanical sciences is described. Microsatellite markers are one of the most informative and versatile DNA-based markers used in plant genetic research, but their development has traditionally been a difficult and costly process. NGS technologies allow the efficient identification of large numbers of microsatellites at a fraction of the cost and effort of traditional approaches. The major advantage of NGS methods is their ability to produce large amounts of sequence data from which to isolate and develop numerous genome-wide and gene-based microsatellite loci. The two major NGS technologies with emergent application in SSR isolation are 454 and Illumina. A review is provided of several recent studies demonstrating the efficient use of 454 and Illumina technologies for the discovery of microsatellites in plants. Additionally, important aspects during NGS isolation and development of microsatellites are discussed, including the use of computational tools and high-throughput genotyping methods. A data set of microsatellite loci in the plastome and mitochondriome of cranberry (Vaccinium macrocarpon Ait.) is provided to illustrate a successful application of 454 sequencing for SSR discovery. In the future, NGS technologies will massively increase the number of SSRs and other genetic markers available to conduct genetic research in understudied but economically important crops such as cranberry.
Collapse
Affiliation(s)
- Juan E Zalapa
- U. S. Department of Agriculture, Agricultural Research Service, Vegetable Crops Research Unit, 1575 Linden Drive, Madison, Wisconsin 53706, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
59
|
Wang H, Huan P, Lu X, Liu B. Mining of EST-SSR markers in clam Meretrix meretrix larvae from 454 shotgun transcriptome. Genes Genet Syst 2012; 86:197-205. [PMID: 21952209 DOI: 10.1266/ggs.86.197] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
A total of 2,970 EST-SSRs (2.38%) were identified by transcriptome sequencing of clam Meretrix meretrix (751,970 reads, ~310.82 Mbp), using 454 Genome Sequencer FLX next-generation sequencing platform. Dinucleotide SSR was the dominant repeat type (40.2%), followed by trinucleotide (37.8%), tetranuleotide (12.0%) and pentanucleotide (2.0%) SSR. The dominant repeat motif was AT (71.3%) in the dinucleotide SSR type and AAC (45.6%) in the trinucleotide SSR type. Nearly 79% of all microsatellites had flanking sequences suitable for PCR primer design. Half of PAL were found to be polymorphic in a subset of 40 primer pairs randomly selected. Specifically, the density of dinucleotide, trinucleotide and tetranucleotide repeats showed significant variation among four development stages (trochophore, D-veliger, pediveliger and postlarva). The results suggested that dinucleotide, trinucleotide and tetranucleotide SSRs may play an important role in contributing to the different expression profiles in larval stages.
Collapse
Affiliation(s)
- Hongxia Wang
- Key Laboratory of Experimental Marine Biology, Institute of Oceanology, Chinese Academy of Sciences, Quingdao, China
| | | | | | | |
Collapse
|
60
|
Zhu H, Senalik D, McCown BH, Zeldin EL, Speers J, Hyman J, Bassil N, Hummer K, Simon PW, Zalapa JE. Mining and validation of pyrosequenced simple sequence repeats (SSRs) from American cranberry (Vaccinium macrocarpon Ait.). TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2012; 124:87-96. [PMID: 21904845 DOI: 10.1007/s00122-011-1689-2] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/20/2011] [Accepted: 08/17/2011] [Indexed: 05/02/2023]
Abstract
The American cranberry (Vaccinium macrocarpon Ait.) is a major commercial fruit crop in North America, but limited genetic resources have been developed for the species. Furthermore, the paucity of codominant DNA markers has hampered the advance of genetic research in cranberry and the Ericaceae family in general. Therefore, we used Roche 454 sequencing technology to perform low-coverage whole genome shotgun sequencing of the cranberry cultivar 'HyRed'. After de novo assembly, the obtained sequence covered 266.3 Mb of the estimated 540-590 Mb in cranberry genome. A total of 107,244 SSR loci were detected with an overall density across the genome of 403 SSR/Mb. The AG repeat was the most frequent motif in cranberry accounting for 35% of all SSRs and together with AAG and AAAT accounted for 46% of all loci discovered. To validate the SSR loci, we designed 96 primer-pairs using contig sequence data containing perfect SSR repeats, and studied the genetic diversity of 25 cranberry genotypes. We identified 48 polymorphic SSR loci with 2-15 alleles per locus for a total of 323 alleles in the 25 cranberry genotypes. Genetic clustering by principal coordinates and genetic structure analyzes confirmed the heterogeneous nature of cranberries. The parentage composition of several hybrid cultivars was evident from the structure analyzes. Whole genome shotgun 454 sequencing was a cost-effective and efficient way to identify numerous SSR repeats in the cranberry sequence for marker development.
Collapse
Affiliation(s)
- H Zhu
- Department of Horticulture, University of Wisconsin, Madison, WI, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
61
|
Couceiro L, Maneiro I, Mauger S, Valero M, Ruiz JM, Barreiro R. MICROSATELLITE DEVELOPMENT IN RHODOPHYTA USING HIGH-THROUGHPUT SEQUENCE DATA(1). JOURNAL OF PHYCOLOGY 2011; 47:1258-1265. [PMID: 27020349 DOI: 10.1111/j.1529-8817.2011.01075.x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Shotgun genome sequencing is rapidly emerging as the method of choice for the identification of microsatellite loci in nonmodel organisms. However, to the best of our knowledge, this approach has not been applied to marine algae so far. Herein, we report the results of using the 454 next-generation sequencing (NGS) platform to randomly sample 36.0 and 40.9 Mbp (139,786 and 139,795 reads, respectively) of the genome of two red algae from the northwest Iberian Peninsula [Grateloupia lanceola (J. Agardh) J. Agardh and a still undescribed new member of the family Cruoriaceae]. Using data mining tools, we identified 4,766 and 5,174 perfect microsatellite loci in 4,344 and 4,504 sequences/contigs from G. lanceola and the Cruoriaceae, respectively. After conservative removal of potentially problematic loci (redundant sequences, mobile elements), primer design was possible for 1,371 and 1,366 loci, respectively. A survey of the literature indicates that microsatellite density in our Rhodophyta is at the low end of the values reported for other organisms investigated with the same technology (land plants and animals). A limited number of loci were successfully tested for PCR amplification and polymorphism finding that they may be suitable for population genetic studies. This study demonstrates that random genome sequencing is a rapid, effective alternative to develop useful microsatellite loci in previously unstudied red algae.
Collapse
Affiliation(s)
- Lucía Couceiro
- Área de Ecoloxía, Facultade de Ciencias, Universidade da Coruña, Campus de A Zapateira s/n, 1571-A Coruña, Spain CNRS-UPMC, UMR 7144, Équipe BEDIM, Station Biologique de Roscoff, Place Georges Teissier, F-29682 Roscoff Cedex, FranceÁrea de Ecoloxía, Facultade de Ciencias, Universidade da Coruña, Campus de A Zapateira s/n, 1571-A Coruña, SpainCNRS-UPMC, UMR 7144, Équipe BEDIM, Station Biologique de Roscoff, Place Georges Teissier, F-29682 Roscoff Cedex, FranceÁrea de Ecoloxía, Facultade de Ciencias, Universidade da Coruña, Campus de A Zapateira s/n, 1571-A Coruña, Spain
| | - Isabel Maneiro
- Área de Ecoloxía, Facultade de Ciencias, Universidade da Coruña, Campus de A Zapateira s/n, 1571-A Coruña, Spain CNRS-UPMC, UMR 7144, Équipe BEDIM, Station Biologique de Roscoff, Place Georges Teissier, F-29682 Roscoff Cedex, FranceÁrea de Ecoloxía, Facultade de Ciencias, Universidade da Coruña, Campus de A Zapateira s/n, 1571-A Coruña, SpainCNRS-UPMC, UMR 7144, Équipe BEDIM, Station Biologique de Roscoff, Place Georges Teissier, F-29682 Roscoff Cedex, FranceÁrea de Ecoloxía, Facultade de Ciencias, Universidade da Coruña, Campus de A Zapateira s/n, 1571-A Coruña, Spain
| | - Stéphane Mauger
- Área de Ecoloxía, Facultade de Ciencias, Universidade da Coruña, Campus de A Zapateira s/n, 1571-A Coruña, Spain CNRS-UPMC, UMR 7144, Équipe BEDIM, Station Biologique de Roscoff, Place Georges Teissier, F-29682 Roscoff Cedex, FranceÁrea de Ecoloxía, Facultade de Ciencias, Universidade da Coruña, Campus de A Zapateira s/n, 1571-A Coruña, SpainCNRS-UPMC, UMR 7144, Équipe BEDIM, Station Biologique de Roscoff, Place Georges Teissier, F-29682 Roscoff Cedex, FranceÁrea de Ecoloxía, Facultade de Ciencias, Universidade da Coruña, Campus de A Zapateira s/n, 1571-A Coruña, Spain
| | - Myriam Valero
- Área de Ecoloxía, Facultade de Ciencias, Universidade da Coruña, Campus de A Zapateira s/n, 1571-A Coruña, Spain CNRS-UPMC, UMR 7144, Équipe BEDIM, Station Biologique de Roscoff, Place Georges Teissier, F-29682 Roscoff Cedex, FranceÁrea de Ecoloxía, Facultade de Ciencias, Universidade da Coruña, Campus de A Zapateira s/n, 1571-A Coruña, SpainCNRS-UPMC, UMR 7144, Équipe BEDIM, Station Biologique de Roscoff, Place Georges Teissier, F-29682 Roscoff Cedex, FranceÁrea de Ecoloxía, Facultade de Ciencias, Universidade da Coruña, Campus de A Zapateira s/n, 1571-A Coruña, Spain
| | - José Miguel Ruiz
- Área de Ecoloxía, Facultade de Ciencias, Universidade da Coruña, Campus de A Zapateira s/n, 1571-A Coruña, Spain CNRS-UPMC, UMR 7144, Équipe BEDIM, Station Biologique de Roscoff, Place Georges Teissier, F-29682 Roscoff Cedex, FranceÁrea de Ecoloxía, Facultade de Ciencias, Universidade da Coruña, Campus de A Zapateira s/n, 1571-A Coruña, SpainCNRS-UPMC, UMR 7144, Équipe BEDIM, Station Biologique de Roscoff, Place Georges Teissier, F-29682 Roscoff Cedex, FranceÁrea de Ecoloxía, Facultade de Ciencias, Universidade da Coruña, Campus de A Zapateira s/n, 1571-A Coruña, Spain
| | - Rodolfo Barreiro
- Área de Ecoloxía, Facultade de Ciencias, Universidade da Coruña, Campus de A Zapateira s/n, 1571-A Coruña, Spain CNRS-UPMC, UMR 7144, Équipe BEDIM, Station Biologique de Roscoff, Place Georges Teissier, F-29682 Roscoff Cedex, FranceÁrea de Ecoloxía, Facultade de Ciencias, Universidade da Coruña, Campus de A Zapateira s/n, 1571-A Coruña, SpainCNRS-UPMC, UMR 7144, Équipe BEDIM, Station Biologique de Roscoff, Place Georges Teissier, F-29682 Roscoff Cedex, FranceÁrea de Ecoloxía, Facultade de Ciencias, Universidade da Coruña, Campus de A Zapateira s/n, 1571-A Coruña, Spain
| |
Collapse
|
62
|
Triwitayakorn K, Chatkulkawin P, Kanjanawattanawong S, Sraphet S, Yoocha T, Sangsrakru D, Chanprasert J, Ngamphiw C, Jomchai N, Therawattanasuk K, Tangphatsornruang S. Transcriptome sequencing of Hevea brasiliensis for development of microsatellite markers and construction of a genetic linkage map. DNA Res 2011; 18:471-82. [PMID: 22086998 PMCID: PMC3223080 DOI: 10.1093/dnares/dsr034] [Citation(s) in RCA: 95] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
To obtain more information on the Hevea brasiliensis genome, we sequenced the transcriptome from the vegetative shoot apex yielding 2 311 497 reads. Clustering and assembly of the reads produced a total of 113 313 unique sequences, comprising 28 387 isotigs and 84 926 singletons. Also, 17 819 expressed sequence tag (EST)-simple sequence repeats (SSRs) were identified from the data set. To demonstrate the use of this EST resource for marker development, primers were designed for 430 of the EST-SSRs. Three hundred and twenty-three primer pairs were amplifiable in H. brasiliensis clones. Polymorphic information content values of selected 47 SSRs among 20 H. brasiliensis clones ranged from 0.13 to 0.71, with an average of 0.51. A dendrogram of genetic similarities between the 20 H. brasiliensis clones using these 47 EST-SSRs suggested two distinct groups that correlated well with clone pedigree. These novel EST-SSRs together with the published SSRs were used for the construction of an integrated parental linkage map of H. brasiliensis based on 81 lines of an F1 mapping population. The map consisted of 97 loci, consisting of 37 novel EST-SSRs and 60 published SSRs, distributed on 23 linkage groups and covered 842.9 cM with a mean interval of 11.9 cM and ∼4 loci per linkage group. Although the numbers of linkage groups exceed the haploid number (18), but with several common markers between homologous linkage groups with the previous map indicated that the F1 map in this study is appropriate for further study in marker-assisted selection.
Collapse
|
63
|
Somta P, Chankaew S, Rungnoi O, Srinives P. Genetic diversity of the Bambara groundnut (Vigna subterranea (L.) Verdc.) as assessed by SSR markers. Genome 2011; 54:898-910. [DOI: 10.1139/g11-056] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Bambara groundnut ( Vigna subterranea (L.) Verdc.) is an important African legume crop. In this study, a collection consisting of 240 accessions was analyzed using 22 simple sequence repeat (SSR) markers. In total, 166 alleles were detected, with a mean of 7.59 alleles per locus. Allelic and gene diversities were higher in the west African and Cameroon/Nigeria regions with 6.68 and 6.18 alleles per locus, and 0.601 and 0.571, respectively. The genetic distance showed high similarity between west African and Cameroon/Nigeria accessions. Principal coordinate analyses and neighbor-joining analysis consistently revealed that the majority of west African accessions were grouped with Cameroon/Nigeria accessions, but they were differentiated from east African, central African, and southeast Asian accessions. Population structure analysis showed that two subpopulations existed, and most of the east African accessions were restricted to one subpopulation with some Cameroon/Nigeria accessions, whereas most of the west African accessions were associated with most of the Cameroon/Nigeria accessions in the other subpopulation. Comparison with SSR analysis of other Vigna cultigens, i.e., cultivated azuki bean ( Vigna angularis ) and mungbean ( Vigna radiata ), reveals that the mean gene diversity of Bambara groundnut was lower than azuki bean but higher than mungbean.
Collapse
Affiliation(s)
- P. Somta
- Department of Agronomy, Faculty of Agriculture at Kamphaeng Saen, Kasetsart University, Nakhon Pathom 73140, Thailand
| | - S. Chankaew
- Department of Agronomy, Faculty of Agriculture at Kamphaeng Saen, Kasetsart University, Nakhon Pathom 73140, Thailand
| | - O. Rungnoi
- Division of Plant Production Technology, Faculty of Agricultural Technology, King Mongkut’s Institute of Technology Ladkrabang, Bangkok 10520, Thailand
| | - P. Srinives
- Department of Agronomy, Faculty of Agriculture at Kamphaeng Saen, Kasetsart University, Nakhon Pathom 73140, Thailand
| |
Collapse
|
64
|
Hammesfahr B, Odronitz F, Hellkamp M, Kollmar M. diArk 2.0 provides detailed analyses of the ever increasing eukaryotic genome sequencing data. BMC Res Notes 2011; 4:338. [PMID: 21906294 PMCID: PMC3180467 DOI: 10.1186/1756-0500-4-338] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2011] [Accepted: 09/09/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Nowadays, the sequencing of even the largest mammalian genomes has become a question of days with current next-generation sequencing methods. It comes as no surprise that dozens of genome assemblies are released per months now. Since the number of next-generation sequencing machines increases worldwide and new major sequencing plans are announced, a further increase in the speed of releasing genome assemblies is expected. Thus it becomes increasingly important to get an overview as well as detailed information about available sequenced genomes. The different sequencing and assembly methods have specific characteristics that need to be known to evaluate the various genome assemblies before performing subsequent analyses. RESULTS diArk has been developed to provide fast and easy access to all sequenced eukaryotic genomes worldwide. Currently, diArk 2.0 contains information about more than 880 species and more than 2350 genome assembly files. Many meta-data like sequencing and read-assembly methods, sequencing coverage, GC-content, extended lists of alternatively used scientific names and common species names, and various kinds of statistics are provided. To intuitively approach the data the web interface makes extensive usage of modern web techniques. A number of search modules and result views facilitate finding and judging the data of interest. Subscribing to the RSS feed is the easiest way to stay up-to-date with the latest genome data. CONCLUSIONS diArk 2.0 is the most up-to-date database of sequenced eukaryotic genomes compared to databases like GOLD, NCBI Genome, NHGRI, and ISC. It is different in that only those projects are stored for which genome assembly data or considerable amounts of cDNA data are available. Projects in planning stage or in the process of being sequenced are not included. The user can easily search through the provided data and directly access the genome assembly files of the sequenced genome of interest. diArk 2.0 is available at http://www.diark.org.
Collapse
Affiliation(s)
- Björn Hammesfahr
- Abteilung NMR basierte Strukturbiologie, Max-Planck-Institut für Biophysikalische Chemie, Am Fassberg 11, D-37077 Göttingen, Germany
| | - Florian Odronitz
- Abteilung NMR basierte Strukturbiologie, Max-Planck-Institut für Biophysikalische Chemie, Am Fassberg 11, D-37077 Göttingen, Germany
| | - Marcel Hellkamp
- Abteilung NMR basierte Strukturbiologie, Max-Planck-Institut für Biophysikalische Chemie, Am Fassberg 11, D-37077 Göttingen, Germany
| | - Martin Kollmar
- Abteilung NMR basierte Strukturbiologie, Max-Planck-Institut für Biophysikalische Chemie, Am Fassberg 11, D-37077 Göttingen, Germany
| |
Collapse
|
65
|
Lepais O, Bacles CFE. Comparison of random and SSR-enriched shotgun pyrosequencing for microsatellite discovery and single multiplex PCR optimization in Acacia harpophylla F. Muell. Ex Benth. Mol Ecol Resour 2011; 11:711-24. [PMID: 21466660 DOI: 10.1111/j.1755-0998.2011.03002.x] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Streamlining the development and genotyping of microsatellites in species for which no genetic information is available represents an important technical challenge to overcome in order to enable mainstream application of state-of-the-art population genetic analysis techniques in nonmodel organisms. Using the example of Acacia harpophylla, an acacia tree endemic of north-eastern Australia, we show that high-throughput shotgun pyrosequencing technology, so-called second-generation sequencing, reduces time and cost of microsatellite marker discovery in nonmodel organisms and of their large-scale typing in natural populations. We found that 0.5% of short sequence reads generated on 454 Genome Sequencer FLX Titanium from random genome sampling and 2.2% of reads generated with prior microsatellite enrichment yielded microsatellite markers with designed polymerase chain reaction (PCR) primers, suggesting that enrichment increases efficiency of pyrosequencing when microsatellite discovery is the primary goal. Using stringent selection criteria to facilitate downstream PCR multiplex design, we identified 1435 microsatellite loci with designed primers from a total of 200,908 short sequence reads. From a subset of 96 loci tested for amplification, 38 were validated for population genetics applications, leading to the optimization of a cost-effective multiplex PCR protocol for the simultaneous typing of nine microsatellites in natural populations of A. harpophylla.
Collapse
Affiliation(s)
- Olivier Lepais
- Institute of Biological and Environmental Sciences, School of Natural Sciences, University of Stirling, Stirling FK9 4LA, UK
| | | |
Collapse
|
66
|
Moe KT, Chung JW, Cho YI, Moon JK, Ku JH, Jung JK, Lee J, Park YJ. Sequence information on simple sequence repeats and single nucleotide polymorphisms through transcriptome analysis of mungbean. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2011; 53:63-73. [PMID: 21205180 DOI: 10.1111/j.1744-7909.2010.01012.x] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Mungbean (Vigna radiata (L.) Wilczek) is a unique species in its ability to fix atmospheric nitrogen, with early maturity, and relatively good drought resistance. We used 454 sequencing technology for transcriptome sequencing. A total of 150 159 and 142 993 reads produced 5 254 and 6 374 large contigs (≥ 500 bp) with an average length of 833 and 853 for Sunhwa and Jangan, respectively. Functional annotation to known sequences yielded 41.34% and 41.74% unigenes for Jangan and Sunhwa. A higher number of simple sequence repeat (SSR) motifs was identified in Jangan (1 630) compared with that of Sunhwa (1 334). A similar SSR distribution pattern was observed in both varieties. A total of 8 249 single nucleotide polymorphisms (SNPs) and indels with 2 098 high-confidence candidates were identified in the two mungbean varieties. The average distance between individual SNPs was approximately 860 bp. Our report demonstrates the utility of transcriptomic data for implementing a functional annotation and development of genetic markers. We also provide large resource sequence data for mungbean improvement programs.
Collapse
Affiliation(s)
- Kyaw Thu Moe
- Department of Palnt Resources, College of Industrial Sciences, Kongju National University, Yesan 304-702, Republic of Korea
| | | | | | | | | | | | | | | |
Collapse
|
67
|
Bräutigam A, Gowik U. What can next generation sequencing do for you? Next generation sequencing as a valuable tool in plant research. PLANT BIOLOGY (STUTTGART, GERMANY) 2010; 12:831-41. [PMID: 21040298 DOI: 10.1111/j.1438-8677.2010.00373.x] [Citation(s) in RCA: 71] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
Next generation sequencing (NGS) technologies have opened fascinating opportunities for the analysis of plants with and without a sequenced genome on a genomic scale. During the last few years, NGS methods have become widely available and cost effective. They can be applied to a wide variety of biological questions, from the sequencing of complete eukaryotic genomes and transcriptomes, to the genome-scale analysis of DNA-protein interactions. In this review, we focus on the use of NGS for plant transcriptomics, including gene discovery, transcript quantification and marker discovery for non-model plants, as well as transcript annotation and quantification, small RNA discovery and antisense transcription analysis for model plants. We discuss the experimental design for analysis of plants with and without a sequenced genome, including considerations on sampling, RNA preparation, sequencing platforms and bioinformatics tools for data analysis. NGS technologies offer exciting new opportunities for the plant sciences, especially for work on plants without a sequenced genome, since large sequence resources can be generated at moderate cost.
Collapse
Affiliation(s)
- A Bräutigam
- Institute of Plant Biochemistry, Heinrich-Heine University, Düsseldorf, Germany.
| | | |
Collapse
|
68
|
Abstract
Next generation sequencing has already been used for genomic analysis of microorganis, human being, animals, and plants. Sample preparation is prerequisite and most important for large-scale sequencing. There are two major interferences for large-scale sequencing, polyA and abundant genes' concealment for rare genes. In order to solve these problems, we used total RNA extracted from violaceae leaves to produce double stranded cDNA. DSN nuclease was used to treat the ds cDNA prior to removing the polyA. Randomly sequencing 100 clones of the treated cDNA showed that there were 94 independent clones in the treated sample, and the sequences did not contained polyA. However, only 62 independent clones were found in the untreated sample, and 15 of the sequencing files were affected by polyA. By randomly sequencing of the treated cDNA, we also found two clones encoded two interested genes. We failed to isolate these genes although the protein mass peaks of them had been found in the MALDI-TOF trace. Furthermore, we designed primers from two known genes with different expression abundances. The PCR yields were approaching similar using the treated cDNAs as templates. These results showed that, removal of the polyA and enrichment of rare genes with DSN can meet the requirements of large-scale sequencing and discovery of new genes.
Collapse
|
69
|
Durand J, Bodénès C, Chancerel E, Frigerio JM, Vendramin G, Sebastiani F, Buonamici A, Gailing O, Koelewijn HP, Villani F, Mattioni C, Cherubini M, Goicoechea PG, Herrán A, Ikaran Z, Cabané C, Ueno S, Alberto F, Dumoulin PY, Guichoux E, de Daruvar A, Kremer A, Plomion C. A fast and cost-effective approach to develop and map EST-SSR markers: oak as a case study. BMC Genomics 2010; 11:570. [PMID: 20950475 PMCID: PMC3091719 DOI: 10.1186/1471-2164-11-570] [Citation(s) in RCA: 116] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2009] [Accepted: 10/15/2010] [Indexed: 08/14/2023] Open
Abstract
Background Expressed Sequence Tags (ESTs) are a source of simple sequence repeats (SSRs) that can be used to develop molecular markers for genetic studies. The availability of ESTs for Quercus robur and Quercus petraea provided a unique opportunity to develop microsatellite markers to accelerate research aimed at studying adaptation of these long-lived species to their environment. As a first step toward the construction of a SSR-based linkage map of oak for quantitative trait locus (QTL) mapping, we describe the mining and survey of EST-SSRs as well as a fast and cost-effective approach (bin mapping) to assign these markers to an approximate map position. We also compared the level of polymorphism between genomic and EST-derived SSRs and address the transferability of EST-SSRs in Castanea sativa (chestnut). Results A catalogue of 103,000 Sanger ESTs was assembled into 28,024 unigenes from which 18.6% presented one or more SSR motifs. More than 42% of these SSRs corresponded to trinucleotides. Primer pairs were designed for 748 putative unigenes. Overall 37.7% (283) were found to amplify a single polymorphic locus in a reference full-sib pedigree of Quercus robur. The usefulness of these loci for establishing a genetic map was assessed using a bin mapping approach. Bin maps were constructed for the male and female parental tree for which framework linkage maps based on AFLP markers were available. The bin set consisting of 14 highly informative offspring selected based on the number and position of crossover sites. The female and male maps comprised 44 and 37 bins, with an average bin length of 16.5 cM and 20.99 cM, respectively. A total of 256 EST-SSRs were assigned to bins and their map position was further validated by linkage mapping. EST-SSRs were found to be less polymorphic than genomic SSRs, but their transferability rate to chestnut, a phylogenetically related species to oak, was higher. Conclusion We have generated a bin map for oak comprising 256 EST-SSRs. This resource constitutes a first step toward the establishment of a gene-based map for this genus that will facilitate the dissection of QTLs affecting complex traits of ecological importance.
Collapse
|
70
|
Feldmeyer B, Hoffmeier K, Pfenninger M. The complete mitochondrial genome of Radix balthica (Pulmonata, Basommatophora), obtained by low coverage shot gun next generation sequencing. Mol Phylogenet Evol 2010; 57:1329-33. [PMID: 20875865 DOI: 10.1016/j.ympev.2010.09.012] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2010] [Revised: 09/14/2010] [Accepted: 09/20/2010] [Indexed: 02/05/2023]
Abstract
A 454-FLX low-coverage sequencing approach was used to assemble the mitochondrial genome of Radix balthica. The mtDNA sequence is 13,993 nt long and contains 37 genes (13 protein coding genes, two rRNAs and 22 tRNAs). Four genes, the 12S RNA and seven tRNAs are transcribed in reverse order. The sequence is AT rich (71.3%), similar to other basommatophoran species. Comparison with the most closely related mt genomes available (Biomphalaria glabrata and Biomphalaria tenagophila) revealed identical gene orders except for five tRNAs. Next generation sequencing proved to be a fast and easy method for sequencing an entire mitochondrial genome.
Collapse
Affiliation(s)
- B Feldmeyer
- Biodiversity and Climate Research Center (BiK-F), Siesmayerstr. 70A, 60323 Frankfurt am Main, Germany.
| | | | | |
Collapse
|
71
|
Novák P, Neumann P, Macas J. Graph-based clustering and characterization of repetitive sequences in next-generation sequencing data. BMC Bioinformatics 2010; 11:378. [PMID: 20633259 PMCID: PMC2912890 DOI: 10.1186/1471-2105-11-378] [Citation(s) in RCA: 297] [Impact Index Per Article: 21.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2010] [Accepted: 07/15/2010] [Indexed: 01/23/2023] Open
Abstract
BACKGROUND The investigation of plant genome structure and evolution requires comprehensive characterization of repetitive sequences that make up the majority of higher plant nuclear DNA. Since genome-wide characterization of repetitive elements is complicated by their high abundance and diversity, novel approaches based on massively-parallel sequencing are being adapted to facilitate the analysis. It has recently been demonstrated that the low-pass genome sequencing provided by a single 454 sequencing reaction is sufficient to capture information about all major repeat families, thus providing the opportunity for efficient repeat investigation in a wide range of species. However, the development of appropriate data mining tools is required in order to fully utilize this sequencing data for repeat characterization. RESULTS We adapted a graph-based approach for similarity-based partitioning of whole genome 454 sequence reads in order to build clusters made of the reads derived from individual repeat families. The information about cluster sizes was utilized for assessing the proportion and composition of repeats in the genomes of two model species, Pisum sativum and Glycine max, differing in genome size and 454 sequencing coverage. Moreover, statistical analysis and visual inspection of the topology of the cluster graphs using a newly developed program tool, SeqGrapheR, were shown to be helpful in distinguishing basic types of repeats and investigating sequence variability within repeat families. CONCLUSIONS Repetitive regions of plant genomes can be efficiently characterized by the presented graph-based analysis and the graph representation of repeats can be further used to assess the variability and evolutionary divergence of repeat families, discover and characterize novel elements, and aid in subsequent assembly of their consensus sequences.
Collapse
Affiliation(s)
- Petr Novák
- Biology Centre ASCR, Institute of Plant Molecular Biology, Branisovska 31, Ceske Budejovice, CZ-37005, Czech Republic
| | - Pavel Neumann
- Biology Centre ASCR, Institute of Plant Molecular Biology, Branisovska 31, Ceske Budejovice, CZ-37005, Czech Republic
| | - Jiří Macas
- Biology Centre ASCR, Institute of Plant Molecular Biology, Branisovska 31, Ceske Budejovice, CZ-37005, Czech Republic
| |
Collapse
|
72
|
Zhang C, Xing D. Single-Molecule DNA Amplification and Analysis Using Microfluidics. Chem Rev 2010; 110:4910-47. [DOI: 10.1021/cr900081z] [Citation(s) in RCA: 115] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Chunsun Zhang
- MOE Key Laboratory of Laser Life Science & Institute of Laser Life Science, College of Biophotonics, South China Normal University, Guangzhou 510631, China
| | - Da Xing
- MOE Key Laboratory of Laser Life Science & Institute of Laser Life Science, College of Biophotonics, South China Normal University, Guangzhou 510631, China
| |
Collapse
|
73
|
Parchman TL, Geist KS, Grahnen JA, Benkman CW, Buerkle CA. Transcriptome sequencing in an ecologically important tree species: assembly, annotation, and marker discovery. BMC Genomics 2010; 11:180. [PMID: 20233449 PMCID: PMC2851599 DOI: 10.1186/1471-2164-11-180] [Citation(s) in RCA: 341] [Impact Index Per Article: 24.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2010] [Accepted: 03/16/2010] [Indexed: 01/05/2023] Open
Abstract
Background Massively parallel sequencing of cDNA is now an efficient route for generating enormous sequence collections that represent expressed genes. This approach provides a valuable starting point for characterizing functional genetic variation in non-model organisms, especially where whole genome sequencing efforts are currently cost and time prohibitive. The large and complex genomes of pines (Pinus spp.) have hindered the development of genomic resources, despite the ecological and economical importance of the group. While most genomic studies have focused on a single species (P. taeda), genomic level resources for other pines are insufficiently developed to facilitate ecological genomic research. Lodgepole pine (P. contorta) is an ecologically important foundation species of montane forest ecosystems and exhibits substantial adaptive variation across its range in western North America. Here we describe a sequencing study of expressed genes from P. contorta, including their assembly and annotation, and their potential for molecular marker development to support population and association genetic studies. Results We obtained 586,732 sequencing reads from a 454 GS XLR70 Titanium pyrosequencer (mean length: 306 base pairs). A combination of reference-based and de novo assemblies yielded 63,657 contigs, with 239,793 reads remaining as singletons. Based on sequence similarity with known proteins, these sequences represent approximately 17,000 unique genes, many of which are well covered by contig sequences. This sequence collection also included a surprisingly large number of retrotransposon sequences, suggesting that they are highly transcriptionally active in the tissues we sampled. We located and characterized thousands of simple sequence repeats and single nucleotide polymorphisms as potential molecular markers in our assembled and annotated sequences. High quality PCR primers were designed for a substantial number of the SSR loci, and a large number of these were amplified successfully in initial screening. Conclusions This sequence collection represents a major genomic resource for P. contorta, and the large number of genetic markers characterized should contribute to future research in this and other pines. Our results illustrate the utility of next generation sequencing as a basis for marker development and population genomics in non-model species.
Collapse
Affiliation(s)
- Thomas L Parchman
- Department of Botany, University of Wyoming, Laramie, WY 82071, USA.
| | | | | | | | | |
Collapse
|
74
|
Transcriptome sequencing in an ecologically important tree species: assembly, annotation, and marker discovery. BMC Genomics 2010. [PMID: 20233449 DOI: 10.1186/1471‐2164‐11‐180] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Massively parallel sequencing of cDNA is now an efficient route for generating enormous sequence collections that represent expressed genes. This approach provides a valuable starting point for characterizing functional genetic variation in non-model organisms, especially where whole genome sequencing efforts are currently cost and time prohibitive. The large and complex genomes of pines (Pinus spp.) have hindered the development of genomic resources, despite the ecological and economical importance of the group. While most genomic studies have focused on a single species (P. taeda), genomic level resources for other pines are insufficiently developed to facilitate ecological genomic research. Lodgepole pine (P. contorta) is an ecologically important foundation species of montane forest ecosystems and exhibits substantial adaptive variation across its range in western North America. Here we describe a sequencing study of expressed genes from P. contorta, including their assembly and annotation, and their potential for molecular marker development to support population and association genetic studies. RESULTS We obtained 586,732 sequencing reads from a 454 GS XLR70 Titanium pyrosequencer (mean length: 306 base pairs). A combination of reference-based and de novo assemblies yielded 63,657 contigs, with 239,793 reads remaining as singletons. Based on sequence similarity with known proteins, these sequences represent approximately 17,000 unique genes, many of which are well covered by contig sequences. This sequence collection also included a surprisingly large number of retrotransposon sequences, suggesting that they are highly transcriptionally active in the tissues we sampled. We located and characterized thousands of simple sequence repeats and single nucleotide polymorphisms as potential molecular markers in our assembled and annotated sequences. High quality PCR primers were designed for a substantial number of the SSR loci, and a large number of these were amplified successfully in initial screening. CONCLUSIONS This sequence collection represents a major genomic resource for P. contorta, and the large number of genetic markers characterized should contribute to future research in this and other pines. Our results illustrate the utility of next generation sequencing as a basis for marker development and population genomics in non-model species.
Collapse
|