101
|
Bokszczanin KL, Fragkostefanakis S. Perspectives on deciphering mechanisms underlying plant heat stress response and thermotolerance. FRONTIERS IN PLANT SCIENCE 2013; 4:315. [PMID: 23986766 PMCID: PMC3750488 DOI: 10.3389/fpls.2013.00315] [Citation(s) in RCA: 168] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/08/2013] [Accepted: 07/27/2013] [Indexed: 05/17/2023]
Abstract
Global warming is a major threat for agriculture and food safety and in many cases the negative effects are already apparent. The current challenge of basic and applied plant science is to decipher the molecular mechanisms of heat stress response (HSR) and thermotolerance in detail and use this information to identify genotypes that will withstand unfavorable environmental conditions. Nowadays X-omics approaches complement the findings of previous targeted studies and highlight the complexity of HSR mechanisms giving information for so far unrecognized genes, proteins and metabolites as potential key players of thermotolerance. Even more, roles of epigenetic mechanisms and the involvement of small RNAs in thermotolerance are currently emerging and thus open new directions of yet unexplored areas of plant HSR. In parallel it is emerging that although the whole plant is vulnerable to heat, specific organs are particularly sensitive to elevated temperatures. This has redirected research from the vegetative to generative tissues. The sexual reproduction phase is considered as the most sensitive to heat and specifically pollen exhibits the highest sensitivity and frequently an elevation of the temperature just a few degrees above the optimum during pollen development can have detrimental effects for crop production. Compared to our knowledge on HSR of vegetative tissues, the information on pollen is still scarce. Nowadays, several techniques for high-throughput X-omics approaches provide major tools to explore the principles of pollen HSR and thermotolerance mechanisms in specific genotypes. The collection of such information will provide an excellent support for improvement of breeding programs to facilitate the development of tolerant cultivars. The review aims at describing the current knowledge of thermotolerance mechanisms and the technical advances which will foster new insights into this process.
Collapse
Affiliation(s)
- Kamila L. Bokszczanin
- GenXPro GmbH, Frankfurt am MainGermany
- *Correspondence: Kamila L. Bokszczanin, GenXPro GmbH, Altenhöferallee 3, Frankfurt am Main 60438, Germany e-mail: ; Sotirios Fragkostefanakis, Department of Biosciences, Molecular Cell Biology of Plants, Goethe University, Max-von-Laue-Street 9, Frankfurt am Main 60438, Germany e-mail:
| | | | - Sotirios Fragkostefanakis
- Department of Biosciences, Molecular Cell Biology of Plants, Goethe University, Frankfurt am MainGermany
- *Correspondence: Kamila L. Bokszczanin, GenXPro GmbH, Altenhöferallee 3, Frankfurt am Main 60438, Germany e-mail: ; Sotirios Fragkostefanakis, Department of Biosciences, Molecular Cell Biology of Plants, Goethe University, Max-von-Laue-Street 9, Frankfurt am Main 60438, Germany e-mail:
| |
Collapse
|
102
|
Edwards D, Batley J, Snowdon RJ. Accessing complex crop genomes with next-generation sequencing. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2013; 126:1-11. [PMID: 22948437 DOI: 10.1007/s00122-012-1964-x] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2012] [Accepted: 08/08/2012] [Indexed: 05/02/2023]
Abstract
Many important crop species have genomes originating from ancestral or recent polyploidisation events. Multiple homoeologous gene copies, chromosomal rearrangements and amplification of repetitive DNA within large and complex crop genomes can considerably complicate genome analysis and gene discovery by conventional, forward genetics approaches. On the other hand, ongoing technological advances in molecular genetics and genomics today offer unprecedented opportunities to analyse and access even more recalcitrant genomes. In this review, we describe next-generation sequencing and data analysis techniques that vastly improve our ability to dissect and mine genomes for causal genes underlying key traits and allelic variation of interest to breeders. We focus primarily on wheat and oilseed rape, two leading examples of major polyploid crop genomes whose size or complexity present different, significant challenges. In both cases, the latest DNA sequencing technologies, applied using quite different approaches, have enabled considerable progress towards unravelling the respective genomes. Our ability to discover the extent and distribution of genetic diversity in crop gene pools, and its relationship to yield and quality-related traits, is swiftly gathering momentum as DNA sequencing and the bioinformatic tools to deal with growing quantities of genomic data continue to develop. In the coming decade, genomic and transcriptomic sequencing, discovery and high-throughput screening of single nucleotide polymorphisms, presence-absence variations and other structural chromosomal variants in diverse germplasm collections will give detailed insight into the origins, domestication and available trait-relevant variation of polyploid crops, in the process facilitating novel approaches and possibilities for genomics-assisted breeding.
Collapse
Affiliation(s)
- David Edwards
- Australian Centre for Plant Functional Genomics, School of Agriculture and Food Sciences, University of Queensland, Brisbane, QLD 4072, Australia
| | | | | |
Collapse
|
103
|
Monteiro F, Romeiras MM, Batista D, Duarte MC. Biodiversity Assessment of Sugar Beet Species and Its Wild Relatives: Linking Ecological Data with New Genetic Approaches. ACTA ACUST UNITED AC 2013. [DOI: 10.4236/ajps.2013.48a003] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
104
|
Kumar S, You FM, Cloutier S. Genome wide SNP discovery in flax through next generation sequencing of reduced representation libraries. BMC Genomics 2012; 13:684. [PMID: 23216845 PMCID: PMC3557168 DOI: 10.1186/1471-2164-13-684] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2012] [Accepted: 11/29/2012] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND Flax (Linum usitatissimum L.) is a significant fibre and oilseed crop. Current flax molecular markers, including isozymes, RAPDs, AFLPs and SSRs are of limited use in the construction of high density linkage maps and for association mapping applications due to factors such as low reproducibility, intense labour requirements and/or limited numbers. We report here on the use of a reduced representation library strategy combined with next generation Illumina sequencing for rapid and large scale discovery of SNPs in eight flax genotypes. SNP discovery was performed through in silico analysis of the sequencing data against the whole genome shotgun sequence assembly of flax genotype CDC Bethune. Genotyping-by-sequencing of an F6-derived recombinant inbred line population provided validation of the SNPs. RESULTS Reduced representation libraries of eight flax genotypes were sequenced on the Illumina sequencing platform resulting in sequence coverage ranging from 4.33 to 15.64X (genome equivalents). Depending on the relatedness of the genotypes and the number and length of the reads, between 78% and 93% of the reads mapped onto the CDC Bethune whole genome shotgun sequence assembly. A total of 55,465 SNPs were discovered with the largest number of SNPs belonging to the genotypes with the highest mapping coverage percentage. Approximately 84% of the SNPs discovered were identified in a single genotype, 13% were shared between any two genotypes and the remaining 3% in three or more. Nearly a quarter of the SNPs were found in genic regions. A total of 4,706 out of 4,863 SNPs discovered in Macbeth were validated using genotyping-by-sequencing of 96 F6 individuals from a recombinant inbred line population derived from a cross between CDC Bethune and Macbeth, corresponding to a validation rate of 96.8%. CONCLUSIONS Next generation sequencing of reduced representation libraries was successfully implemented for genome-wide SNP discovery from flax. The genotyping-by-sequencing approach proved to be efficient for validation. The SNP resources generated in this work will assist in generating high density maps of flax and facilitate QTL discovery, marker-assisted selection, phylogenetic analyses, association mapping and anchoring of the whole genome shotgun sequence.
Collapse
Affiliation(s)
- Santosh Kumar
- Cereal Research Centre, Agriculture and Agri-Food Canada, 195 Dafoe Road, Winnipeg, Manitoba, R3T 2M9, Canada
- Department of Plant Science, University of Manitoba, 66 Dafoe Road, Winnipeg, Manitoba, R3T 2N2, Canada
| | - Frank M You
- Cereal Research Centre, Agriculture and Agri-Food Canada, 195 Dafoe Road, Winnipeg, Manitoba, R3T 2M9, Canada
| | - Sylvie Cloutier
- Cereal Research Centre, Agriculture and Agri-Food Canada, 195 Dafoe Road, Winnipeg, Manitoba, R3T 2M9, Canada
- Department of Plant Science, University of Manitoba, 66 Dafoe Road, Winnipeg, Manitoba, R3T 2N2, Canada
| |
Collapse
|
105
|
Gagnaire PA, Normandeau E, Pavey SA, Bernatchez L. Mapping phenotypic, expression and transmission ratio distortion QTL using RAD markers in the Lake Whitefish(Coregonus clupeaformis). Mol Ecol 2012. [DOI: 10.1111/mec.12127] [Citation(s) in RCA: 91] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Affiliation(s)
- Pierre-Alexandre Gagnaire
- Institut de Biologie Intégrative et des Systèmes (IBIS); Département de Biologie; Université Laval; Pavillon Charles-Eugène-Marchand Québec G1V 0A6 Canada
| | - Eric Normandeau
- Institut de Biologie Intégrative et des Systèmes (IBIS); Département de Biologie; Université Laval; Pavillon Charles-Eugène-Marchand Québec G1V 0A6 Canada
| | - Scott A. Pavey
- Institut de Biologie Intégrative et des Systèmes (IBIS); Département de Biologie; Université Laval; Pavillon Charles-Eugène-Marchand Québec G1V 0A6 Canada
| | - Louis Bernatchez
- Institut de Biologie Intégrative et des Systèmes (IBIS); Département de Biologie; Université Laval; Pavillon Charles-Eugène-Marchand Québec G1V 0A6 Canada
| |
Collapse
|
106
|
Yang M, Han Y, VanBuren R, Ming R, Xu L, Han Y, Liu Y. Genetic linkage maps for Asian and American lotus constructed using novel SSR markers derived from the genome of sequenced cultivar. BMC Genomics 2012; 13:653. [PMID: 23170872 PMCID: PMC3564711 DOI: 10.1186/1471-2164-13-653] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2012] [Accepted: 11/07/2012] [Indexed: 01/18/2023] Open
Abstract
Background The genus Nelumbo Adans. comprises two living species, N. nucifera Gaertan. (Asian lotus) and N. lutea Pers. (American lotus). A genetic linkage map is an essential resource for plant genetic studies and crop improvement but has not been generated for Nelumbo. We aimed to develop genomic simple sequence repeat (SSR) markers from the genome sequence and construct two genetic maps for Nelumbo to assist genome assembly and integration of a genetic map with the genome sequence. Results A total of 86,089 SSR motifs were identified from the genome sequences. Di- and tri-nucleotide repeat motifs were the most abundant, and accounted for 60.73% and 31.66% of all SSRs, respectively. AG/GA repeats constituted 51.17% of dinucleotide repeat motifs, followed by AT/TA (44.29%). Of 500 SSR primers tested, 386 (77.20%) produced scorable alleles with an average of 2.59 per primer, and 185 (37.00%) showed polymorphism among two parental genotypes, N. nucifera ‘Chinese Antique’ and N. lutea ‘AL1’, and six progenies of their F1 population. The normally segregating markers, which comprised 268 newly developed SSRs, 37 previously published SSRs and 53 sequence-related amplified polymorphism markers, were used for genetic map construction. The map for Asian lotus was 365.67 cM with 47 markers distributed in seven linkage groups. The map for American lotus was 524.51 cM, and contained 177 markers distributed in 11 genetic linkage groups. The number of markers per linkage group ranged from three to 34 with an average genetic distance of 3.97 cM between adjacent markers. Moreover, 171 SSR markers contained in linkage groups were anchored to 97 genomic DNA sequence contigs of ‘Chinese Antique’. The 97 contigs were merged into 60 scaffolds. Conclusion Genetic mapping of SSR markers derived from sequenced contigs in Nelumbo enabled the associated contigs to be anchored in the linkage map and facilitated assembly of the genome sequences of ‘Chinese Antique’. The present study reports the first construction of genetic linkage maps for Nelumbo, which can serve as reference linkage maps to accelerate characterization germplasm, genetic mapping for traits of economic interest, and molecular breeding with marker-assisted selection.
Collapse
Affiliation(s)
- Mei Yang
- Key Laboratory of Aquatic Plant and Watershed Ecology, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, Hubei, 430074, China
| | | | | | | | | | | | | |
Collapse
|
107
|
Shinozuka H, Cogan NOI, Spangenberg GC, Forster JW. Quantitative Trait Locus (QTL) meta-analysis and comparative genomics for candidate gene prediction in perennial ryegrass (Lolium perenne L.). BMC Genet 2012; 13:101. [PMID: 23137269 PMCID: PMC3532372 DOI: 10.1186/1471-2156-13-101] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2012] [Accepted: 11/04/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In crop species, QTL analysis is commonly used for identification of factors contributing to variation of agronomically important traits. As an important pasture species, a large number of QTLs have been reported for perennial ryegrass based on analysis of biparental mapping populations. Further characterisation of those QTLs is, however, essential for utilisation in varietal improvement programs. RESULTS A bibliographic survey of perennial ryegrass trait-dissection studies identified a total of 560 QTLs from previously published papers, of which 189, 270 and 101 were classified as morphology-, physiology- and resistance/tolerance-related loci, respectively. The collected dataset permitted a subsequent meta-QTL study and implementation of a cross-species candidate gene identification approach. A meta-QTL analysis based on use of the BioMercator software was performed to identify two consensus regions for pathogen resistance traits. Genes that are candidates for causal polymorphism underpinning perennial ryegrass QTLs were identified through in silico comparative mapping using rice databases, and 7 genes were assigned to the p150/112 reference map. Markers linked to the LpDGL1, LpPh1 and LpPIPK1 genes were located close to plant size, leaf extension time and heading date-related QTLs, respectively, suggesting that these genes may be functionally associated with important agronomic traits in perennial ryegrass. CONCLUSIONS Functional markers are valuable for QTL meta-analysis and comparative genomics. Enrichment of such genetic markers may permit further detailed characterisation of QTLs. The outcomes of QTL meta-analysis and comparative genomics studies may be useful for accelerated development of novel perennial ryegrass cultivars with desirable traits.
Collapse
Affiliation(s)
- Hiroshi Shinozuka
- Department of Primary Industries, Biosciences Research Division, Victorian AgriBiosciences Centre, 1 Park Drive, La Trobe University Research and Development Park, Bundoora, Victoria 3083, Australia
| | | | | | | |
Collapse
|
108
|
Davey JW, Cezard T, Fuentes-Utrilla P, Eland C, Gharbi K, Blaxter ML. Special features of RAD Sequencing data: implications for genotyping. Mol Ecol 2012; 22:3151-64. [PMID: 23110438 PMCID: PMC3712469 DOI: 10.1111/mec.12084] [Citation(s) in RCA: 224] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2012] [Revised: 09/07/2012] [Accepted: 09/12/2012] [Indexed: 12/17/2022]
Abstract
Restriction site-associated DNA Sequencing (RAD-Seq) is an economical and efficient method for SNP discovery and genotyping. As with other sequencing-by-synthesis methods, RAD-Seq produces stochastic count data and requires sensitive analysis to develop or genotype markers accurately. We show that there are several sources of bias specific to RAD-Seq that are not explicitly addressed by current genotyping tools, namely restriction fragment bias, restriction site heterozygosity and PCR GC content bias. We explore the performance of existing analysis tools given these biases and discuss approaches to limiting or handling biases in RAD-Seq data. While these biases need to be taken seriously, we believe RAD loci affected by them can be excluded or processed with relative ease in most cases and that most RAD loci will be accurately genotyped by existing tools.
Collapse
Affiliation(s)
- John W Davey
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, West Mains Road, Edinburgh, EH9 3JT, UK.
| | | | | | | | | | | |
Collapse
|
109
|
Increasing Food Production in Africa by Boosting the Productivity of Understudied Crops. AGRONOMY-BASEL 2012. [DOI: 10.3390/agronomy2040240] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
|
110
|
Everett MV, Miller MR, Seeb JE. Meiotic maps of sockeye salmon derived from massively parallel DNA sequencing. BMC Genomics 2012; 13:521. [PMID: 23031582 PMCID: PMC3563581 DOI: 10.1186/1471-2164-13-521] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2012] [Accepted: 09/23/2012] [Indexed: 12/21/2022] Open
Abstract
Background Meiotic maps are a key tool for comparative genomics and association mapping studies. Next-generation sequencing and genotyping by sequencing are speeding the processes of SNP discovery and the development of new genetic tools, including meiotic maps for numerous species. Currently there are limited genetic resources for sockeye salmon, Oncorhynchus nerka. We develop the first dense meiotic map for sockeye salmon using a combination of novel SNPs found in restriction site associated DNA (RAD tags) and SNPs available from existing expressed sequence tag (EST) based assays. Results We discovered and genotyped putative SNPs in 3,430 RAD tags. We removed paralogous sequence variants leaving 1,672 SNPs; these were combined with 53 EST-based SNP genotypes for linkage mapping. The map contained 29 male and female linkage groups, consistent with the haploid chromosome number expected for sockeye salmon. The female map contains 1,057 loci spanning 4,896 cM, and the male map contains 1,118 loci spanning 4,220 cM. Regions of conservation with rainbow trout and synteny between the RAD based rainbow trout map and the sockeye salmon map were established. Conclusions Using RAD sequencing and EST-based SNP assays we successfully generated the first high density linkage map for sockeye salmon.
Collapse
Affiliation(s)
- Meredith V Everett
- School of Aquatic and Fishery Sciences, University of Washington, Box 355020, Seattle, WA 98195-5020, USA.
| | | | | |
Collapse
|
111
|
Deschamps S, Llaca V, May GD. Genotyping-by-Sequencing in Plants. BIOLOGY 2012; 1:460-83. [PMID: 24832503 PMCID: PMC4009820 DOI: 10.3390/biology1030460] [Citation(s) in RCA: 152] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/06/2012] [Revised: 08/07/2012] [Accepted: 09/13/2012] [Indexed: 12/12/2022]
Abstract
The advent of next-generation DNA sequencing (NGS) technologies has led to the development of rapid genome-wide Single Nucleotide Polymorphism (SNP) detection applications in various plant species. Recent improvements in sequencing throughput combined with an overall decrease in costs per gigabase of sequence is allowing NGS to be applied to not only the evaluation of small subsets of parental inbred lines, but also the mapping and characterization of traits of interest in much larger populations. Such an approach, where sequences are used simultaneously to detect and score SNPs, therefore bypassing the entire marker assay development stage, is known as genotyping-by-sequencing (GBS). This review will summarize the current state of GBS in plants and the promises it holds as a genome-wide genotyping application.
Collapse
Affiliation(s)
- Stéphane Deschamps
- DuPont Agricultural Biotechnology, Experimental Station, PO Box 80353, 200 Powder Mill Road, Wilmington, DE 19880-0353, USA.
| | - Victor Llaca
- DuPont Agricultural Biotechnology, Experimental Station, PO Box 80353, 200 Powder Mill Road, Wilmington, DE 19880-0353, USA.
| | - Gregory D May
- DuPont Pioneer, 7300 NW 62nd Ave., P.O. Box 1004, Johnston, IA 50131-1004, USA.
| |
Collapse
|
112
|
Wang N, Fang L, Xin H, Wang L, Li S. Construction of a high-density genetic map for grape using next generation restriction-site associated DNA sequencing. BMC PLANT BIOLOGY 2012; 12:148. [PMID: 22908993 PMCID: PMC3528476 DOI: 10.1186/1471-2229-12-148] [Citation(s) in RCA: 59] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/23/2012] [Accepted: 07/18/2012] [Indexed: 05/18/2023]
Abstract
BACKGROUND Genetic mapping and QTL detection are powerful methodologies in plant improvement and breeding. Construction of a high-density and high-quality genetic map would be of great benefit in the production of superior grapes to meet human demand. High throughput and low cost of the recently developed next generation sequencing (NGS) technology have resulted in its wide application in genome research. Sequencing restriction-site associated DNA (RAD) might be an efficient strategy to simplify genotyping. Combining NGS with RAD has proven to be powerful for single nucleotide polymorphism (SNP) marker development. RESULTS An F1 population of 100 individual plants was developed. In-silico digestion-site prediction was used to select an appropriate restriction enzyme for construction of a RAD sequencing library. Next generation RAD sequencing was applied to genotype the F1 population and its parents. Applying a cluster strategy for SNP modulation, a total of 1,814 high-quality SNP markers were developed: 1,121 of these were mapped to the female genetic map, 759 to the male map, and 1,646 to the integrated map. A comparison of the genetic maps to the published Vitis vinifera genome revealed both conservation and variations. CONCLUSIONS The applicability of next generation RAD sequencing for genotyping a grape F1 population was demonstrated, leading to the successful development of a genetic map with high density and quality using our designed SNP markers. Detailed analysis revealed that this newly developed genetic map can be used for a variety of genome investigations, such as QTL detection, sequence assembly and genome comparison.
Collapse
Affiliation(s)
- Nian Wang
- Key Laboratory of Plant Germplasm Enhancement and Speciality Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, China
| | - Linchuan Fang
- Key Laboratory of Plant Germplasm Enhancement and Speciality Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, China
- Graduate School of Chinese Academy of Sciences, Beijing, 100049, China
| | - Haiping Xin
- Key Laboratory of Plant Germplasm Enhancement and Speciality Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, China
| | - Lijun Wang
- Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China
| | - Shaohua Li
- Key Laboratory of Plant Germplasm Enhancement and Speciality Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, China
| |
Collapse
|
113
|
Wang N, Fang L, Xin H, Wang L, Li S. Construction of a high-density genetic map for grape using next generation restriction-site associated DNA sequencing. BMC PLANT BIOLOGY 2012. [PMID: 22908993 DOI: 10.1186/1471-2229-12148] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
BACKGROUND Genetic mapping and QTL detection are powerful methodologies in plant improvement and breeding. Construction of a high-density and high-quality genetic map would be of great benefit in the production of superior grapes to meet human demand. High throughput and low cost of the recently developed next generation sequencing (NGS) technology have resulted in its wide application in genome research. Sequencing restriction-site associated DNA (RAD) might be an efficient strategy to simplify genotyping. Combining NGS with RAD has proven to be powerful for single nucleotide polymorphism (SNP) marker development. RESULTS An F1 population of 100 individual plants was developed. In-silico digestion-site prediction was used to select an appropriate restriction enzyme for construction of a RAD sequencing library. Next generation RAD sequencing was applied to genotype the F1 population and its parents. Applying a cluster strategy for SNP modulation, a total of 1,814 high-quality SNP markers were developed: 1,121 of these were mapped to the female genetic map, 759 to the male map, and 1,646 to the integrated map. A comparison of the genetic maps to the published Vitis vinifera genome revealed both conservation and variations. CONCLUSIONS The applicability of next generation RAD sequencing for genotyping a grape F1 population was demonstrated, leading to the successful development of a genetic map with high density and quality using our designed SNP markers. Detailed analysis revealed that this newly developed genetic map can be used for a variety of genome investigations, such as QTL detection, sequence assembly and genome comparison.
Collapse
Affiliation(s)
- Nian Wang
- Key Laboratory of Plant Germplasm Enhancement and Speciality Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China
| | | | | | | | | |
Collapse
|
114
|
Willing EM, Dreyer C, van Oosterhout C. Estimates of genetic differentiation measured by F(ST) do not necessarily require large sample sizes when using many SNP markers. PLoS One 2012; 7:e42649. [PMID: 22905157 PMCID: PMC3419229 DOI: 10.1371/journal.pone.0042649] [Citation(s) in RCA: 267] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2012] [Accepted: 07/11/2012] [Indexed: 01/16/2023] Open
Abstract
Population genetic studies provide insights into the evolutionary processes that influence the distribution of sequence variants within and among wild populations. FST is among the most widely used measures for genetic differentiation and plays a central role in ecological and evolutionary genetic studies. It is commonly thought that large sample sizes are required in order to precisely infer FST and that small sample sizes lead to overestimation of genetic differentiation. Until recently, studies in ecological model organisms incorporated a limited number of genetic markers, but since the emergence of next generation sequencing, the panel size of genetic markers available even in non-reference organisms has rapidly increased. In this study we examine whether a large number of genetic markers can substitute for small sample sizes when estimating FST. We tested the behavior of three different estimators that infer FST and that are commonly used in population genetic studies. By simulating populations, we assessed the effects of sample size and the number of markers on the various estimates of genetic differentiation. Furthermore, we tested the effect of ascertainment bias on these estimates. We show that the population sample size can be significantly reduced (as small as n = 4–6) when using an appropriate estimator and a large number of bi-allelic genetic markers (k>1,000). Therefore, conservation genetic studies can now obtain almost the same statistical power as studies performed on model organisms using markers developed with next-generation sequencing.
Collapse
Affiliation(s)
- Eva-Maria Willing
- Department of Molecular Biology, Max Planck Institute for Developmental Biology, Tübingen, Germany.
| | | | | |
Collapse
|
115
|
Yang H, Tao Y, Zheng Z, Li C, Sweetingham MW, Howieson JG. Application of next-generation sequencing for rapid marker development in molecular plant breeding: a case study on anthracnose disease resistance in Lupinus angustifolius L. BMC Genomics 2012; 13:318. [PMID: 22805587 PMCID: PMC3430595 DOI: 10.1186/1471-2164-13-318] [Citation(s) in RCA: 88] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2011] [Accepted: 06/04/2012] [Indexed: 11/29/2022] Open
Abstract
Background In the last 30 years, a number of DNA fingerprinting methods such as RFLP, RAPD, AFLP, SSR, DArT, have been extensively used in marker development for molecular plant breeding. However, it remains a daunting task to identify highly polymorphic and closely linked molecular markers for a target trait for molecular marker-assisted selection. The next-generation sequencing (NGS) technology is far more powerful than any existing generic DNA fingerprinting methods in generating DNA markers. In this study, we employed a grain legume crop Lupinus angustifolius (lupin) as a test case, and examined the utility of an NGS-based method of RAD (restriction-site associated DNA) sequencing as DNA fingerprinting for rapid, cost-effective marker development tagging a disease resistance gene for molecular breeding. Results Twenty informative plants from a cross of RxS (disease resistant x susceptible) in lupin were subjected to RAD single-end sequencing by multiplex identifiers. The entire RAD sequencing products were resolved in two lanes of the 16-lanes per run sequencing platform Solexa HiSeq2000. A total of 185 million raw reads, approximately 17 Gb of sequencing data, were collected. Sequence comparison among the 20 test plants discovered 8207 SNP markers. Filtration of DNA sequencing data with marker identification parameters resulted in the discovery of 38 molecular markers linked to the disease resistance gene Lanr1. Five randomly selected markers were converted into cost-effective, simple PCR-based markers. Linkage analysis using marker genotyping data and disease resistance phenotyping data on a F8 population consisting of 186 individual plants confirmed that all these five markers were linked to the R gene. Two of these newly developed sequence-specific PCR markers, AnSeq3 and AnSeq4, flanked the target R gene at a genetic distance of 0.9 centiMorgan (cM), and are now replacing the markers previously developed by a traditional DNA fingerprinting method for marker-assisted selection in the Australian national lupin breeding program. Conclusions We demonstrated that more than 30 molecular markers linked to a target gene of agronomic trait of interest can be identified from a small portion (1/8) of one sequencing run on HiSeq2000 by applying NGS based RAD sequencing in marker development. The markers developed by the strategy described in this study are all co-dominant SNP markers, which can readily be converted into high throughput multiplex format or low-cost, simple PCR-based markers desirable for large scale marker implementation in plant breeding programs. The high density and closely linked molecular markers associated with a target trait help to overcome a major bottleneck for implementation of molecular markers on a wide range of germplasm in breeding programs. We conclude that application of NGS based RAD sequencing as DNA fingerprinting is a very rapid and cost-effective strategy for marker development in molecular plant breeding. The strategy does not require any prior genome knowledge or molecular information for the species under investigation, and it is applicable to other plant species.
Collapse
Affiliation(s)
- Huaan Yang
- Department of Agriculture and Food Western Australia, 3 Baron-Hay Court, South Perth, 6151, Australia.
| | | | | | | | | | | |
Collapse
|
116
|
Bus A, Hecht J, Huettel B, Reinhardt R, Stich B. High-throughput polymorphism detection and genotyping in Brassica napus using next-generation RAD sequencing. BMC Genomics 2012; 13:281. [PMID: 22726880 PMCID: PMC3442993 DOI: 10.1186/1471-2164-13-281] [Citation(s) in RCA: 77] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2012] [Accepted: 06/24/2012] [Indexed: 11/28/2022] Open
Abstract
Background The complex genome of rapeseed (Brassica napus) is not well understood despite the economic importance of the species. Good knowledge of sequence variation is needed for genetics approaches and breeding purposes. We used a diversity set of B. napus representing eight different germplasm types to sequence genome-wide distributed restriction-site associated DNA (RAD) fragments for polymorphism detection and genotyping. Results More than 113,000 RAD clusters with more than 20,000 single nucleotide polymorphisms (SNPs) and 125 insertions/deletions were detected and characterized. About one third of the RAD clusters and polymorphisms mapped to the Brassica rapa reference sequence. An even distribution of RAD clusters and polymorphisms was observed across the B. rapa chromosomes, which suggests that there might be an equal distribution over the Brassica oleracea chromosomes, too. The representation of Gene Ontology (GO) terms for unigenes with RAD clusters and polymorphisms revealed no signature of selection with respect to the distribution of polymorphisms within genes belonging to a specific GO category. Conclusions Considering the decreasing costs for next-generation sequencing, the results of our study suggest that RAD sequencing is not only a simple and cost-effective method for high-density polymorphism detection but also an alternative to SNP genotyping from transcriptome sequencing or SNP arrays, even for species with complex genomes such as B. napus.
Collapse
Affiliation(s)
- Anja Bus
- Max Planck Institute for Plant Breeding Research, Cologne, Germany
| | | | | | | | | |
Collapse
|
117
|
Peterson BK, Weber JN, Kay EH, Fisher HS, Hoekstra HE. Double digest RADseq: an inexpensive method for de novo SNP discovery and genotyping in model and non-model species. PLoS One 2012; 7:e37135. [PMID: 22675423 PMCID: PMC3365034 DOI: 10.1371/journal.pone.0037135] [Citation(s) in RCA: 1921] [Impact Index Per Article: 160.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2012] [Accepted: 04/13/2012] [Indexed: 12/14/2022] Open
Abstract
The ability to efficiently and accurately determine genotypes is a keystone technology in modern genetics, crucial to studies ranging from clinical diagnostics, to genotype-phenotype association, to reconstruction of ancestry and the detection of selection. To date, high capacity, low cost genotyping has been largely achieved via “SNP chip” microarray-based platforms which require substantial prior knowledge of both genome sequence and variability, and once designed are suitable only for those targeted variable nucleotide sites. This method introduces substantial ascertainment bias and inherently precludes detection of rare or population-specific variants, a major source of information for both population history and genotype-phenotype association. Recent developments in reduced-representation genome sequencing experiments on massively parallel sequencers (commonly referred to as RAD-tag or RADseq) have brought direct sequencing to the problem of population genotyping, but increased cost and procedural and analytical complexity have limited their widespread adoption. Here, we describe a complete laboratory protocol, including a custom combinatorial indexing method, and accompanying software tools to facilitate genotyping across large numbers (hundreds or more) of individuals for a range of markers (hundreds to hundreds of thousands). Our method requires no prior genomic knowledge and achieves per-site and per-individual costs below that of current SNP chip technology, while requiring similar hands-on time investment, comparable amounts of input DNA, and downstream analysis times on the order of hours. Finally, we provide empirical results from the application of this method to both genotyping in a laboratory cross and in wild populations. Because of its flexibility, this modified RADseq approach promises to be applicable to a diversity of biological questions in a wide range of organisms.
Collapse
Affiliation(s)
- Brant K Peterson
- Department of Organismic & Evolutionary Biology, Museum of Comparative Zoology, Harvard University, Cambridge, Massachusetts, United States of America.
| | | | | | | | | |
Collapse
|
118
|
Pauly L, Flajoulot S, Garon J, Julier B, Béguier V, Barre P. Detection of favorable alleles for plant height and crown rust tolerance in three connected populations of perennial ryegrass (Lolium perenne L.). TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2012; 124:1139-1153. [PMID: 22234605 DOI: 10.1007/s00122-011-1775-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/25/2011] [Accepted: 12/15/2011] [Indexed: 05/31/2023]
Abstract
Plant height, which is an estimator of vegetative yield, and crown rust tolerance are major criteria for perennial ryegrass breeding. Genetic improvement has been achieved through phenotypic selection but it should be speeded up using marker-assisted selection, especially in this heterozygous species suffering from inbreeding depression. Using connected multiparental populations should increase the diversity studied and could substantially increase the power of quantitative trait loci (QTL) detection. The objective of this study was to detect the best alleles for plant height and rust tolerance among three connected populations derived from elite material by comparing an analysis per parent and a multipopulation connected analysis. For the studied traits, 17 QTL were detected with the analysis per parent while the additive and dominance models of the multipopulation connected analysis made it possible to detect 33 and 21 QTL, respectively. Favorable alleles have been detected in all parents. Only a few dominance effects were detected and they generally had lower values than the additive effects. The additive model of the multipopulation connected analysis was the most powerful as it made it possible to detect most of the QTL identified in the other analyses and 11 additional QTL. Using this model, plant growth QTL and rust tolerance QTL explained up to 19 and 38.6% of phenotypic variance, respectively. This example involving three connected populations is promising for an application on polycross progenies, traditionally used in breeding programs. Indeed, polycross progenies actually are a set of several connected populations.
Collapse
Affiliation(s)
- Laurence Pauly
- Jouffray-Drillaud, Station expérimentale La Litière, 86 600, Saint Sauvant, France.
| | | | | | | | | | | |
Collapse
|
119
|
Harrison RJ. Understanding genetic variation and function- the applications of next generation sequencing. Semin Cell Dev Biol 2012; 23:230-6. [DOI: 10.1016/j.semcdb.2012.01.006] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2011] [Revised: 01/10/2012] [Accepted: 01/12/2012] [Indexed: 12/31/2022]
|
120
|
Gao Q, Yue G, Li W, Wang J, Xu J, Yin Y. Recent progress using high-throughput sequencing technologies in plant molecular breeding. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2012; 54:215-27. [PMID: 22409591 DOI: 10.1111/j.1744-7909.2012.01115.x] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
High-throughput sequencing is a revolutionary technological innovation in DNA sequencing. This technology has an ultra-low cost per base of sequencing and an overwhelmingly high data output. High-throughput sequencing has brought novel research methods and solutions to the research fields of genomics and post-genomics. Furthermore, this technology is leading to a new molecular breeding revolution that has landmark significance for scientific research and enables us to launch multi-level, multi-faceted, and multi-extent studies in the fields of crop genetics, genomics, and crop breeding. In this paper, we review progress in the application of high-throughput sequencing technologies to plant molecular breeding studies.
Collapse
Affiliation(s)
- Qiang Gao
- Beijing Genomics Institute-Shenzhen, Shenzhen 518083, China
| | | | | | | | | | | |
Collapse
|
121
|
Lerceteau-Köhler E, Moing A, Guérin G, Renaud C, Petit A, Rothan C, Denoyes B. Genetic dissection of fruit quality traits in the octoploid cultivated strawberry highlights the role of homoeo-QTL in their control. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2012; 124:1059-77. [PMID: 22215248 PMCID: PMC3304055 DOI: 10.1007/s00122-011-1769-3] [Citation(s) in RCA: 60] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2011] [Accepted: 12/08/2011] [Indexed: 05/18/2023]
Abstract
Fruit quality traits are major breeding targets in the Rosaceae. Several of the major Rosaceae species are current or ancient polyploids. To dissect the inheritance of fruit quality traits in polyploid fleshy fruit species, we used a cultivated strawberry segregating population comprising a 213 full-sibling F1 progeny from a cross between the variety 'Capitola' and the genotype 'CF1116'. We previously developed the most comprehensive strawberry linkage map, which displays seven homoeology groups (HG), including each four homoeology linkage groups (Genetics 179:2045-2060, 2008). The map was used to identify quantitative trait loci (QTL) for 19 fruit traits related to fruit development, texture, colour, anthocyanin, sugar and organic acid contents. Analyses were carried out over two or three successive years on field-grown plants. QTL were detected for all the analysed traits. Because strawberry is an octopolyploid species, QTL controlling a given trait and located at orthologous positions on different homoeologous linkage groups within one HG are considered as homoeo-QTL. We found that, for various traits, about one-fourth of QTL were putative homoeo-QTL and were localised on two linkage groups. Several homoeo-QTL could be detected the same year, suggesting that several copies of the gene underlying the QTL are functional. The detection of some other homoeo-QTL was year-dependent. Therefore, changes in allelic expression could take place in response to environmental changes. We believe that, in strawberry as in other polyploid fruit species, the mechanisms unravelled in the present study may play a crucial role in the variations of fruit quality.
Collapse
Affiliation(s)
- E. Lerceteau-Köhler
- Ciref, Maison Jeannette, 24140 Douville, France
- Present Address: Department of Plant Biology and Forest Genetics, Uppsala BioCenter, Swedish University of Agricultural Sciences, Box 7080, 750 07 Uppsala, Sweden
| | - A. Moing
- UMR 1332 Biologie du Fruit et Pathologie, INRA, 33140 Villenave d’Ornon, France
| | - G. Guérin
- UMR 1332 Biologie du Fruit et Pathologie, INRA, 33140 Villenave d’Ornon, France
| | - C. Renaud
- UMR 1332 Biologie du Fruit et Pathologie, INRA, 33140 Villenave d’Ornon, France
| | - A. Petit
- Ciref, Maison Jeannette, 24140 Douville, France
| | - C. Rothan
- UMR 1332 Biologie du Fruit et Pathologie, INRA, 33140 Villenave d’Ornon, France
| | - Béatrice Denoyes
- UMR 1332 Biologie du Fruit et Pathologie, INRA, 33140 Villenave d’Ornon, France
| |
Collapse
|
122
|
Ma XF, Jensen E, Alexandrov N, Troukhan M, Zhang L, Thomas-Jones S, Farrar K, Clifton-Brown J, Donnison I, Swaller T, Flavell R. High resolution genetic mapping by genome sequencing reveals genome duplication and tetraploid genetic structure of the diploid Miscanthus sinensis. PLoS One 2012; 7:e33821. [PMID: 22439001 PMCID: PMC3306302 DOI: 10.1371/journal.pone.0033821] [Citation(s) in RCA: 93] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2011] [Accepted: 02/17/2012] [Indexed: 11/19/2022] Open
Abstract
We have created a high-resolution linkage map of Miscanthus sinensis, using genotyping-by-sequencing (GBS), identifying all 19 linkage groups for the first time. The result is technically significant since Miscanthus has a very large and highly heterozygous genome, but has no or limited genomics information to date. The composite linkage map containing markers from both parental linkage maps is composed of 3,745 SNP markers spanning 2,396 cM on 19 linkage groups with a 0.64 cM average resolution. Comparative genomics analyses of the M. sinensis composite linkage map to the genomes of sorghum, maize, rice, and Brachypodium distachyon indicate that sorghum has the closest syntenic relationship to Miscanthus compared to other species. The comparative results revealed that each pair of the 19 M. sinensis linkages aligned to one sorghum chromosome, except for LG8, which mapped to two sorghum chromosomes (4 and 7), presumably due to a chromosome fusion event after genome duplication. The data also revealed several other chromosome rearrangements relative to sorghum, including two telomere-centromere inversions of the sorghum syntenic chromosome 7 in LG8 of M. sinensis and two paracentric inversions of sorghum syntenic chromosome 4 in LG7 and LG8 of M. sinensis. The results clearly demonstrate, for the first time, that the diploid M. sinensis is tetraploid origin consisting of two sub-genomes. This complete and high resolution composite linkage map will not only serve as a useful resource for novel QTL discoveries, but also enable informed deployment of the wealth of existing genomics resources of other species to the improvement of Miscanthus as a high biomass energy crop. In addition, it has utility as a reference for genome sequence assembly for the forthcoming whole genome sequencing of the Miscanthus genus.
Collapse
Affiliation(s)
- Xue-Feng Ma
- Ceres, Inc., Thousand Oaks, California, United States of America
| | - Elaine Jensen
- Institute of Biological, Environmental & Rural Sciences (IBERS), Aberystwyth University, Gogerddan, United Kingdom
| | | | - Maxim Troukhan
- Ceres, Inc., Thousand Oaks, California, United States of America
| | - Liping Zhang
- Ceres, Inc., Thousand Oaks, California, United States of America
| | - Sian Thomas-Jones
- Institute of Biological, Environmental & Rural Sciences (IBERS), Aberystwyth University, Gogerddan, United Kingdom
| | - Kerrie Farrar
- Institute of Biological, Environmental & Rural Sciences (IBERS), Aberystwyth University, Gogerddan, United Kingdom
| | - John Clifton-Brown
- Institute of Biological, Environmental & Rural Sciences (IBERS), Aberystwyth University, Gogerddan, United Kingdom
| | - Iain Donnison
- Institute of Biological, Environmental & Rural Sciences (IBERS), Aberystwyth University, Gogerddan, United Kingdom
| | - Timothy Swaller
- Ceres, Inc., Thousand Oaks, California, United States of America
- * E-mail:
| | - Richard Flavell
- Ceres, Inc., Thousand Oaks, California, United States of America
| |
Collapse
|
123
|
Monson-Miller J, Sanchez-Mendez DC, Fass J, Henry IM, Tai TH, Comai L. Reference genome-independent assessment of mutation density using restriction enzyme-phased sequencing. BMC Genomics 2012; 13:72. [PMID: 22333298 PMCID: PMC3305632 DOI: 10.1186/1471-2164-13-72] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2011] [Accepted: 02/14/2012] [Indexed: 01/25/2023] Open
Abstract
BACKGROUND The availability of low cost sequencing has spurred its application to discovery and typing of variation, including variation induced by mutagenesis. Mutation discovery is challenging as it requires a substantial amount of sequencing and analysis to detect very rare changes and distinguish them from noise. Also challenging are the cases when the organism of interest has not been sequenced or is highly divergent from the reference. RESULTS We describe the development of a simple method for reduced representation sequencing. Input DNA was digested with a single restriction enzyme and ligated to Y adapters modified to contain a sequence barcode and to provide a compatible overhang for ligation. We demonstrated the efficiency of this method at SNP discovery using rice and arabidopsis. To test its suitability for the discovery of very rare SNP, one control and three mutagenized rice individuals (1, 5 and 10 mM sodium azide) were used to prepare genomic libraries for Illumina sequencers by ligating barcoded adapters to NlaIII restriction sites. For genome-dependent discovery 15-30 million of 80 base reads per individual were aligned to the reference sequence achieving individual sequencing coverage from 7 to 15×. We identified high-confidence base changes by comparing sequences across individuals and identified instances consistent with mutations, i.e. changes that were found in a single treated individual and were solely GC to AT transitions. For genome-independent discovery 70-mers were extracted from the sequence of the control individual and single-copy sequence was identified by comparing the 70-mers across samples to evaluate copy number and variation. This de novo "genome" was used to align the reads and identify mutations as above. Covering approximately 1/5 of the 380 Mb genome of rice we detected mutation densities ranging from 0.6 to 4 per Mb of diploid DNA depending on the mutagenic treatment. CONCLUSIONS The combination of a simple and cost-effective library construction method, with Illumina sequencing, and the use of a bioinformatic pipeline allows practical SNP discovery regardless of whether a genomic reference is available.
Collapse
Affiliation(s)
| | - Diana C Sanchez-Mendez
- Crops Pathology and Genetics Research Unit, U.S. Department of Agriculture, Agricultural Research Service, Davis, California 95616, USA
| | - Joseph Fass
- Bioinformatics Core, Genome Center, UC Davis, Davis, California 95616, USA
| | - Isabelle M Henry
- Department of Plant Biology and Genome Center, UC Davis, Davis, California 95616, USA
| | - Thomas H Tai
- Crops Pathology and Genetics Research Unit, U.S. Department of Agriculture, Agricultural Research Service, Davis, California 95616, USA
| | - Luca Comai
- Department of Plant Biology and Genome Center, UC Davis, Davis, California 95616, USA
| |
Collapse
|
124
|
Cronn R, Knaus BJ, Liston A, Maughan PJ, Parks M, Syring JV, Udall J. Targeted enrichment strategies for next-generation plant biology. AMERICAN JOURNAL OF BOTANY 2012; 99:291-311. [PMID: 22312117 DOI: 10.3732/ajb.1100356] [Citation(s) in RCA: 120] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
PREMISE OF THE STUDY The dramatic advances offered by modern DNA sequencers continue to redefine the limits of what can be accomplished in comparative plant biology. Even with recent achievements, however, plant genomes present obstacles that can make it difficult to execute large-scale population and phylogenetic studies on next-generation sequencing platforms. Factors like large genome size, extensive variation in the proportion of organellar DNA in total DNA, polyploidy, and gene number/redundancy contribute to these challenges, and they demand flexible targeted enrichment strategies to achieve the desired goals. METHODS In this article, we summarize the many available targeted enrichment strategies that can be used to target partial-to-complete organellar genomes, as well as known and anonymous nuclear targets. These methods fall under four categories: PCR-based enrichment, hybridization-based enrichment, restriction enzyme-based enrichment, and enrichment of expressed gene sequences. KEY RESULTS Examples of plant-specific applications exist for nearly all methods described. While some methods are well established (e.g., transcriptome sequencing), other promising methods are in their infancy (hybridization enrichment). A direct comparison of methods shows that PCR-based enrichment may be a reasonable strategy for accessing small genomic targets (e.g., ≤50 kbp), but that hybridization and transcriptome sequencing scale more efficiently if larger targets are desired. CONCLUSIONS While the benefits of targeted sequencing are greatest in plants with large genomes, nearly all comparative projects can benefit from the improved throughput offered by targeted multiplex DNA sequencing, particularly as the amount of data produced from a single instrument approaches a trillion bases per run.
Collapse
Affiliation(s)
- Richard Cronn
- Pacific Northwest Research Station, USDA Forest Service, Corvallis, Oregon 97331, USA.
| | | | | | | | | | | | | |
Collapse
|
125
|
Abstract
The first North American RAD Sequencing and Genomics Symposium, sponsored by Floragenex (http://www.floragenex.com/radmeeting/), took place in Portland, Oregon (USA) on 19 April 2011. This symposium was convened to promote and discuss the use of restriction-site-associated DNA (RAD) sequencing technologies. RAD sequencing is one of several strategies recently developed to increase the power of data generated via short-read sequencing technologies by reducing their complexity (Baird et al. 2008; Huang et al. 2009; Andolfatto et al. 2011; Elshire et al. 2011). RAD sequencing, as a form of genotyping by sequencing, has been effectively applied in genetic mapping and quantitative trait loci (QTL) analyses in a range of organisms including nonmodel, genetically highly heterogeneous organisms (Table 1; Baird et al. 2008; Baxter et al. 2011; Chutimanitsakun et al. 2011; Pfender et al. 2011). RAD sequencing has recently found applications in phylogeography (Emerson et al. 2010) and population genomics (Hohenlohe et al. 2010). Considering the diversity of talks presented during this meeting, more developments are to be expected in the very near future.
Collapse
Affiliation(s)
- H C Rowe
- Department of Botany, University of British Columbia, Vancouver, BC V6T 1Z4, Canada.
| | | | | |
Collapse
|
126
|
Scaglione D, Acquadro A, Portis E, Tirone M, Knapp SJ, Lanteri S. RAD tag sequencing as a source of SNP markers in Cynara cardunculus L. BMC Genomics 2012; 13:3. [PMID: 22214349 PMCID: PMC3269995 DOI: 10.1186/1471-2164-13-3] [Citation(s) in RCA: 73] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2011] [Accepted: 01/03/2012] [Indexed: 11/23/2022] Open
Abstract
BACKGROUND The globe artichoke (Cynara cardunculus L. var. scolymus) genome is relatively poorly explored, especially compared to those of the other major Asteraceae crops sunflower and lettuce. No SNP markers are in the public domain. We have combined the recently developed restriction-site associated DNA (RAD) approach with the Illumina DNA sequencing platform to effect the rapid and mass discovery of SNP markers for C. cardunculus. RESULTS RAD tags were sequenced from the genomic DNA of three C. cardunculus mapping population parents, generating 9.7 million reads, corresponding to ~1 Gbp of sequence. An assembly based on paired ends produced ~6.0 Mbp of genomic sequence, separated into ~19,000 contigs (mean length 312 bp), of which ~21% were fragments of putative coding sequence. The shared sequences allowed for the discovery of ~34,000 SNPs and nearly 800 indels, equivalent to a SNP frequency of 5.6 per 1,000 nt, and an indel frequency of 0.2 per 1,000 nt. A sample of heterozygous SNP loci was mapped by CAPS assays and this exercise provided validation of our mining criteria. The repetitive fraction of the genome had a high representation of retrotransposon sequence, followed by simple repeats, AT-low complexity regions and mobile DNA elements. The genomic k-mers distribution and CpG rate of C. cardunculus, compared with data derived from three whole genome-sequenced dicots species, provided a further evidence of the random representation of the C. cardunculus genome generated by RAD sampling. CONCLUSION The RAD tag sequencing approach is a cost-effective and rapid method to develop SNP markers in a highly heterozygous species. Our approach permitted to generate a large and robust SNP datasets by the adoption of optimized filtering criteria.
Collapse
Affiliation(s)
- Davide Scaglione
- Di.Va.P.R.A. Plant Genetics and Breeding, University of Torino, via L. da Vinci 44, 10095 Grugliasco (Torino), Italy
| | - Alberto Acquadro
- Di.Va.P.R.A. Plant Genetics and Breeding, University of Torino, via L. da Vinci 44, 10095 Grugliasco (Torino), Italy
| | - Ezio Portis
- Di.Va.P.R.A. Plant Genetics and Breeding, University of Torino, via L. da Vinci 44, 10095 Grugliasco (Torino), Italy
| | - Matteo Tirone
- Di.Va.P.R.A. Plant Genetics and Breeding, University of Torino, via L. da Vinci 44, 10095 Grugliasco (Torino), Italy
| | - Steven J Knapp
- Institute for Plant Breeding, Genetics, and Genomics, University of Georgia, 111 Riverbend Rd., 30602 Athens, Georgia USA
| | - Sergio Lanteri
- Di.Va.P.R.A. Plant Genetics and Breeding, University of Torino, via L. da Vinci 44, 10095 Grugliasco (Torino), Italy
| |
Collapse
|
127
|
Willing EM, Hoffmann M, Klein JD, Weigel D, Dreyer C. Paired-end RAD-seq for de novo assembly and marker design without available reference. ACTA ACUST UNITED AC 2011; 27:2187-93. [PMID: 21712251 DOI: 10.1093/bioinformatics/btr346] [Citation(s) in RCA: 80] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
MOTIVATION Next-generation sequencing technologies have facilitated the study of organisms on a genome-wide scale. A recent method called restriction site associated DNA sequencing (RAD-seq) allows to sample sequence information at reduced complexity across a target genome using the Illumina platform. Single-end RAD-seq has proven to provide a large number of informative genetic markers in reference as well as non-reference organisms. RESULTS Here, we present a method for de novo assembly of paired-end RAD-seq data in order to produce extended contigs flanking a restriction site. We were able to reconstruct one-tenth of the guppy genome represented by 200-500 bp contigs associated to EcoRI recognition sites. In addition, these contigs were used as reference allowing the detection of thousands of new polymorphic markers that are informative for mapping and population genetic studies in the guppy. AVAILABILITY A perl and C++ implementation of the method demonstrated in this article is available under http://guppy.weigelworld.org/weigeldatabases/radMarkers/ as package RApiD. CONTACT christine.dreyer@tuebingen.mpg.de SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Eva-Maria Willing
- Department of Molecular Biology, Max Planck Institute for Developmental Biology, University of Tübingen, Sand 14, 72076 Tübingen, Germany
| | | | | | | | | |
Collapse
|
128
|
Genome-wide genetic marker discovery and genotyping using next-generation sequencing. Nat Rev Genet 2011; 12:499-510. [PMID: 21681211 DOI: 10.1038/nrg3012] [Citation(s) in RCA: 1431] [Impact Index Per Article: 110.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
The advent of next-generation sequencing (NGS) has revolutionized genomic and transcriptomic approaches to biology. These new sequencing tools are also valuable for the discovery, validation and assessment of genetic markers in populations. Here we review and discuss best practices for several NGS methods for genome-wide genetic marker development and genotyping that use restriction enzyme digestion of target genomes to reduce the complexity of the target. These new methods -- which include reduced-representation sequencing using reduced-representation libraries (RRLs) or complexity reduction of polymorphic sequences (CRoPS), restriction-site-associated DNA sequencing (RAD-seq) and low coverage genotyping -- are applicable to both model organisms with high-quality reference genome sequences and, excitingly, to non-model species with no existing genomic data.
Collapse
|
129
|
Baxter SW, Davey JW, Johnston JS, Shelton AM, Heckel DG, Jiggins CD, Blaxter ML. Linkage mapping and comparative genomics using next-generation RAD sequencing of a non-model organism. PLoS One 2011; 6:e19315. [PMID: 21541297 PMCID: PMC3082572 DOI: 10.1371/journal.pone.0019315] [Citation(s) in RCA: 241] [Impact Index Per Article: 18.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2011] [Accepted: 03/30/2011] [Indexed: 11/19/2022] Open
Abstract
Restriction-site associated DNA (RAD) sequencing is a powerful new method for targeted sequencing across the genomes of many individuals. This approach has broad potential for genetic analysis of non-model organisms including genotype-phenotype association mapping, phylogeography, population genetics and scaffolding genome assemblies through linkage mapping. We constructed a RAD library using genomic DNA from a Plutella xylostella (diamondback moth) backcross that segregated for resistance to the insecticide spinosad. Sequencing of 24 individuals was performed on a single Illumina GAIIx lane (51 base paired-end reads). Taking advantage of the lack of crossing over in homologous chromosomes in female Lepidoptera, 3,177 maternally inherited RAD alleles were assigned to the 31 chromosomes, enabling identification of the spinosad resistance and W/Z sex chromosomes. Paired-end reads for each RAD allele were assembled into contigs and compared to the genome of Bombyx mori (n = 28) using BLAST, revealing 28 homologous matches plus 3 expected fusion/breakage events which account for the difference in chromosome number. A genome-wide linkage map (1292 cM) was inferred with 2,878 segregating RAD alleles inherited from the backcross father, producing chromosome and location specific sequenced RAD markers. Here we have used RAD sequencing to construct a genetic linkage map de novo for an organism that has no previous genome data. Comparative analysis of P. xyloxtella linkage groups with B. mori chromosomes shows for the first time, genetic synteny appears common beyond the Macrolepidoptera. RAD sequencing is a powerful system capable of rapidly generating chromosome specific data for non-model organisms.
Collapse
Affiliation(s)
- Simon W Baxter
- Department of Zoology, University of Cambridge, Cambridge, United Kingdom.
| | | | | | | | | | | | | |
Collapse
|