51
|
Chen W, Hou L, Zhang Z, Pang X, Li Y. Genetic Diversity, Population Structure, and Linkage Disequilibrium of a Core Collection of Ziziphus jujuba Assessed with Genome-wide SNPs Developed by Genotyping-by-sequencing and SSR Markers. FRONTIERS IN PLANT SCIENCE 2017; 8:575. [PMID: 28458680 PMCID: PMC5394126 DOI: 10.3389/fpls.2017.00575] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/24/2016] [Accepted: 03/30/2017] [Indexed: 05/24/2023]
Abstract
Chinese jujube (Ziziphus jujuba Mill) is an economically important fruit species native to China with high nutritious and medicinal value. Genotyping-by-sequencing was used to detect and genotype single nucleotide polymorphisms (SNPs) in a core collection of 150 Chinese jujube accessions and further to characterize their genetic diversity, population structure, and linkage disequilibrium (LD). A total of 4,680 high-quality SNPs were identified, of which 38 sets of tri-allelic SNPs were detected. The average polymorphism information content (PIC) values based on bi-allelic SNPs and tri-allelic SNPs were 0.27 and 0.38, respectively. STRUCTURE and principal coordinate analyses based on SNPs revealed that the 150 accessions could be clustered into two groups. However, neighbor-joining trees indicated the accessions should be grouped into three major clusters. Our data confirm that the resolving power for genetic diversity was similar for the SSRs and SNPs. In contrast, regarding population structure, the resolving power was higher for SSRs than for SNPs. The LD pattern in Chinese jujube was investigated for the first time. We observed a relatively rapid LD decay with a short range (∼10 kb) for all pseudo-chromosomes and for individual pseudo-chromosomes. Our findings provide important information for future genome-wide association analyses and marker-assisted selective breeding of Chinese jujube.
Collapse
Affiliation(s)
- Wu Chen
- National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry UniversityBeijing, China
| | - Lu Hou
- National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry UniversityBeijing, China
| | - Zhiyong Zhang
- Beijing Key Laboratory of Ornamental Plants Germplasm Innovation and Molecular Breeding, National Engineering Research Center for Floriculture, Beijing Laboratory of Urban and Rural Ecological Environment, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants of Ministry of Education, School of Landscape Architecture, Beijing Forestry UniversityBeijing, China
| | - Xiaoming Pang
- National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry UniversityBeijing, China
| | - Yingyue Li
- National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry UniversityBeijing, China
| |
Collapse
|
52
|
Gao L, Zhao G, Huang D, Jia J. Candidate loci involved in domestication and improvement detected by a published 90K wheat SNP array. Sci Rep 2017; 7:44530. [PMID: 28327671 PMCID: PMC5361097 DOI: 10.1038/srep44530] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2016] [Accepted: 02/10/2017] [Indexed: 11/12/2022] Open
Abstract
Selection is one of the most important forces in crop evolution. Common wheat is a major world food crop and a typical allopolyploid with a huge and complex genome. We applied four approaches to detect loci selected in wheat during domestication and improvement. A total of 7,984 candidate loci were detected, accounting for 23.3% of all 34,317 SNPs analysed, a much higher proportion than estimated in previous reports. We constructed a first generation wheat selection map which revealed the following new insights on genome-wide selection: (1) diversifying selection acted by increasing, decreasing or not affecting gene frequencies; (2) the number of loci under selection during domestication was much higher than that during improvement; (3) the contribution to wheat improvement by the D sub-genome was relatively small due to the bottleneck of hexaploidisation and diversity can be expanded by using synthetic wheat and introgression lines; and (4) clustered selection regions occur throughout the wheat genome, including the centromere regions. This study will not only help future wheat breeding and evolutionary studies, but will also accelerate study of other crops, especially polyploids.
Collapse
Affiliation(s)
- Lifeng Gao
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, MOA, the National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Science, CAAS, Beijing, 100081, China
| | - Guangyao Zhao
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, MOA, the National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Science, CAAS, Beijing, 100081, China
| | - Dawei Huang
- Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, 100101, China
| | - Jizeng Jia
- Key Laboratory of Crop Gene Resources and Germplasm Enhancement, MOA, the National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Science, CAAS, Beijing, 100081, China
| |
Collapse
|
53
|
Berdugo-Cely J, Valbuena RI, Sánchez-Betancourt E, Barrero LS, Yockteng R. Genetic diversity and association mapping in the Colombian Central Collection of Solanum tuberosum L. Andigenum group using SNPs markers. PLoS One 2017; 12:e0173039. [PMID: 28257509 PMCID: PMC5336250 DOI: 10.1371/journal.pone.0173039] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2016] [Accepted: 02/14/2017] [Indexed: 01/03/2023] Open
Abstract
The potato (Solanum tuberosum L.) is the fourth most important crop food in the world and Colombia has one of the most important collections of potato germplasm in the world (the Colombian Central Collection-CCC). Little is known about its potential as a source of genetic diversity for molecular breeding programs. In this study, we analyzed 809 Andigenum group accessions from the CCC using 5968 SNPs to determine: 1) the genetic diversity and population structure of the Andigenum germplasm and 2) the usefulness of this collection to map qualitative traits across the potato genome. The genetic structure analysis based on principal components, cluster analyses, and Bayesian inference revealed that the CCC can be subdivided into two main groups associated with their ploidy level: Phureja (diploid) and Andigena (tetraploid). The Andigena population was more genetically diverse but less genetically substructured than the Phureja population (three vs. five subpopulations, respectively). The association mapping analysis of qualitative morphological data using 4666 SNPs showed 23 markers significantly associated with nine morphological traits. The present study showed that the CCC is a highly diverse germplasm collection genetically and phenotypically, useful to implement association mapping in order to identify genes related to traits of interest and to assist future potato genetic breeding programs.
Collapse
Affiliation(s)
- Jhon Berdugo-Cely
- Colombian Agricultural Research Corporation (CORPOICA)-Mosquera, Cundinamarca, Colombia
| | - Raúl Iván Valbuena
- Colombian Agricultural Research Corporation (CORPOICA)-Mosquera, Cundinamarca, Colombia
| | | | - Luz Stella Barrero
- Colombian Agricultural Research Corporation (CORPOICA)-Mosquera, Cundinamarca, Colombia
| | - Roxana Yockteng
- Colombian Agricultural Research Corporation (CORPOICA)-Mosquera, Cundinamarca, Colombia
- Muséum National d’Histoire Naturelle, UMR-CNRS 7205, Paris, France
| |
Collapse
|
54
|
Sehgal D, Autrique E, Singh R, Ellis M, Singh S, Dreisigacker S. Identification of genomic regions for grain yield and yield stability and their epistatic interactions. Sci Rep 2017; 7:41578. [PMID: 28145508 PMCID: PMC5286416 DOI: 10.1038/srep41578] [Citation(s) in RCA: 61] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2015] [Accepted: 12/21/2016] [Indexed: 12/04/2022] Open
Abstract
The task of identifying genomic regions conferring yield stability is challenging in any crop and requires large experimental data sets in conjunction with complex analytical approaches. We report findings of a first attempt to identify genomic regions with stable expression and their individual epistatic interactions for grain yield and yield stability in a large elite panel of wheat under multiple environments via a genome wide association mapping (GWAM) approach. Seven hundred and twenty lines were genotyped using genotyping-by-sequencing technology and phenotyped for grain yield and phenological traits. High gene diversity (0.250) and a moderate genetic structure (five groups) in the panel provided an excellent base for GWAM. The mixed linear model and multi-locus mixed model analyses identified key genomic regions on chromosomes 2B, 3A, 4A, 5B, 7A and 7B. Further, significant epistatic interactions were observed among loci with and without main effects that contributed to additional variation of up to 10%. Simple stepwise regression provided the most significant main effect and epistatic markers resulting in up to 20% variation for yield stability and up to 17% gain in yield with the best allelic combination.
Collapse
Affiliation(s)
- Deepmala Sehgal
- International Maize and Wheat Improvement Center (CIMMYT), Km. 45, Carretera Méx-Veracruz, El Batán, Texcoco, CP 56237, México
| | - Enrique Autrique
- International Maize and Wheat Improvement Center (CIMMYT), Km. 45, Carretera Méx-Veracruz, El Batán, Texcoco, CP 56237, México
| | - Ravi Singh
- International Maize and Wheat Improvement Center (CIMMYT), Km. 45, Carretera Méx-Veracruz, El Batán, Texcoco, CP 56237, México
| | - Marc Ellis
- International Maize and Wheat Improvement Center (CIMMYT), Km. 45, Carretera Méx-Veracruz, El Batán, Texcoco, CP 56237, México
| | - Sukhwinder Singh
- International Maize and Wheat Improvement Center (CIMMYT), Km. 45, Carretera Méx-Veracruz, El Batán, Texcoco, CP 56237, México
| | - Susanne Dreisigacker
- International Maize and Wheat Improvement Center (CIMMYT), Km. 45, Carretera Méx-Veracruz, El Batán, Texcoco, CP 56237, México
| |
Collapse
|
55
|
Sehgal D, Dreisigacker S, Belen S, Küçüközdemir Ü, Mert Z, Özer E, Morgounov A. Mining Centuries Old In situ Conserved Turkish Wheat Landraces for Grain Yield and Stripe Rust Resistance Genes. Front Genet 2016; 7:201. [PMID: 27917192 PMCID: PMC5114521 DOI: 10.3389/fgene.2016.00201] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2016] [Accepted: 10/31/2016] [Indexed: 11/13/2022] Open
Abstract
Wheat landraces in Turkey are an important genetic resource for wheat improvement. An exhaustive 5-year (2009-2014) effort made by the International Winter Wheat Improvement Programme (IWWIP), a cooperative program between the Ministry of Food, Agriculture and Livestock of Turkey, the International Center for Maize and Wheat Improvement (CIMMYT) and the International Center for Agricultural Research in the Dry Areas (ICARDA), led to the collection and documentation of around 2000 landrace populations from 55 provinces throughout Turkey. This study reports the genetic characterization of a subset of bread wheat landraces collected in 2010 from 11 diverse provinces using genotyping-by-sequencing (GBS) technology. The potential of this collection to identify loci determining grain yield and stripe rust resistance via genome-wide association (GWA) analysis was explored. A high genetic diversity (diversity index = 0.260) and a moderate population structure based on highly inherited spike traits was revealed in the panel. The linkage disequilibrium decayed at 10 cM across the whole genome and was slower as compared to other landrace collections. In addition to previously reported QTL, GWA analysis also identified new candidate genomic regions for stripe rust resistance, grain yield, and spike productivity components. New candidate genomic regions reflect the potential of this landrace collection to further increase genetic diversity in elite germplasm.
Collapse
Affiliation(s)
- Deepmala Sehgal
- International Center for Maize and Wheat Improvement Texcoco, Mexico
| | | | - Savaş Belen
- Crop Breeding Department, Transitional Zone Agricultural Research Institute Eskisehir, Turkey
| | - Ümran Küçüközdemir
- Crop Breeding Department, Eastern Anatolia Agricultural Research Institute Erzurum, Turkey
| | - Zafer Mert
- Central Field Crops Research Institute Ankara, Turkey
| | - Emel Özer
- Crop Breeding Department, Bahri Dagdas International Agricultural Research Institute Konya, Turkey
| | - Alexey Morgounov
- Crop Pathology Department, International Center for Maize and Wheat Improvement Ankara, Turkey
| |
Collapse
|
56
|
Breeding Value of Primary Synthetic Wheat Genotypes for Grain Yield. PLoS One 2016; 11:e0162860. [PMID: 27656893 PMCID: PMC5033409 DOI: 10.1371/journal.pone.0162860] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2016] [Accepted: 08/30/2016] [Indexed: 12/23/2022] Open
Abstract
To introduce new genetic diversity into the bread wheat gene pool from its progenitor, Aegilops tauschii (Coss.) Schmalh, 33 primary synthetic hexaploid wheat genotypes (SYN) were crossed to 20 spring bread wheat (BW) cultivars at the International Wheat and Maize Improvement Center. Modified single seed descent was used to develop 97 populations with 50 individuals per population using first back-cross, biparental, and three-way crosses. Individuals from each cross were selected for short stature, early heading, flowering and maturity, minimal lodging, and free threshing. Yield trials were conducted under irrigated, drought, and heat-stress conditions from 2011 to 2014 in Ciudad Obregon, Mexico. Genomic estimated breeding values (GEBVs) of parents and synthetic derived lines (SDLs) were estimated using a genomic best linear unbiased prediction (GBLUP) model with markers in each trial. In each environment, there were SDLs that had higher GEBVs than their recurrent BW parent for yield. The GEBVs of BW parents for yield ranged from -0.32 in heat to 1.40 in irrigated trials. The range of the SYN parent GEBVs for yield was from -2.69 in the irrigated to 0.26 in the heat trials and were mostly negative across environments. The contribution of the SYN parents to improved grain yield of the SDLs was highest under heat stress, with an average GEBV for the top 10% of the SDLs of 0.55 while the weighted average GEBV of their corresponding recurrent BW parents was 0.26. Using the pedigree-based model, the accuracy of genomic prediction for yield was 0.42, 0.43, and 0.49 in the drought, heat and irrigated trials, respectively, while for the marker-based model these values were 0.43, 0.44, and 0.55. The SYN parents introduced novel diversity into the wheat gene pool. Higher GEBVs of progenies were due to introgression and retention of some positive alleles from SYN parents.
Collapse
|
57
|
Torkamaneh D, Laroche J, Belzile F. Genome-Wide SNP Calling from Genotyping by Sequencing (GBS) Data: A Comparison of Seven Pipelines and Two Sequencing Technologies. PLoS One 2016; 11:e0161333. [PMID: 27547936 PMCID: PMC4993469 DOI: 10.1371/journal.pone.0161333] [Citation(s) in RCA: 82] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2016] [Accepted: 08/03/2016] [Indexed: 11/18/2022] Open
Abstract
Next-generation sequencing (NGS) has revolutionized plant and animal research in many ways including new methods of high throughput genotyping. Genotyping-by-sequencing (GBS) has been demonstrated to be a robust and cost-effective genotyping method capable of producing thousands to millions of SNPs across a wide range of species. Undoubtedly, the greatest barrier to its broader use is the challenge of data analysis. Herein we describe a comprehensive comparison of seven GBS bioinformatics pipelines developed to process raw GBS sequence data into SNP genotypes. We compared five pipelines requiring a reference genome (TASSEL-GBS v1& v2, Stacks, IGST, and Fast-GBS) and two de novo pipelines that do not require a reference genome (UNEAK and Stacks). Using Illumina sequence data from a set of 24 re-sequenced soybean lines, we performed SNP calling with these pipelines and compared the GBS SNP calls with the re-sequencing data to assess their accuracy. The number of SNPs called without a reference genome was lower (13k to 24k) than with a reference genome (25k to 54k SNPs) while accuracy was high (92.3 to 98.7%) for all but one pipeline (TASSEL-GBSv1, 76.1%). Among pipelines offering a high accuracy (>95%), Fast-GBS called the greatest number of polymorphisms (close to 35,000 SNPs + Indels) and yielded the highest accuracy (98.7%). Using Ion Torrent sequence data for the same 24 lines, we compared the performance of Fast-GBS with that of TASSEL-GBSv2. It again called more polymorphisms (25.8K vs 22.9K) and these proved more accurate (95.2 vs 91.1%). Typically, SNP catalogues called from the same sequencing data using different pipelines resulted in highly overlapping SNP catalogues (79-92% overlap). In contrast, overlap between SNP catalogues obtained using the same pipeline but different sequencing technologies was less extensive (~50-70%).
Collapse
Affiliation(s)
- Davoud Torkamaneh
- Département de Phytologie, Université Laval, Quebec City, QC, Canada
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Quebec City, QC, Canada
| | - Jérôme Laroche
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Quebec City, QC, Canada
| | - François Belzile
- Département de Phytologie, Université Laval, Quebec City, QC, Canada
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Quebec City, QC, Canada
| |
Collapse
|
58
|
Jarquin D, Specht J, Lorenz A. Prospects of Genomic Prediction in the USDA Soybean Germplasm Collection: Historical Data Creates Robust Models for Enhancing Selection of Accessions. G3 (BETHESDA, MD.) 2016; 6:2329-41. [PMID: 27247288 PMCID: PMC4978888 DOI: 10.1534/g3.116.031443] [Citation(s) in RCA: 44] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/11/2015] [Accepted: 05/23/2016] [Indexed: 02/05/2023]
Abstract
The identification and mobilization of useful genetic variation from germplasm banks for use in breeding programs is critical for future genetic gain and protection against crop pests. Plummeting costs of next-generation sequencing and genotyping is revolutionizing the way in which researchers and breeders interface with plant germplasm collections. An example of this is the high density genotyping of the entire USDA Soybean Germplasm Collection. We assessed the usefulness of 50K single nucleotide polymorphism data collected on 18,480 domesticated soybean (Glycine max) accessions and vast historical phenotypic data for developing genomic prediction models for protein, oil, and yield. Resulting genomic prediction models explained an appreciable amount of the variation in accession performance in independent validation trials, with correlations between predicted and observed reaching up to 0.92 for oil and protein and 0.79 for yield. The optimization of training set design was explored using a series of cross-validation schemes. It was found that the target population and environment need to be well represented in the training set. Second, genomic prediction training sets appear to be robust to the presence of data from diverse geographical locations and genetic clusters. This finding, however, depends on the influence of shattering and lodging, and may be specific to soybean with its presence of maturity groups. The distribution of 7608 nonphenotyped accessions was examined through the application of genomic prediction models. The distribution of predictions of phenotyped accessions was representative of the distribution of predictions for nonphenotyped accessions, with no nonphenotyped accessions being predicted to fall far outside the range of predictions of phenotyped accessions.
Collapse
Affiliation(s)
- Diego Jarquin
- Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Nebraska 68583-0915
| | - James Specht
- Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Nebraska 68583-0915
| | - Aaron Lorenz
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, Minnesota 55108
| |
Collapse
|
59
|
Genome-wide identification of novel genetic markers from RNA sequencing assembly of diverse Aegilops tauschii accessions. Mol Genet Genomics 2016; 291:1681-94. [DOI: 10.1007/s00438-016-1211-2] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2016] [Accepted: 04/21/2016] [Indexed: 02/03/2023]
|