301
|
He J, Zhao X, Laroche A, Lu ZX, Liu H, Li Z. Genotyping-by-sequencing (GBS), an ultimate marker-assisted selection (MAS) tool to accelerate plant breeding. FRONTIERS IN PLANT SCIENCE 2014; 5:484. [PMID: 25324846 DOI: 10.3389/fpls.2014.00484/full] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Received: 07/10/2014] [Accepted: 09/02/2014] [Indexed: 05/23/2023]
Abstract
Marker-assisted selection (MAS) refers to the use of molecular markers to assist phenotypic selections in crop improvement. Several types of molecular markers, such as single nucleotide polymorphism (SNP), have been identified and effectively used in plant breeding. The application of next-generation sequencing (NGS) technologies has led to remarkable advances in whole genome sequencing, which provides ultra-throughput sequences to revolutionize plant genotyping and breeding. To further broaden NGS usages to large crop genomes such as maize and wheat, genotyping-by-sequencing (GBS) has been developed and applied in sequencing multiplexed samples that combine molecular marker discovery and genotyping. GBS is a novel application of NGS protocols for discovering and genotyping SNPs in crop genomes and populations. The GBS approach includes the digestion of genomic DNA with restriction enzymes followed by the ligation of barcode adapter, PCR amplification and sequencing of the amplified DNA pool on a single lane of flow cells. Bioinformatic pipelines are needed to analyze and interpret GBS datasets. As an ultimate MAS tool and a cost-effective technique, GBS has been successfully used in implementing genome-wide association study (GWAS), genomic diversity study, genetic linkage analysis, molecular marker discovery and genomic selection under a large scale of plant breeding programs.
Collapse
Affiliation(s)
- Jiangfeng He
- Inner Mongolia Academy of Agriculture and Husbandry Science Hohhot, China ; Lethbridge Research Centre, Agriculture and Agri-Food Canada Lethbridge, AB, Canada
| | - Xiaoqing Zhao
- Inner Mongolia Academy of Agriculture and Husbandry Science Hohhot, China
| | - André Laroche
- Lethbridge Research Centre, Agriculture and Agri-Food Canada Lethbridge, AB, Canada
| | - Zhen-Xiang Lu
- Lethbridge Research Centre, Agriculture and Agri-Food Canada Lethbridge, AB, Canada
| | - HongKui Liu
- Inner Mongolia Academy of Agriculture and Husbandry Science Hohhot, China
| | - Ziqin Li
- Inner Mongolia Academy of Agriculture and Husbandry Science Hohhot, China
| |
Collapse
|
302
|
Unterseer S, Bauer E, Haberer G, Seidel M, Knaak C, Ouzunova M, Meitinger T, Strom TM, Fries R, Pausch H, Bertani C, Davassi A, Mayer KF, Schön CC. A powerful tool for genome analysis in maize: development and evaluation of the high density 600 k SNP genotyping array. BMC Genomics 2014; 15:823. [PMID: 25266061 PMCID: PMC4192734 DOI: 10.1186/1471-2164-15-823] [Citation(s) in RCA: 169] [Impact Index Per Article: 15.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2014] [Accepted: 09/23/2014] [Indexed: 02/07/2023] Open
Abstract
Background High density genotyping data are indispensable for genomic analyses of complex traits in animal and crop species. Maize is one of the most important crop plants worldwide, however a high density SNP genotyping array for analysis of its large and highly dynamic genome was not available so far. Results We developed a high density maize SNP array composed of 616,201 variants (SNPs and small indels). Initially, 57 M variants were discovered by sequencing 30 representative temperate maize lines and then stringently filtered for sequence quality scores and predicted conversion performance on the array resulting in the selection of 1.2 M polymorphic variants assayed on two screening arrays. To identify high-confidence variants, 285 DNA samples from a broad genetic diversity panel of worldwide maize lines including the samples used for sequencing, important founder lines for European maize breeding, hybrids, and proprietary samples with European, US, semi-tropical, and tropical origin were used for experimental validation. We selected 616 k variants according to their performance during validation, support of genotype calls through sequencing data, and physical distribution for further analysis and for the design of the commercially available Affymetrix® Axiom® Maize Genotyping Array. This array is composed of 609,442 SNPs and 6,759 indels. Among these are 116,224 variants in coding regions and 45,655 SNPs of the Illumina® MaizeSNP50 BeadChip for study comparison. In a subset of 45,974 variants, apart from the target SNP additional off-target variants are detected, which show only a minor bias towards intermediate allele frequencies. We performed principal coordinate and admixture analyses to determine the ability of the array to detect and resolve population structure and investigated the extent of LD within a worldwide validation panel. Conclusions The high density Affymetrix® Axiom® Maize Genotyping Array is optimized for European and American temperate maize and was developed based on a diverse sample panel by applying stringent quality filter criteria to ensure its suitability for a broad range of applications. With 600 k variants it is the largest currently publically available genotyping array in crop species. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-823) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | - Eva Bauer
- Plant Breeding, Centre of Life and Food Sciences Weihenstephan, Technische Universität München, 85354 Freising, Germany.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
303
|
Linkage disequilibrium with linkage analysis of multiline crosses reveals different multiallelic QTL for hybrid performance in the flint and dent heterotic groups of maize. Genetics 2014; 198:1717-34. [PMID: 25271305 DOI: 10.1534/genetics.114.169367] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Multiparental designs combined with dense genotyping of parents have been proposed as a way to increase the diversity and resolution of quantitative trait loci (QTL) mapping studies, using methods combining linkage disequilibrium information with linkage analysis (LDLA). Two new nested association mapping designs adapted to European conditions were derived from the complementary dent and flint heterotic groups of maize (Zea mays L.). Ten biparental dent families (N = 841) and 11 biparental flint families (N = 811) were genotyped with 56,110 single nucleotide polymorphism markers and evaluated as test crosses with the central line of the reciprocal design for biomass yield, plant height, and precocity. Alleles at candidate QTL were defined as (i) parental alleles, (ii) haplotypic identity by descent, and (iii) single-marker groupings. Between five and 16 QTL were detected depending on the model, trait, and genetic group considered. In the flint design, a major QTL (R(2) = 27%) with pleiotropic effects was detected on chromosome 10, whereas other QTL displayed milder effects (R(2) < 10%). On average, the LDLA models detected more QTL but generally explained lower percentages of variance, consistent with the fact that most QTL display complex allelic series. Only 15% of the QTL were common to the two designs. A joint analysis of the two designs detected between 15 and 21 QTL for the five traits. Of these, between 27 for silking date and 41% for tasseling date were significant in both groups. Favorable allelic effects detected in both groups open perspectives for improving biomass production.
Collapse
|
304
|
A foundation for provitamin A biofortification of maize: genome-wide association and genomic prediction models of carotenoid levels. Genetics 2014; 198:1699-716. [PMID: 25258377 PMCID: PMC4256781 DOI: 10.1534/genetics.114.169979] [Citation(s) in RCA: 115] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
Efforts are underway for development of crops with improved levels of provitamin A carotenoids to help combat dietary vitamin A deficiency. As a global staple crop with considerable variation in kernel carotenoid composition, maize (Zea mays L.) could have a widespread impact. We performed a genome-wide association study (GWAS) of quantified seed carotenoids across a panel of maize inbreds ranging from light yellow to dark orange in grain color to identify some of the key genes controlling maize grain carotenoid composition. Significant associations at the genome-wide level were detected within the coding regions of zep1 and lut1, carotenoid biosynthetic genes not previously shown to impact grain carotenoid composition in association studies, as well as within previously associated lcyE and crtRB1 genes. We leveraged existing biochemical and genomic information to identify 58 a priori candidate genes relevant to the biosynthesis and retention of carotenoids in maize to test in a pathway-level analysis. This revealed dxs2 and lut5, genes not previously associated with kernel carotenoids. In genomic prediction models, use of markers that targeted a small set of quantitative trait loci associated with carotenoid levels in prior linkage studies were as effective as genome-wide markers for predicting carotenoid traits. Based on GWAS, pathway-level analysis, and genomic prediction studies, we outline a flexible strategy involving use of a small number of genes that can be selected for rapid conversion of elite white grain germplasm, with minimal amounts of carotenoids, to orange grain versions containing high levels of provitamin A.
Collapse
|
305
|
Pace J, Lee N, Naik HS, Ganapathysubramanian B, Lübberstedt T. Analysis of maize (Zea mays L.) seedling roots with the high-throughput image analysis tool ARIA (Automatic Root Image Analysis). PLoS One 2014; 9:e108255. [PMID: 25251072 PMCID: PMC4176968 DOI: 10.1371/journal.pone.0108255] [Citation(s) in RCA: 70] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2014] [Accepted: 08/26/2014] [Indexed: 11/18/2022] Open
Abstract
The maize root system is crucial for plant establishment as well as water and nutrient uptake. There is substantial genetic and phenotypic variation for root architecture, which gives opportunity for selection. Root traits, however, have not been used as selection criterion mainly due to the difficulty in measuring them, as well as their quantitative mode of inheritance. Seedling root traits offer an opportunity to study multiple individuals and to enable repeated measurements per year as compared to adult root phenotyping. We developed a new software framework to capture various traits from a single image of seedling roots. This framework is based on the mathematical notion of converting images of roots into an equivalent graph. This allows automated querying of multiple traits simply as graph operations. This framework is furthermore extendable to 3D tomography image data. In order to evaluate this tool, a subset of the 384 inbred lines from the Ames panel, for which extensive genotype by sequencing data are available, was investigated. A genome wide association study was applied to this panel for two traits, Total Root Length and Total Surface Area, captured from seedling root images from WinRhizo Pro 9.0 and the current framework (called ARIA) for comparison using 135,311 single nucleotide polymorphism markers. The trait Total Root Length was found to have significant SNPs in similar regions of the genome when analyzed by both programs. This high-throughput trait capture software system allows for large phenotyping experiments and can help to establish relationships between developmental stages between seedling and adult traits in the future.
Collapse
Affiliation(s)
- Jordon Pace
- Department of Agronomy, Iowa State University, Ames, Iowa, United States of America
| | - Nigel Lee
- Department of Mechanical Engineering, Iowa State University, Ames, Iowa, United States of America
| | - Hsiang Sing Naik
- Department of Mechanical Engineering, Iowa State University, Ames, Iowa, United States of America
| | | | - Thomas Lübberstedt
- Department of Agronomy, Iowa State University, Ames, Iowa, United States of America
| |
Collapse
|
306
|
Huang P, Feldman M, Schroder S, Bahri BA, Diao X, Zhi H, Estep M, Baxter I, Devos KM, Kellogg EA. Population genetics of Setaria viridis, a new model system. Mol Ecol 2014; 23:4912-25. [PMID: 25185718 DOI: 10.1111/mec.12907] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2014] [Revised: 08/27/2014] [Accepted: 08/29/2014] [Indexed: 02/03/2023]
Abstract
An extensive survey of the standing genetic variation in natural populations is among the priority steps in developing a species into a model system. In recent years, green foxtail (Setaria viridis), along with its domesticated form foxtail millet (S. italica), has rapidly become a promising new model system for C4 grasses and bioenergy crops, due to its rapid life cycle, large amount of seed production and small diploid genome, among other characters. However, remarkably little is known about the genetic diversity in natural populations of this species. In this study, we survey the genetic diversity of a worldwide sample of more than 200 S. viridis accessions, using the genotyping-by-sequencing technique. Two distinct genetic groups in S. viridis and a third group resembling S. italica were identified, with considerable admixture among the three groups. We find the genetic variation of North American S. viridis correlates with both geography and climate and is representative of the total genetic diversity in this species. This pattern may reflect several introduction/dispersal events of S. viridis into North America. We also modelled demographic history and show signal of recent population decline in one subgroup. Finally, we show linkage disequilibrium decay is rapid (<45 kb) in our total sample and slow in genetic subgroups. These results together provide an in-depth understanding of the pattern of genetic diversity of this new model species on a broad geographic scale. They also provide key guidelines for on-going and future work including germplasm preservation, local adaptation, crossing designs and genomewide association studies.
Collapse
Affiliation(s)
- Pu Huang
- Donald Danforth Plant Science Center, 975 North Warson Rd., St. Louis, MO, 63132, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
307
|
Jarquín D, Kocak K, Posadas L, Hyma K, Jedlicka J, Graef G, Lorenz A. Genotyping by sequencing for genomic prediction in a soybean breeding population. BMC Genomics 2014; 15:740. [PMID: 25174348 PMCID: PMC4176594 DOI: 10.1186/1471-2164-15-740] [Citation(s) in RCA: 112] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2014] [Accepted: 08/22/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Advances in genotyping technology, such as genotyping by sequencing (GBS), are making genomic prediction more attractive to reduce breeding cycle times and costs associated with phenotyping. Genomic prediction and selection has been studied in several crop species, but no reports exist in soybean. The objectives of this study were (i) evaluate prospects for genomic selection using GBS in a typical soybean breeding program and (ii) evaluate the effect of GBS marker selection and imputation on genomic prediction accuracy. To achieve these objectives, a set of soybean lines sampled from the University of Nebraska Soybean Breeding Program were genotyped using GBS and evaluated for yield and other agronomic traits at multiple Nebraska locations. RESULTS Genotyping by sequencing scored 16,502 single nucleotide polymorphisms (SNPs) with minor-allele frequency (MAF) > 0.05 and percentage of missing values ≤ 5% on 301 elite soybean breeding lines. When SNPs with up to 80% missing values were included, 52,349 SNPs were scored. Prediction accuracy for grain yield, assessed using cross validation, was estimated to be 0.64, indicating good potential for using genomic selection for grain yield in soybean. Filtering SNPs based on missing data percentage had little to no effect on prediction accuracy, especially when random forest imputation was used to impute missing values. The highest accuracies were observed when random forest imputation was used on all SNPs, but differences were not significant. A standard additive G-BLUP model was robust; modeling additive-by-additive epistasis did not provide any improvement in prediction accuracy. The effect of training population size on accuracy began to plateau around 100, but accuracy steadily climbed until the largest possible size was used in this analysis. Including only SNPs with MAF > 0.30 provided higher accuracies when training populations were smaller. CONCLUSIONS Using GBS for genomic prediction in soybean holds good potential to expedite genetic gain. Our results suggest that standard additive G-BLUP models can be used on unfiltered, imputed GBS data without loss in accuracy.
Collapse
Affiliation(s)
- Diego Jarquín
- />Department of Agronomy and Horticulture, University of Nebraska, 363 Keim Hall, Lincoln, NE 68583 USA
| | - Kyle Kocak
- />Department of Agronomy and Horticulture, University of Nebraska, 363 Keim Hall, Lincoln, NE 68583 USA
| | - Luis Posadas
- />Department of Agronomy and Horticulture, University of Nebraska, 363 Keim Hall, Lincoln, NE 68583 USA
| | - Katie Hyma
- />Institute of Genomic Diversity, Cornell University, Ithaca, NY USA
| | - Joseph Jedlicka
- />Department of Agronomy and Horticulture, University of Nebraska, 363 Keim Hall, Lincoln, NE 68583 USA
| | - George Graef
- />Department of Agronomy and Horticulture, University of Nebraska, 363 Keim Hall, Lincoln, NE 68583 USA
| | - Aaron Lorenz
- />Department of Agronomy and Horticulture, University of Nebraska, 363 Keim Hall, Lincoln, NE 68583 USA
| |
Collapse
|
308
|
Olukolu BA, Wang GF, Vontimitta V, Venkata BP, Marla S, Ji J, Gachomo E, Chu K, Negeri A, Benson J, Nelson R, Bradbury P, Nielsen D, Holland JB, Balint-Kurti PJ, Johal G. A genome-wide association study of the maize hypersensitive defense response identifies genes that cluster in related pathways. PLoS Genet 2014; 10:e1004562. [PMID: 25166276 PMCID: PMC4148229 DOI: 10.1371/journal.pgen.1004562] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2014] [Accepted: 06/27/2014] [Indexed: 02/04/2023] Open
Abstract
Much remains unknown of molecular events controlling the plant hypersensitive defense response (HR), a rapid localized cell death that limits pathogen spread and is mediated by resistance (R-) genes. Genetic control of the HR is hard to quantify due to its microscopic and rapid nature. Natural modifiers of the ectopic HR phenotype induced by an aberrant auto-active R-gene (Rp1-D21), were mapped in a population of 3,381 recombinant inbred lines from the maize nested association mapping population. Joint linkage analysis was conducted to identify 32 additive but no epistatic quantitative trait loci (QTL) using a linkage map based on more than 7000 single nucleotide polymorphisms (SNPs). Genome-wide association (GWA) analysis of 26.5 million SNPs was conducted after adjusting for background QTL. GWA identified associated SNPs that colocalized with 44 candidate genes. Thirty-six of these genes colocalized within 23 of the 32 QTL identified by joint linkage analysis. The candidate genes included genes predicted to be in involved programmed cell death, defense response, ubiquitination, redox homeostasis, autophagy, calcium signalling, lignin biosynthesis and cell wall modification. Twelve of the candidate genes showed significant differential expression between isogenic lines differing for the presence of Rp1-D21. Low but significant correlations between HR-related traits and several previously-measured disease resistance traits suggested that the genetic control of these traits was substantially, though not entirely, independent. This study provides the first system-wide analysis of natural variation that modulates the HR response in plants. The hypersensitive pathogen defense response (HR) in plants typically consists of a rapid, localized cell death around the point of attempted pathogen penetration. It is found in all plant species and is associated with high levels of resistance to a wide range of pathogens and pests including bacteria, fungi, viruses, nematodes, parasitic plants and insects. Little is known about the control of HR after initiation, largely because it is so rapid and localized and therefore difficult to quantify. Here we use a mutant maize gene conferring an exaggerated HR to quantify HR levels in a set of 3,381 mapping lines characterised at 26.5 million loci to identify genes associated with naturally-occurring variation in HR. Many of these genes seem to be involved in a set of connected regulatory pathways including protein degradation, control of programmed cell death, recycling of cellular components and regulation of oxidative stress. We have also shown that several of these genes show high levels of expression induction during HR. The study provides the first comprehensive list of natural variants in maize genes that modulate HR and cluster within reported pathways underlying molecular events during HR.
Collapse
Affiliation(s)
- Bode A Olukolu
- Department of Plant Pathology, NC State University, Raleigh, North Carolina, United States of America
| | - Guan-Feng Wang
- Department of Plant Pathology, NC State University, Raleigh, North Carolina, United States of America
| | - Vijay Vontimitta
- Department of Botany and Plant Pathology, Purdue University, Lilly Hall, West Lafayette, Indiana, United States of America
| | - Bala P Venkata
- Department of Botany and Plant Pathology, Purdue University, Lilly Hall, West Lafayette, Indiana, United States of America
| | - Sandeep Marla
- Department of Botany and Plant Pathology, Purdue University, Lilly Hall, West Lafayette, Indiana, United States of America
| | - Jiabing Ji
- Department of Botany and Plant Pathology, Purdue University, Lilly Hall, West Lafayette, Indiana, United States of America
| | - Emma Gachomo
- Department of Botany and Plant Pathology, Purdue University, Lilly Hall, West Lafayette, Indiana, United States of America
| | - Kevin Chu
- Department of Botany and Plant Pathology, Purdue University, Lilly Hall, West Lafayette, Indiana, United States of America
| | - Adisu Negeri
- Department of Plant Pathology, NC State University, Raleigh, North Carolina, United States of America
| | - Jacqueline Benson
- Department of Plant Pathology and Plant-Microbe Biology, Cornell University, Ithaca, New York, United States of America
| | - Rebecca Nelson
- Department of Plant Pathology and Plant-Microbe Biology, Cornell University, Ithaca, New York, United States of America
| | - Peter Bradbury
- Institute for Genomic Diversity, Cornell University, Ithaca, New York, United States of America
| | - Dahlia Nielsen
- Department of Biological Sciences, NC State University, Raleigh, North Carolina, United States of America
| | - James B Holland
- USDA-ARS Plant Science Research Unit, Raleigh, North Carolina, United States of America; Department of Crop Science, NC State University, Raleigh, North Carolina, United States of America
| | - Peter J Balint-Kurti
- Department of Plant Pathology, NC State University, Raleigh, North Carolina, United States of America; USDA-ARS Plant Science Research Unit, Raleigh, North Carolina, United States of America
| | - Gurmukh Johal
- Department of Botany and Plant Pathology, Purdue University, Lilly Hall, West Lafayette, Indiana, United States of America
| |
Collapse
|
309
|
Huang YF, Poland JA, Wight CP, Jackson EW, Tinker NA. Using genotyping-by-sequencing (GBS) for genomic discovery in cultivated oat. PLoS One 2014; 9:e102448. [PMID: 25047601 PMCID: PMC4105502 DOI: 10.1371/journal.pone.0102448] [Citation(s) in RCA: 101] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2013] [Accepted: 06/19/2014] [Indexed: 01/23/2023] Open
Abstract
Advances in next-generation sequencing offer high-throughput and cost-effective genotyping alternatives, including genotyping-by-sequencing (GBS). Results have shown that this methodology is efficient for genotyping a variety of species, including those with complex genomes. To assess the utility of GBS in cultivated hexaploid oat (Avena sativa L.), seven bi-parental mapping populations and diverse inbred lines from breeding programs around the world were studied. We examined technical factors that influence GBS SNP calls, established a workflow that combines two bioinformatics pipelines for GBS SNP calling, and provided a nomenclature for oat GBS loci. The high-throughput GBS system enabled us to place 45,117 loci on an oat consensus map, thus establishing a positional reference for further genomic studies. Using the diversity lines, we estimated that a minimum density of one marker per 2 to 2.8 cM would be required for genome-wide association studies (GWAS), and GBS markers met this density requirement in most chromosome regions. We also demonstrated the utility of GBS in additional diagnostic applications related to oat breeding. We conclude that GBS is a powerful and useful approach, which will have many additional applications in oat breeding and genomic studies.
Collapse
Affiliation(s)
- Yung-Fen Huang
- Eastern Cereal and Oilseed Research Centre, Agriculture and Agri-Food Canada, Ottawa, Ontario, Canada
| | - Jesse A. Poland
- Department of Plant Pathology, Kansas State University, Manhattan, Kansas, United States of America
| | - Charlene P. Wight
- Eastern Cereal and Oilseed Research Centre, Agriculture and Agri-Food Canada, Ottawa, Ontario, Canada
| | - Eric W. Jackson
- General Mills Crop Biosciences, Manhattan, Kansas, United States of America
| | - Nicholas A. Tinker
- Eastern Cereal and Oilseed Research Centre, Agriculture and Agri-Food Canada, Ottawa, Ontario, Canada
- * E-mail:
| |
Collapse
|
310
|
Fast and cost-effective genetic mapping in apple using next-generation sequencing. G3-GENES GENOMES GENETICS 2014; 4:1681-7. [PMID: 25031181 PMCID: PMC4169160 DOI: 10.1534/g3.114.011023] [Citation(s) in RCA: 74] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Next-generation DNA sequencing (NGS) produces vast amounts of DNA sequence data, but it is not specifically designed to generate data suitable for genetic mapping. Recently developed DNA library preparation methods for NGS have helped solve this problem, however, by combining the use of reduced representation libraries with DNA sample barcoding to generate genome-wide genotype data from a common set of genetic markers across a large number of samples. Here we use such a method, called genotyping-by-sequencing (GBS), to produce a data set for genetic mapping in an F1 population of apples (Malus × domestica) segregating for skin color. We show that GBS produces a relatively large, but extremely sparse, genotype matrix: over 270,000 SNPs were discovered but most SNPs have too much missing data across samples to be useful for genetic mapping. After filtering for genotype quality and missing data, only 6% of the 85 million DNA sequence reads contributed to useful genotype calls. Despite this limitation, using existing software and a set of simple heuristics, we generated a final genotype matrix containing 3967 SNPs from 89 DNA samples from a single lane of Illumina HiSeq and used it to create a saturated genetic linkage map and to identify a known QTL underlying apple skin color. We therefore demonstrate that GBS is a cost-effective method for generating genome-wide SNP data suitable for genetic mapping in a highly diverse and heterozygous agricultural species. We anticipate future improvements to the GBS analysis pipeline presented here that will enhance the utility of next-generation DNA sequence data for the purposes of genetic mapping across diverse species.
Collapse
|
311
|
Abstract
Multiple disease resistance has important implications for plant fitness, given the selection pressure that many pathogens exert directly on natural plant populations and indirectly via crop improvement programs. Evidence of a locus conditioning resistance to multiple pathogens was found in bin 1.06 of the maize genome with the allele from inbred line "Tx303" conditioning quantitative resistance to northern leaf blight (NLB) and qualitative resistance to Stewart's wilt. To dissect the genetic basis of resistance in this region and to refine candidate gene hypotheses, we mapped resistance to the two diseases. Both resistance phenotypes were localized to overlapping regions, with the Stewart's wilt interval refined to a 95.9-kb segment containing three genes and the NLB interval to a 3.60-Mb segment containing 117 genes. Regions of the introgression showed little to no recombination, suggesting structural differences between the inbred lines Tx303 and "B73," the parents of the fine-mapping population. We examined copy number variation across the region using next-generation sequencing data, and found large variation in read depth in Tx303 across the region relative to the reference genome of B73. In the fine-mapping region, association mapping for NLB implicated candidate genes, including a putative zinc finger and pan1. We tested mutant alleles and found that pan1 is a susceptibility gene for NLB and Stewart's wilt. Our data strongly suggest that structural variation plays an important role in resistance conditioned by this region, and pan1, a gene conditioning susceptibility for NLB, may underlie the QTL.
Collapse
|
312
|
Mascher M, Gerlach N, Gahrtz M, Bucher M, Scholz U, Dresselhaus T. Sequence and ionomic analysis of divergent strains of maize inbred line B73 with an altered growth phenotype. PLoS One 2014; 9:e96782. [PMID: 24804793 PMCID: PMC4013074 DOI: 10.1371/journal.pone.0096782] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2013] [Accepted: 04/11/2014] [Indexed: 11/19/2022] Open
Abstract
Maize (Zea mays) is the most widely grown crop species in the world and a classical model organism for plant research. The completion of a high-quality reference genome sequence and the advent of high-throughput sequencing have greatly empowered re-sequencing studies in maize. In this study, plants of maize inbred line B73 descended from two different sets of seed material grown for several generations either in the field or in the greenhouse were found to show a different growth phenotype and ionome under phosphate starvation conditions and moreover a different responsiveness towards mycorrhizal fungi of the species Glomus intraradices (syn: Rhizophagus irregularis). Whole genome re-sequencing of individuals from both sets and comparison to the B73 reference sequence revealed three cryptic introgressions on chromosomes 1, 5 and 10 in the line grown in the greenhouse summing up to a total of 5,257 single-nucleotide polymorphisms (SNPs). Transcriptome sequencing of three individuals from each set lent further support to the location of the introgression intervals and confirmed them to be fixed in all sequenced individuals. Moreover, we identified >120 genes differentially expressed between the two B73 lines. We thus have found a nearly-isogenic line (NIL) of maize inbred line B73 that is characterized by an altered growth phenotype under phosphate starvation conditions and an improved responsiveness towards symbiosis with mycorrhizal fungi. Through next-generation sequencing of the genomes and transcriptomes we were able to delineate exact introgression intervals. Putative de novo mutations appeared approximately uniformly distributed along the ten maize chromosomes mainly representing G:C -> A:T transitions. The plant material described in this study will be a valuable tool both for functional studies of genes differentially expressed in both B73 lines and for research on growth behavior especially in response to symbiosis between maize and mycorrhizal fungi.
Collapse
Affiliation(s)
- Martin Mascher
- Department of Cytogenetics and Genome Analysis, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Corrensstraβe 3, Stadt Seeland, Germany
| | - Nina Gerlach
- Botanical Institute, Cologne Biocenter, Cluster of Excellence on Plant Sciences (CEPLAS), University of Cologne, Zülpicherstrasse 47b, Cologne, Germany
| | - Manfred Gahrtz
- Cell Biology and Plant Biochemistry, Biochemie-Zentrum Regensburg, University of Regensburg, Universitätsstraβe 31, Regensburg, Germany
| | - Marcel Bucher
- Botanical Institute, Cologne Biocenter, Cluster of Excellence on Plant Sciences (CEPLAS), University of Cologne, Zülpicherstrasse 47b, Cologne, Germany
| | - Uwe Scholz
- Department of Cytogenetics and Genome Analysis, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Corrensstraβe 3, Stadt Seeland, Germany
| | - Thomas Dresselhaus
- Cell Biology and Plant Biochemistry, Biochemie-Zentrum Regensburg, University of Regensburg, Universitätsstraβe 31, Regensburg, Germany
- * E-mail:
| |
Collapse
|
313
|
Rincent R, Moreau L, Monod H, Kuhn E, Melchinger AE, Malvar RA, Moreno-Gonzalez J, Nicolas S, Madur D, Combes V, Dumas F, Altmann T, Brunel D, Ouzunova M, Flament P, Dubreuil P, Charcosset A, Mary-Huard T. Recovering power in association mapping panels with variable levels of linkage disequilibrium. Genetics 2014; 197:375-87. [PMID: 24532779 PMCID: PMC4012494 DOI: 10.1534/genetics.113.159731] [Citation(s) in RCA: 66] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2013] [Accepted: 02/09/2014] [Indexed: 11/18/2022] Open
Abstract
Association mapping has permitted the discovery of major QTL in many species. It can be applied to existing populations and, as a consequence, it is generally necessary to take into account structure and relatedness among individuals in the statistical model to control false positives. We analytically studied power in association studies by computing noncentrality parameter of the tests and its relationship with parameters characterizing diversity (genetic differentiation between groups and allele frequencies) and kinship between individuals. Investigation of three different maize diversity panels genotyped with the 50k SNPs array highlighted contrasted average power among panels and revealed gaps of power of classical mixed models in regions with high linkage disequilibrium (LD). These gaps could be related to the fact that markers are used for both testing association and estimating relatedness. We thus considered two alternative approaches to estimating the kinship matrix to recover power in regions of high LD. In the first one, we estimated the kinship with all the markers that are not located on the same chromosome than the tested SNP. In the second one, correlation between markers was taken into account to weight the contribution of each marker to the kinship. Simulations revealed that these two approaches were efficient to control false positives and were more powerful than classical models.
Collapse
Affiliation(s)
- Renaud Rincent
- Unité Mixte de Recherche de Génétique Végétale, Institut National de la Recherche Agronomique, Université Paris-Sud, Centre National de la Recherche Scientifique, 91190 Gif-sur-Yvette, France
- Biogemma, Genetics and Genomics in Cereals, 63720 Chappes, France
- Kleinwanzlebener Saatzucht Saat AG, 37555 Einbeck, Germany
- Limagrain, site d’Ulice, BP173, 63204 Riom Cedex, France
| | - Laurence Moreau
- Unité Mixte de Recherche de Génétique Végétale, Institut National de la Recherche Agronomique, Université Paris-Sud, Centre National de la Recherche Scientifique, 91190 Gif-sur-Yvette, France
| | - Hervé Monod
- Institut National de la Recherche Agronomique, Unité de Mathématique et Informatique Appliquées, 78352 Jouy-en-Josas, France
| | - Estelle Kuhn
- Institut National de la Recherche Agronomique, Unité de Mathématique et Informatique Appliquées, 78352 Jouy-en-Josas, France
| | - Albrecht E. Melchinger
- Institute of Plant Breeding, Seed Science, and Population Genetics, University of Hohenheim, 70599, Stuttgart, Germany
| | - Rosa A. Malvar
- Misión Biológica de Galicia, Spanish National Research Council, 36080 Pontevedra, Spain
| | | | - Stéphane Nicolas
- Unité Mixte de Recherche de Génétique Végétale, Institut National de la Recherche Agronomique, Université Paris-Sud, Centre National de la Recherche Scientifique, 91190 Gif-sur-Yvette, France
| | - Delphine Madur
- Unité Mixte de Recherche de Génétique Végétale, Institut National de la Recherche Agronomique, Université Paris-Sud, Centre National de la Recherche Scientifique, 91190 Gif-sur-Yvette, France
| | - Valérie Combes
- Unité Mixte de Recherche de Génétique Végétale, Institut National de la Recherche Agronomique, Université Paris-Sud, Centre National de la Recherche Scientifique, 91190 Gif-sur-Yvette, France
| | - Fabrice Dumas
- Unité Mixte de Recherche de Génétique Végétale, Institut National de la Recherche Agronomique, Université Paris-Sud, Centre National de la Recherche Scientifique, 91190 Gif-sur-Yvette, France
| | - Thomas Altmann
- Max-Planck Institute for Molecular Plant Physiology, 14476 Potsdam-Golm and Leibniz-Institute of Plant Genetics and Crop Plant Research (IPK), 06466 Gatersleben, Germany
| | - Dominique Brunel
- Institut National de la Recherche Agronomique, Etude du Polymorphisme des Génomes Végétaux, Commissariat à l'Energie Atomique Institut de Génomique, Centre National de Génotypage, 91057 Evry, France
| | | | - Pascal Flament
- Limagrain, site d’Ulice, BP173, 63204 Riom Cedex, France
| | - Pierre Dubreuil
- Biogemma, Genetics and Genomics in Cereals, 63720 Chappes, France
| | - Alain Charcosset
- Unité Mixte de Recherche de Génétique Végétale, Institut National de la Recherche Agronomique, Université Paris-Sud, Centre National de la Recherche Scientifique, 91190 Gif-sur-Yvette, France
| | - Tristan Mary-Huard
- Unité Mixte de Recherche de Génétique Végétale, Institut National de la Recherche Agronomique, Université Paris-Sud, Centre National de la Recherche Scientifique, 91190 Gif-sur-Yvette, France
- Institut National de la Recherche Agronomique/AgroParisTech, Unité Mixte de Recherche 518, 75231, Paris, France
| |
Collapse
|
314
|
TASSEL-GBS: a high capacity genotyping by sequencing analysis pipeline. PLoS One 2014; 9:e90346. [PMID: 24587335 PMCID: PMC3938676 DOI: 10.1371/journal.pone.0090346] [Citation(s) in RCA: 957] [Impact Index Per Article: 87.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2013] [Accepted: 01/28/2014] [Indexed: 12/30/2022] Open
Abstract
Genotyping by sequencing (GBS) is a next generation sequencing based method that takes advantage of reduced representation to enable high throughput genotyping of large numbers of individuals at a large number of SNP markers. The relatively straightforward, robust, and cost-effective GBS protocol is currently being applied in numerous species by a large number of researchers. Herein we describe a bioinformatics pipeline, tassel-gbs, designed for the efficient processing of raw GBS sequence data into SNP genotypes. The tassel-gbs pipeline successfully fulfills the following key design criteria: (1) Ability to run on the modest computing resources that are typically available to small breeding or ecological research programs, including desktop or laptop machines with only 8–16 GB of RAM, (2) Scalability from small to extremely large studies, where hundreds of thousands or even millions of SNPs can be scored in up to 100,000 individuals (e.g., for large breeding programs or genetic surveys), and (3) Applicability in an accelerated breeding context, requiring rapid turnover from tissue collection to genotypes. Although a reference genome is required, the pipeline can also be run with an unfinished “pseudo-reference” consisting of numerous contigs. We describe the tassel-gbs pipeline in detail and benchmark it based upon a large scale, species wide analysis in maize (Zea mays), where the average error rate was reduced to 0.0042 through application of population genetic-based SNP filters. Overall, the GBS assay and the tassel-gbs pipeline provide robust tools for studying genomic diversity.
Collapse
|
315
|
Abstract
Height is one of the most heritable and easily measured traits in maize (Zea mays L.). Given a pedigree or estimates of the genomic identity-by-state among related plants, height is also accurately predictable. But, mapping alleles explaining natural variation in maize height remains a formidable challenge. To address this challenge, we measured the plant height, ear height, flowering time, and node counts of plants grown in >64,500 plots across 13 environments. These plots contained >7300 inbreds representing most publically available maize inbreds in the United States and families of the maize Nested Association Mapping (NAM) panel. Joint-linkage mapping of quantitative trait loci (QTL), fine mapping in near isogenic lines (NILs), genome-wide association studies (GWAS), and genomic best linear unbiased prediction (GBLUP) were performed. The heritability of maize height was estimated to be >90%. Mapping NAM family-nested QTL revealed the largest explained 2.1 ± 0.9% of height variation. The effects of two tropical alleles at this QTL were independently validated by fine mapping in NIL families. Several significant associations found by GWAS colocalized with established height loci, including brassinosteroid-deficient dwarf1, dwarf plant1, and semi-dwarf2. GBLUP explained >80% of height variation in the panels and outperformed bootstrap aggregation of family-nested QTL models in evaluations of prediction accuracy. These results revealed maize height was under strong genetic control and had a highly polygenic genetic architecture. They also showed that multiple models of genetic architecture differing in polygenicity and effect sizes can plausibly explain a population’s variation in maize height, but they may vary in predictive efficacy.
Collapse
|
316
|
Mezmouk S, Ross-Ibarra J. The pattern and distribution of deleterious mutations in maize. G3 (BETHESDA, MD.) 2014; 4:163-71. [PMID: 24281428 PMCID: PMC3887532 DOI: 10.1534/g3.113.008870] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/07/2013] [Accepted: 11/19/2013] [Indexed: 12/19/2022]
Abstract
Most nonsynonymous mutations are thought to be deleterious because of their effect on protein sequence and are expected to be removed or kept at low frequency by the action of natural selection. Nonetheless, the effect of positive selection on linked sites or drift in small or inbred populations may also impact the evolution of deleterious alleles. Despite their potential to affect complex trait phenotypes, deleterious alleles are difficult to study precisely because they are often at low frequency. Here, we made use of genome-wide genotyping data to characterize deleterious variants in a large panel of maize inbred lines. We show that, despite small effective population sizes and inbreeding, most putatively deleterious SNPs are indeed at low frequencies within individual genetic groups. We find that genes associated with a number of complex traits are enriched for deleterious variants. Together, these data are consistent with the dominance model of heterosis, in which complementation of numerous low-frequency, weak deleterious variants contribute to hybrid vigor.
Collapse
Affiliation(s)
- Sofiane Mezmouk
- Department of Plant Sciences, University of California–Davis, Davis, California 95616
| | - Jeffrey Ross-Ibarra
- Department of Plant Sciences, University of California–Davis, Davis, California 95616
- Center for Population Biology and Genome Center, University of California–Davis, Davis, California 95616
| |
Collapse
|
317
|
He J, Zhao X, Laroche A, Lu ZX, Liu H, Li Z. Genotyping-by-sequencing (GBS), an ultimate marker-assisted selection (MAS) tool to accelerate plant breeding. FRONTIERS IN PLANT SCIENCE 2014; 5:484. [PMID: 25324846 PMCID: PMC4179701 DOI: 10.3389/fpls.2014.00484] [Citation(s) in RCA: 276] [Impact Index Per Article: 25.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2014] [Accepted: 09/02/2014] [Indexed: 05/05/2023]
Abstract
Marker-assisted selection (MAS) refers to the use of molecular markers to assist phenotypic selections in crop improvement. Several types of molecular markers, such as single nucleotide polymorphism (SNP), have been identified and effectively used in plant breeding. The application of next-generation sequencing (NGS) technologies has led to remarkable advances in whole genome sequencing, which provides ultra-throughput sequences to revolutionize plant genotyping and breeding. To further broaden NGS usages to large crop genomes such as maize and wheat, genotyping-by-sequencing (GBS) has been developed and applied in sequencing multiplexed samples that combine molecular marker discovery and genotyping. GBS is a novel application of NGS protocols for discovering and genotyping SNPs in crop genomes and populations. The GBS approach includes the digestion of genomic DNA with restriction enzymes followed by the ligation of barcode adapter, PCR amplification and sequencing of the amplified DNA pool on a single lane of flow cells. Bioinformatic pipelines are needed to analyze and interpret GBS datasets. As an ultimate MAS tool and a cost-effective technique, GBS has been successfully used in implementing genome-wide association study (GWAS), genomic diversity study, genetic linkage analysis, molecular marker discovery and genomic selection under a large scale of plant breeding programs.
Collapse
Affiliation(s)
- Jiangfeng He
- Inner Mongolia Academy of Agriculture and Husbandry ScienceHohhot, China
- Lethbridge Research Centre, Agriculture and Agri-Food CanadaLethbridge, AB, Canada
| | - Xiaoqing Zhao
- Inner Mongolia Academy of Agriculture and Husbandry ScienceHohhot, China
| | - André Laroche
- Lethbridge Research Centre, Agriculture and Agri-Food CanadaLethbridge, AB, Canada
| | - Zhen-Xiang Lu
- Lethbridge Research Centre, Agriculture and Agri-Food CanadaLethbridge, AB, Canada
| | - HongKui Liu
- Inner Mongolia Academy of Agriculture and Husbandry ScienceHohhot, China
- *Correspondence: Hongkui Liu and Ziqin Li, Inner Mongolia Academy of Agriculture and Husbandry Science, Zhaojun Road 22, Hohhot, Inner Mongolia 010031, China e-mail: ;
| | - Ziqin Li
- Inner Mongolia Academy of Agriculture and Husbandry ScienceHohhot, China
- *Correspondence: Hongkui Liu and Ziqin Li, Inner Mongolia Academy of Agriculture and Husbandry Science, Zhaojun Road 22, Hohhot, Inner Mongolia 010031, China e-mail: ;
| |
Collapse
|
318
|
Huang X, Han B. Natural variations and genome-wide association studies in crop plants. ANNUAL REVIEW OF PLANT BIOLOGY 2014; 65:531-51. [PMID: 24274033 DOI: 10.1146/annurev-arplant-050213-035715] [Citation(s) in RCA: 381] [Impact Index Per Article: 34.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
Natural variants of crops are generated from wild progenitor plants under both natural and human selection. Diverse crops that are able to adapt to various environmental conditions are valuable resources for crop improvements to meet the food demands of the increasing human population. With the completion of reference genome sequences, the advent of high-throughput sequencing technology now enables rapid and accurate resequencing of a large number of crop genomes to detect the genetic basis of phenotypic variations in crops. Comprehensive maps of genome variations facilitate genome-wide association studies of complex traits and functional investigations of evolutionary changes in crops. These advances will greatly accelerate studies on crop designs via genomics-assisted breeding. Here, we first discuss crop genome studies and describe the development of sequencing-based genotyping and genome-wide association studies in crops. We then review sequencing-based crop domestication studies and offer a perspective on genomics-driven crop designs.
Collapse
Affiliation(s)
- Xuehui Huang
- National Center for Gene Research, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200233, China; ,
| | | |
Collapse
|
319
|
Huang YF, Poland JA, Wight CP, Jackson EW, Tinker NA. Using genotyping-by-sequencing (GBS) for genomic discovery in cultivated oat. PLoS One 2014. [PMID: 25047601 DOI: 10.1371/journalpone0102448] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/14/2023] Open
Abstract
Advances in next-generation sequencing offer high-throughput and cost-effective genotyping alternatives, including genotyping-by-sequencing (GBS). Results have shown that this methodology is efficient for genotyping a variety of species, including those with complex genomes. To assess the utility of GBS in cultivated hexaploid oat (Avena sativa L.), seven bi-parental mapping populations and diverse inbred lines from breeding programs around the world were studied. We examined technical factors that influence GBS SNP calls, established a workflow that combines two bioinformatics pipelines for GBS SNP calling, and provided a nomenclature for oat GBS loci. The high-throughput GBS system enabled us to place 45,117 loci on an oat consensus map, thus establishing a positional reference for further genomic studies. Using the diversity lines, we estimated that a minimum density of one marker per 2 to 2.8 cM would be required for genome-wide association studies (GWAS), and GBS markers met this density requirement in most chromosome regions. We also demonstrated the utility of GBS in additional diagnostic applications related to oat breeding. We conclude that GBS is a powerful and useful approach, which will have many additional applications in oat breeding and genomic studies.
Collapse
Affiliation(s)
- Yung-Fen Huang
- Eastern Cereal and Oilseed Research Centre, Agriculture and Agri-Food Canada, Ottawa, Ontario, Canada
| | - Jesse A Poland
- Department of Plant Pathology, Kansas State University, Manhattan, Kansas, United States of America
| | - Charlene P Wight
- Eastern Cereal and Oilseed Research Centre, Agriculture and Agri-Food Canada, Ottawa, Ontario, Canada
| | - Eric W Jackson
- General Mills Crop Biosciences, Manhattan, Kansas, United States of America
| | - Nicholas A Tinker
- Eastern Cereal and Oilseed Research Centre, Agriculture and Agri-Food Canada, Ottawa, Ontario, Canada
| |
Collapse
|
320
|
Genomic prediction in maize breeding populations with genotyping-by-sequencing. G3-GENES GENOMES GENETICS 2013; 3:1903-26. [PMID: 24022750 PMCID: PMC3815055 DOI: 10.1534/g3.113.008227] [Citation(s) in RCA: 147] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Genotyping-by-sequencing (GBS) technologies have proven capacity for delivering large numbers of marker genotypes with potentially less ascertainment bias than standard single nucleotide polymorphism (SNP) arrays. Therefore, GBS has become an attractive alternative technology for genomic selection. However, the use of GBS data poses important challenges, and the accuracy of genomic prediction using GBS is currently undergoing investigation in several crops, including maize, wheat, and cassava. The main objective of this study was to evaluate various methods for incorporating GBS information and compare them with pedigree models for predicting genetic values of lines from two maize populations evaluated for different traits measured in different environments (experiments 1 and 2). Given that GBS data come with a large percentage of uncalled genotypes, we evaluated methods using nonimputed, imputed, and GBS-inferred haplotypes of different lengths (short or long). GBS and pedigree data were incorporated into statistical models using either the genomic best linear unbiased predictors (GBLUP) or the reproducing kernel Hilbert spaces (RKHS) regressions, and prediction accuracy was quantified using cross-validation methods. The following results were found: relative to pedigree or marker-only models, there were consistent gains in prediction accuracy by combining pedigree and GBS data; there was increased predictive ability when using imputed or nonimputed GBS data over inferred haplotype in experiment 1, or nonimputed GBS and information-based imputed short and long haplotypes, as compared to the other methods in experiment 2; the level of prediction accuracy achieved using GBS data in experiment 2 is comparable to those reported by previous authors who analyzed this data set using SNP arrays; and GBLUP and RKHS models with pedigree with nonimputed and imputed GBS data provided the best prediction correlations for the three traits in experiment 1, whereas for experiment 2 RKHS provided slightly better prediction than GBLUP for drought-stressed environments, and both models provided similar predictions in well-watered environments.
Collapse
|
321
|
Zila CT, Samayoa LF, Santiago R, Butrón A, Holland JB. A genome-wide association study reveals genes associated with fusarium ear rot resistance in a maize core diversity panel. G3 (BETHESDA, MD.) 2013; 3:2095-104. [PMID: 24048647 PMCID: PMC3815068 DOI: 10.1534/g3.113.007328] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/21/2013] [Accepted: 09/11/2013] [Indexed: 12/28/2022]
Abstract
Fusarium ear rot is a common disease of maize that affects food and feed quality globally. Resistance to the disease is highly quantitative, and maize breeders have difficulty incorporating polygenic resistance alleles from unadapted donor sources into elite breeding populations without having a negative impact on agronomic performance. Identification of specific allele variants contributing to improved resistance may be useful to breeders by allowing selection of resistance alleles in coupling phase linkage with favorable agronomic characteristics. We report the results of a genome-wide association study to detect allele variants associated with increased resistance to Fusarium ear rot in a maize core diversity panel of 267 inbred lines evaluated in two sets of environments. We performed association tests with 47,445 single-nucleotide polymorphisms (SNPs) while controlling for background genomic relationships with a mixed model and identified three marker loci significantly associated with disease resistance in at least one subset of environments. Each associated SNP locus had relatively small additive effects on disease resistance (±1.1% on a 0-100% scale), but nevertheless were associated with 3 to 12% of the genotypic variation within or across environment subsets. Two of three identified SNPs colocalized with genes that have been implicated with programmed cell death. An analysis of associated allele frequencies within the major maize subpopulations revealed enrichment for resistance alleles in the tropical/subtropical and popcorn subpopulations compared with other temperate breeding pools.
Collapse
Affiliation(s)
- Charles T. Zila
- Department of Crop Science, North Carolina State University, Raleigh, North Carolina 27695
| | | | | | - Ana Butrón
- Misión Biológica de Galicia, CSIC, Pontevedra, Spain, 36080
| | - James B. Holland
- Department of Crop Science, North Carolina State University, Raleigh, North Carolina 27695
- U.S. Department of Agriculture—Agricultural Research Service, Plant Science Research Unit, Raleigh, North Carolina 27695
| |
Collapse
|
322
|
Bilsborough GD. Plant genomics: sowing the seeds of success. Genome Biol 2013; 14:404. [PMID: 23809627 PMCID: PMC3706952 DOI: 10.1186/gb4111] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/02/2023] Open
|
323
|
|