1
|
Technow F, Schrag TA, Schipprack W, Melchinger AE. Identification of key ancestors of modern germplasm in a breeding program of maize. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2014; 127:2545-2553. [PMID: 25208647 DOI: 10.1007/s00122-014-2396-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/07/2014] [Accepted: 08/28/2014] [Indexed: 06/03/2023]
Abstract
Probabilities of gene origin computed from the genomic kinships matrix can accurately identify key ancestors of modern germplasms Identifying the key ancestors of modern plant breeding populations can provide valuable insights into the history of a breeding program and provide reference genomes for next generation whole genome sequencing. In an animal breeding context, a method was developed that employs probabilities of gene origin, computed from the pedigree-based additive kinship matrix, for identifying key ancestors. Because reliable and complete pedigree information is often not available in plant breeding, we replaced the additive kinship matrix with the genomic kinship matrix. As a proof-of-concept, we applied this approach to simulated data sets with known ancestries. The relative contribution of the ancestral lines to later generations could be determined with high accuracy, with and without selection. Our method was subsequently used for identifying the key ancestors of the modern Dent germplasm of the public maize breeding program of the University of Hohenheim. We found that the modern germplasm can be traced back to six or seven key ancestors, with one or two of them having a disproportionately large contribution. These results largely corroborated conjectures based on early records of the breeding program. We conclude that probabilities of gene origin computed from the genomic kinships matrix can be used for identifying key ancestors in breeding programs and estimating the proportion of genes contributed by them.
Collapse
Affiliation(s)
- F Technow
- Department of Applied Genetics, Institute of Plant Breeding, Seed Science and Population Genetics, University of Hohenheim, Stuttgart, 70599, Germany,
| | | | | | | |
Collapse
|
2
|
Thirunavukkarasu N, Hossain F, Shiriga K, Mittal S, Arora K, Rathore A, Mohan S, Shah T, Sharma R, Namratha PM, Mithra ASV, Mohapatra T, Gupta HS. Unraveling the genetic architecture of subtropical maize (Zea mays L.) lines to assess their utility in breeding programs. BMC Genomics 2013; 14:877. [PMID: 24330649 PMCID: PMC3867671 DOI: 10.1186/1471-2164-14-877] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2013] [Accepted: 12/10/2013] [Indexed: 01/14/2023] Open
Abstract
BACKGROUND Maize is an increasingly important food crop in southeast Asia. The elucidation of its genetic architecture, accomplished by exploring quantitative trait loci and useful alleles in various lines across numerous breeding programs, is therefore of great interest. The present study aimed to characterize subtropical maize lines using high-quality SNPs distributed throughout the genome. RESULTS We genotyped a panel of 240 subtropical elite maize inbred lines and carried out linkage disequilibrium, genetic diversity, population structure, and principal component analyses on the generated SNP data. The mean SNP distance across the genome was 70 Kb. The genome had both high and low linkage disequilibrium (LD) regions; the latter were dominant in areas near the gene-rich telomeric portions where recombination is frequent. A total of 252 haplotype blocks, ranging in size from 1 to 15.8 Mb, were identified. Slow LD decay (200-300 Kb) at r(2) ≤ 0.1 across all chromosomes explained the selection of favorable traits around low LD regions in different breeding programs. The association mapping panel was characterized by strong population substructure. Genotypes were grouped into three distinct clusters with a mean genetic dissimilarity coefficient of 0.36. CONCLUSIONS The genotyped panel of subtropical maize lines characterized in this study should be useful for association mapping of agronomically important genes. The dissimilarity uncovered among genotypes provides an opportunity to exploit the heterotic potential of subtropical elite maize breeding lines.
Collapse
|
3
|
Technow F, Riedelsheimer C, Schrag TA, Melchinger AE. Genomic prediction of hybrid performance in maize with models incorporating dominance and population specific marker effects. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2012; 125:1181-94. [PMID: 22733443 DOI: 10.1007/s00122-012-1905-8] [Citation(s) in RCA: 94] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/07/2012] [Accepted: 05/16/2012] [Indexed: 05/05/2023]
Abstract
Identifying high performing hybrids is an essential part of every maize breeding program. Genomic prediction of maize hybrid performance allows to identify promising hybrids, when they themselves or other hybrids produced from their parents were not tested in field trials. Using simulations, we investigated the effects of marker density (10, 1, 0.3 marker per mega base pair, Mbp(-1)), convergent or divergent parental populations, number of parents tested in other combinations (2, 1, 0), genetic model (including population-specific and/or dominance marker effects or not), and estimation method (GBLUP or BayesB) on the prediction accuracy. We based our simulations on marker genotypes of Central European flint and dent inbred lines from an ongoing maize breeding program. To simulate convergent or divergent parent populations, we generated phenotypes by assigning QTL to markers with similar or very different allele frequencies in both pools, respectively. Prediction accuracies increased with marker density and number of parents tested and were higher under divergent compared with convergent parental populations. Modeling marker effects as population-specific slightly improved prediction accuracy under lower marker densities (1 and 0.3 Mbp(-1)). This indicated that modeling marker effects as population-specific will be most beneficial under low linkage disequilibrium. Incorporating dominance effects improved prediction accuracies considerably for convergent parent populations, where dominance results in major contributions of SCA effects to the genetic variance among inter-population hybrids. While the general trends regarding the effects of the aforementioned influence factors on prediction accuracy were similar for GBLUP and BayesB, the latter method produced significantly higher accuracies for models incorporating dominance.
Collapse
Affiliation(s)
- Frank Technow
- Department of Applied Genetics, Institute of Plant Breeding, Seed Science and Population Genetics, University of Hohenheim, 70599, Stuttgart, Germany
| | | | | | | |
Collapse
|
4
|
Technow F, Riedelsheimer C, Schrag TA, Melchinger AE. Genomic prediction of hybrid performance in maize with models incorporating dominance and population specific marker effects. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2012. [PMID: 22733443 DOI: 10.1007/s00122‐012‐1905‐8] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Identifying high performing hybrids is an essential part of every maize breeding program. Genomic prediction of maize hybrid performance allows to identify promising hybrids, when they themselves or other hybrids produced from their parents were not tested in field trials. Using simulations, we investigated the effects of marker density (10, 1, 0.3 marker per mega base pair, Mbp(-1)), convergent or divergent parental populations, number of parents tested in other combinations (2, 1, 0), genetic model (including population-specific and/or dominance marker effects or not), and estimation method (GBLUP or BayesB) on the prediction accuracy. We based our simulations on marker genotypes of Central European flint and dent inbred lines from an ongoing maize breeding program. To simulate convergent or divergent parent populations, we generated phenotypes by assigning QTL to markers with similar or very different allele frequencies in both pools, respectively. Prediction accuracies increased with marker density and number of parents tested and were higher under divergent compared with convergent parental populations. Modeling marker effects as population-specific slightly improved prediction accuracy under lower marker densities (1 and 0.3 Mbp(-1)). This indicated that modeling marker effects as population-specific will be most beneficial under low linkage disequilibrium. Incorporating dominance effects improved prediction accuracies considerably for convergent parent populations, where dominance results in major contributions of SCA effects to the genetic variance among inter-population hybrids. While the general trends regarding the effects of the aforementioned influence factors on prediction accuracy were similar for GBLUP and BayesB, the latter method produced significantly higher accuracies for models incorporating dominance.
Collapse
Affiliation(s)
- Frank Technow
- Department of Applied Genetics, Institute of Plant Breeding, Seed Science and Population Genetics, University of Hohenheim, 70599, Stuttgart, Germany
| | | | | | | |
Collapse
|
5
|
QTL detection power of multi-parental RIL populations in Arabidopsis thaliana. Heredity (Edinb) 2012; 108:626-32. [PMID: 22334115 DOI: 10.1038/hdy.2011.133] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Abstract
A major goal of today's biology is to understand the genetic basis of quantitative traits. This can be achieved by statistical methods that evaluate the association between molecular marker variation and phenotypic variation in different types of mapping populations. The objective of this work was to evaluate the statistical power of quantitative trait loci (QTL) detection of various multi-parental mating designs, as well as to assess the reasons for the observed differences. Our study was based on an empirical data of 20 Arabidopsis thaliana accessions, which have been selected to capture the maximum genetic diversity. The examined mating designs differed strongly with respect to the statistical power to detect QTL. We observed the highest power to detect QTL for the diallel cross with random mating design. The results of our study suggested that performing sibling mating within subpopulations of joint-linkage mapping populations has the potential to considerably increase the power for QTL detection. Our results, however, revealed that using designs in which more than two parental alleles segregate in each subpopulation increases the power even more.
Collapse
|
6
|
Reif JC, Maurer HP, Korzun V, Ebmeyer E, Miedaner T, Würschum T. Mapping QTLs with main and epistatic effects underlying grain yield and heading time in soft winter wheat. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2011; 123:283-292. [PMID: 21476040 DOI: 10.1007/s00122-011-1583-y] [Citation(s) in RCA: 65] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/13/2010] [Accepted: 03/23/2011] [Indexed: 05/30/2023]
Abstract
There is increasing awareness that epistasis plays a role for the determination of complex traits. This study employed an association mapping approach in a large panel of 455 diverse European elite soft winter wheat lines. The genotypes were evaluated in multi-environment trials and fingerprinted with SSR markers to dissect the underlying genetic architecture of grain yield and heading time. A linear mixed model was applied to assess marker-trait associations incorporating information of covariance among relatives. Our findings indicate that main effects dominate the control of grain yield in wheat. In contrast, the genetic architecture underlying heading time is controlled by main and epistatic effects. Consequently, for heading time it is important to consider epistatic effects towards an increased selection gain in marker-assisted breeding.
Collapse
|
7
|
Abstract
Genotyping technology now allows the rapid and affordable generation of million-SNP profiles for humans, leading to considerable activity in association mapping. Similar activity is anticipated for many plant species, including Brassica. These plant association mapping activities will require the same care in quality control and quality assurance as for humans. The subsequent analyses may draw upon the same body of theory that is described here in the language of quantitative genetics.
Collapse
Affiliation(s)
- Bruce S Weir
- Department of Biostatistics, University of Washington, Box 357232, Seattle, WA 98195-7232, USA.
| |
Collapse
|
8
|
Application of association mapping to understanding the genetic diversity of plant germplasm resources. INTERNATIONAL JOURNAL OF PLANT GENOMICS 2010; 2008:574927. [PMID: 18551188 PMCID: PMC2423417 DOI: 10.1155/2008/574927] [Citation(s) in RCA: 107] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/21/2007] [Accepted: 04/18/2008] [Indexed: 02/05/2023]
Abstract
Compared to the conventional linkage mapping, linkage disequilibrium (LD)-mapping, using the nonrandom associations of loci in haplotypes, is a powerful high-resolution mapping tool for complex quantitative traits. The recent advances in the development of unbiased association mapping approaches for plant population with their successful applications in dissecting a number of simple to complex traits in many crop species demonstrate a flourish of the approach as a “powerful gene tagging” tool for crops in the plant genomics era of 21st century. The goal of this review is to provide nonexpert readers of crop breeding community with (1) the basic concept, merits, and simple description of existing methodologies for an association mapping with the recent improvements for plant populations, and (2) the details of some of pioneer and recent studies on association mapping in various crop species to demonstrate the feasibility, success, problems, and future perspectives of the efforts in plants. This should be helpful for interested readers of international plant research community as a guideline for the basic understanding, choosing the appropriate methods, and its application.
Collapse
|
9
|
Abstract
Efficient genomic selection in animals or crops requires the accurate prediction of the agronomic performance of individuals from their high-density molecular marker profiles. Using a training data set that contains the genotypic and phenotypic information of a large number of individuals, each marker or marker allele is associated with an estimated effect on the trait under study. These estimated marker effects are subsequently used for making predictions on individuals for which no phenotypic records are available. As most plant and animal breeding programs are currently still phenotype driven, the continuously expanding collection of phenotypic records can only be used to construct a genomic prediction model if a dense molecular marker fingerprint is available for each phenotyped individual. However, as the genotyping budget is generally limited, the genomic prediction model can only be constructed using a subset of the tested individuals and possibly a genome-covering subset of the molecular markers. In this article, we demonstrate how an optimal selection of individuals can be made with respect to the quality of their available phenotypic data. We also demonstrate how the total number of molecular markers can be reduced while a maximum genome coverage is ensured. The third selection problem we tackle is specific to the construction of a genomic prediction model for a hybrid breeding program where only molecular marker fingerprints of the homozygous parents are available. We show how to identify the set of parental inbred lines of a predefined size that has produced the highest number of progeny. These three selection approaches are put into practice in a simulation study where we demonstrate how the trade-off between sample size and sample quality affects the prediction accuracy of genomic prediction models for hybrid maize.
Collapse
|
10
|
Maenhout S, De Baets B, Haesaert G. Marker-based estimation of the coefficient of coancestry in hybrid breeding programmes. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2009; 118:1181-92. [PMID: 19224194 DOI: 10.1007/s00122-009-0972-y] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/20/2008] [Accepted: 01/15/2009] [Indexed: 05/24/2023]
Abstract
Molecular markers allow to estimate the pairwise relatedness between the members of a breeding pool when their selection history is no longer available or has become too complex for a classical pedigree analysis. The field of population genetics has several estimation procedures at its disposal, but when the genotyped individuals are highly selected inbred lines, their application is not warranted as the theoretical assumptions on which these estimators were built, usually linkage equilibrium between marker loci or even Hardy-Weinberg equilibrium, are not met. An alternative approach requires the availability of a genotyped reference set of inbred lines, which allows to correct the observed marker similarities for their inherent upward bias when used as a coancestry measure. However, this approach does not guarantee that the resulting coancestry matrix is at least positive semi-definite (psd), a necessary condition for its use as a covariance matrix. In this paper we present the weighted alikeness in state (WAIS) estimator. This marker-based coancestry estimator is compared to several other commonly applied relatedness estimators under realistic hybrid breeding conditions in a number of simulations. We also fit a linear mixed model to phenotypical data from a commercial maize breeding programme and compare the likelihood of the different variance structures. WAIS is shown to be psd which makes it suitable for modelling the covariance between genetic components in linear mixed models involved in breeding value estimation or association studies. Results indicate that it generally produces a low root mean squared error under different breeding circumstances and provides a fit to the data that is comparable to that of several other marker-based alternatives. Recommendations for each of the examined coancestry measures are provided.
Collapse
Affiliation(s)
- S Maenhout
- Department of Biosciences and Landscape Architecture, University College Ghent, Voskenslaan 270, 9000, Ghent, Belgium.
| | | | | |
Collapse
|
11
|
Wang R, Yu Y, Zhao J, Shi Y, Song Y, Wang T, Li Y. Population structure and linkage disequilibrium of a mini core set of maize inbred lines in China. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2008; 117:1141-53. [PMID: 18696041 DOI: 10.1007/s00122-008-0852-x] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/02/2008] [Accepted: 07/16/2008] [Indexed: 05/20/2023]
Abstract
Understanding genetic diversity, population structure, and the level and distribution of linkage disequilibrium (LD) in target populations are of great importance and the prerequisite for association mapping. In the present study, 145 genome-wide SSR markers were used to assess the genetic diversity, population structure, and LD of a set of 95 maize inbred lines which represented the Chinese maize inbred lines. Results showed that the population included a diverse genetic variation. A model-based population structure analysis subdivided the inbred lines into four subgroups that correspond to the four major empirical germplasm origins in China, i.e., Lancaster, Reid, Tangsipingtou and P. Among all of the inbred lines, 65.3% were assigned into the corresponding subgroups; others were assigned into a "mixed" subgroup. LD was significant at a 0.01 level between 63.89% of the SSR pairs in the entire sample and with a range of 18.75-40.28% in the subgroups. Among factors influencing LD, linkage was the major cause for LD of SSR loci. The results suggested that the population may be used in the detection of genome-wide SSR marker-phenotype association.
Collapse
Affiliation(s)
- Ronghuan Wang
- Institute of Crop Science, Chinese Academy of Agricultural Sciences/National Key Facility for Crop Gene Resources and Genetic Improvement (NFCRI), 100081 Beijing, People's Republic of China.
| | | | | | | | | | | | | |
Collapse
|