1
|
Zhang J, Yue C, Zhang YM. Bias correction for estimated QTL effects using the penalized maximum likelihood method. Heredity (Edinb) 2012; 108:396-402. [PMID: 21934700 PMCID: PMC3313049 DOI: 10.1038/hdy.2011.86] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2011] [Revised: 08/05/2011] [Accepted: 08/12/2011] [Indexed: 01/22/2023] Open
Abstract
A penalized maximum likelihood method has been proposed as an important approach to the detection of epistatic quantitative trait loci (QTL). However, this approach is not optimal in two special situations: (1) closely linked QTL with effects in opposite directions and (2) small-effect QTL, because the method produces downwardly biased estimates of QTL effects. The present study aims to correct the bias by using correction coefficients and shifting from the use of a uniform prior on the variance parameter of a QTL effect to that of a scaled inverse chi-square prior. The results of Monte Carlo simulation experiments show that the improved method increases the power from 25 to 88% in the detection of two closely linked QTL of equal size in opposite directions and from 60 to 80% in the identification of QTL with small effects (0.5% of the total phenotypic variance). We used the improved method to detect QTL responsible for the barley kernel weight trait using 145 doubled haploid lines developed in the North American Barley Genome Mapping Project. Application of the proposed method to other shrinkage estimation of QTL effects is discussed.
Collapse
Affiliation(s)
- J Zhang
- Section on Statistical Genomics, State Key Laboratory of Crop Genetics and Germplasm Enhancement, College of Agriculture, Nanjing Agricultural University, Nanjing, China
| | - C Yue
- Section on Statistical Genomics, State Key Laboratory of Crop Genetics and Germplasm Enhancement, College of Agriculture, Nanjing Agricultural University, Nanjing, China
| | - Y-M Zhang
- Section on Statistical Genomics, State Key Laboratory of Crop Genetics and Germplasm Enhancement, College of Agriculture, Nanjing Agricultural University, Nanjing, China
| |
Collapse
|
2
|
Abstract
This chapter covers the procedure of mapping quantitative trait loci (QTLs) in an F(2) breeding design. I describe genetic design, general methods and software, and several commonly used approaches. The genetic design section includes F(2) population construction. Widely used methods and software are introduced in the section of general methods and software. Finally, composite interval mapping, penalized maximum likelihood, and empirical Bayes are described in detail. Some issues related to the F(2) design are discussed.
Collapse
|
3
|
Lü HY, Liu XF, Wei SP, Zhang YM. Epistatic association mapping in homozygous crop cultivars. PLoS One 2011; 6:e17773. [PMID: 21423630 PMCID: PMC3058038 DOI: 10.1371/journal.pone.0017773] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2010] [Accepted: 02/14/2011] [Indexed: 12/02/2022] Open
Abstract
The genetic dissection of complex traits plays a crucial role in crop breeding. However, genetic analysis and crop breeding have heretofore been performed separately. In this study, we designed a new approach that integrates epistatic association analysis in crop cultivars with breeding by design. First, we proposed an epistatic association mapping (EAM) approach in homozygous crop cultivars. The phenotypic values of complex traits, along with molecular marker information, were used to perform EAM. In our EAM, all the main-effect quantitative trait loci (QTLs), environmental effects, QTL-by-environment interactions and QTL-by-QTL interactions were included in a full model and estimated by empirical Bayes approach. A series of Monte Carlo simulations was performed to confirm the reliability of the new method. Next, the information from all detected QTLs was used to mine novel alleles for each locus and to design elite cross combination. Finally, the new approach was adopted to dissect the genetic basis of seed length in 215 soybean cultivars obtained, by stratified random sampling, from 6 geographic ecotypes in China. As a result, 19 main-effect QTLs and 3 epistatic QTLs were identified, more than 10 novel alleles were mined and 3 elite parental combinations, such as Daqingdou and Zhengzhou790034, were predicted.
Collapse
Affiliation(s)
- Hai-Yan Lü
- Section on Statistical Genomics, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, Jiangsu, China
- College of Information and Management Science, Henan Agricultural University, Zhengzhou, Henan, China
| | - Xiao-Fen Liu
- Section on Statistical Genomics, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, Jiangsu, China
| | - Shi-Ping Wei
- Section on Statistical Genomics, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, Jiangsu, China
| | - Yuan-Ming Zhang
- Section on Statistical Genomics, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, Jiangsu, China
| |
Collapse
|
4
|
He XH, Qin H, Hu Z, Zhang T, Zhang YM. Mapping of epistatic quantitative trait loci in four-way crosses. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2011; 122:33-48. [PMID: 20827458 DOI: 10.1007/s00122-010-1420-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/21/2009] [Accepted: 07/24/2010] [Indexed: 05/29/2023]
Abstract
Four-way crosses (4WC) involving four different inbred lines often appear in plant and animal commercial breeding programs. Direct mapping of quantitative trait loci (QTL) in these commercial populations is both economical and practical. However, the existing statistical methods for mapping QTL in a 4WC population are built on the single-QTL genetic model. This simple genetic model fails to take into account QTL interactions, which play an important role in the genetic architecture of complex traits. In this paper, therefore, we attempted to develop a statistical method to detect epistatic QTL in 4WC population. Conditional probabilities of QTL genotypes, computed by the multi-point single locus method, were used to sample the genotypes of all putative QTL in the entire genome. The sampled genotypes were used to construct the design matrix for QTL effects. All QTL effects, including main and epistatic effects, were simultaneously estimated by the penalized maximum likelihood method. The proposed method was confirmed by a series of Monte Carlo simulation studies and real data analysis of cotton. The new method will provide novel tools for the genetic dissection of complex traits, construction of QTL networks, and analysis of heterosis.
Collapse
Affiliation(s)
- Xiao-Hong He
- Section on Statistical Genomics, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, People's Republic of China
| | | | | | | | | |
Collapse
|
5
|
Lü HY, Li M, Li GJ, Yao LL, Lin F, Zhang YM. Multiple loci in silico mapping in inbred lines. Heredity (Edinb) 2009; 103:346-54. [PMID: 19491924 DOI: 10.1038/hdy.2009.66] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
The in silico mapping (ISM) technique and its extension represent major advances for novel gene discovery in germplasm resources of inbred lines. However, the techniques suffer from a relatively high false-positive rate (FPR) and they do not consider the effect of linkage disequilibrium (LD) markers around the identified quantitative trait locus (QTL). In addition, it has not yet been established whether it is optimal to use absolute trait differences as the response variable. To address these problems, this article presents the multiple loci ISM (MLISM) approach, which uses all markers on the entire genome, along with a penalized maximum likelihood. The method proposed here was verified by a series of simulation experiments with a maize pedigree population of inbred lines of known ancestry. Results from the simulated studies show that the best response variable is the trait product. The MLISM FPR is substantially decreased and the proportion of the number of false QTL to the number of LD markers around the identified QTL is adequately reduced. The MLISM method, with the trait product as the response variable, is an improvement on the existing methods for novel QTL mapping in germplasm resources of inbred lines.
Collapse
Affiliation(s)
- H-Y Lü
- Section on Statistical Genomics, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Department of Crop Genetics and Breeding, Nanjing Agricultural University, Nanjing, China
| | | | | | | | | | | |
Collapse
|
6
|
Zhu M, Yu M, Zhao S. Understanding quantitative genetics in the systems biology era. Int J Biol Sci 2009; 5:161-70. [PMID: 19173038 PMCID: PMC2631226 DOI: 10.7150/ijbs.5.161] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2008] [Accepted: 01/21/2009] [Indexed: 01/06/2023] Open
Abstract
Biology is now entering the new era of systems biology and exerting a growing influence on the future development of various disciplines within life sciences. In early classical and molecular periods of Biology, the theoretical frames of classical and molecular quantitative genetics have been systematically established, respectively. With the new advent of systems biology, there is occurring a paradigm shift in the field of quantitative genetics. Where and how the quantitative genetics would develop after having undergone its classical and molecular periods? This is a difficult question to answer exactly. In this perspective article, the major effort was made to discuss the possible development of quantitative genetics in the systems biology era, and for which there is a high potentiality to develop towards "systems quantitative genetics". In our opinion, the systems quantitative genetics can be defined as a new discipline to address the generalized genetic laws of bioalleles controlling the heritable phenotypes of complex traits following a new dynamic network model. Other issues from quantitative genetic perspective relating to the genetical genomics, the updates of network model, and the future research prospects were also discussed.
Collapse
Affiliation(s)
| | | | - Shuhong Zhao
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Huazhong Agricultural University, Wuhan 430070, P. R. China
| |
Collapse
|
7
|
Zhang YM, Lü HY, Yao LL. Multiple quantitative trait loci Haseman-Elston regression using all markers on the entire genome. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2008; 117:683-690. [PMID: 18563308 DOI: 10.1007/s00122-008-0809-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/28/2007] [Accepted: 05/17/2008] [Indexed: 05/26/2023]
Abstract
The Haseman-Elston (HE) regression, developed in the 1970s, remains in common use to detect genetic linkage between a quantitative trait and a genetic marker. Although the technique has been improved in a number of ways, it predicts a high rate of false positive quantitative trait locus (QTL) because it is based on a single-QTL model. We have extended the origin HE regression to multi-QTL HE (MQHE) regression, so that all markers across the entire genome can be exploited simultaneously. The parameters have been estimated by the penalized maximum likelihood method, and several response variables for phenotypic difference have been compared in order to optimize the procedure. The method has been tested by simulation in a pedigree population of maize inbred lines of known ancestry. These simulations show that the trait product is the optimal response variable for phenotypic difference. The false positive rate produced by the MQHE regression is substantially lower than that generated by either variance component analysis or the origin HE regression. The MQHE regression, with the trait product as the response variable, represents a significant improvement on existing methods for QTL mapping in a set of inbred lines (or cultivars) of known ancestry.
Collapse
Affiliation(s)
- Yuan-Ming Zhang
- Section on Statistical Genomics, State Key Laboratory of Crop Genetics and Germplasm Enhancement, College of Agriculture, Nanjing Agricultural University, 1 Weigang Road, Nanjing 210095, China.
| | | | | |
Collapse
|
8
|
Zhang YM, Gai J. Methodologies for segregation analysis and QTL mapping in plants. Genetica 2008; 136:311-8. [PMID: 18726162 DOI: 10.1007/s10709-008-9313-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2008] [Accepted: 08/11/2008] [Indexed: 12/01/2022]
Abstract
Most characters of biological interest and economic importance are quantitative traits. To uncover the genetic architecture of quantitative traits, two approaches have become popular in China. One is the establishment of an analytical model for mixed major-gene plus polygenes inheritance and the other the discovery of quantitative trait locus (QTL). Here we review our progress employing these two approaches. First, we proposed joint segregation analysis of multiple generations for mixed major-gene plus polygenes inheritance. Second, we extended the multilocus method of Lander and Green (1987), Jiang and Zeng (1997) to a more generalized approach. Our methodology handles distorted, dominant and missing markers, including the effect of linked segregation distortion loci on the estimation of map distance. Finally, we developed several QTL mapping methods. In the Bayesian shrinkage estimation (BSE) method, we suggested a method to test the significance of QTL effects and studied the effect of the prior distribution of the variance of QTL effect on QTL mapping. To reduce running time, a penalized maximum likelihood method was adopted. To mine novel genes in crop inbred lines generated in the course of normal crop breeding work, three methods were introduced. If a well-documented genealogical history of the lines is available, two-stage variance component analysis and multi-QTL Haseman-Elston regression were suggested; if unavailable, multiple loci in silico mapping was proposed.
Collapse
Affiliation(s)
- Yuan-Ming Zhang
- Section on Statistical Genomics, State Key Laboratory of Crop Genetics and Germplasm Enhancement and National Center for Soybean Improvement, College of Agriculture, Nanjing Agricultural University, Nanjing, 210095, China.
| | | |
Collapse
|