1
|
Souza VFD, Pereira GDS, Pastina MM, Parrella RADC, Simeone MLF, Barros BDA, Noda RW, da Costa e Silva L, Magalhães JVD, Schaffert RE, Garcia AAF, Damasceno CMB. QTL mapping for bioenergy traits in sweet sorghum recombinant inbred lines. G3 GENES|GENOMES|GENETICS 2021; 11:6370150. [PMID: 34519766 PMCID: PMC8527507 DOI: 10.1093/g3journal/jkab314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Accepted: 08/26/2021] [Indexed: 11/13/2022]
Abstract
Abstract
During the past decade, sweet sorghum (Sorghum bicolor Moench L.) has shown great potential for bioenergy production, especially biofuels. In this study, 223 recombinant inbred lines (RILs) derived from a cross between two sweet sorghum lines (Brandes × Wray) were evaluated in three trials. Single-nucleotide polymorphisms (SNPs) derived from genotyping by sequencing of 272 RILs were used to build a high-density genetic map comprising 3,767 SNPs spanning 1,368.83 cM. Multitrait multiple interval mapping (MT-MIM) was carried out to map quantitative trait loci (QTL) for eight bioenergy traits. A total of 33 QTLs were identified for flowering time, plant height, total soluble solids and sucrose (five QTLs each), fibers (four QTLs), and fresh biomass yield, juice extraction yield, and reducing sugars (three QTLs each). QTL hotspots were found on chromosomes 1, 3, 6, 9, and 10, in addition to other QTLs detected on chromosomes 4 and 8. We observed that 14 out of the 33 mapped QTLs were found in all three trials. Upon further development and validation in other crosses, the results provided by the present study have a great potential to be used in marker-assisted selection in sorghum breeding programs for biofuel production.
Collapse
Affiliation(s)
| | - Guilherme da Silva Pereira
- Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo, Piracicaba, SP, 13418-900, Brazil
| | | | | | | | | | | | | | | | | | - Antonio Augusto Franco Garcia
- Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo, Piracicaba, SP, 13418-900, Brazil
| | | |
Collapse
|
2
|
de Almeida CP, de Carvalho Paulino JF, Bonfante GFJ, Perseguini JMKC, Santos IL, Gonçalves JGR, Patrício FRA, Taniguti CH, Gesteira GDS, Garcia AAF, Song Q, Carbonell SAM, Chiorato AF, Benchimol-Reis LL. Angular Leaf Spot Resistance Loci Associated With Different Plant Growth Stages in Common Bean. FRONTIERS IN PLANT SCIENCE 2021; 12:647043. [PMID: 33927738 PMCID: PMC8078856 DOI: 10.3389/fpls.2021.647043] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/28/2020] [Accepted: 03/23/2021] [Indexed: 06/12/2023]
Abstract
Angular leaf spot (ALS) is a disease that causes major yield losses in the common bean crop. Studies based on different isolates and populations have already been carried out to elucidate the genetic mechanisms of resistance to ALS. However, understanding of the interaction of this resistance with the reproductive stages of common bean is lacking. The aim of the present study was to identify ALS resistance loci at different plant growth stages (PGS) by association and linkage mapping approaches. An BC2F3 inter-gene pool cross population (AND 277 × IAC-Milênio - AM population) profiled with 1,091 SNPs from genotyping by sequencing (GBS) was used for linkage mapping, and a carioca diversity panel (CDP) genotyped by 5,398 SNPs from BeadChip assay technology was used for association mapping. Both populations were evaluated for ALS resistance at the V2 and V3 PGSs (controlled conditions) and R8 PGS (field conditions). Different QTL (quantitative trait loci) were detected for the three PGSs and both populations, showing a different quantitative profile of the disease at different plant growth stages. For the three PGS, multiple interval mapping (MIM) identified seven significant QTL, and the Genome-wide association study (GWAS) identified fourteen associate SNPs. Several loci validated regions of previous studies, and Phg-1, Phg-2, Phg-4, and Phg-5, among the 5 loci of greatest effects reported in the literature, were detected in the CDP. The AND 277 cultivar contained both the Phg-1 and the Phg-5 QTL, which is reported for the first time in the descendant cultivar CAL143 as ALS10.1UC. The novel QTL named ALS11.1AM was located at the beginning of chromosome Pv11. Gene annotation revealed several putative resistance genes involved in the ALS response at the three PGSs, and with the markers and loci identified, new specific molecular markers can be developed, representing a powerful tool for common bean crop improvement and for gain in ALS resistance.
Collapse
Affiliation(s)
| | | | | | | | - Isabella Laporte Santos
- Centro de Pesquisa em Recursos Genéticos Vegetais, Instituto Agronômico - IAC, Campinas, Brazil
| | | | | | - Cristiane Hayumi Taniguti
- Departamento de Genética, Escola Superior de Agricultura “Luiz de Queiroz”, Universidade de São Paulo, Piracicaba, Brazil
| | - Gabriel de Siqueira Gesteira
- Departamento de Genética, Escola Superior de Agricultura “Luiz de Queiroz”, Universidade de São Paulo, Piracicaba, Brazil
| | - Antônio Augusto Franco Garcia
- Departamento de Genética, Escola Superior de Agricultura “Luiz de Queiroz”, Universidade de São Paulo, Piracicaba, Brazil
| | - Qijian Song
- USDA-ARS, Soybean Genomics and Improvement Lab, Beltsville, MD, United States
| | | | | | | |
Collapse
|
3
|
Multiple QTL Mapping in Autopolyploids: A Random-Effect Model Approach with Application in a Hexaploid Sweetpotato Full-Sib Population. Genetics 2020; 215:579-595. [PMID: 32371382 PMCID: PMC7337090 DOI: 10.1534/genetics.120.303080] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2019] [Accepted: 04/26/2020] [Indexed: 11/18/2022] Open
Abstract
In developing countries, the sweetpotato, Ipomoea batatas (L.) Lam. [Formula: see text], is an important autopolyploid species, both socially and economically. However, quantitative trait loci (QTL) mapping has remained limited due to its genetic complexity. Current fixed-effect models can fit only a single QTL and are generally hard to interpret. Here, we report the use of a random-effect model approach to map multiple QTL based on score statistics in a sweetpotato biparental population ('Beauregard' × 'Tanzania') with 315 full-sibs. Phenotypic data were collected for eight yield component traits in six environments in Peru, and jointly adjusted means were obtained using mixed-effect models. An integrated linkage map consisting of 30,684 markers distributed along 15 linkage groups (LGs) was used to obtain the genotype conditional probabilities of putative QTL at every centiMorgan position. Multiple interval mapping was performed using our R package QTLpoly and detected a total of 13 QTL, ranging from none to four QTL per trait, which explained up to 55% of the total variance. Some regions, such as those on LGs 3 and 15, were consistently detected among root number and yield traits, and provided a basis for candidate gene search. In addition, some QTL were found to affect commercial and noncommercial root traits distinctly. Further best linear unbiased predictions were decomposed into additive allele effects and were used to compute multiple QTL-based breeding values for selection. Together with quantitative genotyping and its appropriate usage in linkage analyses, this QTL mapping methodology will facilitate the use of genomic tools in sweetpotato breeding as well as in other autopolyploids.
Collapse
|
4
|
Mollinari M, Olukolu BA, Pereira GDS, Khan A, Gemenet D, Yencho GC, Zeng ZB. Unraveling the Hexaploid Sweetpotato Inheritance Using Ultra-Dense Multilocus Mapping. G3 (BETHESDA, MD.) 2020; 10:281-292. [PMID: 31732504 PMCID: PMC6945028 DOI: 10.1534/g3.119.400620] [Citation(s) in RCA: 42] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/12/2019] [Accepted: 11/12/2019] [Indexed: 12/18/2022]
Abstract
The hexaploid sweetpotato (Ipomoea batatas (L.) Lam., 2n = 6x = 90) is an important staple food crop worldwide and plays a vital role in alleviating famine in developing countries. Due to its high ploidy level, genetic studies in sweetpotato lag behind major diploid crops significantly. We built an ultra-dense multilocus integrated genetic map and characterized the inheritance system in a sweetpotato full-sib family using our newly developed software, MAPpoly. The resulting genetic map revealed 96.5% collinearity between I. batatas and its diploid relative I. trifida We computed the genotypic probabilities across the whole genome for all individuals in the mapping population and inferred their complete hexaploid haplotypes. We provide evidence that most of the meiotic configurations (73.3%) were resolved in bivalents, although a small portion of multivalent signatures (15.7%), among other inconclusive configurations (11.0%), were also observed. Except for low levels of preferential pairing in linkage group 2, we observed a hexasomic inheritance mechanism in all linkage groups. We propose that the hexasomic-bivalent inheritance promotes stability to the allelic transmission in sweetpotato.
Collapse
Affiliation(s)
- Marcelo Mollinari
- Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina,
- Department of Horticultural Science, North Carolina State University, Raleigh, North Carolina
| | - Bode A Olukolu
- Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, Tennessee
| | - Guilherme da S Pereira
- Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina
- Department of Horticultural Science, North Carolina State University, Raleigh, North Carolina
| | - Awais Khan
- Plant Pathology and Plant-Microbe Biology Section, Cornell University, Geneva, New York, and
| | - Dorcus Gemenet
- International Potato Center, ILRI Campus, Nairobi, Kenya
| | - G Craig Yencho
- Department of Horticultural Science, North Carolina State University, Raleigh, North Carolina
| | - Zhao-Bang Zeng
- Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina
- Department of Horticultural Science, North Carolina State University, Raleigh, North Carolina
| |
Collapse
|
5
|
Mollinari M, Olukolu BA, Pereira GDS, Khan A, Gemenet D, Yencho GC, Zeng ZB. Unraveling the Hexaploid Sweetpotato Inheritance Using Ultra-Dense Multilocus Mapping. G3 (BETHESDA, MD.) 2020. [PMID: 31732504 DOI: 10.25387/g3.10255844] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 04/30/2023]
Abstract
The hexaploid sweetpotato (Ipomoea batatas (L.) Lam., 2n = 6x = 90) is an important staple food crop worldwide and plays a vital role in alleviating famine in developing countries. Due to its high ploidy level, genetic studies in sweetpotato lag behind major diploid crops significantly. We built an ultra-dense multilocus integrated genetic map and characterized the inheritance system in a sweetpotato full-sib family using our newly developed software, MAPpoly. The resulting genetic map revealed 96.5% collinearity between I. batatas and its diploid relative I. trifida We computed the genotypic probabilities across the whole genome for all individuals in the mapping population and inferred their complete hexaploid haplotypes. We provide evidence that most of the meiotic configurations (73.3%) were resolved in bivalents, although a small portion of multivalent signatures (15.7%), among other inconclusive configurations (11.0%), were also observed. Except for low levels of preferential pairing in linkage group 2, we observed a hexasomic inheritance mechanism in all linkage groups. We propose that the hexasomic-bivalent inheritance promotes stability to the allelic transmission in sweetpotato.
Collapse
Affiliation(s)
- Marcelo Mollinari
- Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina,
- Department of Horticultural Science, North Carolina State University, Raleigh, North Carolina
| | - Bode A Olukolu
- Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, Tennessee
| | - Guilherme da S Pereira
- Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina
- Department of Horticultural Science, North Carolina State University, Raleigh, North Carolina
| | - Awais Khan
- Plant Pathology and Plant-Microbe Biology Section, Cornell University, Geneva, New York, and
| | - Dorcus Gemenet
- International Potato Center, ILRI Campus, Nairobi, Kenya
| | - G Craig Yencho
- Department of Horticultural Science, North Carolina State University, Raleigh, North Carolina
| | - Zhao-Bang Zeng
- Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina
- Department of Horticultural Science, North Carolina State University, Raleigh, North Carolina
| |
Collapse
|
6
|
Genome-Wide Composite Interval Mapping (GCIM) of Expressional Quantitative Trait Loci in Backcross Population. Methods Mol Biol 2020; 2082:63-71. [PMID: 31849008 DOI: 10.1007/978-1-0716-0026-9_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
One of the most remarkable findings in expressional quantitative trait locus (eQTL) mapping is that trans (distal) eQTL has small effect. The widely used approaches have a low power in the detection of small-effect eQTL. To overcome this issue, we integrate polygenic background control with multi-locus genetic model to develop genome-wide composite interval mapping (GCIM). This chapter covers the GCIM procedure in a backcross or doubled haploid populations. We describe the genetic model, parameter estimation, multi-locus genetic model, hypothesis tests, and software. Finally, some issues related to the GCIM method are discussed.
Collapse
|
7
|
Linkage Analysis and Haplotype Phasing in Experimental Autopolyploid Populations with High Ploidy Level Using Hidden Markov Models. G3-GENES GENOMES GENETICS 2019; 9:3297-3314. [PMID: 31405891 PMCID: PMC6778803 DOI: 10.1534/g3.119.400378] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Modern SNP genotyping technologies allow measurement of the relative abundance of different alleles for a given locus and consequently estimation of their allele dosage, opening a new road for genetic studies in autopolyploids. Despite advances in genetic linkage analysis in autotetraploids, there is a lack of statistical models to perform linkage analysis in organisms with higher ploidy levels. In this paper, we present a statistical method to estimate recombination fractions and infer linkage phases in full-sib populations of autopolyploid species with even ploidy levels for a set of SNP markers using hidden Markov models. Our method uses efficient two-point procedures to reduce the search space for the best linkage phase configuration and reestimate the final parameters by maximizing the likelihood of the Markov chain. To evaluate the method, and demonstrate its properties, we rely on simulations of autotetraploid, autohexaploid and autooctaploid populations and on a real tetraploid potato data set. The results show the reliability of our approach, including situations with complex linkage phase scenarios in hexaploid and octaploid populations.
Collapse
|
8
|
Mollinari M, Garcia AAF. Linkage Analysis and Haplotype Phasing in Experimental Autopolyploid Populations with High Ploidy Level Using Hidden Markov Models. G3 (BETHESDA, MD.) 2019. [PMID: 31405891 DOI: 10.1101/415232v2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 04/14/2023]
Abstract
Modern SNP genotyping technologies allow measurement of the relative abundance of different alleles for a given locus and consequently estimation of their allele dosage, opening a new road for genetic studies in autopolyploids. Despite advances in genetic linkage analysis in autotetraploids, there is a lack of statistical models to perform linkage analysis in organisms with higher ploidy levels. In this paper, we present a statistical method to estimate recombination fractions and infer linkage phases in full-sib populations of autopolyploid species with even ploidy levels for a set of SNP markers using hidden Markov models. Our method uses efficient two-point procedures to reduce the search space for the best linkage phase configuration and reestimate the final parameters by maximizing the likelihood of the Markov chain. To evaluate the method, and demonstrate its properties, we rely on simulations of autotetraploid, autohexaploid and autooctaploid populations and on a real tetraploid potato data set. The results show the reliability of our approach, including situations with complex linkage phase scenarios in hexaploid and octaploid populations.
Collapse
Affiliation(s)
- Marcelo Mollinari
- Department of Horticultural Science, Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, and
| | | |
Collapse
|
9
|
Zhao J, Han D, Shi K, Wang L, Gao J, Yang R. Influence of epistatic segregation distortion loci on genetic marker linkages in Japanese flounder. Genomics 2017; 110:59-66. [PMID: 28830780 DOI: 10.1016/j.ygeno.2017.08.006] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2017] [Revised: 08/11/2017] [Accepted: 08/18/2017] [Indexed: 11/28/2022]
Abstract
For genetic linkage analysis of Japanese flounder, 160 doubled haploids (DH) were artificially produced using mitotic gynogenesis and were genotyped for 458 simple sequence repeat (SSR) markers, 101 of which show distortional segregation. The genetic linkage map was constructed by modifying recombination fractions between the distorted markers. Between the corrected and uncorrected genetic maps, there were considerable differences in genetic distance, but not in relative locations among markers. Using a liability model, a segregation distortion locus (SDL), with an additive genetic effect of 1.772, was mapped between markers BDHYP387 and Poli56TUF of chromosome 24 in the corrected genetic map. Additionally, six pairs of epistatic SDLs were identified on chromosomes 1, 5, 8, 9, 23, and 24. Changes in genetic distances between markers did not occur on chromosome regions with main effect SDLs. However, most chromosome regions where genetic distances changed covered the detected epistatic SDLs. This study concluded that epistatic SDLs decrease linkages between markers and lengthen genetic distances in Japanese flounder. This finding has been partially validated in other DH populations derived from three female Japanese flounders.
Collapse
Affiliation(s)
- Jingli Zhao
- Key Laboratory of Aquatic Genomics, Ministry of Agriculture, Research Centre for Aquatic Biotechnology, Chinese Academy of Fishery Sciences, Beijing 100141, China
| | - Dandan Han
- Department of Biological Science and Technology, Heilongjiang Vocational College for Nationalities, Harbin 150066, China
| | - Kuntao Shi
- Division of Comprehensive Aquaculture, Shandong Weihai Institute of Ocean and Aquaculture, Weihai 264200, China
| | - Li Wang
- Division of Comprehensive Aquaculture, Shandong Weihai Institute of Ocean and Aquaculture, Weihai 264200, China
| | - Jin Gao
- Key Laboratory of Aquatic Genomics, Ministry of Agriculture, Research Centre for Aquatic Biotechnology, Chinese Academy of Fishery Sciences, Beijing 100141, China
| | - Runqing Yang
- Key Laboratory of Aquatic Genomics, Ministry of Agriculture, Research Centre for Aquatic Biotechnology, Chinese Academy of Fishery Sciences, Beijing 100141, China.
| |
Collapse
|
10
|
Ohyama A, Shirasawa K, Matsunaga H, Negoro S, Miyatake K, Yamaguchi H, Nunome T, Iwata H, Fukuoka H, Hayashi T. Bayesian QTL mapping using genome-wide SSR markers and segregating population derived from a cross of two commercial F 1 hybrids of tomato. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2017; 130:1601-1616. [PMID: 28477044 DOI: 10.1007/s00122-017-2913-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/24/2016] [Accepted: 04/27/2017] [Indexed: 06/07/2023]
Abstract
Using newly developed euchromatin-derived genomic SSR markers and a flexible Bayesian mapping method, 13 significant agricultural QTLs were identified in a segregating population derived from a four-way cross of tomato. So far, many QTL mapping studies in tomato have been performed for progeny obtained from crosses between two genetically distant parents, e.g., domesticated tomatoes and wild relatives. However, QTL information of quantitative traits related to yield (e.g., flower or fruit number, and total or average weight of fruits) in such intercross populations would be of limited use for breeding commercial tomato cultivars because individuals in the populations have specific genetic backgrounds underlying extremely different phenotypes between the parents such as large fruit in domesticated tomatoes and small fruit in wild relatives, which may not be reflective of the genetic variation in tomato breeding populations. In this study, we constructed F2 population derived from a cross between two commercial F1 cultivars in tomato to extract QTL information practical for tomato breeding. This cross corresponded to a four-way cross, because the four parental lines of the two F1 cultivars were considered to be the founders. We developed 2510 new expressed sequence tag (EST)-based (euchromatin-derived) genomic SSR markers and selected 262 markers from these new SSR markers and publicly available SSR markers to construct a linkage map. QTL analysis for ten agricultural traits of tomato was performed based on the phenotypes and marker genotypes of F2 plants using a flexible Bayesian method. As results, 13 QTL regions were detected for six traits by the Bayesian method developed in this study.
Collapse
Affiliation(s)
- Akio Ohyama
- National Agriculture and Food Research Organization (NARO), Institute of Vegetable and Floriculture Science (NIVFS), 3-1-1 Kannondai, Tsukuba, Ibaraki, 305-8519, Japan.
- NARO, NIVFS, 360 Kusawa, Ano, Tsu, Mie, 514-2392, Japan.
| | - Kenta Shirasawa
- Kazusa DNA Research Institute, 2-6-7 Kazusa-Kamatari, Kisarazu, Chiba, 292-0818, Japan
| | | | - Satomi Negoro
- NARO, NIVFS, 360 Kusawa, Ano, Tsu, Mie, 514-2392, Japan
| | - Koji Miyatake
- NARO, NIVFS, 360 Kusawa, Ano, Tsu, Mie, 514-2392, Japan
| | | | | | - Hiroyoshi Iwata
- Department of Agricultural and Environmental Biology, Graduate School of Agricultural and Life Sciences, The University of Tokyo, 1-1-1 Yayoi, Bunkyo, Tokyo, 113-8657, Japan
| | - Hiroyuki Fukuoka
- NARO, Institute of Vegetable and Tea Science (NIVTS), 360 Kusawa, Ano, Tsu, Mie, 514-2392, Japan
- Takii & Company, Limited, 1360 Hari, Konan, Shiga, 520-3231, Japan
| | - Takeshi Hayashi
- NARO, Institute of Crop Science (NICS), 2-1-2 Kannondai, Tsukuba, Ibaraki, 305-8518, Japan.
| |
Collapse
|
11
|
Wang H, van Eeuwijk FA, Jansen J. The potential of probabilistic graphical models in linkage map construction. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2017; 130:433-444. [PMID: 27921120 PMCID: PMC5263214 DOI: 10.1007/s00122-016-2824-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/14/2016] [Accepted: 11/03/2016] [Indexed: 06/06/2023]
Abstract
Probabilistic graphical models show great potential for robust and reliable construction of linkage maps. We show how to use probabilistic graphical models to construct high-quality linkage maps in the face of data perturbations caused by genotyping errors and reciprocal translocations. It has been shown that linkage map construction can be hampered by the presence of genotyping errors and chromosomal rearrangements such as inversions and translocations. Here, we report a novel method for linkage map construction using probabilistic graphical models. The method is proven, both theoretically and practically, to be effective in filtering out markers that contain genotyping errors. In particular, it carries out marker filtering and ordering simultaneously, and is therefore superior to the standard post hoc filtering using nearest-neighbour stress. Furthermore, we demonstrate empirically that the proposed method offers a promising solution to linkage map construction in the case of a reciprocal translocation.
Collapse
Affiliation(s)
- Huange Wang
- Biometris, Wageningen University and Research Centre, P.O. Box 16, 6700 AA, Wageningen, The Netherlands.
| | - Fred A van Eeuwijk
- Biometris, Wageningen University and Research Centre, P.O. Box 16, 6700 AA, Wageningen, The Netherlands
| | - Johannes Jansen
- Biometris, Wageningen University and Research Centre, P.O. Box 16, 6700 AA, Wageningen, The Netherlands
| |
Collapse
|
12
|
Balsalobre TWA, da Silva Pereira G, Margarido GRA, Gazaffi R, Barreto FZ, Anoni CO, Cardoso-Silva CB, Costa EA, Mancini MC, Hoffmann HP, de Souza AP, Garcia AAF, Carneiro MS. GBS-based single dosage markers for linkage and QTL mapping allow gene mining for yield-related traits in sugarcane. BMC Genomics 2017; 18:72. [PMID: 28077090 PMCID: PMC5225503 DOI: 10.1186/s12864-016-3383-x] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2016] [Accepted: 12/07/2016] [Indexed: 01/01/2023] Open
Abstract
BACKGROUND Sugarcane (Saccharum spp.) is predominantly an autopolyploid plant with a variable ploidy level, frequent aneuploidy and a large genome that hampers investigation of its organization. Genetic architecture studies are important for identifying genomic regions associated with traits of interest. However, due to the genetic complexity of sugarcane, the practical applications of genomic tools have been notably delayed in this crop, in contrast to other crops that have already advanced to marker-assisted selection (MAS) and genomic selection. High-throughput next-generation sequencing (NGS) technologies have opened new opportunities for discovering molecular markers, especially single nucleotide polymorphisms (SNPs) and insertion-deletion (indels), at the genome-wide level. The objectives of this study were to (i) establish a pipeline for identifying variants from genotyping-by-sequencing (GBS) data in sugarcane, (ii) construct an integrated genetic map with GBS-based markers plus target region amplification polymorphisms and microsatellites, (iii) detect QTLs related to yield component traits, and (iv) perform annotation of the sequences that originated the associated markers with mapped QTLs to search putative candidate genes. RESULTS We used four pseudo-references to align the GBS reads. Depending on the reference, from 3,433 to 15,906 high-quality markers were discovered, and half of them segregated as single-dose markers (SDMs) on average. In addition to 7,049 non-redundant SDMs from GBS, 629 gel-based markers were used in a subsequent linkage analysis. Of 7,678 SDMs, 993 were mapped. These markers were distributed throughout 223 linkage groups, which were clustered in 18 homo(eo)logous groups (HGs), with a cumulative map length of 3,682.04 cM and an average marker density of 3.70 cM. We performed QTL mapping of four traits and found seven QTLs. Our results suggest the presence of a stable QTL across locations. Furthermore, QTLs to soluble solid content (BRIX) and fiber content (FIB) traits had markers linked to putative candidate genes. CONCLUSIONS This study is the first to report the use of GBS for large-scale variant discovery and genotyping of a mapping population in sugarcane, providing several insights regarding the use of NGS data in a polyploid, non-model species. The use of GBS generated a large number of markers and still enabled ploidy and allelic dosage estimation. Moreover, we were able to identify seven QTLs, two of which had great potential for validation and future use for molecular breeding in sugarcane.
Collapse
Affiliation(s)
- Thiago Willian Almeida Balsalobre
- Departamento de Biotecnologia e Produção Vegetal e Animal, Centro de Ciências Agrárias, Universidade Federal de São Carlos, Rodovia Anhanguera, Km 174, Araras, CEP 13600-970 São Paulo Brazil
- Departamento de Biologia Vegetal, Instituto de Biologia, Universidade Estadual de Campinas, Avenida Monteiro Lobato 255, Campinas, CEP 13083-862 São Paulo Brazil
- Centro de Biologia Molecular e Engenharia Genética, Universidade Estadual de Campinas, Avenida Candido Rondon 400, Campinas, CEP 13083-875 São Paulo Brazil
| | - Guilherme da Silva Pereira
- Departamento de Genética, Escola Superior de Agricultura Luiz de Queiroz, Universidade de São Paulo, Avenida Pádua Dias 11, Piracicaba, CEP 13418-900 São Paulo Brazil
| | - Gabriel Rodrigues Alves Margarido
- Departamento de Genética, Escola Superior de Agricultura Luiz de Queiroz, Universidade de São Paulo, Avenida Pádua Dias 11, Piracicaba, CEP 13418-900 São Paulo Brazil
| | - Rodrigo Gazaffi
- Departamento de Biotecnologia e Produção Vegetal e Animal, Centro de Ciências Agrárias, Universidade Federal de São Carlos, Rodovia Anhanguera, Km 174, Araras, CEP 13600-970 São Paulo Brazil
| | - Fernanda Zatti Barreto
- Departamento de Biotecnologia e Produção Vegetal e Animal, Centro de Ciências Agrárias, Universidade Federal de São Carlos, Rodovia Anhanguera, Km 174, Araras, CEP 13600-970 São Paulo Brazil
| | - Carina Oliveira Anoni
- Departamento de Genética, Escola Superior de Agricultura Luiz de Queiroz, Universidade de São Paulo, Avenida Pádua Dias 11, Piracicaba, CEP 13418-900 São Paulo Brazil
| | - Cláudio Benício Cardoso-Silva
- Departamento de Biologia Vegetal, Instituto de Biologia, Universidade Estadual de Campinas, Avenida Monteiro Lobato 255, Campinas, CEP 13083-862 São Paulo Brazil
- Centro de Biologia Molecular e Engenharia Genética, Universidade Estadual de Campinas, Avenida Candido Rondon 400, Campinas, CEP 13083-875 São Paulo Brazil
| | - Estela Araújo Costa
- Departamento de Biologia Vegetal, Instituto de Biologia, Universidade Estadual de Campinas, Avenida Monteiro Lobato 255, Campinas, CEP 13083-862 São Paulo Brazil
- Centro de Biologia Molecular e Engenharia Genética, Universidade Estadual de Campinas, Avenida Candido Rondon 400, Campinas, CEP 13083-875 São Paulo Brazil
| | - Melina Cristina Mancini
- Departamento de Biologia Vegetal, Instituto de Biologia, Universidade Estadual de Campinas, Avenida Monteiro Lobato 255, Campinas, CEP 13083-862 São Paulo Brazil
- Centro de Biologia Molecular e Engenharia Genética, Universidade Estadual de Campinas, Avenida Candido Rondon 400, Campinas, CEP 13083-875 São Paulo Brazil
| | - Hermann Paulo Hoffmann
- Departamento de Biotecnologia e Produção Vegetal e Animal, Centro de Ciências Agrárias, Universidade Federal de São Carlos, Rodovia Anhanguera, Km 174, Araras, CEP 13600-970 São Paulo Brazil
| | - Anete Pereira de Souza
- Departamento de Biologia Vegetal, Instituto de Biologia, Universidade Estadual de Campinas, Avenida Monteiro Lobato 255, Campinas, CEP 13083-862 São Paulo Brazil
- Centro de Biologia Molecular e Engenharia Genética, Universidade Estadual de Campinas, Avenida Candido Rondon 400, Campinas, CEP 13083-875 São Paulo Brazil
| | - Antonio Augusto Franco Garcia
- Departamento de Genética, Escola Superior de Agricultura Luiz de Queiroz, Universidade de São Paulo, Avenida Pádua Dias 11, Piracicaba, CEP 13418-900 São Paulo Brazil
| | - Monalisa Sampaio Carneiro
- Departamento de Biotecnologia e Produção Vegetal e Animal, Centro de Ciências Agrárias, Universidade Federal de São Carlos, Rodovia Anhanguera, Km 174, Araras, CEP 13600-970 São Paulo Brazil
| |
Collapse
|
13
|
Wang SB, Wen YJ, Ren WL, Ni YL, Zhang J, Feng JY, Zhang YM. Mapping small-effect and linked quantitative trait loci for complex traits in backcross or DH populations via a multi-locus GWAS methodology. Sci Rep 2016; 6:29951. [PMID: 27435756 PMCID: PMC4951730 DOI: 10.1038/srep29951] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2016] [Accepted: 06/24/2016] [Indexed: 11/09/2022] Open
Abstract
Composite interval mapping (CIM) is the most widely-used method in linkage analysis. Its main feature is the ability to control genomic background effects via inclusion of co-factors in its genetic model. However, the result often depends on how the co-factors are selected, especially for small-effect and linked quantitative trait loci (QTL). To address this issue, here we proposed a new method under the framework of genome-wide association studies (GWAS). First, a single-locus random-SNP-effect mixed linear model method for GWAS was used to scan each putative QTL on the genome in backcross or doubled haploid populations. Here, controlling background via selecting markers in the CIM was replaced by estimating polygenic variance. Then, all the peaks in the negative logarithm P-value curve were selected as the positions of multiple putative QTL to be included in a multi-locus genetic model, and true QTL were automatically identified by empirical Bayes. This called genome-wide CIM (GCIM). A series of simulated and real datasets was used to validate the new method. As a result, the new method had higher power in QTL detection, greater accuracy in QTL effect estimation, and stronger robustness under various backgrounds as compared with the CIM and empirical Bayes methods.
Collapse
Affiliation(s)
- Shi-Bo Wang
- Statistical Genomics Lab, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China.,State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing 210095, China
| | - Yang-Jun Wen
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing 210095, China
| | - Wen-Long Ren
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing 210095, China
| | - Yuan-Li Ni
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing 210095, China
| | - Jin Zhang
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing 210095, China
| | - Jian-Ying Feng
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing 210095, China
| | - Yuan-Ming Zhang
- Statistical Genomics Lab, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| |
Collapse
|
14
|
Zhu X, Zhong S, Chao S, Gu YQ, Kianian SF, Elias E, Cai X. Toward a better understanding of the genomic region harboring Fusarium head blight resistance QTL Qfhs.ndsu-3AS in durum wheat. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2016; 129:31-43. [PMID: 26385373 DOI: 10.1007/s00122-015-2606-x] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/15/2015] [Accepted: 09/07/2015] [Indexed: 05/08/2023]
Abstract
New molecular markers were developed and mapped to the FHB resistance QTL region in high resolution. Micro-collinearity of the QTL region with rice and Brachypodium was revealed for a better understanding of the genomic region. The wild emmer wheat (Triticum dicoccoides)-derived Fusarium head blight (FHB) resistance quantitative trait locus (QTL) Qfhs.ndsu-3AS previously mapped to the short arm of chromosome 3A (3AS) in a population of recombinant inbred chromosome lines (RICLs). This study aimed to attain a better understanding of the genomic region harboring Qfhs.ndsu-3AS and to improve the utility of the QTL in wheat breeding. Micro-collinearity of the QTL region with rice chromosome 1 and Brachypodium chromosome 2 was identified and used for marker development in saturation mapping. A total of 42 new EST-derived sequence tagged site (STS) and simple sequence repeat (SSR) markers were developed and mapped to the QTL and nearby regions on 3AS. Further comparative analysis revealed a complex collinearity of the 3AS genomic region with their collinear counterparts of rice and Brachypodium. Fine mapping of the QTL region resolved five co-segregating markers (Xwgc1186/Xwgc716/Xwgc1143/Xwgc501/Xwgc1204) into three distinct loci proximal to Xgwm2, a marker previously reported to be closely linked to the QTL. Four other markers (Xwgc1226, Xwgc510, Xwgc1296, and Xwgc1301) mapped farther proximal to the above markers in the QTL region with a higher resolution. Five homozygous recombinants with shortened T. dicoccoides chromosomal segments in the QTL region were recovered by molecular marker analysis and evaluated for FHB resistance. Qfhs.ndsu-3AS was positioned to a 5.2 cM interval flanked by the marker Xwgc501 and Xwgc510. The recombinants containing Qfhs.ndsu-3AS and new markers defining the QTL will facilitate utilization of this resistance source in wheat breeding.
Collapse
Affiliation(s)
- Xianwen Zhu
- Departments of Plant Sciences, North Dakota State University, Fargo, ND, 58108, USA
| | - Shaobin Zhong
- Departments of Plant Pathology, North Dakota State University, Fargo, ND, 58108, USA
| | - Shiaoman Chao
- The Red River Valley Agricultural Research Center, USDA-ARS, Fargo, ND, 58102, USA
| | - Yong Qiang Gu
- The Western Regional Research Center, USDA-ARS, Albany, CA, 94710, USA
| | - Shahryar F Kianian
- The Cereal Disease Laboratory, USDA-ARS, 1551 Lindig Street, St. Paul, MN, 55108, USA
| | - Elias Elias
- Departments of Plant Sciences, North Dakota State University, Fargo, ND, 58108, USA
| | - Xiwen Cai
- Departments of Plant Sciences, North Dakota State University, Fargo, ND, 58108, USA.
| |
Collapse
|
15
|
Cui Y, Zhang F, Xu J, Li Z, Xu S. Mapping quantitative trait loci in selected breeding populations: A segregation distortion approach. Heredity (Edinb) 2015; 115:538-46. [PMID: 26126541 DOI: 10.1038/hdy.2015.56] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2014] [Revised: 05/07/2015] [Accepted: 05/26/2015] [Indexed: 11/09/2022] Open
Abstract
Quantitative trait locus (QTL) mapping is often conducted in line-crossing experiments where a sample of individuals is randomly selected from a pool of all potential progeny. QTLs detected from such an experiment are important for us to understand the genetic mechanisms governing a complex trait, but may not be directly relevant to plant breeding if they are not detected from the breeding population where selection is targeting for. QTLs segregating in one population may not necessarily segregate in another population. To facilitate marker-assisted selection, QTLs must be detected from the very population which the selection is targeting. However, selected breeding populations often have depleted genetic variation with small population sizes, resulting in low power in detecting useful QTLs. On the other hand, if selection is effective, loci controlling the selected trait will deviate from the expected Mendelian segregation ratio. In this study, we proposed to detect QTLs in selected breeding populations via the detection of marker segregation distortion in either a single population or multiple populations using the same selection scheme. Simulation studies showed that QTL can be detected in strong selected populations with selected population sizes as small as 25 plants. We applied the new method to detect QTLs in two breeding populations of rice selected for high grain yield. Seven QTLs were identified, four of which have been validated in advanced generations in a follow-up study. Cloned genes in the vicinity of the four QTLs were also reported in the literatures. This mapping-by-selection approach provides a new avenue for breeders to improve breeding progress. The new method can be applied to breeding programs not only in rice but also in other agricultural species including crops, trees and animals.
Collapse
Affiliation(s)
- Y Cui
- Institute of Crop Sciences/National Key Facility for Crop Gene Resources and Genetic Improvement, Chinese Academy of Agricultural Sciences, Beijing, China.,Department of Botany and Plant Sciences, University of California, Riverside, CA, USA
| | - F Zhang
- Institute of Crop Sciences/National Key Facility for Crop Gene Resources and Genetic Improvement, Chinese Academy of Agricultural Sciences, Beijing, China
| | - J Xu
- Institute of Crop Sciences/National Key Facility for Crop Gene Resources and Genetic Improvement, Chinese Academy of Agricultural Sciences, Beijing, China.,Agricultural Genomics Institute, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Z Li
- Institute of Crop Sciences/National Key Facility for Crop Gene Resources and Genetic Improvement, Chinese Academy of Agricultural Sciences, Beijing, China
| | - S Xu
- Department of Botany and Plant Sciences, University of California, Riverside, CA, USA
| |
Collapse
|
16
|
El-Soda M, Kruijer W, Malosetti M, Koornneef M, Aarts MGM. Quantitative trait loci and candidate genes underlying genotype by environment interaction in the response of Arabidopsis thaliana to drought. PLANT, CELL & ENVIRONMENT 2015; 38:585-99. [PMID: 25074022 DOI: 10.1111/pce.12418] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/23/2013] [Revised: 07/07/2014] [Accepted: 07/14/2014] [Indexed: 05/21/2023]
Abstract
Drought stress was imposed on two sets of Arabidopsis thaliana genotypes grown in sand under short-day conditions and analysed for several shoot and root growth traits. The response to drought was assessed for quantitative trait locus (QTL) mapping in a genetically diverse set of Arabidopsis accessions using genome-wide association (GWA) mapping, and conventional linkage analysis of a recombinant inbred line (RIL) population. Results showed significant genotype by environment interaction (G×E) for all traits in response to different watering regimes. For the RIL population, the observed G×E was reflected in 17 QTL by environment interactions (Q×E), while 17 additional QTLs were mapped not showing Q×E. GWA mapping identified 58 single nucleotide polymorphism (SNPs) associated with loci displaying Q×E and an additional 16 SNPs associated with loci not showing Q×E. Many candidate genes potentially underlying these loci were suggested. The genes for RPS3C and YLS7 were found to contain conserved amino acid differences when comparing Arabidopsis accessions with strongly contrasting drought response phenotypes, further supporting their candidacy. One of these candidate genes co-located with a QTL mapped in the RIL population.
Collapse
Affiliation(s)
- Mohamed El-Soda
- Laboratory of Genetics, Wageningen University, Wageningen, 6708PB, The Netherlands
| | | | | | | | | |
Collapse
|
17
|
Wang H, Paulo J, Kruijer W, Boer M, Jansen H, Tikunov Y, Usadel B, van Heusden S, Bovy A, van Eeuwijk F. Genotype–phenotype modeling considering intermediate level of biological variation: a case study involving sensory traits, metabolites and QTLs in ripe tomatoes. MOLECULAR BIOSYSTEMS 2015; 11:3101-10. [DOI: 10.1039/c5mb00477b] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
We integrate Gaussian graphical modelling and causal inference to infer dependency networks from multilevel phenotypic and omics data.
Collapse
Affiliation(s)
- Huange Wang
- Biometris
- Wageningen University and Research Centre
- 6700AA Wageningen
- The Netherlands
| | - Joao Paulo
- Biometris
- Wageningen University and Research Centre
- 6700AA Wageningen
- The Netherlands
| | - Willem Kruijer
- Biometris
- Wageningen University and Research Centre
- 6700AA Wageningen
- The Netherlands
| | - Martin Boer
- Biometris
- Wageningen University and Research Centre
- 6700AA Wageningen
- The Netherlands
| | - Hans Jansen
- Biometris
- Wageningen University and Research Centre
- 6700AA Wageningen
- The Netherlands
| | - Yury Tikunov
- Plant Research International
- 6700AJ Wageningen
- The Netherlands
| | - Björn Usadel
- Institute for Biology I
- RWTH Aachen University
- 52074 Aachen
- Germany
| | | | - Arnaud Bovy
- Plant Research International
- 6700AJ Wageningen
- The Netherlands
| | - Fred van Eeuwijk
- Biometris
- Wageningen University and Research Centre
- 6700AA Wageningen
- The Netherlands
| |
Collapse
|
18
|
Goldstein P, Korol AB, Reiner-Benaim A. Two-stage genome-wide search for epistasis with implementation to Recombinant Inbred Lines (RIL) populations. PLoS One 2014; 9:e115680. [PMID: 25536193 PMCID: PMC4275240 DOI: 10.1371/journal.pone.0115680] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2014] [Accepted: 11/07/2014] [Indexed: 11/18/2022] Open
Abstract
OBJECTIVE AND METHODS This paper proposes an inegrative two-stage genome-wide search for pairwise epistasis on expression quantitative trait loci (eQTL). The traits are clustered into multi-trait complexes that account for correlations between them that may result from common epistasis effects. The search is done by first screening for epistatic regions and then using dense markers within the identified regions, resulting in substantial reduction in the number of tests for epistasis. The FDR is controlled using a hierarchical procedure that accounts for the search structure. Each combination of trait and marker-pair is tested using a model that accounts for both statistical and functional interpretations of epistasis and considers orthogonal effects, such that their contributions to heritability can be estimated individually. We examine the impact of using multi-trait complexes rather than single traits, and of using a hierarchical search for epistasis rather than skipping the initial screen for epistatic regions. We apply the proposed algorithm on Arabidopsis transcription data. PRINCIPAL FINDINGS Both epistasis detection power and heritability contributed by epistasis increased when using multi-trait complexes rather than single traits. Epistatic effects common to the eQTLs included in the complexes have higher chance of being identified by analysis of multi-trait complexes, particularly when epistatic effects on individual traits are small. Compared to direct testing for all potential epistatic effects, the hierarchical search was substantially more powerful in detecting epistasis, while controlling the FDR at the desired level. Association in functional roles within genomic regions was observed, supporting an initial screen for epistatic QTLs.
Collapse
Affiliation(s)
- Pavel Goldstein
- Department of Statistics, University of Haifa, Haifa, 3498838, Israel
| | - Abraham B. Korol
- Department of Evolutionary and Environmental Biology and Institute of Evolution, University of Haifa, Haifa, 3498838, Israel
| | - Anat Reiner-Benaim
- Department of Statistics, University of Haifa, Haifa, 3498838, Israel
- * E-mail:
| |
Collapse
|
19
|
Laurie C, Wang S, Carlini-Garcia LA, Zeng ZB. Mapping epistatic quantitative trait loci. BMC Genet 2014; 15:112. [PMID: 25367219 PMCID: PMC4226885 DOI: 10.1186/s12863-014-0112-9] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2014] [Accepted: 10/09/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND How to map quantitative trait loci (QTL) with epistasis efficiently and reliably has been a persistent problem for QTL mapping analysis. There are a number of difficulties for studying epistatic QTL. Linkage can impose a significant challenge for finding epistatic QTL reliably. If multiple QTL are in linkage and have interactions, searching for QTL can become a very delicate issue. A commonly used strategy that performs a two-dimensional genome scan to search for a pair of QTL with epistasis can suffer from low statistical power and also may lead to false identification due to complex linkage disequilibrium and interaction patterns. RESULTS To tackle the problem of complex interaction of multiple QTL with linkage, we developed a three-stage search strategy. In the first stage, main effect QTL are searched and mapped. In the second stage, epistatic QTL that interact significantly with other identified QTL are searched. In the third stage, new epistatic QTL are searched in pairs. This strategy is based on the consideration that most genetic variance is due to the main effects of QTL. Thus by first mapping those main-effect QTL, the statistical power for the second and third stages of analysis for mapping epistatic QTL can be maximized. The search for main effect QTL is robust and does not bias the search for epistatic QTL due to a genetic property associated with the orthogonal genetic model that the additive and additive by additive variances are independent despite of linkage. The model search criterion is empirically and dynamically evaluated by using a score-statistic based resampling procedure. We demonstrate through simulations that the method has good power and low false positive in the identification of QTL and epistasis. CONCLUSION This method provides an effective and powerful solution to map multiple QTL with complex epistatic pattern. The method has been implemented in the user-friendly computer software Windows QTL Cartographer. This will greatly facilitate the application of the method for QTL mapping data analysis.
Collapse
Affiliation(s)
- Cecelia Laurie
- Department of Mathematics, University of Alabama, Tuscaloosa AL, USA. .,Department of Biostatistics, University of Washington, Seattle WA, USA.
| | - Shengchu Wang
- Bioinformatics Research Center, Department of Statistics, North Carolina State University, Raleigh NC, 27695-7566, USA.
| | - Luciana Aparecida Carlini-Garcia
- Instituto Agronômico de Campinas, Centro de Grãos e Fibras, Campinas SP, Brazil. .,APTA Regional, Pólo Centro Sul, Piracicaba SP, Brazil.
| | - Zhao-Bang Zeng
- Bioinformatics Research Center, Department of Statistics, North Carolina State University, Raleigh NC, 27695-7566, USA. .,Department of Biological Sciences, North Carolina State University, Raleigh NC, USA.
| |
Collapse
|
20
|
Wu K, Liu H, Yang M, Tao Y, Ma H, Wu W, Zuo Y, Zhao Y. High-density genetic map construction and QTLs analysis of grain yield-related traits in sesame (Sesamum indicum L.) based on RAD-Seq techonology. BMC PLANT BIOLOGY 2014; 14:274. [PMID: 25300176 PMCID: PMC4200128 DOI: 10.1186/s12870-014-0274-7] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/10/2014] [Accepted: 10/03/2014] [Indexed: 05/20/2023]
Abstract
BACKGROUND Sesame (Sesamum indicum L., 2n = 26) is an important oilseed crop with an estimated genome size of 369 Mb. The genetic basis, including the number and locations of quantitative trait loci (QTLs) of sesame grain yield and quality remain poorly understood, due in part to the lack of reliable markers and genetic maps. Here we report on the construction of a hitherto most high-density genetic map of sesame using the restriction-site associated DNA sequencing (RAD-seq) combined with 89 PCR markers, and the identification of grain yield-related QTLs using a recombinant inbred line (RIL) population. RESULT In total, 3,769 single-nucleotide polymorphism (SNP) markers were identified from RAD-seq, and 89 polymorphic PCR markers were identified including 44 expressed sequence tag-simple sequence repeats (EST-SSRs), 10 genomic-SSRs and 35 Insertion-Deletion markers (InDels). The final map included 1,230 markers distributed on 14 linkage groups (LGs) and was 844.46 cM in length with an average of 0.69 cM between adjacent markers. Using this map and RIL population, we detected 13 QTLs on 7 LGs and 17 QTLs on 10 LGs for seven grain yield-related traits by the multiple interval mapping (MIM) and the mixed linear composite interval mapping (MCIM), respectively. Three major QTLs had been identified using MIM with R2 > 10.0% or MCIM with ha 2 > 5.0%. Two co-localized QTL groups were identified that partially explained the correlations among five yield-related traits. CONCLUSION Three thousand eight hundred and four pairs of new DNA markers including SNPs and InDels were developed by RAD-seq, and a so far most high-density genetic map was constructed based on these markers in combination with SSR markers. Several grain yield-related QTLs had been identified using this population and genetic map. We report here the first QTL mapping of yield-related traits with a high-density genetic map using a RIL population in sesame. Results of this study solidified the basis for studying important agricultural traits and implementing marker-assisted selection (MAS) toward genetic improvement in sesame.
Collapse
Affiliation(s)
- Kun Wu
- />Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture, Sesame Genetic Improvement Laboratory, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences (OCRI-CAAS), Wuhan, Hubei 430062 China
| | - Hongyan Liu
- />Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture, Sesame Genetic Improvement Laboratory, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences (OCRI-CAAS), Wuhan, Hubei 430062 China
| | - Minmin Yang
- />Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture, Sesame Genetic Improvement Laboratory, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences (OCRI-CAAS), Wuhan, Hubei 430062 China
| | - Ye Tao
- />Shanghai Major Biological Medicine Technology Co., Ltd., Shanghai, 201203 China
| | - Huihui Ma
- />Fuyang Academy of Agricultural Sciences, Fuyang, Anhui 236065 China
| | - Wenxiong Wu
- />Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture, Sesame Genetic Improvement Laboratory, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences (OCRI-CAAS), Wuhan, Hubei 430062 China
| | - Yang Zuo
- />Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture, Sesame Genetic Improvement Laboratory, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences (OCRI-CAAS), Wuhan, Hubei 430062 China
| | - Yingzhong Zhao
- />Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture, Sesame Genetic Improvement Laboratory, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences (OCRI-CAAS), Wuhan, Hubei 430062 China
| |
Collapse
|
21
|
Xie SQ, Feng JY, Zhang YM. Linkage group correction using epistatic distorted markers in F2 and backcross populations. Heredity (Edinb) 2014; 112:479-88. [PMID: 24595363 PMCID: PMC3998779 DOI: 10.1038/hdy.2013.127] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2013] [Revised: 09/29/2013] [Accepted: 10/28/2013] [Indexed: 01/09/2023] Open
Abstract
Epistasis has been frequently observed in all types of mapping populations. However, relatively little is known about the effect of epistatic distorted markers on linkage group construction. In this study, a new approach was proposed to correct the recombination fraction between epistatic distorted markers in backcross and F2 populations under the framework of fitness and liability models. The information for three or four markers flanking with an epistatic segregation distortion locus was used to estimate the recombination fraction by the maximum likelihood method, implemented via an expectation-maximisation algorithm. A set of Monte Carlo simulation experiments along with a real data analysis in rice was performed to validate the new method. The results showed that the estimates from the new method are unbiased. In addition, five statistical properties for the new method in a backcross were summarised and confirmed by theoretical, simulated and real data analyses.
Collapse
Affiliation(s)
- S-Q Xie
- Section on Statistical Genomics, State Key Laboratory of Crop Genetics and Germplasm Enhancement/Collaborative Innovation Center for Modern Crop Production, Department of Crop Genetics and Breeding, College of Agriculture, Nanjing Agricultural University, Nanjing, China
| | - J-Y Feng
- Section on Statistical Genomics, State Key Laboratory of Crop Genetics and Germplasm Enhancement/Collaborative Innovation Center for Modern Crop Production, Department of Crop Genetics and Breeding, College of Agriculture, Nanjing Agricultural University, Nanjing, China
| | - Y-M Zhang
- Section on Statistical Genomics, State Key Laboratory of Crop Genetics and Germplasm Enhancement/Collaborative Innovation Center for Modern Crop Production, Department of Crop Genetics and Breeding, College of Agriculture, Nanjing Agricultural University, Nanjing, China
| |
Collapse
|
22
|
Demetrashvili N, Van den Heuvel ER, Wit EC. Probability genotype imputation method and integrated weighted lasso for QTL identification. BMC Genet 2013; 14:125. [PMID: 24378210 PMCID: PMC4126192 DOI: 10.1186/1471-2156-14-125] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2013] [Accepted: 12/17/2013] [Indexed: 11/10/2022] Open
Abstract
Background Many QTL studies have two common features: (1) often there is missing marker information, (2) among many markers involved in the biological process only a few are causal. In statistics, the second issue falls under the headings “sparsity” and “causal inference”. The goal of this work is to develop a two-step statistical methodology for QTL mapping for markers with binary genotypes. The first step introduces a novel imputation method for missing genotypes. Outcomes of the proposed imputation method are probabilities which serve as weights to the second step, namely in weighted lasso. The sparse phenotype inference is employed to select a set of predictive markers for the trait of interest. Results Simulation studies validate the proposed methodology under a wide range of realistic settings. Furthermore, the methodology outperforms alternative imputation and variable selection methods in such studies. The methodology was applied to an Arabidopsis experiment, containing 69 markers for 165 recombinant inbred lines of a F8 generation. The results confirm previously identified regions, however several new markers are also found. On the basis of the inferred ROC behavior these markers show good potential for being real, especially for the germination trait Gmax. Conclusions Our imputation method shows higher accuracy in terms of sensitivity and specificity compared to alternative imputation method. Also, the proposed weighted lasso outperforms commonly practiced multiple regression as well as the traditional lasso and adaptive lasso with three weighting schemes. This means that under realistic missing data settings this methodology can be used for QTL identification.
Collapse
Affiliation(s)
- Nino Demetrashvili
- Johann Bernoulli Institute for Mathematics and Computer Science, University of Groningen, Groningen 9747 AG, The Netherlands.
| | | | | |
Collapse
|
23
|
Johnson HL, Hanson LM, Chen Y, Bieber AJ, Buono RJ, Ferraro TN, Pirko I, Johnson AJ. Quantitative trait loci analysis reveals candidate genes implicated in regulating functional deficit and CNS vascular permeability in CD8 T cell-initiated blood-brain barrier disruption. BMC Genomics 2013; 14:678. [PMID: 24090483 PMCID: PMC3850781 DOI: 10.1186/1471-2164-14-678] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2013] [Accepted: 09/30/2013] [Indexed: 01/27/2023] Open
Abstract
Background Blood–brain barrier (BBB) disruption is an integral feature of numerous neurological disorders. However, there is a relative lack of knowledge regarding the underlying molecular mechanisms of immune-mediated BBB disruption. We have previously shown that CD8 T cells and perforin play critical roles in initiating altered permeability of the BBB in the peptide-induced fatal syndrome (PIFS) model developed by our laboratory. Additionally, despite having indistinguishable CD8 T cell responses, C57BL/6J (B6) mice are highly susceptible to PIFS, exhibiting functional motor deficits, increased astrocyte activation, and severe CNS vascular permeability, while 129S1/SvImJ (129S1) mice remain resistant. Therefore, to investigate the potential role of genetic factors, we performed a comprehensive genetic analysis of (B6 x 129S1) F2 progeny to define quantitative trait loci (QTL) linked to the phenotypic characteristics stated above that mediate CD8 T cell-initiated BBB disruption. Results Using single nucleotide polymorphism (SNP) markers and a 95% confidence interval, we identified one QTL (PIFS1) on chromosome 12 linked to deficits in motor function (SNP markers rs6292954, rs13481303, rs3655057, and rs13481324, LOD score = 3.3). In addition we identified a second QTL (PIFS2) on chromosome 17 linked to changes in CNS vascular permeability (SNP markers rs6196216 and rs3672065, LOD score = 3.7). Conclusions The QTL critical intervals discovered have allowed for compilation of a list of candidate genes implicated in regulating functional deficit and CNS vascular permeability. These genes encode for factors that may be potential targets for therapeutic approaches to treat disorders characterized by CD8 T cell-mediated BBB disruption.
Collapse
|
24
|
Xie SQ, Wen J, Zhang YM. Multi-QTL mapping for quantitative traits using epistatic distorted markers. PLoS One 2013; 8:e68510. [PMID: 23874647 PMCID: PMC3706401 DOI: 10.1371/journal.pone.0068510] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2013] [Accepted: 05/31/2013] [Indexed: 11/18/2022] Open
Abstract
The interaction between segregation distortion loci (SDL) has been often observed in all kinds of mapping populations. However, little has been known about the effect of epistatic SDL on quantitative trait locus (QTL) mapping. Here we proposed a multi-QTL mapping approach using epistatic distorted markers. Using the corrected linkage groups, epistatic SDL was identified. Then, these SDL parameters were used to correct the conditional probabilities of QTL genotypes, and these corrections were further incorporated into the new QTL mapping approach. Finally, a set of simulated datasets and a real data in 304 mouse F2 individuals were used to validate the new method. As compared with the old method, the new one corrects genetic distance between distorted markers, and considers epistasis between two linked SDL. As a result, the power in the detection of QTL is higher for the new method than for the old one, and significant differences for estimates of QTL parameters between the two methods were observed, except for QTL position. Among two QTL for mouse weight, one significant difference for QTL additive effect between the above two methods was observed, because epistatic SDL between markers C66 and T93 exists (P = 2.94e-4).
Collapse
Affiliation(s)
- Shang-Qian Xie
- Statistical Genomics Group, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Department of Crop Genetics and Breeding, Nanjing Agricultural University, Nanjing, Jiangsu, China
| | - Jia Wen
- Statistical Genomics Group, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Department of Crop Genetics and Breeding, Nanjing Agricultural University, Nanjing, Jiangsu, China
| | - Yuan-Ming Zhang
- Statistical Genomics Group, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Department of Crop Genetics and Breeding, Nanjing Agricultural University, Nanjing, Jiangsu, China
| |
Collapse
|
25
|
Zhang H, Miao H, Wei L, Li C, Zhao R, Wang C. Genetic analysis and QTL mapping of seed coat color in sesame (Sesamum indicum L.). PLoS One 2013; 8:e63898. [PMID: 23704951 PMCID: PMC3660586 DOI: 10.1371/journal.pone.0063898] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2012] [Accepted: 04/09/2013] [Indexed: 11/19/2022] Open
Abstract
Seed coat color is an important agronomic trait in sesame, as it is associated with seed biochemical properties, antioxidant content and activity and even disease resistance of sesame. Here, using a high-density linkage map, we analyzed genetic segregation and quantitative trait loci (QTL) for sesame seed coat color in six generations (P1, P2, F1, BC1, BC2 and F2). Results showed that two major genes with additive-dominant-epistatic effects and polygenes with additive-dominant-epistatic effects were responsible for controlling the seed coat color trait. Average heritability of the major genes in the BC1, BC2 and F2 populations was 89.30%, 24.00%, and 91.11% respectively, while the heritability of polygenes was low in the BC1 (5.43%), in BC2 (0.00%) and in F2 (0.89%) populations. A high-density map was constructed using 724 polymorphic markers. 653 SSR, AFLP and RSAMPL loci were anchored in 14 linkage groups (LG) spanning a total of 1,216.00 cM. The average length of each LG was 86.86 cM and the marker density was 1.86 cM per marker interval. Four QTLs for seed coat color, QTL1-1, QTL11-1, QTL11-2 and QTL13-1, whose heritability ranged from 59.33%-69.89%, were detected in F3 populations using CIM and MCIM methods. Alleles at all QTLs from the black-seeded parent tended to increase the seed coat color. Results from QTLs mapping and classical genetic analysis among the P1, P2, F1, BC1, BC2 and F2 populations were comparatively consistent. This first QTL analysis and high-density genetic linkage map for sesame provided a good foundation for further research on sesame genetics and molecular marker-assisted selection (MAS).
Collapse
Affiliation(s)
- Haiyang Zhang
- Henan Sesame Research Center, Henan Academy of Agricultural Sciences, Zhengzhou, Henan, China.
| | | | | | | | | | | |
Collapse
|
26
|
Malosetti M, Ribaut JM, van Eeuwijk FA. The statistical analysis of multi-environment data: modeling genotype-by-environment interaction and its genetic basis. Front Physiol 2013; 4:44. [PMID: 23487515 PMCID: PMC3594989 DOI: 10.3389/fphys.2013.00044] [Citation(s) in RCA: 165] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2012] [Accepted: 02/25/2013] [Indexed: 12/04/2022] Open
Abstract
Genotype-by-environment interaction (GEI) is an important phenomenon in plant breeding. This paper presents a series of models for describing, exploring, understanding, and predicting GEI. All models depart from a two-way table of genotype by environment means. First, a series of descriptive and explorative models/approaches are presented: Finlay–Wilkinson model, AMMI model, GGE biplot. All of these approaches have in common that they merely try to group genotypes and environments and do not use other information than the two-way table of means. Next, factorial regression is introduced as an approach to explicitly introduce genotypic and environmental covariates for describing and explaining GEI. Finally, QTL modeling is presented as a natural extension of factorial regression, where marker information is translated into genetic predictors. Tests for regression coefficients corresponding to these genetic predictors are tests for main effect QTL expression and QTL by environment interaction (QEI). QTL models for which QEI depends on environmental covariables form an interesting model class for predicting GEI for new genotypes and new environments. For realistic modeling of genotypic differences across multiple environments, sophisticated mixed models are necessary to allow for heterogeneity of genetic variances and correlations across environments. The use and interpretation of all models is illustrated by an example data set from the CIMMYT maize breeding program, containing environments differing in drought and nitrogen stress. To help readers to carry out the statistical analyses, GenStat® programs, 15th Edition and Discovery® version, are presented as “Appendix.”
Collapse
Affiliation(s)
- Marcos Malosetti
- Biometris - Applied Statistics, Department of Plant Science, Wageningen University Wageningen, Netherlands
| | | | | |
Collapse
|
27
|
A decision rule for quantitative trait locus detection under the extended Bayesian LASSO model. Genetics 2012; 192:1483-91. [PMID: 22982577 DOI: 10.1534/genetics.111.130278] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Bayesian shrinkage analysis is arguably the state-of-the-art technique for large-scale multiple quantitative trait locus (QTL) mapping. However, when the shrinkage model does not involve indicator variables for marker inclusion, QTL detection remains heavily dependent on significance thresholds derived from phenotype permutation under the null hypothesis of no phenotype-to-genotype association. This approach is computationally intensive and more importantly, the hypothetical data generation at the heart of the permutation-based method violates the Bayesian philosophy. Here we propose a fully Bayesian decision rule for QTL detection under the recently introduced extended Bayesian LASSO for QTL mapping. Our new decision rule is free of any hypothetical data generation and relies on the well-established Bayes factors for evaluating the evidence for QTL presence at any locus. Simulation results demonstrate the remarkable performance of our decision rule. An application to real-world data is considered as well.
Collapse
|
28
|
Da Costa E Silva L, Wang S, Zeng ZB. Multiple trait multiple interval mapping of quantitative trait loci from inbred line crosses. BMC Genet 2012; 13:67. [PMID: 22852865 PMCID: PMC3778868 DOI: 10.1186/1471-2156-13-67] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2012] [Accepted: 06/28/2012] [Indexed: 12/02/2022] Open
Abstract
Background Although many experiments have measurements on multiple traits, most studies performed the analysis of mapping of quantitative trait loci (QTL) for each trait separately using single trait analysis. Single trait analysis does not take advantage of possible genetic and environmental correlations between traits. In this paper, we propose a novel statistical method for multiple trait multiple interval mapping (MTMIM) of QTL for inbred line crosses. We also develop a novel score-based method for estimating genome-wide significance level of putative QTL effects suitable for the MTMIM model. The MTMIM method is implemented in the freely available and widely used Windows QTL Cartographer software. Results Throughout the paper, we provide compelling empirical evidences that: (1) the score-based threshold maintains proper type I error rate and tends to keep false discovery rate within an acceptable level; (2) the MTMIM method can deliver better parameter estimates and power than single trait multiple interval mapping method; (3) an analysis of Drosophila dataset illustrates how the MTMIM method can better extract information from datasets with measurements in multiple traits. Conclusions The MTMIM method represents a convenient statistical framework to test hypotheses of pleiotropic QTL versus closely linked nonpleiotropic QTL, QTL by environment interaction, and to estimate the total genotypic variance-covariance matrix between traits and to decompose it in terms of QTL-specific variance-covariance matrices, therefore, providing more details on the genetic architecture of complex traits.
Collapse
Affiliation(s)
- Luciano Da Costa E Silva
- Department of Statistics & Bioinformatics Research Center, North Carolina State University, Raleigh 27695-7566, USA
| | | | | |
Collapse
|
29
|
Scott-Boyer MP, Imholte GC, Tayeb A, Labbe A, Deschepper CF, Gottardo R. An integrated hierarchical Bayesian model for multivariate eQTL mapping. Stat Appl Genet Mol Biol 2012; 11:/j/sagmb.2012.11.issue-4/1544-6115.1760/1544-6115.1760.xml. [PMID: 22850063 PMCID: PMC4627701 DOI: 10.1515/1544-6115.1760] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Recently, expression quantitative loci (eQTL) mapping studies, where expression levels of thousands of genes are viewed as quantitative traits, have been used to provide greater insight into the biology of gene regulation. Originally, eQTLs were detected by applying standard QTL detection tools (using a "one gene at-a-time" approach), but this method ignores many possible interactions between genes. Several other methods have proposed to overcome these limitations, but each of them has some specific disadvantages. In this paper, we present an integrated hierarchical Bayesian model that jointly models all genes and SNPs to detect eQTLs. We propose a model (named iBMQ) that is specifically designed to handle a large number G of gene expressions, a large number S of regressors (genetic markers) and a small number n of individuals in what we call a ``large G, large S, small n'' paradigm. This method incorporates genotypic and gene expression data into a single model while 1) specifically coping with the high dimensionality of eQTL data (large number of genes), 2) borrowing strength from all gene expression data for the mapping procedures, and 3) controlling the number of false positives to a desirable level. To validate our model, we have performed simulation studies and showed that it outperforms other popular methods for eQTL detection, including QTLBIM, R-QTL, remMap and M-SPLS. Finally, we used our model to analyze a real expression dataset obtained in a panel of mice BXD Recombinant Inbred (RI) strains. Analysis of these data with iBMQ revealed the presence of multiple hotspots showing significant enrichment in genes belonging to one or more annotation categories.
Collapse
|
30
|
Genotype by environment interaction of quantitative traits: a case study in barley. G3-GENES GENOMES GENETICS 2012; 2:779-88. [PMID: 22870401 PMCID: PMC3385984 DOI: 10.1534/g3.112.002980] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/25/2012] [Accepted: 05/07/2012] [Indexed: 11/18/2022]
Abstract
Genotype by environment interaction is a phenomenon that a better genotype in one environment may perform poorly in another environment. When the genotype refers to a quantitative trait locus (QTL), this phenomenon is called QTL by environment interaction, denoted by Q×E. Using a recently developed new Bayesian method and genome-wide marker information, we estimated and tested QTL main effects and Q×E interactions for a well-known barley dataset produced by the North American Barley Genome Mapping Project. This dataset contained seven quantitative traits collected from 145 doubled-haploid (DH) lines evaluated in multiple environments, which derived from a cross between two Canadian two-row barley lines, Harrington and TR306. Numerous main effects and Q×E interaction effects have been detected for all seven quantitative traits. However, main effects seem to be more important than the Q×E interaction effects for all seven traits examined. The number of main effects detected varied from 26 for the maturity trait to 75 for the heading trait, with an average of 61.86. The heading trait has the most detected effects, with a total of 98 (75 main, 29 Q×E). Among the 98 effects, 6 loci had both the main and Q×E effects. Among the total number of detected loci, on average, 78.5% of the loci show the main effects whereas 34.9% of the loci show Q×E interactions. Overall, we detected many loci with either the main or the Q×E effects, and the main effects appear to be more important than the Q×E interaction effects for all the seven traits. This means that most detected loci have a constant effect across environments. Another discovery from this analysis is that Q×E interaction occurs independently, regardless whether the locus has main effects.
Collapse
|
31
|
Sukhwinder-Singh, Hernandez MV, Crossa J, Singh PK, Bains NS, Singh K, Sharma I. Multi-trait and multi-environment QTL analyses for resistance to wheat diseases. PLoS One 2012; 7:e38008. [PMID: 22679489 PMCID: PMC3367963 DOI: 10.1371/journal.pone.0038008] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2011] [Accepted: 04/30/2012] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND Stripe rust, leaf rust, tan spot, and Karnal bunt are economically significant diseases impacting wheat production. The objectives of this study were to identify quantitative trait loci for resistance to these diseases in a recombinant inbred line (RIL) from a cross HD29/WH542, and to evaluate the evidence for the presence loci on chromosome region conferring multiple disease resistance. METHODOLOGY/PRINCIPAL FINDINGS The RIL population was evaluated for four diseases and genotyped with DNA markers. Multi-trait (MT) analysis revealed thirteen QTLs on nine chromosomes, significantly associated with resistance. Phenotypic variation explained by all significant QTLs for KB, TS, Yr, Lr diseases were 57%, 55%, 38% and 22%, respectively. Marginal trait analysis identified the most significant QTLs for resistance to KB on chromosomes 1BS, 2DS, 3BS, 4BL, 5BL, and 5DL. Chromosomes 3AS and 4BL showed significant association with TS resistance. Significant QTLs for Yr resistance were identified on chromosomes 2AS, 4BL and 5BL, while Lr was significant on 6DS. MT analysis revealed that all the QTLs except 3BL significantly reduce KB and was contributed from parent HD29 while all resistant QTLs for TS except on chromosomes 2DS.1, 2DS.2 and 3BL came from WH542. Five resistant QTLs for Yr and six for Lr were contributed from parents WH542 and HD29 respectively. Chromosome region on 4BL showed significant association to KB, TS, and Yr in the population. The multi environment analysis for KB identified three putative QTLs of which two new QTLs, mapped on chromosomes 3BS and 5DL explained 10 and 20% of the phenotypic variation, respectively. CONCLUSIONS/SIGNIFICANCE This study revealed that MT analysis is an effective tool for detection of multi-trait QTLs for disease resistance. This approach is a more effective and practical than individual QTL mapping analyses. MT analysis identified RILs that combine resistance to multiple diseases from parents WH542 and/or HD29.
Collapse
Affiliation(s)
- Sukhwinder-Singh
- International Maize and Wheat Improvement Center, CIMMYT, Mexico Distrito Federal, Mexico.
| | | | | | | | | | | | | |
Collapse
|
32
|
Abstract
In this chapter, we consider the problem of jointly analyzing multiple (correlated) complex traits in the context of identifying quantitative trait loci (QTL). The advantages of joint analysis (as opposed independent analysis) is the detection of pleiotropy and improved precision of estimates. The multivariate model is introduced along with a brief description of the setup. The model is evaluated in a Bayesian framework to perform model selection (strategy to identify QTL for each trait). A detailed vignette of a statistical software (R/qtlbim) which uses a Markov Chain Monte Carlo (MCMC) approach to draw samples from the posterior distribution is presented. Strategies of checking MCMC convergence, visualization of posterior samples, model building, and testing pleiotropy with the software are described. Relevant code to perform the analysis on an example (simulated) dataset is also provided.
Collapse
|
33
|
Zhao F, Xu S. An expectation and maximization algorithm for estimating Q X E interaction effects. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2012; 124:1375-1387. [PMID: 22297562 DOI: 10.1007/s00122-012-1794-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/20/2011] [Accepted: 01/05/2012] [Indexed: 05/31/2023]
Abstract
A Markov chain Monte Carlo (MCMC) implemented Bayesian method has been developed to detect quantitative trait loci (QTL) effects and Q x E interaction effects. However, the MCMC algorithm is time consuming due to repeated samplings of QTL parameters. We developed an expectation and maximization (EM) algorithm as an alternative method for detecting QTL and Q x E interaction. Simulation studies and real data analysis showed that the EM algorithm produced comparable result as the Bayesian method, but with a speed many magnitudes faster than the MCMC algorithm. We used the EM algorithm to analyze a well known barley dataset produced by the North American Barley Genome Mapping Project. The dataset contained eight quantitative traits collected from 150 doubled-haploid (DH) lines evaluated in multiple environments. Each line was genotyped for 495 polymorphic markers. The result showed that all eight traits exhibited QTL main effects and Q x E interaction effects. On average, the main effects and Q x E interaction effects contributed 34.56 and 16.23% of the total phenotypic variance, respectively. Furthermore, we found that whether or not a locus shows Q x E interaction does not depend on the presence of main effect.
Collapse
Affiliation(s)
- Fuping Zhao
- Department of Botany and Plant Sciences, University of California, Riverside, CA 92521, USA
| | | |
Collapse
|
34
|
Sabadin PK, Malosetti M, Boer MP, Tardin FD, Santos FG, Guimarães CT, Gomide RL, Andrade CLT, Albuquerque PEP, Caniato FF, Mollinari M, Margarido GRA, Oliveira BF, Schaffert RE, Garcia AAF, van Eeuwijk FA, Magalhaes JV. Studying the genetic basis of drought tolerance in sorghum by managed stress trials and adjustments for phenological and plant height differences. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2012; 124:1389-402. [PMID: 22297563 DOI: 10.1007/s00122-012-1795-9] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/30/2011] [Accepted: 01/05/2012] [Indexed: 05/25/2023]
Abstract
Managed environments in the form of well watered and water stressed trials were performed to study the genetic basis of grain yield and stay green in sorghum with the objective of validating previously detected QTL. As variations in phenology and plant height may influence QTL detection for the target traits, QTL for flowering time and plant height were introduced as cofactors in QTL analyses for yield and stay green. All but one of the flowering time QTL were detected near yield and stay green QTL. Similar co-localization was observed for two plant height QTL. QTL analysis for yield, using flowering time/plant height cofactors, led to yield QTL on chromosomes 2, 3, 6, 8 and 10. For stay green, QTL on chromosomes 3, 4, 8 and 10 were not related to differences in flowering time/plant height. The physical positions for markers in QTL regions projected on the sorghum genome suggest that the previously detected plant height QTL, Sb-HT9-1, and Dw2, in addition to the maturity gene, Ma5, had a major confounding impact on the expression of yield and stay green QTL. Co-localization between an apparently novel stay green QTL and a yield QTL on chromosome 3 suggests there is potential for indirect selection based on stay green to improve drought tolerance in sorghum. Our QTL study was carried out with a moderately sized population and spanned a limited geographic range, but still the results strongly emphasize the necessity of corrections for phenology in QTL mapping for drought tolerance traits in sorghum.
Collapse
Affiliation(s)
- P K Sabadin
- Embrapa Maize and Sorghum, Rod. MG 424, Km 65, Sete Lagoas, MG 35701-970, Brazil
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
35
|
Che X, Xu S. Generalized linear mixed models for mapping multiple quantitative trait loci. Heredity (Edinb) 2012; 109:41-9. [PMID: 22415425 DOI: 10.1038/hdy.2012.10] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
Many biological traits are discretely distributed in phenotype but continuously distributed in genetics because they are controlled by multiple genes and environmental variants. Due to the quantitative nature of the genetic background, these multiple genes are called quantitative trait loci (QTL). When the QTL effects are treated as random, they can be estimated in a single generalized linear mixed model (GLMM), even if the number of QTL may be larger than the sample size. The GLMM in its original form cannot be applied to QTL mapping for discrete traits if there are missing genotypes. We examined two alternative missing genotype-handling methods: the expectation method and the overdispersion method. Simulation studies show that the two methods are efficient for multiple QTL mapping (MQM) under the GLMM framework. The overdispersion method showed slight advantages over the expectation method in terms of smaller mean-squared errors of the estimated QTL effects. The two methods of GLMM were applied to MQM for the female fertility trait of wheat. Multiple QTL were detected to control the variation of the number of seeded spikelets.
Collapse
Affiliation(s)
- X Che
- Department of Statistics, University of California, Riverside, CA 92521, USA
| | | |
Collapse
|
36
|
Costa E Silva L, Wang S, Zeng ZB. Multiple trait multiple interval mapping of quantitative trait loci from inbred line crosses. BMC Genet 2012. [DOI: 10.1186/1471-2156] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
|
37
|
Zhan H, Xu S. Generalized linear mixed model for segregation distortion analysis. BMC Genet 2011; 12:97. [PMID: 22078575 PMCID: PMC3748016 DOI: 10.1186/1471-2156-12-97] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2011] [Accepted: 11/11/2011] [Indexed: 11/16/2022] Open
Abstract
Background Segregation distortion is a phenomenon that the observed genotypic frequencies of a locus fall outside the expected Mendelian segregation ratio. The main cause of segregation distortion is viability selection on linked marker loci. These viability selection loci can be mapped using genome-wide marker information. Results We developed a generalized linear mixed model (GLMM) under the liability model to jointly map all viability selection loci of the genome. Using a hierarchical generalized linear mixed model, we can handle the number of loci several times larger than the sample size. We used a dataset from an F2 mouse family derived from the cross of two inbred lines to test the model and detected a major segregation distortion locus contributing 75% of the variance of the underlying liability. Replicated simulation experiments confirm that the power of viability locus detection is high and the false positive rate is low. Conclusions Not only can the method be used to detect segregation distortion loci, but also used for mapping quantitative trait loci of disease traits using case only data in humans and selected populations in plants and animals.
Collapse
Affiliation(s)
- Haimao Zhan
- Department of Botany and Plant Sciences, University of California, Riverside, CA 92521, USA
| | | |
Collapse
|
38
|
Nelson JC. Linkage analysis in unconventional mating designs in line crosses. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2011; 123:897-906. [PMID: 21681487 DOI: 10.1007/s00122-011-1635-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/03/2011] [Accepted: 06/03/2011] [Indexed: 05/30/2023]
Abstract
Linkage estimation and genetic map construction with genotyped DNA markers in plants preferentially employ a few maximally informative early-generation or recombinant-inbred mating designs. Fitting their recombination models to unconventional designs adapted to cultivar development (series of backcrossing, selfing, haploid-doubling, random-intercrossing, and sib-mating steps) distorts single- and multipoint linkage estimates even with dense marker coverage. Two methods are provided for correct linkage estimation in unconventional designs: fitting a correct multigeneration model, or correcting the estimates produced by fitting a one-generation model with any conventional software. These methods also support calculation of multilocus genotype frequencies and QTL-genotype distributions and are available in software.
Collapse
Affiliation(s)
- James C Nelson
- Department of Plant Pathology, 4024 Throckmorton Plant Sciences Center, Kansas State University, Manhattan, KS 66506, USA.
| |
Collapse
|
39
|
Malosetti M, van Eeuwijk FA, Boer MP, Casas AM, Elía M, Moralejo M, Bhat PR, Ramsay L, Molina-Cano JL. Gene and QTL detection in a three-way barley cross under selection by a mixed model with kinship information using SNPs. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2011; 122:1605-16. [PMID: 21373796 PMCID: PMC3082036 DOI: 10.1007/s00122-011-1558-z] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/29/2010] [Accepted: 02/16/2011] [Indexed: 05/18/2023]
Abstract
Quantitative trait locus (QTL) detection is commonly performed by analysis of designed segregating populations derived from two inbred parental lines, where absence of selection, mutation and genetic drift is assumed. Even for designed populations, selection cannot always be avoided, with as consequence varying correlation between genotypes instead of uniform correlation. Akin to linkage disequilibrium mapping, ignoring this type of genetic relatedness will increase the rate of false-positives. In this paper, we advocate using mixed models including genetic relatedness, or 'kinship' information for QTL detection in populations where selection forces operated. We demonstrate our case with a three-way barley cross, designed to segregate for dwarfing, vernalization and spike morphology genes, in which selection occurred. The population of 161 inbred lines was screened with 1,536 single nucleotide polymorphisms (SNPs), and used for gene and QTL detection. The coefficient of coancestry matrix was estimated based on the SNPs and imposed to structure the distribution of random genotypic effects. The model incorporating kinship, coancestry, information was consistently superior to the one without kinship (according to the Akaike information criterion). We show, for three traits, that ignoring the coancestry information results in an unrealistically high number of marker-trait associations, without providing clear conclusions about QTL locations. We used a number of widely recognized dwarfing and vernalization genes known to segregate in the studied population as landmarks or references to assess the agreement of the mapping results with a priori candidate gene expectations. Additional QTLs to the major genes were detected for all traits as well.
Collapse
Affiliation(s)
- Marcos Malosetti
- Biometris-Applied Statistics, Wageningen University, Wageningen, The Netherlands.
| | | | | | | | | | | | | | | | | |
Collapse
|
40
|
Xu S, Hu Z. Mapping quantitative trait loci using the MCMC procedure in SAS. Heredity (Edinb) 2011; 106:357-69. [PMID: 20551982 PMCID: PMC3183881 DOI: 10.1038/hdy.2010.77] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2010] [Revised: 05/10/2010] [Accepted: 05/13/2010] [Indexed: 11/09/2022] Open
Abstract
The MCMC procedure in SAS (called PROC MCMC) is particularly designed for Bayesian analysis using the Markov chain Monte Carlo (MCMC) algorithm. The program is sufficiently general to handle very complicated statistical models and arbitrary prior distributions. This study introduces the SAS/MCMC procedure and demonstrates the application of the program to quantitative trait locus (QTL) mapping. A real life QTL mapping experiment in wheat female fertility trait was used as an example for the demonstration. The fertility trait phenotypes were described under three different models: (1) the Poisson model, (2) the Bernoulli model and (3) the zero-truncated Poisson model. One QTL was identified on the second chromosome. This QTL appears to control the switch of seed-producing ability of female plants but does not affect the number of seeds produced once the switch is turned on.
Collapse
Affiliation(s)
- S Xu
- Department of Botany and Plant Sciences, University of California, Riverside, CA 92521, USA.
| | | |
Collapse
|
41
|
Nelson JC, McClung AM, Fjellstrom RG, Moldenhauer KAK, Boza E, Jodari F, Oard JH, Linscombe S, Scheffler BE, Yeater KM. Mapping QTL main and interaction influences on milling quality in elite US rice germplasm. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2011; 122:291-309. [PMID: 20857082 DOI: 10.1007/s00122-010-1445-z] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/07/2010] [Accepted: 08/30/2010] [Indexed: 05/10/2023]
Abstract
Rice (Oryza sativa L.) head-rice yield (HR) is a key export and domestic quality trait whose genetic control is poorly understood. With the goal of identifying genomic regions influencing HR, quantitative-trait-locus (QTL) mapping was carried out for quality-related traits in recombinant inbred lines (RILs) derived from crosses of common parent Cypress, a high-HR US japonica cultivar, with RT0034, a low-HR indica line (129 RILs) and LaGrue, a low-HR japonica cultivar (298 RILs), grown in two US locations in 2005-2007. Early heading increased HR in the Louisiana (LA) but not the Arkansas (AR) location. Fitting QTL-mapping models to separate QTL main and QTL × environment interaction (QEI) effects and identify epistatic interactions revealed six main-effect HR QTLs in the two crosses, at four of which Cypress contributed the increasing allele. Multi-QTL models accounted for 0.36 of genetic and 0.21 of genetic × environment interaction of HR in MY1, and corresponding proportions of 0.25 and 0.37 in MY2. The greater HR advantage of Cypress in LA than in AR corresponded to a genomewide pattern of opposition of HR-increasing QTL effects by AR-specific effects, suggesting a selection strategy for improving this cultivar for AR. Treating year-location combinations as independent environments resulted in underestimation of QEI effects, evidently owing to lower variation among years within location than between location. Identification of robust HR QTLs in elite long-grain germplasm is suggested to require more detailed attention to the interaction of plant and grain development parameters with environmental conditions than has been given to date.
Collapse
Affiliation(s)
- J C Nelson
- Department of Plant Pathology, 4024 Throckmorton Plant Sciences Center, Kansas State University, Manhattan, KS 66506, USA.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
42
|
Xu HM, Wei CS, Tang YT, Zhu ZH, Sima YF, Lou XY. A new mapping method for quantitative trait loci of silkworm. BMC Genet 2011; 12:19. [PMID: 21276233 PMCID: PMC3042969 DOI: 10.1186/1471-2156-12-19] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2010] [Accepted: 01/28/2011] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND Silkworm is the basis of sericultural industry and the model organism in insect genetics study. Mapping quantitative trait loci (QTLs) underlying economically important traits of silkworm is of high significance for promoting the silkworm molecular breeding and advancing our knowledge on genetic architecture of the Lepidoptera. Yet, the currently used mapping methods are not well suitable for silkworm, because of ignoring the recombination difference in meiosis between two sexes. RESULTS A mixed linear model including QTL main effects, epistatic effects, and QTL × sex interaction effects was proposed for mapping QTLs in an F2 population of silkworm. The number and positions of QTLs were determined by F-test and model selection. The Markov chain Monte Carlo (MCMC) algorithm was employed to estimate and test genetic effects of QTLs and QTL × sex interaction effects. The effectiveness of the model and statistical method was validated by a series of simulations. The results indicate that when markers are distributed sparsely on chromosomes, our method will substantially improve estimation accuracy as compared to the normal chiasmate F2 model. We also found that a sample size of hundreds was sufficiently large to unbiasedly estimate all the four types of epistases (i.e., additive-additive, additive-dominance, dominance-additive, and dominance-dominance) when the paired QTLs reside on different chromosomes in silkworm. CONCLUSION The proposed method could accurately estimate not only the additive, dominance and digenic epistatic effects but also their interaction effects with sex, correcting the potential bias and precision loss in the current QTL mapping practice of silkworm and thus representing an important addition to the arsenal of QTL mapping tools.
Collapse
Affiliation(s)
- Hai-Ming Xu
- Institute of Bioinformatics, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310029, China
| | | | | | | | | | | |
Collapse
|
43
|
He XH, Qin H, Hu Z, Zhang T, Zhang YM. Mapping of epistatic quantitative trait loci in four-way crosses. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2011; 122:33-48. [PMID: 20827458 DOI: 10.1007/s00122-010-1420-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/21/2009] [Accepted: 07/24/2010] [Indexed: 05/29/2023]
Abstract
Four-way crosses (4WC) involving four different inbred lines often appear in plant and animal commercial breeding programs. Direct mapping of quantitative trait loci (QTL) in these commercial populations is both economical and practical. However, the existing statistical methods for mapping QTL in a 4WC population are built on the single-QTL genetic model. This simple genetic model fails to take into account QTL interactions, which play an important role in the genetic architecture of complex traits. In this paper, therefore, we attempted to develop a statistical method to detect epistatic QTL in 4WC population. Conditional probabilities of QTL genotypes, computed by the multi-point single locus method, were used to sample the genotypes of all putative QTL in the entire genome. The sampled genotypes were used to construct the design matrix for QTL effects. All QTL effects, including main and epistatic effects, were simultaneously estimated by the penalized maximum likelihood method. The proposed method was confirmed by a series of Monte Carlo simulation studies and real data analysis of cotton. The new method will provide novel tools for the genetic dissection of complex traits, construction of QTL networks, and analysis of heterosis.
Collapse
Affiliation(s)
- Xiao-Hong He
- Section on Statistical Genomics, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, 210095, People's Republic of China
| | | | | | | | | |
Collapse
|
44
|
Guo B, Beavis WD. In silico genotyping of the maize nested association mapping population. MOLECULAR BREEDING : NEW STRATEGIES IN PLANT IMPROVEMENT 2011; 27:107-113. [PMID: 21289856 PMCID: PMC3015163 DOI: 10.1007/s11032-010-9503-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/28/2010] [Accepted: 09/04/2010] [Indexed: 05/30/2023]
Abstract
Nested Association Mapping (NAM) has been proposed as a means to combine the power of linkage mapping with the resolution of association mapping. It is enabled through sequencing or array genotyping of parental inbred lines while using low-cost, low-density genotyping technologies for their segregating progenies. For purposes of data analyses of NAM populations, parental genotypes at a large number of Single Nucleotide Polymorphic (SNP) loci need to be projected to their segregating progeny. Herein we demonstrate how approximately 0.5 million SNPs that have been genotyped in 26 parental lines of the publicly available maize NAM population can be projected onto their segregating progeny using only 1,106 SNP loci that have been genotyped in both the parents and their 5,000 progeny. The challenge is to estimate both the genotype and genetic location of the parental SNP genotypes in segregating progeny. Both challenges were met by estimating their expected genotypic values conditional on observed flanking markers through the use of both physical and linkage maps. About 90%, of 500,000 genotyped SNPs from the maize HapMap project, were assigned linkage map positions using linear interpolation between the maize Accessioned Gold Path (AGP) and NAM linkage maps. Of these, almost 70% provided high probability estimates of genotypes in almost 5,000 recombinant inbred lines.
Collapse
Affiliation(s)
- Baohong Guo
- Department of Agronomy, Iowa State University, 1208 Agronomy Hall, Ames, IA 50011 USA
- Present Address: Syngenta Seeds, Inc, Slater, IA 50244 USA
| | - William D. Beavis
- Department of Agronomy, Iowa State University, 1208 Agronomy Hall, Ames, IA 50011 USA
| |
Collapse
|
45
|
Identification of genes and networks driving cardiovascular and metabolic phenotypes in a mouse F2 intercross. PLoS One 2010; 5:e14319. [PMID: 21179467 PMCID: PMC3001864 DOI: 10.1371/journal.pone.0014319] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2010] [Accepted: 11/03/2010] [Indexed: 12/21/2022] Open
Abstract
To identify the genes and pathways that underlie cardiovascular and metabolic phenotypes we performed an integrated analysis of a mouse C57BL/6JxA/J F2 (B6AF2) cross by relating genome-wide gene expression data from adipose, kidney, and liver tissues to physiological endpoints measured in the population. We have identified a large number of trait QTLs including loci driving variation in cardiac function on chromosomes 2 and 6 and a hotspot for adiposity, energy metabolism, and glucose traits on chromosome 8. Integration of adipose gene expression data identified a core set of genes that drive the chromosome 8 adiposity QTL. This chromosome 8 trans eQTL signature contains genes associated with mitochondrial function and oxidative phosphorylation and maps to a subnetwork with conserved function in humans that was previously implicated in human obesity. In addition, human eSNPs corresponding to orthologous genes from the signature show enrichment for association to type II diabetes in the DIAGRAM cohort, supporting the idea that the chromosome 8 locus perturbs a molecular network that in humans senses variations in DNA and in turn affects metabolic disease risk. We functionally validate predictions from this approach by demonstrating metabolic phenotypes in knockout mice for three genes from the trans eQTL signature, Akr1b8, Emr1, and Rgs2. In addition we show that the transcriptional signatures for knockout of two of these genes, Akr1b8 and Rgs2, map to the F2 network modules associated with the chromosome 8 trans eQTL signature and that these modules are in turn very significantly correlated with adiposity in the F2 population. Overall this study demonstrates how integrating gene expression data with QTL analysis in a network-based framework can aid in the elucidation of the molecular drivers of disease that can be translated from mice to humans.
Collapse
|
46
|
Zhang L, Wang S, Li H, Deng Q, Zheng A, Li S, Li P, Li Z, Wang J. Effects of missing marker and segregation distortion on QTL mapping in F2 populations. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2010; 121:1071-1082. [PMID: 20535442 DOI: 10.1007/s00122-010-1372-z] [Citation(s) in RCA: 88] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/04/2010] [Accepted: 05/21/2010] [Indexed: 05/27/2023]
Abstract
Missing marker and segregation distortion are commonly encountered in actual quantitative trait locus (QTL) mapping populations. Our objective in this study was to investigate the impact of the two factors on QTL mapping through computer simulations. Results indicate that detection power decreases with increasing levels of missing markers, and the false discovery rate increases. Missing markers have greater effects on smaller effect QTL and smaller size populations. The effect of missing markers can be quantified by a population with a reduced size similar to the marker missing rate. As for segregation distortion, if the distorted marker is not closely linked with any QTL, it will not have significant impact on QTL mapping; otherwise, the impact of the distortion will depend on the degree of dominance of QTL, frequencies of the three marker types, the linkage distance between the distorted marker and QTL, and the mapping population size. Sometimes, the distortion can result in a higher genetic variance than that of non-distortion, and therefore benefits the detection of linked QTL. A formula of the ratio of genetic variance explained by QTL under distortion and non-distortion was given in this study, so as to easily determine whether the segregation distortion marker (SDM) increases or decreases the QTL detection power. The effect of SDM decreases rapidly as its linkage relationship with QTL becomes looser. In general, distorted markers will not have a great effect on the position and effect estimations of QTL, and their effects can be ignored in large-size mapping populations.
Collapse
Affiliation(s)
- Luyan Zhang
- School of Mathematical Sciences, Beijing Normal University, Beijing, 100875, China
| | | | | | | | | | | | | | | | | |
Collapse
|
47
|
Che X, Xu S. Significance test and genome selection in bayesian shrinkage analysis. INTERNATIONAL JOURNAL OF PLANT GENOMICS 2010; 2010:893206. [PMID: 20631902 PMCID: PMC2902048 DOI: 10.1155/2010/893206] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/16/2009] [Revised: 02/05/2010] [Accepted: 03/27/2010] [Indexed: 05/29/2023]
Abstract
Bayesian shrinkage analysis is the state-of-the-art method for whole genome analysis of quantitative traits. It can estimate the genetic effects for the entire genome using a dense marker map. The technique is now called genome selection. A nice property of the shrinkage analysis is that it can estimate effects of QTL as small as explaining 2% of the phenotypic variance in a typical sample size of 300-500 individuals. In most cases, QTL can be detected with simple visual inspection of the entire genome for the effect because the false positive rate is low. As a Bayesian method, no significance test is needed. However, it is still desirable to put some confidences on the estimated QTL effects. We proposed to use the permutation test to draw empirical thresholds to declare significance of QTL under a predetermined genome wide type I error. With the permutation test, Bayesian shrinkage analysis can be routinely used for QTL detection.
Collapse
Affiliation(s)
- Xiaohong Che
- Department of Statistics, University of California, Riverside, California 92521, USA
| | - Shizhong Xu
- Department of Botany and Plant Sciences, University of California, Riverside, California 92521, USA
| |
Collapse
|
48
|
Xu S, Hu Z. Generalized linear model for interval mapping of quantitative trait loci. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2010; 121:47-63. [PMID: 20180093 PMCID: PMC2871098 DOI: 10.1007/s00122-010-1290-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2009] [Accepted: 02/01/2010] [Indexed: 05/23/2023]
Abstract
We developed a generalized linear model of QTL mapping for discrete traits in line crossing experiments. Parameter estimation was achieved using two different algorithms, a mixture model-based EM (expectation-maximization) algorithm and a GEE (generalized estimating equation) algorithm under a heterogeneous residual variance model. The methods were developed using ordinal data, binary data, binomial data and Poisson data as examples. Applications of the methods to simulated as well as real data are presented. The two different algorithms were compared in the data analyses. In most situations, the two algorithms were indistinguishable, but when large QTL are located in large marker intervals, the mixture model-based EM algorithm can fail to converge to the correct solutions. Both algorithms were coded in C++ and interfaced with SAS as a user-defined SAS procedure called PROC QTL.
Collapse
Affiliation(s)
- Shizhong Xu
- Department of Botany and Plant Sciences, University of California, Riverside, CA 92521, USA.
| | | |
Collapse
|
49
|
E Silva LDC, Zeng ZB. Current Progress on Statistical Methods for Mapping Quantitative Trait Loci from Inbred Line Crosses. J Biopharm Stat 2010; 20:454-81. [DOI: 10.1080/10543400903572845] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Affiliation(s)
- Luciano Da Costa E Silva
- a Department of Statistics, Bioinformatics Research Center , North Carolina State University , Raleigh, North Carolina, USA
| | - Zhao-Bang Zeng
- a Department of Statistics, Bioinformatics Research Center , North Carolina State University , Raleigh, North Carolina, USA
- b Department of Genetics , North Carolina State University , Raleigh, North Carolina, USA
| |
Collapse
|
50
|
Multilocus tetrasomic linkage analysis using hidden Markov chain model. Proc Natl Acad Sci U S A 2010; 107:4270-4. [PMID: 20142473 DOI: 10.1073/pnas.0908477107] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The availability of reliable genetic linkage maps is crucial for functional and evolutionary genomic analyses. Established theory and methods of genetic linkage analysis have made map construction a routine exercise in diploids. However, many evolutionarily, ecologically, and/or agronomically important species are autopolyploids, with autotetraploidy being a typical example. These species undergo much more complicated chromosomal segregation and recombination at meiosis than diploids. In addition, there is evidence of polyploidy-induced and highly dynamic changes in the structure of the genome. These polysomic characteristics indicate the inappropriateness of the theory and methods of linkage analysis in diploids for use in these species and a gap in the theory and methodology of tetraploid map construction. This paper presents a theoretical model and statistical framework for multilocus linkage analysis in autotetraploids for use with dominant and/or codominant DNA molecular markers. The theory and methods incorporate the essential features of allele segregation and recombination under tetrasomic inheritance and the major challenges in statistical modeling and marker data analysis. We validated the method and explored its statistical properties by intensive simulation study and demonstrated its utility by analysis of AFLP and SSR marker data from an outbred autotetraploid potato population.
Collapse
|