1
|
Kaier A, Beck S, Ingold M, García JMC, Reinert S, Sonnewald U, Sonnewald S. Identification of heat stress-related genomic regions by genome-wide association study in Solanum tuberosum. Genomics 2024; 116:110954. [PMID: 39477032 DOI: 10.1016/j.ygeno.2024.110954] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2024] [Revised: 10/22/2024] [Accepted: 10/23/2024] [Indexed: 11/04/2024]
Abstract
The climate crisis impairs yield and quality of crucial crops like potatoes. We investigated the effects of heat stress on five morpho-physiological parameters in a diverse panel of 178 potato cultivars under glasshouse conditions. Overall, heat stress increased shoot elongation and green fresh weight, but reduced tuber yield, starch content and harvest index. Genomic information was obtained from 258 tetraploid and three diploid cultivars by a genotyping-by-sequencing approach using methylation-sensitive restriction enzymes. This resulted in an enrichment of sequences in gene-rich regions. Population structure analyses using genetic distances and hierarchical clustering revealed strong kinship but weak overall population structure cultivars. A genome-wide association study (GWAS) was conducted with a subset of 20 K stringently filtered SNPs to identify quantitative trait loci (QTL) linked to heat tolerance. We identified 67 QTL and established haploblock boundaries to narrow down the number of candidate genes. Additionally, GO-enrichment analyses provided insights into gene functions. Heritability and genomic prediction were conducted to assess the usability of the collected data for selecting breeding material. The detected QTL might be exploited in marker-assisted selection to develop heat-resilient potato cultivars.
Collapse
Affiliation(s)
- Alexander Kaier
- Friedrich-Alexander-Universität Erlangen-Nürnberg, Department Biology, Division of Biochemistry, 91058 Erlangen, Germany
| | - Selina Beck
- Friedrich-Alexander-Universität Erlangen-Nürnberg, Department Biology, Division of Biochemistry, 91058 Erlangen, Germany
| | - Markus Ingold
- Friedrich-Alexander-Universität Erlangen-Nürnberg, Department Biology, Division of Biochemistry, 91058 Erlangen, Germany
| | - José María Corral García
- Friedrich-Alexander-Universität Erlangen-Nürnberg, Department Biology, Division of Biochemistry, 91058 Erlangen, Germany
| | - Stephan Reinert
- Friedrich-Alexander-Universität Erlangen-Nürnberg, Department Biology, Division of Biochemistry, 91058 Erlangen, Germany
| | - Uwe Sonnewald
- Friedrich-Alexander-Universität Erlangen-Nürnberg, Department Biology, Division of Biochemistry, 91058 Erlangen, Germany
| | - Sophia Sonnewald
- Friedrich-Alexander-Universität Erlangen-Nürnberg, Department Biology, Division of Biochemistry, 91058 Erlangen, Germany.
| |
Collapse
|
2
|
Endelman JB, Kante M, Lindqvist-Kreuze H, Kilian A, Shannon LM, Caraza-Harter MV, Vaillancourt B, Mailloux K, Hamilton JP, Buell CR. Targeted genotyping-by-sequencing of potato and data analysis with R/polyBreedR. THE PLANT GENOME 2024:e20484. [PMID: 38887158 DOI: 10.1002/tpg2.20484] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/17/2024] [Revised: 04/30/2024] [Accepted: 05/10/2024] [Indexed: 06/20/2024]
Abstract
Mid-density targeted genotyping-by-sequencing (GBS) combines trait-specific markers with thousands of genomic markers at an attractive price for linkage mapping and genomic selection. A 2.5K targeted GBS assay for potato (Solanum tuberosum L.) was developed using the DArTag technology and later expanded to 4K targets. Genomic markers were selected from the potato Infinium single nucleotide polymorphism (SNP) array to maximize genome coverage and polymorphism rates. The DArTag and SNP array platforms produced equivalent dendrograms in a test set of 298 tetraploid samples, and 83% of the common markers showed good quantitative agreement, with RMSE (root mean squared error) <0.5. DArTag is suited for genomic selection candidates in the clonal evaluation trial, coupled with imputation to a higher density platform for the training population. Using the software polyBreedR, an R package for the manipulation and analysis of polyploid marker data, the RMSE for imputation by linkage analysis was 0.15 in a small half-diallel population (N = 85), which was significantly lower than the RMSE of 0.42 with the random forest method. Regarding high-value traits, the DArTag markers for resistance to potato virus Y, golden cyst nematode, and potato wart appeared to track their targets successfully, as did multi-allelic markers for maturity and tuber shape. In summary, the potato DArTag assay is a transformative and publicly available technology for potato breeding and genetics.
Collapse
Affiliation(s)
- Jeffrey B Endelman
- Department of Plant & Agroecosystem Sciences, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Moctar Kante
- Genetics, Genomics and Crop Improvement, International Potato Center, Lima, Peru
| | | | - Andrzej Kilian
- Diversity Arrays Technology Pty Ltd., University of Canberra, Bruce, Australian Capital Territory, Australia
| | - Laura M Shannon
- Department of Horticultural Science, University of Minnesota, Saint Paul, Minnesota, USA
| | - Maria V Caraza-Harter
- Department of Plant & Agroecosystem Sciences, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Brieanne Vaillancourt
- Center for Applied Genetic Technologies, University of Georgia, Athens, Georgia, USA
| | - Kathrine Mailloux
- Center for Applied Genetic Technologies, University of Georgia, Athens, Georgia, USA
| | - John P Hamilton
- Center for Applied Genetic Technologies, University of Georgia, Athens, Georgia, USA
| | - C Robin Buell
- Center for Applied Genetic Technologies, University of Georgia, Athens, Georgia, USA
| |
Collapse
|
3
|
Bilton TP, Sharma SK, Schofield MR, Black MA, Jacobs JME, Bryan GJ, Dodds KG. Construction of relatedness matrices in autopolyploid populations using low-depth high-throughput sequencing data. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024; 137:64. [PMID: 38430392 PMCID: PMC10908621 DOI: 10.1007/s00122-024-04568-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Accepted: 01/30/2024] [Indexed: 03/03/2024]
Abstract
KEY MESSAGE An improved estimator of genomic relatedness using low-depth high-throughput sequencing data for autopolyploids is developed. Its outputs strongly correlate with SNP array-based estimates and are available in the package GUSrelate. High-throughput sequencing (HTS) methods have reduced sequencing costs and resources compared to array-based tools, facilitating the investigation of many non-model polyploid species. One important quantity that can be computed from HTS data is the genetic relatedness between all individuals in a population. However, HTS data are often messy, with multiple sources of errors (i.e. sequencing errors or missing parental alleles) which, if not accounted for, can lead to bias in genomic relatedness estimates. We derive a new estimator for constructing a genomic relationship matrix (GRM) from HTS data for autopolyploid species that accounts for errors associated with low sequencing depths, implemented in the R package GUSrelate. Simulations revealed that GUSrelate performed similarly to existing GRM methods at high depth but reduced bias in self-relatedness estimates when the sequencing depth was low. Using a panel consisting of 351 tetraploid potato genotypes, we found that GUSrelate produced GRMs from genotyping-by-sequencing (GBS) data that were highly correlated with a GRM computed from SNP array data, and less biased than existing methods when benchmarking against the array-based GRM estimates. GUSrelate provides researchers with a tool to reliably construct GRMs from low-depth HTS data.
Collapse
Affiliation(s)
- Timothy P Bilton
- AgResearch, Invermay Agricultural Centre, Mosgiel, New Zealand.
- Department of Mathematics and Statistics, University of Otago, Dunedin, New Zealand.
| | - Sanjeev Kumar Sharma
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, UK
| | - Matthew R Schofield
- Department of Mathematics and Statistics, University of Otago, Dunedin, New Zealand
| | - Michael A Black
- Department of Biochemistry, University of Otago, Dunedin, New Zealand
| | | | - Glenn J Bryan
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, UK
| | - Ken G Dodds
- AgResearch, Invermay Agricultural Centre, Mosgiel, New Zealand
| |
Collapse
|
4
|
Machado IP, DoVale JC, Sabadin F, Fritsche-Neto R. On the usefulness of mock genomes to define heterotic pools, testers, and hybrid predictions in orphan crops. FRONTIERS IN PLANT SCIENCE 2023; 14:1164555. [PMID: 37332727 PMCID: PMC10272588 DOI: 10.3389/fpls.2023.1164555] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Accepted: 05/10/2023] [Indexed: 06/20/2023]
Abstract
The advances in genomics in recent years have increased the accuracy and efficiency of breeding programs for many crops. Nevertheless, the adoption of genomic enhancement for several other crops essential in developing countries is still limited, especially for those that do not have a reference genome. These crops are more often called orphans. This is the first report to show how the results provided by different platforms, including the use of a simulated genome, called the mock genome, can generate in population structure and genetic diversity studies, especially when the intention is to use this information to support the formation of heterotic groups, choice of testers, and genomic prediction of single crosses. For that, we used a method to assemble a reference genome to perform the single-nucleotide polymorphism (SNP) calling without needing an external genome. Thus, we compared the analysis results using the mock genome with the standard approaches (array and genotyping-by-sequencing (GBS)). The results showed that the GBS-Mock presented similar results to the standard methods of genetic diversity studies, division of heterotic groups, the definition of testers, and genomic prediction. These results showed that a mock genome constructed from the population's intrinsic polymorphisms to perform the SNP calling is an effective alternative for conducting genomic studies of this nature in orphan crops, especially those that do not have a reference genome.
Collapse
Affiliation(s)
| | - Júlio César DoVale
- Department of Crop Science, Federal University of Ceará, Fortaleza, Brazil
| | - Felipe Sabadin
- School of Plant and Environmental Sciences, Virginia Tech: Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Roberto Fritsche-Neto
- LSU AgCenter, Louisiana State University Agricultural Center, Baton Rouge, LA, United States
| |
Collapse
|
5
|
Batista LG, Mello VH, Souza AP, Margarido GRA. Genomic prediction with allele dosage information in highly polyploid species. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2022; 135:723-739. [PMID: 34800132 DOI: 10.1007/s00122-021-03994-w] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Accepted: 11/06/2021] [Indexed: 06/13/2023]
Abstract
Including allele, dosage can improve genomic selection in highly polyploid species under higher frequency of different heterozygous genotypic classes and high dominance degree levels. Several studies have shown how to leverage allele dosage information to improve the accuracy of genomic selection models in autotetraploid. In this study, we expanded the methodology used for genomic selection in autotetraploid to higher (and mixed) ploidy levels. We adapted the models to build covariance matrices of both additive and digenic dominance effects that are subsequently used in genomic selection models. We applied these models using estimates of ploidy and allele dosage to sugarcane and sweet potato datasets and validated our results by also applying the models in simulated data. For the simulated datasets, including allele dosage information led up to 140% higher mean predictive abilities in comparison to using diploidized markers. Including dominance effects were highly advantageous when using diploidized markers, leading to mean predictive abilities which were up to 115% higher in comparison to only including additive effects. When the frequency of heterozygous genotypes in the population was low, such as in the sugarcane and sweet potato datasets, there was little advantage in including allele dosage information in the models. Overall, we show that including allele dosage can improve genomic selection in highly polyploid species under higher frequency of different heterozygous genotypic classes and high dominance degree levels.
Collapse
Affiliation(s)
- Lorena G Batista
- Luiz de Queiroz" College of Agriculture, University of São Paulo, Piracicaba, SP, 13418-900, Brazil
| | - Victor H Mello
- Luiz de Queiroz" College of Agriculture, University of São Paulo, Piracicaba, SP, 13418-900, Brazil
| | - Anete P Souza
- Center of Molecular Biology and Genetic Engineering, University of Campinas, Campinas, SP, 13083-970, Brazil
| | - Gabriel R A Margarido
- Luiz de Queiroz" College of Agriculture, University of São Paulo, Piracicaba, SP, 13418-900, Brazil.
| |
Collapse
|
6
|
Zheng C, Amadeu RR, Munoz PR, Endelman JB. Haplotype reconstruction in connected tetraploid F1 populations. Genetics 2021; 219:6330625. [PMID: 34849879 DOI: 10.1093/genetics/iyab106] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Accepted: 06/03/2021] [Indexed: 11/12/2022] Open
Abstract
In diploid species, many multiparental populations have been developed to increase genetic diversity and quantitative trait loci (QTL) mapping resolution. In these populations, haplotype reconstruction has been used as a standard practice to increase the power of QTL detection in comparison with the marker-based association analysis. However, such software tools for polyploid species are few and limited to a single biparental F1 population. In this study, a statistical framework for haplotype reconstruction has been developed and implemented in the software PolyOrigin for connected tetraploid F1 populations with shared parents, regardless of the number of parents or mating design. Given a genetic or physical map of markers, PolyOrigin first phases parental genotypes, then refines the input marker map, and finally reconstructs offspring haplotypes. PolyOrigin can utilize single nucleotide polymorphism (SNP) data coming from arrays or from sequence-based genotyping; in the latter case, bi-allelic read counts can be used (and are preferred) as input data to minimize the influence of genotype calling errors at low depth. With extensive simulation we show that PolyOrigin is robust to the errors in the input genotypic data and marker map. It works well for various population designs with ≥30 offspring per parent and for sequences with read depth as low as 10x. PolyOrigin was further evaluated using an autotetraploid potato dataset with a 3 × 3 half-diallel mating design. In conclusion, PolyOrigin opens up exciting new possibilities for haplotype analysis in tetraploid breeding populations.
Collapse
Affiliation(s)
- Chaozhi Zheng
- Biometris, Wageningen University and Research, Wageningen 6700AA, The Netherlands
| | - Rodrigo R Amadeu
- Horticultural Sciences Department, University of Florida, Gainesville, FL 32611, USA
| | - Patricio R Munoz
- Horticultural Sciences Department, University of Florida, Gainesville, FL 32611, USA
| | - Jeffrey B Endelman
- Department of Horticulture, University of Wisconsin, Madison, WI 53706, USA
| |
Collapse
|
7
|
Lindqvist-Kreuze H, De Boeck B, Unger P, Gemenet D, Li X, Pan Z, Sui Q, Qin J, Woldegjorgis G, Negash K, Seid I, Hirut B, Gastelo M, De Vega J, Bonierbale M. Global multi-environment resistance QTL for foliar late blight resistance in tetraploid potato with tropical adaptation. G3-GENES GENOMES GENETICS 2021; 11:6342414. [PMID: 34549785 PMCID: PMC8527470 DOI: 10.1093/g3journal/jkab251] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Accepted: 06/29/2021] [Indexed: 11/18/2022]
Abstract
The identification of environmentally stable and globally predictable resistance to potato late blight is challenged by the clonal and polyploid nature of the crop and the rapid evolution of the pathogen. A diversity panel of tetraploid potato germplasm bred for multiple resistance and quality traits was genotyped by genotyping by sequencing (GBS) and evaluated for late blight resistance in three countries where the International Potato Center (CIP) has established breeding work. Health-indexed, in vitro plants of 380 clones and varieties were distributed from CIP headquarters and tuber seed was produced centrally in Peru, China, and Ethiopia. Phenotypes were recorded following field exposure to local isolates of Phytophthora infestans. QTL explaining resistance in four experiments conducted across the three countries were identified in chromosome IX, and environment-specific QTL were found in chromosomes III, V, and X. Different genetic models were evaluated for prediction ability to identify best performing germplasm in each and all environments. The best prediction ability (0.868) was identified with the genomic best linear unbiased predictors (GBLUPs) when using the diploid marker data and QTL-linked markers as fixed effects. Genotypes with high levels of resistance in all environments were identified from the B3, LBHT, and B3-LTVR populations. The results show that many of the advanced clones bred in Peru for high levels of late blight resistance maintain their resistance in Ethiopia and China, suggesting that the centralized selection strategy has been largely successful.
Collapse
Affiliation(s)
| | | | - Paula Unger
- International Potato Center, CIP, Lima 15024, Peru
| | | | - Xianping Li
- ndustrial Crops Research Institute, Yunnan Academy of Agricultural Sciences (YAAS), 2238 Beijing Road, Kunming, Yunnan 650205, P.R. China
| | - Zhechao Pan
- ndustrial Crops Research Institute, Yunnan Academy of Agricultural Sciences (YAAS), 2238 Beijing Road, Kunming, Yunnan 650205, P.R. China
| | - Qinjun Sui
- ndustrial Crops Research Institute, Yunnan Academy of Agricultural Sciences (YAAS), 2238 Beijing Road, Kunming, Yunnan 650205, P.R. China
| | | | - Gebremedhin Woldegjorgis
- Ethiopian Institute of Agricultural Research, (EIAR), Holetta Agricultural research Center. P.O. Box 31, West Showa Zone, Oromia Region, Ethiopia
| | - Kassaye Negash
- Ethiopian Institute of Agricultural Research, (EIAR), Holetta Agricultural research Center. P.O. Box 31, West Showa Zone, Oromia Region, Ethiopia
| | - Ibrahim Seid
- Ethiopian Institute of Agricultural Research, (EIAR), Holetta Agricultural research Center. P.O. Box 31, West Showa Zone, Oromia Region, Ethiopia
| | - Betaw Hirut
- CIP Ethiopia, c/o ILRI Ethiopia P.O. Box 5689, Addis Ababa, Ethiopia
| | | | - Jose De Vega
- Earlham Institute (EI), Norwich Research Park, Norwich NR4 7UZ, UK
| | | |
Collapse
|
8
|
Liao Y, Voorrips RE, Bourke PM, Tumino G, Arens P, Visser RGF, Smulders MJM, Maliepaard C. Using probabilistic genotypes in linkage analysis of polyploids. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2021; 134:2443-2457. [PMID: 34032878 PMCID: PMC8277618 DOI: 10.1007/s00122-021-03834-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Accepted: 04/10/2021] [Indexed: 05/21/2023]
Abstract
KEY MESSAGE In polyploids, linkage mapping is carried out using genotyping with discrete dosage scores. Here, we use probabilistic genotypes and we validate it for the construction of polyploid linkage maps. Marker genotypes are generally called as discrete values: homozygous versus heterozygous in the case of diploids, or an integer allele dosage in the case of polyploids. Software for linkage map construction and/or QTL analysis usually relies on such discrete genotypes. However, it may not always be possible, or desirable, to assign definite values to genotype observations in the presence of uncertainty in the genotype calling. Here, we present an approach that uses probabilistic marker dosages for linkage map construction in polyploids. We compare our method to an approach based on discrete dosages, using simulated SNP array and sequence reads data with varying levels of data quality. We validate our approach using experimental data from a potato (Solanum tuberosum L.) SNP array applied to an F1 mapping population. In comparison to the approach based on discrete dosages, we mapped an additional 562 markers. All but three of these were mapped to the expected chromosome and marker position. For the remaining three markers, no physical position was known. The use of dosage probabilities is of particular relevance for map construction in polyploids using sequencing data, as these often result in a higher level of uncertainty regarding allele dosage.
Collapse
Affiliation(s)
- Yanlin Liao
- Wageningen University and Research Plant Breeding, P.O. Box 386, Wageningen, AJ, 6700, The Netherlands
| | - Roeland E Voorrips
- Wageningen University and Research Plant Breeding, P.O. Box 386, Wageningen, AJ, 6700, The Netherlands
| | - Peter M Bourke
- Wageningen University and Research Plant Breeding, P.O. Box 386, Wageningen, AJ, 6700, The Netherlands
| | - Giorgio Tumino
- Wageningen University and Research Plant Breeding, P.O. Box 386, Wageningen, AJ, 6700, The Netherlands
| | - Paul Arens
- Wageningen University and Research Plant Breeding, P.O. Box 386, Wageningen, AJ, 6700, The Netherlands
| | - Richard G F Visser
- Wageningen University and Research Plant Breeding, P.O. Box 386, Wageningen, AJ, 6700, The Netherlands
| | - Marinus J M Smulders
- Wageningen University and Research Plant Breeding, P.O. Box 386, Wageningen, AJ, 6700, The Netherlands
| | - Chris Maliepaard
- Wageningen University and Research Plant Breeding, P.O. Box 386, Wageningen, AJ, 6700, The Netherlands.
| |
Collapse
|
9
|
Simeão RM, Resende MDV, Alves RS, Pessoa-Filho M, Azevedo ALS, Jones CS, Pereira JF, Machado JC. Genomic Selection in Tropical Forage Grasses: Current Status and Future Applications. FRONTIERS IN PLANT SCIENCE 2021; 12:665195. [PMID: 33995461 PMCID: PMC8120112 DOI: 10.3389/fpls.2021.665195] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/07/2021] [Accepted: 04/06/2021] [Indexed: 05/06/2023]
Abstract
The world population is expected to be larger and wealthier over the next few decades and will require more animal products, such as milk and beef. Tropical regions have great potential to meet this growing global demand, where pasturelands play a major role in supporting increased animal production. Better forage is required in consonance with improved sustainability as the planted area should not increase and larger areas cultivated with one or a few forage species should be avoided. Although, conventional tropical forage breeding has successfully released well-adapted and high-yielding cultivars over the last few decades, genetic gains from these programs have been low in view of the growing food demand worldwide. To guarantee their future impact on livestock production, breeding programs should leverage genotyping, phenotyping, and envirotyping strategies to increase genetic gains. Genomic selection (GS) and genome-wide association studies play a primary role in this process, with the advantage of increasing genetic gain due to greater selection accuracy, reduced cycle time, and increased number of individuals that can be evaluated. This strategy provides solutions to bottlenecks faced by conventional breeding methods, including long breeding cycles and difficulties to evaluate complex traits. Initial results from implementing GS in tropical forage grasses (TFGs) are promising with notable improvements over phenotypic selection alone. However, the practical impact of GS in TFG breeding programs remains unclear. The development of appropriately sized training populations is essential for the evaluation and validation of selection markers based on estimated breeding values. Large panels of single-nucleotide polymorphism markers in different tropical forage species are required for multiple application targets at a reduced cost. In this context, this review highlights the current challenges, achievements, availability, and development of genomic resources and statistical methods for the implementation of GS in TFGs. Additionally, the prediction accuracies from recent experiments and the potential to harness diversity from genebanks are discussed. Although, GS in TFGs is still incipient, the advances in genomic tools and statistical models will speed up its implementation in the foreseeable future. All TFG breeding programs should be prepared for these changes.
Collapse
Affiliation(s)
| | | | - Rodrigo S. Alves
- Instituto Nacional de Ciência e Tecnologia do Café, Universidade Federal de Viçosa, Viçosa, Brazil
| | | | | | - Chris S. Jones
- International Livestock Research Institute, Nairobi, Kenya
| | | | | |
Collapse
|
10
|
Gerard D. Pairwise linkage disequilibrium estimation for polyploids. Mol Ecol Resour 2021; 21:1230-1242. [PMID: 33559321 DOI: 10.1111/1755-0998.13349] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2020] [Revised: 01/18/2021] [Accepted: 02/01/2021] [Indexed: 12/31/2022]
Abstract
Many tasks in statistical genetics involve pairwise estimation of linkage disequilibrium (LD). The study of LD in diploids is mature. However, in polyploids, the field lacks a comprehensive characterization of LD. Polyploids also exhibit greater levels of genotype uncertainty than diploids, yet no methods currently exist to estimate LD in polyploids in the presence of such genotype uncertainty. Furthermore, most LD estimation methods do not quantify the level of uncertainty in their LD estimates. Our study contains three major contributions. (i) We characterize haplotypic and composite measures of LD in polyploids. These composite measures of LD turn out to be functions of common statistical measures of association. (ii) We derive procedures to estimate haplotypic and composite LD in polyploids in the presence of genotype uncertainty. We do this by estimating LD directly from genotype likelihoods, which may be obtained from many genotyping platforms. (iii) We derive standard errors of all LD estimators that we discuss. We validate our methods on both real and simulated data. Our methods are implemented in the R package ldsep, available on the Comprehensive R Archive Network https://cran.r-project.org/package=ldsep.
Collapse
Affiliation(s)
- David Gerard
- Department of Mathematics and Statistics, American University, Washington, DC, USA
| |
Collapse
|