1
|
Fu Y, Yao M, Qiu P, Song M, Ni X, Niu E, Shi J, Wang T, Zhang Y, Yu H, Qian L. Identification of transcription factor BnHDG4-A08 as a novel candidate associated with the accumulation of oleic, linoleic, linolenic, and erucic acid in Brassica napus. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024; 137:243. [PMID: 39352575 DOI: 10.1007/s00122-024-04733-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/20/2024] [Accepted: 08/24/2024] [Indexed: 10/03/2024]
Abstract
KEY MESSAGE We screened 47 significantly associated haplotype blocks for oleic, linoleic, linolenic, and erucic acid, with 17 blocks influencing multiple traits. A novel candidate of transcription factor BnHDG4 A08 influencing oleic, linoleic, linolenic, and erucic acid was identified, by a joint strategy of haplotype-based genome-wide association study, genomic resequencing, gene cloning, and co-expression network Fatty acid (FA) composition determines the quality and economic value of rapeseed oil (Brassica napus). However, the molecular network of FAs is unclear. In the current study, multi-strategies of haplotype-based genome-wide association study (GWAS), genomic resequencing, gene cloning, and co-expression network were joint to reveal novel genetic factors influencing FA accumulation in rapeseed. We identified 47 significantly associated haplotype blocks for oleic, linoleic, linolenic, and erucic acid, with 17 blocks influencing multiple traits, using a haplotype-based GWAS with phenotype data from 203 Chinese semi-winter accessions. A total of 61 rapeseed orthologs involved in acyl-lipid metabolism, carbohydrate metabolism, or photosynthesis were identified in these 17 blocks. Among these genes, BnHDG4-A08, encoding a class IV homeodomain leucine-zipper transcription factor, exhibited two single-nucleotide polymorphisms (SNPs) in the exon and intron, with significant associations with oleic, linoleic, linolenic, and erucic acid. Gene cloning further validated two SNPs in the exon of BnHDG4-A08 in a population with 75 accessions, leading to two amino acid changes (T372A and P366L) and significant variation of oleic, linoleic, linolenic, and erucic acid. A competitive allele-specific PCR (KASP) marker based on the SNPs was successfully developed and validated. Moreover, 98 genes exhibiting direct interconnections and high weight values with BnHDG4-A08 were identified through co-expression network analysis using transcriptome data from 13 accessions. Our study identified a novel FA candidate of transcription factor BnHDG4-A08 influencing oleic, linoleic, linolenic, and erucic acid. This gene provides a potential promising gene resource for the novel mechanistic understanding of transcription factors regulating FA accumulation.
Collapse
Affiliation(s)
- Ying Fu
- Institute of Crop and Nuclear Technology Utilization, Zhejiang Academy of Agricultural Sciences, Hangzhou, China
| | - Min Yao
- College of Agronomy, Hunan Agricultural University, Changsha, China
| | - Ping Qiu
- College of Agronomy, Hunan Agricultural University, Changsha, China
| | - Maolin Song
- Institute of Crop and Nuclear Technology Utilization, Zhejiang Academy of Agricultural Sciences, Hangzhou, China
- College of Advanced Agricultural Sciences, Zhejiang A&F University, Hangzhou, China
| | - Xiyuan Ni
- Institute of Crop and Nuclear Technology Utilization, Zhejiang Academy of Agricultural Sciences, Hangzhou, China
| | - Erli Niu
- Institute of Crop and Nuclear Technology Utilization, Zhejiang Academy of Agricultural Sciences, Hangzhou, China
| | - Jianghua Shi
- Institute of Crop and Nuclear Technology Utilization, Zhejiang Academy of Agricultural Sciences, Hangzhou, China
| | - Tanliu Wang
- Institute of Crop and Nuclear Technology Utilization, Zhejiang Academy of Agricultural Sciences, Hangzhou, China
| | - Yaofeng Zhang
- Institute of Crop and Nuclear Technology Utilization, Zhejiang Academy of Agricultural Sciences, Hangzhou, China
| | - Huasheng Yu
- Institute of Crop and Nuclear Technology Utilization, Zhejiang Academy of Agricultural Sciences, Hangzhou, China.
| | - Lunwen Qian
- College of Agronomy, Hunan Agricultural University, Changsha, China.
| |
Collapse
|
2
|
Sivabharathi RC, Rajagopalan VR, Suresh R, Sudha M, Karthikeyan G, Jayakanthan M, Raveendran M. Haplotype-based breeding: A new insight in crop improvement. PLANT SCIENCE : AN INTERNATIONAL JOURNAL OF EXPERIMENTAL PLANT BIOLOGY 2024; 346:112129. [PMID: 38763472 DOI: 10.1016/j.plantsci.2024.112129] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Revised: 05/09/2024] [Accepted: 05/15/2024] [Indexed: 05/21/2024]
Abstract
Haplotype-based breeding (HBB) is one of the cutting-edge technologies in the realm of crop improvement due to the increasing availability of Single Nucleotide Polymorphisms identified by Next Generation Sequencing technologies. The complexity of the data can be decreased with fewer statistical tests and a lower probability of spurious associations by combining thousands of SNPs into a few hundred haplotype blocks. The presence of strong genomic regions in breeding lines of most crop species facilitates the use of haplotypes to improve the efficiency of genomic and marker-assisted selection. Haplotype-based breeding as a Genomic Assisted Breeding (GAB) approach harnesses the genome sequence data to pinpoint the allelic variation used to hasten the breeding cycle and circumvent the challenges associated with linkage drag. This review article demonstrates ways to identify candidate genes, superior haplotype identification, haplo-pheno analysis, and haplotype-based marker-assisted selection. The crop improvement strategies that utilize superior haplotypes will hasten the breeding progress to safeguard global food security.
Collapse
Affiliation(s)
- R C Sivabharathi
- Department of Genetics and Plant breeding, CPBG, Tamil Nadu Agricultural University, Coimbatore 641003, India
| | - Veera Ranjani Rajagopalan
- Department of Plant Biotechnology, Centre for Plant Molecular Biology and Biotechnology, Tamil Nadu Agricultural University, Coimbatore, 641003, India
| | - R Suresh
- Department of Rice, CPBG, Tamil Nadu Agricultural University, Coimbatore 641003, India
| | - M Sudha
- Department of Plant Biotechnology, Centre for Plant Molecular Biology and Biotechnology, Tamil Nadu Agricultural University, Coimbatore, 641003, India.
| | - G Karthikeyan
- Department of Plant Pathology, CPPS, Tamil Nadu Agricultural University, Coimbatore 641003, India
| | - M Jayakanthan
- Department of Plant Molecular Biology and Bioinformatics, Centre for Plant Molecular Biology and Biotechnology, Tamil Nadu Agricultural University, Coimbatore 641003, India
| | - M Raveendran
- Directorate of research, Tamil Nadu Agricultural University, Coimbatore 641003, India.
| |
Collapse
|
3
|
Weber SE, Roscher-Ehrig L, Kox T, Abbadi A, Stahl A, Snowdon RJ. Genomic prediction in Brassica napus: evaluating the benefit of imputed whole-genome sequencing data. Genome 2024; 67:210-222. [PMID: 38708850 DOI: 10.1139/gen-2023-0126] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/07/2024]
Abstract
Advances in sequencing technology allow whole plant genomes to be sequenced with high quality. Combining genotypic and phenotypic data in genomic prediction helps breeders to select crossing partners in partially phenotyped populations. In plant breeding programs, the cost of sequencing entire breeding populations still exceeds available genotyping budgets. Hence, the method for genotyping is still mainly single nucleotide polymorphism (SNP) arrays; however, arrays are unable to assess the entire genome- and population-wide diversity. A compromise involves genotyping the entire population using an SNP array and a subset of the population with whole-genome sequencing. Both datasets can then be used to impute markers from whole-genome sequencing onto the entire population. Here, we evaluate whether imputation of whole-genome sequencing data enhances genomic predictions, using data from a nested association mapping population of rapeseed (Brassica napus). Employing two cross-validation schemes that mimic scenarios for the prediction of close and distant relatives, we show that imputed marker data do not significantly improve prediction accuracy, likely due to redundancy in relationship estimates and imputation errors. In simulation studies, only small improvements were observed, further corroborating the findings. We conclude that SNP arrays are already equipped with the information that is added by imputation through relationship and linkage disequilibrium.
Collapse
Affiliation(s)
- Sven E Weber
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Giessen, Germany
| | - Lennard Roscher-Ehrig
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Giessen, Germany
| | | | | | - Andreas Stahl
- Julius Kuehn Institute (JKI), Federal Research Centre for Cultivated Plants, Institute for Resistance Research and Stress Tolerance, Quedlinburg, Germany
| | - Rod J Snowdon
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Giessen, Germany
| |
Collapse
|
4
|
Lin YC, Mayer M, Valle Torres D, Pook T, Hölker AC, Presterl T, Ouzunova M, Schön CC. Genomic prediction within and across maize landrace derived populations using haplotypes. FRONTIERS IN PLANT SCIENCE 2024; 15:1351466. [PMID: 38584949 PMCID: PMC10995330 DOI: 10.3389/fpls.2024.1351466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Accepted: 03/05/2024] [Indexed: 04/09/2024]
Abstract
Genomic prediction (GP) using haplotypes is considered advantageous compared to GP solely reliant on single nucleotide polymorphisms (SNPs), owing to haplotypes' enhanced ability to capture ancestral information and their higher linkage disequilibrium with quantitative trait loci (QTL). Many empirical studies supported the advantages of haplotype-based GP over SNP-based approaches. Nevertheless, the performance of haplotype-based GP can vary significantly depending on multiple factors, including the traits being studied, the genetic structure of the population under investigation, and the particular method employed for haplotype construction. In this study, we compared haplotype and SNP based prediction accuracies in four populations derived from European maize landraces. Populations comprised either doubled haploid lines (DH) derived directly from landraces, or gamete capture lines (GC) derived from crosses of the landraces with an inbred line. For two different landraces, both types of populations were generated, genotyped with 600k SNPs and phenotyped as lines per se for five traits. Our study explores three prediction scenarios: (i) within each of the four populations, (ii) across DH and GC populations from the same landrace, and (iii) across landraces using either DH or GC populations. Three haplotype construction methods were evaluated: 1. fixed-window blocks (FixedHB), 2. LD-based blocks (HaploView), and 3. IBD-based blocks (HaploBlocker). In within population predictions, FixedHB and HaploView methods performed as well as or slightly better than SNPs for all traits. HaploBlocker improved accuracy for certain traits but exhibited inferior performance for others. In prediction across populations, the parameter setting from HaploBlocker which controls the construction of shared haplotypes between populations played a crucial role for obtaining optimal results. When predicting across landraces, accuracies were low for both, SNP and haplotype approaches, but for specific traits substantial improvement was observed with HaploBlocker. This study provides recommendations for optimal haplotype construction and identifies relevant parameters for constructing haplotypes in the context of genomic prediction.
Collapse
Affiliation(s)
- Yan-Cheng Lin
- Chair of Plant Breeding, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Manfred Mayer
- Chair of Plant Breeding, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
- Bayer CropScience Deutschland GmbH, Borken, Germany
| | - Daniel Valle Torres
- Chair of Plant Breeding, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
- Sugar Beet Breeding, Strube Research GmbH & Co. KG, Söllingen, Germany
| | - Torsten Pook
- Animal Breeding and Genomics, Wageningen University & Research, Wageningen, Netherlands
| | - Armin C. Hölker
- Product Development Maize and Oil Crops, KWS SAAT SE & Co. KGaA, Einbeck, Germany
| | - Thomas Presterl
- Product Development Maize and Oil Crops, KWS SAAT SE & Co. KGaA, Einbeck, Germany
| | - Milena Ouzunova
- Product Development Maize and Oil Crops, KWS SAAT SE & Co. KGaA, Einbeck, Germany
| | - Chris-Carolin Schön
- Chair of Plant Breeding, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| |
Collapse
|
5
|
de Oliveira LF, Brito LF, Marques DBD, da Silva DA, Lopes PS, Dos Santos CG, Johnson JS, Veroneze R. Investigating the impact of non-additive genetic effects in the estimation of variance components and genomic predictions for heat tolerance and performance traits in crossbred and purebred pig populations. BMC Genom Data 2023; 24:76. [PMID: 38093199 PMCID: PMC10717470 DOI: 10.1186/s12863-023-01174-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 11/13/2023] [Indexed: 12/18/2023] Open
Abstract
BACKGROUND Non-additive genetic effects are often ignored in livestock genetic evaluations. However, fitting them in the models could improve the accuracy of genomic breeding values. Furthermore, non-additive genetic effects contribute to heterosis, which could be optimized through mating designs. Traits related to fitness and adaptation, such as heat tolerance, tend to be more influenced by non-additive genetic effects. In this context, the primary objectives of this study were to estimate variance components and assess the predictive performance of genomic prediction of breeding values based on alternative models and two independent datasets, including performance records from a purebred pig population and heat tolerance indicators recorded in crossbred lactating sows. RESULTS Including non-additive genetic effects when modelling performance traits in purebred pigs had no effect on the residual variance estimates for most of the traits, but lower additive genetic variances were observed, especially when additive-by-additive epistasis was included in the models. Furthermore, including non-additive genetic effects did not improve the prediction accuracy of genomic breeding values, but there was animal re-ranking across the models. For the heat tolerance indicators recorded in a crossbred population, most traits had small non-additive genetic variance with large standard error estimates. Nevertheless, panting score and hair density presented substantial additive-by-additive epistatic variance. Panting score had an epistatic variance estimate of 0.1379, which accounted for 82.22% of the total genetic variance. For hair density, the epistatic variance estimates ranged from 0.1745 to 0.1845, which represent 64.95-69.59% of the total genetic variance. CONCLUSIONS Including non-additive genetic effects in the models did not improve the accuracy of genomic breeding values for performance traits in purebred pigs, but there was substantial re-ranking of selection candidates depending on the model fitted. Except for panting score and hair density, low non-additive genetic variance estimates were observed for heat tolerance indicators in crossbred pigs.
Collapse
Affiliation(s)
- Letícia Fernanda de Oliveira
- Department of Animal Science, Federal University of Viçosa, Viçosa, MG, Brazil.
- Department of Animal Sciences, Purdue University, West Lafayette, IN, USA.
| | - Luiz F Brito
- Department of Animal Sciences, Purdue University, West Lafayette, IN, USA
| | | | | | - Paulo Sávio Lopes
- Department of Animal Science, Federal University of Viçosa, Viçosa, MG, Brazil
| | | | - Jay S Johnson
- USDA-ARS Livestock Behavior Research Unit, West Lafayette, IN, USA
| | - Renata Veroneze
- Department of Animal Science, Federal University of Viçosa, Viçosa, MG, Brazil
| |
Collapse
|
6
|
Singh V, Krause M, Sandhu D, Sekhon RS, Kaundal A. Salinity stress tolerance prediction for biomass-related traits in maize (Zea mays L.) using genome-wide markers. THE PLANT GENOME 2023; 16:e20385. [PMID: 37667417 DOI: 10.1002/tpg2.20385] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Revised: 07/18/2023] [Accepted: 08/14/2023] [Indexed: 09/06/2023]
Abstract
Maize (Zea mays L.) is the third most important cereal crop after rice (Oryza sativa) and wheat (Triticum aestivum). Salinity stress significantly affects vegetative biomass and grain yield and, therefore, reduces the food and silage productivity of maize. Selecting salt-tolerant genotypes is a cumbersome and time-consuming process that requires meticulous phenotyping. To predict salt tolerance in maize, we estimated breeding values for four biomass-related traits, including shoot length, shoot weight, root length, and root weight under salt-stressed and controlled conditions. A five-fold cross-validation method was used to select the best model among genomic best linear unbiased prediction (GBLUP), ridge-regression BLUP (rrBLUP), extended GBLUP, Bayesian Lasso, Bayesian ridge regression, BayesA, BayesB, and BayesC. Examination of the effect of different marker densities on prediction accuracy revealed that a set of low-density single nucleotide polymorphisms obtained through filtering based on a combination of analysis of variance and linkage disequilibrium provided the best prediction accuracy for all the traits. The average prediction accuracy in cross-validations ranged from 0.46 to 0.77 across the four derived traits. The GBLUP, rrBLUP, and all Bayesian models except BayesB demonstrated comparable levels of prediction accuracy that were superior to the other modeling approaches. These findings provide a roadmap for the deployment and optimization of genomic selection in breeding for salt tolerance in maize.
Collapse
Affiliation(s)
- Vishal Singh
- Plants, Soils, and Climate, College of Agricultural and Applied Sciences, Utah State University, Logan, Utah, USA
- ICAR-Indian Institute of Maize Research, Ludhiana, Punjab, India
| | - Margaret Krause
- Plants, Soils, and Climate, College of Agricultural and Applied Sciences, Utah State University, Logan, Utah, USA
| | - Devinder Sandhu
- US Salinity Laboratory (USDA-ARS), Riverside, California, USA
| | - Rajandeep S Sekhon
- Department of Genetics and Biochemistry, Clemson University, Clemson, South Carolina, USA
| | - Amita Kaundal
- Plants, Soils, and Climate, College of Agricultural and Applied Sciences, Utah State University, Logan, Utah, USA
| |
Collapse
|
7
|
Weber SE, Frisch M, Snowdon RJ, Voss-Fels KP. Haplotype blocks for genomic prediction: a comparative evaluation in multiple crop datasets. FRONTIERS IN PLANT SCIENCE 2023; 14:1217589. [PMID: 37731980 PMCID: PMC10507710 DOI: 10.3389/fpls.2023.1217589] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Accepted: 08/21/2023] [Indexed: 09/22/2023]
Abstract
In modern plant breeding, genomic selection is becoming the gold standard for selection of superior genotypes. The basis for genomic prediction models is a set of phenotyped lines along with their genotypic profile. With high marker density and linkage disequilibrium (LD) between markers, genotype data in breeding populations tends to exhibit considerable redundancy. Therefore, interest is growing in the use of haplotype blocks to overcome redundancy by summarizing co-inherited features. Moreover, haplotype blocks can help to capture local epistasis caused by interacting loci. Here, we compared genomic prediction methods that either used single SNPs or haplotype blocks with regards to their prediction accuracy for important traits in crop datasets. We used four published datasets from canola, maize, wheat and soybean. Different approaches to construct haplotype blocks were compared, including blocks based on LD, physical distance, number of adjacent markers and the algorithms implemented in the software "Haploview" and "HaploBlocker". The tested prediction methods included Genomic Best Linear Unbiased Prediction (GBLUP), Extended GBLUP to account for additive by additive epistasis (EGBLUP), Bayesian LASSO and Reproducing Kernel Hilbert Space (RKHS) regression. We found improved prediction accuracy in some traits when using haplotype blocks compared to SNP-based predictions, however the magnitude of improvement was very trait- and model-specific. Especially in settings with low marker density, haplotype blocks can improve genomic prediction accuracy. In most cases, physically large haplotype blocks yielded a strong decrease in prediction accuracy. Especially when prediction accuracy varies greatly across different prediction models, prediction based on haplotype blocks can improve prediction accuracy of underperforming models. However, there is no "best" method to build haplotype blocks, since prediction accuracy varied considerably across methods and traits. Hence, criteria used to define haplotype blocks should not be viewed as fixed biological parameters, but rather as hyperparameters that need to be adjusted for every dataset.
Collapse
Affiliation(s)
- Sven E. Weber
- Department of Plant Breeding, Justus Liebig University, Giessen, Germany
| | - Matthias Frisch
- Department of Biometry and Population Genetics, Justus Liebig University, Giessen, Germany
| | - Rod J. Snowdon
- Department of Plant Breeding, Justus Liebig University, Giessen, Germany
| | - Kai P. Voss-Fels
- Institute for Grapevine Breeding, Hochschule Geisenheim University, Geisenheim, Germany
| |
Collapse
|
8
|
Difabachew YF, Frisch M, Langstroff AL, Stahl A, Wittkop B, Snowdon RJ, Koch M, Kirchhoff M, Cselényi L, Wolf M, Förster J, Weber S, Okoye UJ, Zenke-Philippi C. Genomic prediction with haplotype blocks in wheat. FRONTIERS IN PLANT SCIENCE 2023; 14:1168547. [PMID: 37229104 PMCID: PMC10203549 DOI: 10.3389/fpls.2023.1168547] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Accepted: 04/17/2023] [Indexed: 05/27/2023]
Abstract
Haplotype blocks might carry additional information compared to single SNPs and have therefore been suggested for use as independent variables in genomic prediction. Studies in different species resulted in more accurate predictions than with single SNPs in some traits but not in others. In addition, it remains unclear how the blocks should be built to obtain the greatest prediction accuracies. Our objective was to compare the results of genomic prediction with different types of haplotype blocks to prediction with single SNPs in 11 traits in winter wheat. We built haplotype blocks from marker data from 361 winter wheat lines based on linkage disequilibrium, fixed SNP numbers, fixed lengths in cM and with the R package HaploBlocker. We used these blocks together with data from single-year field trials in a cross-validation study for predictions with RR-BLUP, an alternative method (RMLA) that allows for heterogeneous marker variances, and GBLUP performed with the software GVCHAP. The greatest prediction accuracies for resistance scores for B. graminis, P. triticina, and F. graminearum were obtained with LD-based haplotype blocks while blocks with fixed marker numbers and fixed lengths in cM resulted in the greatest prediction accuracies for plant height. Prediction accuracies of haplotype blocks built with HaploBlocker were greater than those of the other methods for protein concentration and resistances scores for S. tritici, B. graminis, and P. striiformis. We hypothesize that the trait-dependence is caused by properties of the haplotype blocks that have overlapping and contrasting effects on the prediction accuracy. While they might be able to capture local epistatic effects and to detect ancestral relationships better than single SNPs, prediction accuracy might be reduced by unfavorable characteristics of the design matrices in the models that are due to their multi-allelic nature.
Collapse
Affiliation(s)
| | - Matthias Frisch
- Institute of Agronomy and Plant Breeding II, Justus Liebig University, Gießen, Germany
| | - Anna Luise Langstroff
- Institute of Agronomy and Plant Breeding I, Justus Liebig University, Gießen, Germany
| | - Andreas Stahl
- Institute for Resistance Research and Stress Tolerance, Julius Kühn Institute, Quedlinburg, Germany
| | - Benjamin Wittkop
- Institute of Agronomy and Plant Breeding I, Justus Liebig University, Gießen, Germany
| | - Rod J. Snowdon
- Institute of Agronomy and Plant Breeding I, Justus Liebig University, Gießen, Germany
| | | | | | - László Cselényi
- Department of Cereal Breeding, W. von Borries-Eckendorf GmbH & Co. KG, Leopoldshöhe, Germany
| | - Markus Wolf
- German Seed Alliance GmbH, Holtsee, Germany
- Saaten-Union Biotec GmbH, Leopoldshöhe, Germany
| | | | - Sven Weber
- Institute of Agronomy and Plant Breeding I, Justus Liebig University, Gießen, Germany
| | - Uche Joshua Okoye
- Institute of Agronomy and Plant Breeding II, Justus Liebig University, Gießen, Germany
| | - Carola Zenke-Philippi
- Institute of Agronomy and Plant Breeding II, Justus Liebig University, Gießen, Germany
| |
Collapse
|
9
|
Alemu A, Batista L, Singh PK, Ceplitis A, Chawade A. Haplotype-tagged SNPs improve genomic prediction accuracy for Fusarium head blight resistance and yield-related traits in wheat. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2023; 136:92. [PMID: 37009920 PMCID: PMC10068637 DOI: 10.1007/s00122-023-04352-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Accepted: 03/21/2023] [Indexed: 06/19/2023]
Abstract
Linkage disequilibrium (LD)-based haplotyping with subsequent SNP tagging improved the genomic prediction accuracy up to 0.07 and 0.092 for Fusarium head blight resistance and spike width, respectively, across six different models. Genomic prediction is a powerful tool to enhance genetic gain in plant breeding. However, the method is accompanied by various complications leading to low prediction accuracy. One of the major challenges arises from the complex dimensionality of marker data. To overcome this issue, we applied two pre-selection methods for SNP markers viz. LD-based haplotype-tagging and GWAS-based trait-linked marker identification. Six different models were tested with preselected SNPs to predict the genomic estimated breeding values (GEBVs) of four traits measured in 419 winter wheat genotypes. Ten different sets of haplotype-tagged SNPs were selected by adjusting the level of LD thresholds. In addition, various sets of trait-linked SNPs were identified with different scenarios from the training-test combined and only from the training populations. The BRR and RR-BLUP models developed from haplotype-tagged SNPs had a higher prediction accuracy for FHB and SPW by 0.07 and 0.092, respectively, compared to the corresponding models developed without marker pre-selection. The highest prediction accuracy for SPW and FHB was achieved with tagged SNPs pruned at weak LD thresholds (r2 < 0.5), while stringent LD was required for spike length (SPL) and flag leaf area (FLA). Trait-linked SNPs identified only from training populations failed to improve the prediction accuracy of the four studied traits. Pre-selection of SNPs via LD-based haplotype-tagging could play a vital role in optimizing genomic selection and reducing genotyping costs. Furthermore, the method could pave the way for developing low-cost genotyping methods through customized genotyping platforms targeting key SNP markers tagged to essential haplotype blocks.
Collapse
Affiliation(s)
- Admas Alemu
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden
| | | | - Pawan K Singh
- International Maize and Wheat Improvement Center, Texcoco, Mexico
| | | | - Aakash Chawade
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden.
| |
Collapse
|
10
|
Araujo AC, Carneiro PLS, Oliveira HR, Lewis RM, Brito LF. SNP- and haplotype-based single-step genomic predictions for body weight, wool, and reproductive traits in North American Rambouillet sheep. J Anim Breed Genet 2023; 140:216-234. [PMID: 36408677 PMCID: PMC10099590 DOI: 10.1111/jbg.12748] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2022] [Accepted: 10/23/2022] [Indexed: 11/22/2022]
Abstract
Rambouillet sheep are commonly raised in extensive grazing systems in the US, mainly for wool and meat production. Genomic evaluations in US sheep breeds, including Rambouillet, are still incipient. Therefore, we aimed to evaluate the feasibility of performing genomic prediction of breeding values for various traits in Rambouillet sheep based on single nucleotide polymorphisms (SNP) or haplotypes (fitted as pseudo-SNP) under a single-step GBLUP approach. A total of 28,834 records for birth weight (BWT), 23,306 for postweaning weight (PWT), 5,832 for yearling weight (YWT), 9,880 for yearling fibre diameter (YFD), 11,872 for yearling greasy fleece weight (YGFW), and 15,984 for number of lambs born (NLB) were used in this study. Seven hundred forty-one individuals were genotyped using a moderate (50 K; n = 677) or high (600 K; n = 64) density SNP panel, in which 32 K SNP in common between the two SNP panels (after genotypic quality control) were used for further analyses. Single-step genomic predictions using SNP (H-BLUP) or haplotypes (HAP-BLUP) from blocks with different linkage disequilibrium (LD) thresholds (0.15, 0.35, 0.50, 0.65, and 0.80) were evaluated. We also considered different blending parameters when constructing the genomic relationship matrix used to predict the genomic-enhanced estimated breeding values (GEBV), with alpha equal to 0.95 or 0.50. The GEBV were compared to the estimated breeding values (EBV) obtained from traditional pedigree-based evaluations (A-BLUP). The mean theoretical accuracy ranged from 0.499 (A-BLUP for PWT) to 0.795 (HAP-BLUP using haplotypes from blocks with LD threshold of 0.35 and alpha equal to 0.95 for YFD). The prediction accuracies ranged from 0.143 (A-BLUP for PWT) to 0.330 (A-BLUP for YGFW) while the prediction bias ranged from -0.104 (H-BLUP for PWT) to 0.087 (HAP-BLUP using haplotypes from blocks with LD threshold of 0.15 and alpha equal to 0.95 for YGFW). The GEBV dispersion ranged from 0.428 (A-BLUP for PWT) to 1.035 (A-BLUP for YGFW). Similar results were observed for H-BLUP or HAP-BLUP, independently of the LD threshold to create the haplotypes, alpha value, or trait analysed. Using genomic information (fitting individual SNP or haplotypes) provided similar or higher prediction and theoretical accuracies and reduced the dispersion of the GEBV for body weight, wool, and reproductive traits in Rambouillet sheep. However, there were no clear improvements in the prediction bias when compared to pedigree-based predictions. The next step will be to enlarge the training populations for this breed to increase the benefits of genomic predictions.
Collapse
Affiliation(s)
- Andre C. Araujo
- Graduate Program in Animal SciencesState University of Southwestern BahiaItapetingaBahiaBrazil
- Department of Animal SciencesPurdue UniversityWest LafayetteIndianaUSA
| | | | | | - Ronald M. Lewis
- Department of Animal SciencesUniversity of Nebraska‐LincolnLincolnNebraskaUSA
| | - Luiz F. Brito
- Department of Animal SciencesPurdue UniversityWest LafayetteIndianaUSA
| |
Collapse
|
11
|
Ye H, Xu Z, Bello SF, Zhu Q, Kong S, Zheng M, Fang X, Jia X, Xu H, Zhang X, Nie Q. Haplotype analysis of genomic prediction by incorporating genomic pathway information based on high-density SNP marker in Chinese yellow-feathered chicken. Poult Sci 2023; 102:102549. [PMID: 36907129 PMCID: PMC10024239 DOI: 10.1016/j.psj.2023.102549] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2022] [Revised: 01/16/2023] [Accepted: 01/27/2023] [Indexed: 02/09/2023] Open
Abstract
Genomic selection using single nucleotide polymorphism (SNP) markers is now intensively investigated in breeding and has been widely utilized for genetic improvement. Currently, several studies have used haplotype (consisting of multiallelic SNPs) for genomic prediction and revealed its performance advantage. In this study, we comprehensively evaluated the performance of haplotype models for genomic prediction in 15 traits, including 6 growth, 5 carcass, and 4 feeding traits in a Chinese yellow-feathered chicken population. We adopted 3 methods to define haplotypes from high-density SNP panels, and our strategy included combining Kyoto Encyclopedia of Genes and Genomes pathway information and considering linkage disequilibrium (LD) information. Our results showed an increase in prediction accuracy due to haplotypes ranging from -0.04∼27.16% in all traits, where the significant improvements were found in 12 traits. The estimates of haplotype epistasis heritability were strongly correlated with the accuracy increase by haplotype models. In addition, incorporating genomic annotation information could further increase the accuracy of the haplotype model, where the further increase in accuracy is significantly relative to the increase of relative haplotype epistasis heritability. The genomic prediction using LD information for constructing haplotypes has the best prediction performance among the 4 traits. These results uncovered that haplotype methods were beneficial for genomic prediction, and the accuracy could be further increased by incorporating genomic annotation information. Moreover, using LD information would potentially improve the performance of genomic prediction.
Collapse
Affiliation(s)
- Haoqiang Ye
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou, 510642 China; Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding and Key Lab of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou, 510642 China
| | - Zhenqiang Xu
- Wen's Nanfang Poultry Breeding Co. Ltd, Guangdong Province, Yunfu 527400, China
| | - Semiu Folaniyi Bello
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou, 510642 China; Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding and Key Lab of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou, 510642 China
| | - Qianghui Zhu
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou, 510642 China
| | - Shaofen Kong
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou, 510642 China; Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding and Key Lab of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou, 510642 China
| | - Ming Zheng
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou, 510642 China; Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding and Key Lab of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou, 510642 China
| | - Xiang Fang
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou, 510642 China; Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding and Key Lab of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou, 510642 China
| | - Xinzheng Jia
- Guangdong Provincial Key Laboratory of Animal Molecular Design and Precise Breeding, Foshan University, Foshan, 528225 China
| | - Haiping Xu
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou, 510642 China; Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding and Key Lab of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou, 510642 China
| | - Xiquan Zhang
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou, 510642 China; Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding and Key Lab of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou, 510642 China
| | - Qinghua Nie
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou, 510642 China; Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding and Key Lab of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou, 510642 China.
| |
Collapse
|
12
|
Elgart M, Lyons G, Romero-Brufau S, Kurniansyah N, Brody JA, Guo X, Lin HJ, Raffield L, Gao Y, Chen H, de Vries P, Lloyd-Jones DM, Lange LA, Peloso GM, Fornage M, Rotter JI, Rich SS, Morrison AC, Psaty BM, Levy D, Redline S, Sofer T. Non-linear machine learning models incorporating SNPs and PRS improve polygenic prediction in diverse human populations. Commun Biol 2022; 5:856. [PMID: 35995843 PMCID: PMC9395509 DOI: 10.1038/s42003-022-03812-z] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2021] [Accepted: 08/05/2022] [Indexed: 01/03/2023] Open
Abstract
Polygenic risk scores (PRS) are commonly used to quantify the inherited susceptibility for a trait, yet they fail to account for non-linear and interaction effects between single nucleotide polymorphisms (SNPs). We address this via a machine learning approach, validated in nine complex phenotypes in a multi-ancestry population. We use an ensemble method of SNP selection followed by gradient boosted trees (XGBoost) to allow for non-linearities and interaction effects. We compare our results to the standard, linear PRS model developed using PRSice, LDpred2, and lassosum2. Combining a PRS as a feature in an XGBoost model results in a relative increase in the percentage variance explained compared to the standard linear PRS model by 22% for height, 27% for HDL cholesterol, 43% for body mass index, 50% for sleep duration, 58% for systolic blood pressure, 64% for total cholesterol, 66% for triglycerides, 77% for LDL cholesterol, and 100% for diastolic blood pressure. Multi-ancestry trained models perform similarly to specific racial/ethnic group trained models and are consistently superior to the standard linear PRS models. This work demonstrates an effective method to account for non-linearities and interaction effects in genetics-based prediction models.
Collapse
Affiliation(s)
- Michael Elgart
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA.
- Department of Medicine, Harvard Medical School, Boston, MA, USA.
| | - Genevieve Lyons
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Santiago Romero-Brufau
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Department of Medicine, Mayo Clinic, Rochester, MN, USA
| | - Nuzulul Kurniansyah
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA
| | - Jennifer A Brody
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
| | - Xiuqing Guo
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | - Henry J Lin
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | - Laura Raffield
- Department of Genetics, University of North Carolina, Chapel Hill, NC, USA
| | - Yan Gao
- The Jackson Heart Study, University of Mississippi Medical Center, Jackson, MS, USA
| | - Han Chen
- Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Paul de Vries
- Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | | | - Leslie A Lange
- Department of Medicine, University of Colorado Denver, Anschutz Medical Campus, Aurora, CO, USA
| | - Gina M Peloso
- Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
| | - Myriam Fornage
- Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
- Brown Foundation Institute of Molecular Medicine, McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Jerome I Rotter
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | - Stephen S Rich
- Center for Public Health Genomics, University of Virginia School of Medicine, Charlottesville, VA, USA
| | - Alanna C Morrison
- Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Bruce M Psaty
- Cardiovascular Health Research Unit, Departments of Medicine, Epidemiology, and Health Services, University of Washington, Seattle, WA, USA
| | - Daniel Levy
- The Population Sciences Branch of the National Heart, Lung and Blood Institute, Bethesda, MD, USA
- The Framingham Heart Study, Framingham, MA, USA
| | - Susan Redline
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Tamar Sofer
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA.
- Department of Medicine, Harvard Medical School, Boston, MA, USA.
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
| |
Collapse
|
13
|
Ye H, Zhang Z, Ren D, Cai X, Zhu Q, Ding X, Zhang H, Zhang Z, Li J. Genomic Prediction Using LD-Based Haplotypes in Combined Pig Populations. Front Genet 2022; 13:843300. [PMID: 35754827 PMCID: PMC9218795 DOI: 10.3389/fgene.2022.843300] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2021] [Accepted: 05/02/2022] [Indexed: 11/13/2022] Open
Abstract
The size of reference population is an important factor affecting genomic prediction. Thus, combining different populations in genomic prediction is an attractive way to improve prediction ability. However, combining multireference population roughly cannot increase the prediction accuracy as well as expected in pig. This may be due to different linkage disequilibrium (LD) pattern differences between population. In this study, we used the imputed whole-genome sequencing (WGS) data to construct LD-based haplotypes for genomic prediction in combined population to explore the impact of different single-nucleotide polymorphism (SNP) densities, variant representation (SNPs or haplotype alleles), and reference population size on the prediction accuracy for reproduction traits. Our results showed that genomic best linear unbiased prediction (GBLUP) using the WGS data can improve prediction accuracy in multi-population but not within-population. Not only the genomic prediction accuracy of the haplotype method using 80 K chip data in multi-population but also GBLUP for the multi-population (3.4–5.9%) was higher than that within-population (1.2–4.3%). More importantly, we have found that using the haplotype method based on the WGS data in multi-population has better genomic prediction performance, and our results showed that building haploblock in this scenario based on low LD threshold (r2 = 0.2–0.3) produced an optimal set of variables for reproduction traits in Yorkshire pig population. Our results suggested that whether the use of the haplotype method based on the chip data or GBLUP (individual SNP method) based on the WGS data were beneficial for genomic prediction in multi-population, while simultaneously combining the haplotype method and WGS data was a better strategy for multi-population genomic evaluation.
Collapse
Affiliation(s)
- Haoqiang Ye
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Zipeng Zhang
- Key Laboratory of Animal Genetics and Breeding of Ministry of Agriculture and Rural Affairs, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Duanyang Ren
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Xiaodian Cai
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Qianghui Zhu
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Xiangdong Ding
- Key Laboratory of Animal Genetics and Breeding of Ministry of Agriculture and Rural Affairs, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Hao Zhang
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Zhe Zhang
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Jiaqi Li
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, China
| |
Collapse
|
14
|
Muvunyi BP, Zou W, Zhan J, He S, Ye G. Multi-Trait Genomic Prediction Models Enhance the Predictive Ability of Grain Trace Elements in Rice. Front Genet 2022; 13:883853. [PMID: 35812754 PMCID: PMC9257107 DOI: 10.3389/fgene.2022.883853] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Accepted: 05/05/2022] [Indexed: 11/13/2022] Open
Abstract
Multi-trait (MT) genomic prediction models enable breeders to save phenotyping resources and increase the prediction accuracy of unobserved target traits by exploiting available information from non-target or auxiliary traits. Our study evaluated different MT models using 250 rice accessions from Asian countries genotyped and phenotyped for grain content of zinc (Zn), iron (Fe), copper (Cu), manganese (Mn), and cadmium (Cd). The predictive performance of MT models compared to a traditional single trait (ST) model was assessed by 1) applying different cross-validation strategies (CV1, CV2, and CV3) inferring varied phenotyping patterns and budgets; 2) accounting for local epistatic effects along with the main additive effect in MT models; and 3) using a selective marker panel composed of trait-associated SNPs in MT models. MT models were not statistically significantly (p < 0.05) superior to ST model under CV1, where no phenotypic information was available for the accessions in the test set. After including phenotypes from auxiliary traits in both training and test sets (MT-CV2) or simply in the test set (MT-CV3), MT models significantly (p < 0.05) outperformed ST model for all the traits. The highest increases in the predictive ability of MT models relative to ST models were 11.1% (Mn), 11.5 (Cd), 33.3% (Fe), 95.2% (Cu) and 126% (Zn). Accounting for the local epistatic effects using a haplotype-based model further improved the predictive ability of MT models by 4.6% (Cu), 3.8% (Zn), and 3.5% (Cd) relative to MT models with only additive effects. The predictive ability of the haplotype-based model was not improved after optimizing the marker panel by only considering the markers associated with the traits. This study first assessed the local epistatic effects and marker optimization strategies in the MT genomic prediction framework and then illustrated the power of the MT model in predicting trace element traits in rice for the effective use of genetic resources to improve the nutritional quality of rice grain.
Collapse
Affiliation(s)
- Blaise Pascal Muvunyi
- CAAS-IRRI Joint Laboratory for Genomics-Assisted Germplasm Enhancement, Agricultural Genomics Institute in Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Wenli Zou
- CAAS-IRRI Joint Laboratory for Genomics-Assisted Germplasm Enhancement, Agricultural Genomics Institute in Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Junhui Zhan
- CAAS-IRRI Joint Laboratory for Genomics-Assisted Germplasm Enhancement, Agricultural Genomics Institute in Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Sang He
- CAAS-IRRI Joint Laboratory for Genomics-Assisted Germplasm Enhancement, Agricultural Genomics Institute in Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
- *Correspondence: Sang He, ; Guoyou Ye,
| | - Guoyou Ye
- CAAS-IRRI Joint Laboratory for Genomics-Assisted Germplasm Enhancement, Agricultural Genomics Institute in Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
- Rice Breeding Innovations Platform, International Rice Research Institute, Los Baños, Philippines
- *Correspondence: Sang He, ; Guoyou Ye,
| |
Collapse
|
15
|
Varona L, Legarra A, Toro MA, Vitezica ZG. Genomic Prediction Methods Accounting for Nonadditive Genetic Effects. Methods Mol Biol 2022; 2467:219-243. [PMID: 35451778 DOI: 10.1007/978-1-0716-2205-6_8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
The use of genomic information for prediction of future phenotypes or breeding values for the candidates to selection has become a standard over the last decade. However, most procedures for genomic prediction only consider the additive (or substitution) effects associated with polymorphic markers. Nevertheless, the implementation of models that consider nonadditive genetic variation may be interesting because they (1) may increase the ability of prediction, (2) can be used to define mate allocation procedures in plant and animal breeding schemes, and (3) can be used to benefit from nonadditive genetic variation in crossbreeding or purebred breeding schemes. This study reviews the available methods for incorporating nonadditive effects into genomic prediction procedures and their potential applications in predicting future phenotypic performance, mate allocation, and crossbred and purebred selection. Finally, a brief outline of some future research lines is also proposed.
Collapse
Affiliation(s)
- Luis Varona
- Departamento de Anatomía, Embriología y Genética Animal, Universidad de Zaragoza, Zaragoza, Spain.
- Instituto Agroalimentario de Aragón (IA2), Zaragoza, Spain.
| | | | - Miguel A Toro
- Dpto. Producción Agraria, ETS Ingeniería Agronómica, Alimentaria y de Biosistemas, Universidad Politécnica de Madrid, Madrid, Spain
| | | |
Collapse
|
16
|
Araujo AC, Carneiro PLS, Alvarenga AB, Oliveira HR, Miller SP, Retallick K, Brito LF. Haplotype-Based Single-Step GWAS for Yearling Temperament in American Angus Cattle. Genes (Basel) 2021; 13:17. [PMID: 35052358 PMCID: PMC8775055 DOI: 10.3390/genes13010017] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Revised: 12/14/2021] [Accepted: 12/18/2021] [Indexed: 01/23/2023] Open
Abstract
Behavior is a complex trait and, therefore, understanding its genetic architecture is paramount for the development of effective breeding strategies. The objective of this study was to perform traditional and weighted single-step genome-wide association studies (ssGWAS and WssGWAS, respectively) for yearling temperament (YT) in North American Angus cattle using haplotypes. Approximately 266 K YT records and 70 K animals genotyped using a 50 K single nucleotide polymorphisms (SNP) panel were used. Linkage disequilibrium thresholds (LD) of 0.15, 0.50, and 0.80 were used to create the haploblocks, and the inclusion of non-LD-clustered SNPs (NCSNP) with the haplotypes in the genomic models was also evaluated. WssGWAS did not perform better than ssGWAS. Cattle YT was found to be a highly polygenic trait, with genes and quantitative trait loci (QTL) broadly distributed across the whole genome. Association studies using LD-based haplotypes should include NCSNPs and different LD thresholds to increase the likelihood of finding the relevant genomic regions affecting the trait of interest. The main candidate genes identified, i.e., ATXN10, ADAM10, VAX2, ATP6V1B1, CRISPLD1, CAPRIN1, FA2H, SPEF2, PLXNA1, and CACNA2D3, are involved in important biological processes and metabolic pathways related to behavioral traits, social interactions, and aggressiveness in cattle. Future studies should further investigate the role of these candidate genes.
Collapse
Affiliation(s)
- Andre C. Araujo
- Graduate Program in Animal Sciences, State University of Southwestern Bahia, Itapetinga 45700-000, Brazil;
- Department of Animal Science, Purdue University, West Lafayette, IN 47907, USA; (A.B.A.); (H.R.O.)
| | - Paulo L. S. Carneiro
- Department of Biology, State University of Southwest Bahia, Jequié 45205-490, Brazil;
| | - Amanda B. Alvarenga
- Department of Animal Science, Purdue University, West Lafayette, IN 47907, USA; (A.B.A.); (H.R.O.)
| | - Hinayah R. Oliveira
- Department of Animal Science, Purdue University, West Lafayette, IN 47907, USA; (A.B.A.); (H.R.O.)
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, ON N1G2W1, Canada
| | - Stephen P. Miller
- American Angus Association, Angus Genetics Inc., 3201 Frederick Ave, St. Joseph, MO 64506, USA; (S.P.M.); (K.R.)
| | - Kelli Retallick
- American Angus Association, Angus Genetics Inc., 3201 Frederick Ave, St. Joseph, MO 64506, USA; (S.P.M.); (K.R.)
| | - Luiz F. Brito
- Department of Animal Science, Purdue University, West Lafayette, IN 47907, USA; (A.B.A.); (H.R.O.)
| |
Collapse
|
17
|
Bhat JA, Yu D, Bohra A, Ganie SA, Varshney RK. Features and applications of haplotypes in crop breeding. Commun Biol 2021; 4:1266. [PMID: 34737387 PMCID: PMC8568931 DOI: 10.1038/s42003-021-02782-y] [Citation(s) in RCA: 44] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2021] [Accepted: 10/09/2021] [Indexed: 12/17/2022] Open
Abstract
Climate change with altered pest-disease dynamics and rising abiotic stresses threatens resource-constrained agricultural production systems worldwide. Genomics-assisted breeding (GAB) approaches have greatly contributed to enhancing crop breeding efficiency and delivering better varieties. Fast-growing capacity and affordability of DNA sequencing has motivated large-scale germplasm sequencing projects, thus opening exciting avenues for mining haplotypes for breeding applications. This review article highlights ways to mine haplotypes and apply them for complex trait dissection and in GAB approaches including haplotype-GWAS, haplotype-based breeding, haplotype-assisted genomic selection. Improvement strategies that efficiently deploy superior haplotypes to hasten breeding progress will be key to safeguarding global food security.
Collapse
Affiliation(s)
- Javaid Akhter Bhat
- National Center for Soybean Improvement, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China
| | - Deyue Yu
- National Center for Soybean Improvement, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China
| | - Abhishek Bohra
- Crop Improvement Division, ICAR- Indian Institute of Pulses Research (ICAR- IIPR), Kanpur, India
| | - Showkat Ahmad Ganie
- Department of Biotechnology, Visva-Bharati, Santiniketan, 731235, WB, India.
| | - Rajeev K Varshney
- Center of Excellence in Genomics & Systems Biology, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, 502324, India.
- State Agricultural Biotechnology Centre, Centre for Crop & Food Research Innovation, Food Futures Institute, Murdoch University, Murdoch, WA, Australia.
| |
Collapse
|
18
|
Araujo AC, Carneiro PLS, Oliveira HR, Schenkel FS, Veroneze R, Lourenco DAL, Brito LF. A Comprehensive Comparison of Haplotype-Based Single-Step Genomic Predictions in Livestock Populations With Different Genetic Diversity Levels: A Simulation Study. Front Genet 2021; 12:729867. [PMID: 34721524 PMCID: PMC8551834 DOI: 10.3389/fgene.2021.729867] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2021] [Accepted: 09/07/2021] [Indexed: 11/13/2022] Open
Abstract
The level of genetic diversity in a population is inversely proportional to the linkage disequilibrium (LD) between individual single nucleotide polymorphisms (SNPs) and quantitative trait loci (QTLs), leading to lower predictive ability of genomic breeding values (GEBVs) in high genetically diverse populations. Haplotype-based predictions could outperform individual SNP predictions by better capturing the LD between SNP and QTL. Therefore, we aimed to evaluate the accuracy and bias of individual-SNP- and haplotype-based genomic predictions under the single-step-genomic best linear unbiased prediction (ssGBLUP) approach in genetically diverse populations. We simulated purebred and composite sheep populations using literature parameters for moderate and low heritability traits. The haplotypes were created based on LD thresholds of 0.1, 0.3, and 0.6. Pseudo-SNPs from unique haplotype alleles were used to create the genomic relationship matrix ( G ) in the ssGBLUP analyses. Alternative scenarios were compared in which the pseudo-SNPs were combined with non-LD clustered SNPs, only pseudo-SNPs, or haplotypes fitted in a second G (two relationship matrices). The GEBV accuracies for the moderate heritability-trait scenarios fitting individual SNPs ranged from 0.41 to 0.55 and with haplotypes from 0.17 to 0.54 in the most (Ne ≅ 450) and less (Ne < 200) genetically diverse populations, respectively, and the bias fitting individual SNPs or haplotypes ranged between -0.14 and -0.08 and from -0.62 to -0.08, respectively. For the low heritability-trait scenarios, the GEBV accuracies fitting individual SNPs ranged from 0.24 to 0.32, and for fitting haplotypes, it ranged from 0.11 to 0.32 in the more (Ne ≅ 250) and less (Ne ≅ 100) genetically diverse populations, respectively, and the bias ranged between -0.36 and -0.32 and from -0.78 to -0.33 fitting individual SNPs or haplotypes, respectively. The lowest accuracies and largest biases were observed fitting only pseudo-SNPs from blocks constructed with an LD threshold of 0.3 (p < 0.05), whereas the best results were obtained using only SNPs or the combination of independent SNPs and pseudo-SNPs in one or two G matrices, in both heritability levels and all populations regardless of the level of genetic diversity. In summary, haplotype-based models did not improve the performance of genomic predictions in genetically diverse populations.
Collapse
Affiliation(s)
- Andre C Araujo
- Postgraduate Program in Animal Sciences, State University of Southwestern Bahia, Itapetinga, Brazil.,Department of Animal Sciences, Purdue University, West Lafayette, IN, United States
| | - Paulo L S Carneiro
- Department of Biology, State University of Southwestern Bahia, Jequié, Brazil
| | - Hinayah R Oliveira
- Department of Animal Sciences, Purdue University, West Lafayette, IN, United States.,Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, ON, Canada
| | - Flavio S Schenkel
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, ON, Canada
| | - Renata Veroneze
- Department of Animal Sciences, Federal University of Viçosa, Viçosa, Brazil
| | - Daniela A L Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, United States
| | - Luiz F Brito
- Department of Animal Sciences, Purdue University, West Lafayette, IN, United States
| |
Collapse
|
19
|
Bian C, Prakapenka D, Tan C, Yang R, Zhu D, Guo X, Liu D, Cai G, Li Y, Liang Z, Wu Z, Da Y, Hu X. Haplotype genomic prediction of phenotypic values based on chromosome distance and gene boundaries using low-coverage sequencing in Duroc pigs. Genet Sel Evol 2021; 53:78. [PMID: 34620094 PMCID: PMC8496108 DOI: 10.1186/s12711-021-00661-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2021] [Accepted: 08/20/2021] [Indexed: 11/22/2022] Open
Abstract
Background Genomic selection using single nucleotide polymorphism (SNP) markers has been widely used for genetic improvement of livestock, but most current methods of genomic selection are based on SNP models. In this study, we investigated the prediction accuracies of haplotype models based on fixed chromosome distances and gene boundaries compared to those of SNP models for genomic prediction of phenotypic values. We also examined the reasons for the successes and failures of haplotype genomic prediction. Methods We analyzed a swine population of 3195 Duroc boars with records on eight traits: body judging score (BJS), teat number (TN), age (AGW), loin muscle area (LMA), loin muscle depth (LMD) and back fat thickness (BF) at 100 kg live weight, and average daily gain (ADG) and feed conversion rate (FCR) from 30 to100 kg live weight. Ten-fold validation was used to evaluate the prediction accuracy of each SNP model and each multi-allelic haplotype model based on 488,124 autosomal SNPs from low-coverage sequencing. Haplotype blocks were defined using fixed chromosome distances or gene boundaries. Results Compared to the best SNP model, the accuracy of predicting phenotypic values using a haplotype model was greater by 7.4% for BJS, 7.1% for AGW, 6.6% for ADG, 4.9% for FCR, 2.7% for LMA, 1.9% for LMD, 1.4% for BF, and 0.3% for TN. The use of gene-based haplotype blocks resulted in the best prediction accuracy for LMA, LMD, and TN. Compared to estimates of SNP additive heritability, estimates of haplotype epistasis heritability were strongly correlated with the increase in prediction accuracy by haplotype models. The increase in prediction accuracy was largest for BJS, AGW, ADG, and FCR, which also had the largest estimates of haplotype epistasis heritability, 24.4% for BJS, 14.3% for AGW, 14.5% for ADG, and 17.7% for FCR. SNP and haplotype heritability profiles across the genome identified several genes with large genetic contributions to phenotypes: NUDT3 for LMA, LMD and BF, VRTN for TN, COL5A2 for BJS, BSND for ADG, and CARTPT for FCR. Conclusions Haplotype prediction models improved the accuracy for genomic prediction of phenotypes in Duroc pigs. For some traits, the best prediction accuracy was obtained with haplotypes defined using gene regions, which provides evidence that functional genomic information can improve the accuracy of haplotype genomic prediction for certain traits. Supplementary Information The online version contains supplementary material available at 10.1186/s12711-021-00661-y.
Collapse
Affiliation(s)
- Cheng Bian
- State Key Laboratory for Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Dzianis Prakapenka
- Department of Animal Science, University of Minnesota, Saint Paul, MN, 55108, USA
| | - Cheng Tan
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China.,National Engineering Research Center for Breeding Swine Industry, Wens Foodstuff Group Co., Ltd., Yunfu, 527400, China
| | - Ruifei Yang
- State Key Laboratory for Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Di Zhu
- State Key Laboratory for Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Xiaoli Guo
- State Key Laboratory for Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Dewu Liu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
| | - Gengyuan Cai
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China.,National Engineering Research Center for Breeding Swine Industry, Wens Foodstuff Group Co., Ltd., Yunfu, 527400, China
| | - Yalan Li
- National Engineering Research Center for Breeding Swine Industry, Wens Foodstuff Group Co., Ltd., Yunfu, 527400, China
| | - Zuoxiang Liang
- Department of Animal Science, University of Minnesota, Saint Paul, MN, 55108, USA
| | - Zhenfang Wu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China. .,National Engineering Research Center for Breeding Swine Industry, Wens Foodstuff Group Co., Ltd., Yunfu, 527400, China.
| | - Yang Da
- Department of Animal Science, University of Minnesota, Saint Paul, MN, 55108, USA.
| | - Xiaoxiang Hu
- State Key Laboratory for Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, 100193, China.
| |
Collapse
|
20
|
Jighly A, Hayden M, Daetwyler H. Integrating genomic selection with a genotype plus genotype x environment (GGE) model improves prediction accuracy and computational efficiency. PLANT, CELL & ENVIRONMENT 2021; 44:3459-3470. [PMID: 34231236 DOI: 10.1111/pce.14145] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/10/2021] [Accepted: 06/30/2021] [Indexed: 06/13/2023]
Abstract
Genotype-by-environment interaction (GEI) is one of the major factors affecting the prediction accuracy of genomic selection (GS) models. Standard models have low power to model complex GEI, and they fail to predict phenotypes in unobserved environments. Here, we developed a novel prediction model that account for GEI, named 3GS, that combines genotype plus genotype × environment (GGE) analysis with GS. The model calculates the principal components (PCs) of the environmental phenotypes using GGE analysis and predict the performance of these PCs using standard GS models before converting the GEBVs of these PCs (pcGEBVs) back to the original phenotypes. We demonstrated three advantages of the new model. First, 3GS showed significantly higher prediction accuracy primarily for deviated environments that have low to negative correlations to other environments. Second, 3GS can predict new genotypes in unobserved environments with high accuracy. Third, the computational complexity of 3GS increases linearly with increasing the number of environments and the population size, unlike the standard models that exhibit exponential increase, making it hundreds of times faster than the standard models for large data sets. 3GS can improve prediction accuracy with minimal resources in modern breeding programmes in which massive populations get multi-environment phenotypes with high-throughput techniques.
Collapse
Affiliation(s)
- Abdulqader Jighly
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, Victoria, Australia
| | - Matthew Hayden
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, Victoria, Australia
- School of Applied Systems Biology, La Trobe University, Bundoora, Victoria, Australia
| | - Hans Daetwyler
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, Victoria, Australia
- School of Applied Systems Biology, La Trobe University, Bundoora, Victoria, Australia
| |
Collapse
|
21
|
Ahmar S, Ballesta P, Ali M, Mora-Poblete F. Achievements and Challenges of Genomics-Assisted Breeding in Forest Trees: From Marker-Assisted Selection to Genome Editing. Int J Mol Sci 2021; 22:10583. [PMID: 34638922 PMCID: PMC8508745 DOI: 10.3390/ijms221910583] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Revised: 09/26/2021] [Accepted: 09/27/2021] [Indexed: 12/23/2022] Open
Abstract
Forest tree breeding efforts have focused mainly on improving traits of economic importance, selecting trees suited to new environments or generating trees that are more resilient to biotic and abiotic stressors. This review describes various methods of forest tree selection assisted by genomics and the main technological challenges and achievements in research at the genomic level. Due to the long rotation time of a forest plantation and the resulting long generation times necessary to complete a breeding cycle, the use of advanced techniques with traditional breeding have been necessary, allowing the use of more precise methods for determining the genetic architecture of traits of interest, such as genome-wide association studies (GWASs) and genomic selection (GS). In this sense, main factors that determine the accuracy of genomic prediction models are also addressed. In turn, the introduction of genome editing opens the door to new possibilities in forest trees and especially clustered regularly interspaced short palindromic repeats and CRISPR-associated protein 9 (CRISPR/Cas9). It is a highly efficient and effective genome editing technique that has been used to effectively implement targetable changes at specific places in the genome of a forest tree. In this sense, forest trees still lack a transformation method and an inefficient number of genotypes for CRISPR/Cas9. This challenge could be addressed with the use of the newly developing technique GRF-GIF with speed breeding.
Collapse
Affiliation(s)
- Sunny Ahmar
- Institute of Biological Sciences, University of Talca, 1 Poniente 1141, Talca 3460000, Chile;
| | - Paulina Ballesta
- The National Fund for Scientific and Technological Development, Av. del Agua 3895, Talca 3460000, Chile
| | - Mohsin Ali
- Department of Forestry and Range Management, University of Agriculture Faisalabad, Faisalabad 38000, Pakistan;
| | - Freddy Mora-Poblete
- Institute of Biological Sciences, University of Talca, 1 Poniente 1141, Talca 3460000, Chile;
| |
Collapse
|
22
|
Liu F, Jiang Y, Zhao Y, Schulthess AW, Reif JC. Haplotype-based genome-wide association increases the predictability of leaf rust (Puccinia triticina) resistance in wheat. JOURNAL OF EXPERIMENTAL BOTANY 2020; 71:6958-6968. [PMID: 32827041 DOI: 10.1093/jxb/eraa387] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Accepted: 08/17/2020] [Indexed: 05/12/2023]
Abstract
Resistance breeding is crucial for sustainable control of wheat leaf rust and single nucleotide polymorphism (SNP)-based genome-wide association studies (GWAS) are widely used to dissect leaf rust resistance. Unfortunately, GWAS based on SNPs often explained only a small proportion of the genetic variation. We compared SNP-based GWAS with a method based on functional haplotypes (FH) considering epistasis in a comprehensive hybrid wheat mapping population composed of 133 parents plus their 1574 hybrids and characterized with 626 245 high-quality SNPs. In total, 2408 and 1 139 828 significant associations were detected in the mapping population by using SNP-based and FH-based GWAS, respectively. These associations mapped to 25 and 69 candidate regions, correspondingly. SNP-based GWAS highlighted two already-known resistance genes, Lr22a and Lr34-B, while FH-based GWAS detected associations not only on these genes but also on two additional genes, Lr10 and Lr1. As revealed by a second hybrid wheat population for independent validation, the use of detected associations from SNP-based and FH-based GWAS reached predictabilities of 11.72% and 22.86%, respectively. Therefore, FH-based GWAS is not only more powerful for detecting associations, but also improves the accuracy of marker-assisted selection compared with the SNP-based approach.
Collapse
Affiliation(s)
- Fang Liu
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466 Stadt Seeland, Germany
| | - Yong Jiang
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466 Stadt Seeland, Germany
| | - Yusheng Zhao
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466 Stadt Seeland, Germany
| | - Albert W Schulthess
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466 Stadt Seeland, Germany
| | - Jochen C Reif
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466 Stadt Seeland, Germany
| |
Collapse
|
23
|
Liang Z, Tan C, Prakapenka D, Ma L, Da Y. Haplotype Analysis of Genomic Prediction Using Structural and Functional Genomic Information for Seven Human Phenotypes. Front Genet 2020; 11:588907. [PMID: 33324447 PMCID: PMC7726221 DOI: 10.3389/fgene.2020.588907] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2020] [Accepted: 10/28/2020] [Indexed: 11/19/2022] Open
Abstract
Genomic prediction using multi-allelic haplotype models improved the prediction accuracy for all seven human phenotypes, the normality transformed high density lipoproteins, low density lipoproteins, total cholesterol, triglycerides, weight, and the original height and body mass index without normality transformation. Eight SNP sets with 40,941-380,705 SNPs were evaluated. The increase in prediction accuracy due to haplotypes was 1.86-8.12%. Haplotypes using fixed chromosome distances had the best prediction accuracy for four phenotypes, fixed number of SNPs for two phenotypes, and gene-based haplotypes for high density lipoproteins and height (tied for best). Haplotypes of coding genes were more accurate than haplotypes of all autosome genes that included both coding and noncoding genes for triglycerides and weight, and nearly the same as haplotypes of all autosome genes for the other phenotypes. Haplotypes of noncoding genes (mostly lncRNAs) only improved the prediction accuracy over the SNP models for high density lipoproteins, total cholesterol, and height. ChIP-seq haplotypes had better prediction accuracy than gene-based haplotypes for total cholesterol, body mass index and low density lipoproteins. The accuracy of ChIP-seq haplotypes was most striking for low density lipoproteins, where all four haplotype models with ChIP-seq haplotypes had similarly high prediction accuracy over the best prediction model with gene-based haplotypes. Haplotype epistasis was shown to be the reason for the increased accuracy due to haplotypes. Low density lipoproteins had the largest haplotype epistasis heritability that explained 14.70% of the phenotypic variance and was 31.27% of the SNP additive heritability, and the largest increase in prediction accuracy relative to the best SNP model (8.12%). Relative to the SNP additive heritability of the same regions, noncoding genes had the highest haplotype epistasis heritability, followed by coding genes and ChIP-seq for the seven phenotypes. SNP and haplotype heritability profiles showed that the integration of SNP and haplotype additive values compensated the weakness of haplotypes in estimating SNP heritabilities for four phenotypes, whereas models with haplotype additive values fully accounted for SNP additive values for three phenotypes. These results showed that haplotype analysis can be a method to utilize functional and structural genomic information to improve the accuracy of genomic prediction.
Collapse
Affiliation(s)
- Zuoxiang Liang
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States
| | - Cheng Tan
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States
- National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, China
| | - Dzianis Prakapenka
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States
| | - Li Ma
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD, United States
| | - Yang Da
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States
| |
Collapse
|
24
|
Jiang Y, Reif JC. Efficient Algorithms for Calculating Epistatic Genomic Relationship Matrices. Genetics 2020; 216:651-669. [PMID: 32973077 PMCID: PMC7648578 DOI: 10.1534/genetics.120.303459] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Accepted: 09/21/2020] [Indexed: 11/18/2022] Open
Abstract
The genomic relationship matrix plays a key role in the analysis of genetic diversity, genomic prediction, and genome-wide association studies. The epistatic genomic relationship matrix is a natural generalization of the classic genomic relationship matrix in the sense that it implicitly models the epistatic effects among all markers. Calculating the exact form of the epistatic relationship matrix requires high computational load, and is hence not feasible when the number of markers is large, or when high-degree of epistasis is in consideration. Currently, many studies use the Hadamard product of the classic genomic relationship matrix as an approximation. However, the quality of the approximation is difficult to investigate in the strict mathematical sense. In this study, we derived iterative formulas for the precise form of the epistatic genomic relationship matrix for arbitrary degree of epistasis including both additive and dominance interactions. The key to our theoretical results is the observation of an interesting link between the elements in the genomic relationship matrix and symmetric polynomials, which motivated the application of the corresponding mathematical theory. Based on the iterative formulas, efficient recursive algorithms were implemented. Compared with the approximation by the Hadamard product, our algorithms provided a complete solution to the problem of calculating the exact epistatic genomic relationship matrix. As an application, we showed that our new algorithms easily relieved the computational burden in a previous study on the approximation behavior of two limit models.
Collapse
Affiliation(s)
- Yong Jiang
- Department of Breeding Research, Leibniz Institute of Plant Genetics and Crop Plant Research, Gatersleben 06466, Germany
| | - Jochen C Reif
- Department of Breeding Research, Leibniz Institute of Plant Genetics and Crop Plant Research, Gatersleben 06466, Germany
| |
Collapse
|
25
|
Sallam AH, Conley E, Prakapenka D, Da Y, Anderson JA. Improving Prediction Accuracy Using Multi-allelic Haplotype Prediction and Training Population Optimization in Wheat. G3 (BETHESDA, MD.) 2020; 10:2265-2273. [PMID: 32371453 PMCID: PMC7341132 DOI: 10.1534/g3.120.401165] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Accepted: 04/29/2020] [Indexed: 02/01/2023]
Abstract
The use of haplotypes may improve the accuracy of genomic prediction over single SNPs because haplotypes can better capture linkage disequilibrium and genomic similarity in different lines and may capture local high-order allelic interactions. Additionally, prediction accuracy could be improved by portraying population structure in the calibration set. A set of 383 advanced lines and cultivars that represent the diversity of the University of Minnesota wheat breeding program was phenotyped for yield, test weight, and protein content and genotyped using the Illumina 90K SNP Assay. Population structure was confirmed using single SNPs. Haplotype blocks of 5, 10, 15, and 20 adjacent markers were constructed for all chromosomes. A multi-allelic haplotype prediction algorithm was implemented and compared with single SNPs using both k-fold cross validation and stratified sampling optimization. After confirming population structure, the stratified sampling improved the predictive ability compared with k-fold cross validation for yield and protein content, but reduced the predictive ability for test weight. In all cases, haplotype predictions outperformed single SNPs. Haplotypes of 15 adjacent markers showed the best improvement in accuracy for all traits; however, this was more pronounced in yield and protein content. The combined use of haplotypes of 15 adjacent markers and training population optimization significantly improved the predictive ability for yield and protein content by 14.3 (four percentage points) and 16.8% (seven percentage points), respectively, compared with using single SNPs and k-fold cross validation. These results emphasize the effectiveness of using haplotypes in genomic selection to increase genetic gain in self-fertilized crops.
Collapse
Affiliation(s)
| | - Emily Conley
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN 55108
| | | | - Yang Da
- Department of Animal Science, and
| | - James A Anderson
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN 55108
| |
Collapse
|
26
|
Xu L, Gao N, Wang Z, Xu L, Liu Y, Chen Y, Xu L, Gao X, Zhang L, Gao H, Zhu B, Li J. Incorporating Genome Annotation Into Genomic Prediction for Carcass Traits in Chinese Simmental Beef Cattle. Front Genet 2020; 11:481. [PMID: 32499816 PMCID: PMC7243208 DOI: 10.3389/fgene.2020.00481] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2019] [Accepted: 04/17/2020] [Indexed: 01/08/2023] Open
Abstract
Various methods have been proposed for genomic prediction (GP) in livestock. These methods have mainly focused on statistical considerations and did not include genome annotation information. In this study, to improve the predictive performance of carcass traits in Chinese Simmental beef cattle, we incorporated the genome annotation information into GP. Single nucleotide polymorphisms (SNPs) were annotated to five genomic classes: intergenic, gene, exon, protein coding sequences, and 3'/5' untranslated region. Haploblocks were constructed for all markers and these five genomic classes by defining a biologically functional unit, and haplotype effects were modeled in both numerical dosage and categorical coding strategies. The first-order epistatic effects among SNPs and haplotypes were modeled using a categorical epistasis model. For all makers, the extension from the SNP-based model to a haplotype-based model improved the accuracy by 5.4-9.8% for carcass weight (CW), live weight (LW), and striploin (SI). For the five genomic classes using the haplotype-based prediction model, the incorporation of gene class information into the model improved the accuracies by an average of 1.4, 2.1, and 1.3% for CW, LW, and SI, respectively, compared with their corresponding results for all markers. Including the first-order epistatic effects into the prediction models improved the accuracies in some traits and genomic classes. Therefore, for traits with moderate-to-high heritability, incorporating genome annotation information of gene class into haplotype-based prediction models could be considered as a promising tool for GP in Chinese Simmental beef cattle, and modeling epistasis in prediction can further increase the accuracy to some degree.
Collapse
Affiliation(s)
- Ling Xu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Ning Gao
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Zezhao Wang
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Lei Xu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Ying Liu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Yan Chen
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Lingyang Xu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Xue Gao
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Lupei Zhang
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Huijiang Gao
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- National Centre of Beef Cattle Genetic Evaluation, Beijing, China
| | - Bo Zhu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- National Centre of Beef Cattle Genetic Evaluation, Beijing, China
| | - Junya Li
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- National Centre of Beef Cattle Genetic Evaluation, Beijing, China
| |
Collapse
|
27
|
Yao M, Guan M, Zhang Z, Zhang Q, Cui Y, Chen H, Liu W, Jan HU, Voss-Fels KP, Werner CR, He X, Liu Z, Guan C, Snowdon RJ, Hua W, Qian L. GWAS and co-expression network combination uncovers multigenes with close linkage effects on the oleic acid content accumulation in Brassica napus. BMC Genomics 2020; 21:320. [PMID: 32326904 PMCID: PMC7181522 DOI: 10.1186/s12864-020-6711-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2019] [Accepted: 03/31/2020] [Indexed: 11/19/2022] Open
Abstract
Background Strong artificial and natural selection causes the formation of highly conserved haplotypes that harbor agronomically important genes. GWAS combination with haplotype analysis has evolved as an effective method to dissect the genetic architecture of complex traits in crop species. Results We used the 60 K Brassica Infinium SNP array to perform a genome-wide analysis of haplotype blocks associated with oleic acid (C18:1) in rapeseed. Six haplotype regions were identified as significantly associated with oleic acid (C18:1) that mapped to chromosomes A02, A07, A08, C01, C02, and C03. Additionally, whole-genome sequencing of 50 rapeseed accessions revealed three genes (BnmtACP2-A02, BnABCI13-A02 and BnECI1-A02) in the A02 chromosome haplotype region and two genes (BnFAD8-C02 and BnSDP1-C02) in the C02 chromosome haplotype region that were closely linked to oleic acid content phenotypic variation. Moreover, the co-expression network analysis uncovered candidate genes from these two different haplotype regions with potential regulatory interrelationships with oleic acid content accumulation. Conclusions Our results suggest that several candidate genes are closely linked, which provides us with an opportunity to develop functional haplotype markers for the improvement of the oleic acid content in rapeseed.
Collapse
Affiliation(s)
- Min Yao
- Collaborative Innovation Center of Grain and Oil Crops in South China, Hunan Agricultural University, Changsha, 410128, China
| | - Mei Guan
- Collaborative Innovation Center of Grain and Oil Crops in South China, Hunan Agricultural University, Changsha, 410128, China
| | - Zhenqian Zhang
- Collaborative Innovation Center of Grain and Oil Crops in South China, Hunan Agricultural University, Changsha, 410128, China
| | - Qiuping Zhang
- Collaborative Innovation Center of Grain and Oil Crops in South China, Hunan Agricultural University, Changsha, 410128, China
| | - Yixin Cui
- College of Horticulture and Landscape Architecture, Southwest University, Chongqing, 400715, China
| | - Hao Chen
- Collaborative Innovation Center of Grain and Oil Crops in South China, Hunan Agricultural University, Changsha, 410128, China
| | - Wei Liu
- Collaborative Innovation Center of Grain and Oil Crops in South China, Hunan Agricultural University, Changsha, 410128, China
| | - Habib U Jan
- Precision Medicine Lab, Rehman Medical Institute (RMI), Phase 5, Hayatabad, Peshawar, 25000, Pakistan
| | - Kai P Voss-Fels
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD, Australia
| | - Christian R Werner
- The Roslin Institute University of Edinburgh Easter Bush Research Centre Midlothian, Edinburgh, EH25 9RG, UK
| | - Xin He
- Collaborative Innovation Center of Grain and Oil Crops in South China, Hunan Agricultural University, Changsha, 410128, China
| | - Zhongsong Liu
- Collaborative Innovation Center of Grain and Oil Crops in South China, Hunan Agricultural University, Changsha, 410128, China
| | - Chunyun Guan
- Collaborative Innovation Center of Grain and Oil Crops in South China, Hunan Agricultural University, Changsha, 410128, China
| | - Rod J Snowdon
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Heinrich-Buff-Ring 26-32, 35392, Giessen, Germany
| | - Wei Hua
- Collaborative Innovation Center of Grain and Oil Crops in South China, Hunan Agricultural University, Changsha, 410128, China. .,Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences, Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture, Wuhan, 430062, China.
| | - Lunwen Qian
- Collaborative Innovation Center of Grain and Oil Crops in South China, Hunan Agricultural University, Changsha, 410128, China.
| |
Collapse
|
28
|
Prakapenka D, Wang C, Liang Z, Bian C, Tan C, Da Y. GVCHAP: A Computing Pipeline for Genomic Prediction and Variance Component Estimation Using Haplotypes and SNP Markers. Front Genet 2020; 11:282. [PMID: 32318093 PMCID: PMC7154123 DOI: 10.3389/fgene.2020.00282] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2019] [Accepted: 03/09/2020] [Indexed: 01/05/2023] Open
Abstract
Haplotype prediction models open many possibilities to improve the accuracy of genomic selection but require more data processing and computing time than single-SNP prediction models. To facilitate haplotype analysis for genomic prediction and estimation using structural and functional genomic information, we developed a computing pipeline to implement haplotype analysis with capabilities for preparation of input data for haplotype analysis, genomic prediction and estimation using GVCHAP, and analysis of GVCHAP results. Data preparation includes utility programs for haplotype imputing; defining haplotype blocks by a fixed number of SNPs, a fixed distance in base pairs per block, or user defined block lengths based on structural or functional genomic information or a mixture of both types of information; and defining haplotype genotypes within each haplotype block. GVCHAP is the main program for genomic prediction and estimation, calculates GREML (genomic restricted maximum likelihood) estimates of variance components and heritabilities, and calculates GBLUP (genomic best linear unbiased prediction) for additive and dominance values of single SNPs as well as additive values of haplotypes with reliability estimates for training and validation populations. A two-step strategy and a method of multi-node processing are implemented to remove the computing bottleneck due to the creation of genomic relationship matrices for large samples. The analysis of GVCHAP results includes calculation of observed prediction accuracies from validation studies and preparation of input files for graphical visualization of heritability estimates of haplotype blocks as well as estimates of SNP effects and heritabilities. The entire pipeline provides an efficient and versatile computing tool for identifying the most accurate haplotype model among many candidate haplotype models utilizing structural and functional genomic information for genomic selection.
Collapse
Affiliation(s)
- Dzianis Prakapenka
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States
| | - Chunkao Wang
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States
| | - Zuoxiang Liang
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States
| | - Cheng Bian
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States.,State Key Laboratory for Agrobiotechnology, China Agricultural University, Beijing, China
| | - Cheng Tan
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States.,National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, China
| | - Yang Da
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States
| |
Collapse
|
29
|
Jensen SE, Charles JR, Muleta K, Bradbury PJ, Casstevens T, Deshpande SP, Gore MA, Gupta R, Ilut DC, Johnson L, Lozano R, Miller Z, Ramu P, Rathore A, Romay MC, Upadhyaya HD, Varshney RK, Morris GP, Pressoir G, Buckler ES, Ramstein GP. A sorghum practical haplotype graph facilitates genome-wide imputation and cost-effective genomic prediction. THE PLANT GENOME 2020; 13:e20009. [PMID: 33016627 DOI: 10.1002/tpg2.20009] [Citation(s) in RCA: 35] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/23/2019] [Accepted: 01/04/2020] [Indexed: 05/22/2023]
Abstract
Successful management and utilization of increasingly large genomic datasets is essential for breeding programs to accelerate cultivar development. To help with this, we developed a Sorghum bicolor Practical Haplotype Graph (PHG) pangenome database that stores haplotypes and variant information. We developed two PHGs in sorghum that were used to identify genome-wide variants for 24 founders of the Chibas sorghum breeding program from 0.01x sequence coverage. The PHG called single nucleotide polymorphisms (SNPs) with 5.9% error at 0.01x coverage-only 3% higher than PHG error when calling SNPs from 8x coverage sequence. Additionally, 207 progenies from the Chibas genomic selection (GS) training population were sequenced and processed through the PHG. Missing genotypes were imputed from PHG parental haplotypes and used for genomic prediction. Mean prediction accuracies with PHG SNP calls range from .57-.73 and are similar to prediction accuracies obtained with genotyping-by-sequencing or targeted amplicon sequencing (rhAmpSeq) markers. This study demonstrates the use of a sorghum PHG to impute SNPs from low-coverage sequence data and shows that the PHG can unify genotype calls across multiple sequencing platforms. By reducing input sequence requirements, the PHG can decrease the cost of genotyping, make GS more feasible, and facilitate larger breeding populations. Our results demonstrate that the PHG is a useful research and breeding tool that maintains variant information from a diverse group of taxa, stores sequence data in a condensed but readily accessible format, unifies genotypes across genotyping platforms, and provides a cost-effective option for genomic selection.
Collapse
Affiliation(s)
- Sarah E Jensen
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Jean Rigaud Charles
- Chibas and Department of Agriculture and Environmental Sciences, Quisqueya University, Port-au-Prince, Haiti
| | - Kebede Muleta
- Department of Agronomy, Kansas State University, Manhattan, KS, 66506, USA
| | - Peter J Bradbury
- Institute for Genomic Diversity, Cornell University, Ithaca, NY, 14853, USA
- United States Department of Agriculture-Agricultural Research Service, Robert W. Holley Center for Agriculture and Health, Ithaca, NY, 14853, USA
| | - Terry Casstevens
- Institute for Genomic Diversity, Cornell University, Ithaca, NY, 14853, USA
| | - Santosh P Deshpande
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Telangana, 502324, India
| | - Michael A Gore
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Rajeev Gupta
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Telangana, 502324, India
| | - Daniel C Ilut
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Lynn Johnson
- Institute for Genomic Diversity, Cornell University, Ithaca, NY, 14853, USA
| | - Roberto Lozano
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Zachary Miller
- Institute for Genomic Diversity, Cornell University, Ithaca, NY, 14853, USA
| | - Punna Ramu
- Institute for Genomic Diversity, Cornell University, Ithaca, NY, 14853, USA
| | - Abhishek Rathore
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Telangana, 502324, India
| | - M Cinta Romay
- Institute for Genomic Diversity, Cornell University, Ithaca, NY, 14853, USA
| | - Hari D Upadhyaya
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Telangana, 502324, India
| | - Rajeev K Varshney
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Telangana, 502324, India
| | - Geoffrey P Morris
- Department of Agronomy, Kansas State University, Manhattan, KS, 66506, USA
| | - Gael Pressoir
- Chibas and Department of Agriculture and Environmental Sciences, Quisqueya University, Port-au-Prince, Haiti
| | - Edward S Buckler
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
- Institute for Genomic Diversity, Cornell University, Ithaca, NY, 14853, USA
- United States Department of Agriculture-Agricultural Research Service, Robert W. Holley Center for Agriculture and Health, Ithaca, NY, 14853, USA
| | | |
Collapse
|
30
|
Lan S, Zheng C, Hauck K, McCausland M, Duguid SD, Booker HM, Cloutier S, You FM. Genomic Prediction Accuracy of Seven Breeding Selection Traits Improved by QTL Identification in Flax. Int J Mol Sci 2020; 21:ijms21051577. [PMID: 32106624 PMCID: PMC7084455 DOI: 10.3390/ijms21051577] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2020] [Revised: 02/23/2020] [Accepted: 02/23/2020] [Indexed: 01/21/2023] Open
Abstract
Molecular markers are one of the major factors affecting genomic prediction accuracy and the cost of genomic selection (GS). Previous studies have indicated that the use of quantitative trait loci (QTL) as markers in GS significantly increases prediction accuracy compared with genome-wide random single nucleotide polymorphism (SNP) markers. To optimize the selection of QTL markers in GS, a set of 260 lines from bi-parental populations with 17,277 genome-wide SNPs were used to evaluate the prediction accuracy for seed yield (YLD), days to maturity (DTM), iodine value (IOD), protein (PRO), oil (OIL), linoleic acid (LIO), and linolenic acid (LIN) contents. These seven traits were phenotyped over four years at two locations. Identification of quantitative trait nucleotides (QTNs) for the seven traits was performed using three types of statistical models for genome-wide association study: two SNP-based single-locus (SS), seven SNP-based multi-locus (SM), and one haplotype-block-based multi-locus (BM) models. The identified QTNs were then grouped into QTL based on haplotype blocks. For all seven traits, 133, 355, and 1208 unique QTL were identified by SS, SM, and BM, respectively. A total of 1420 unique QTL were obtained by SS+SM+BM, ranging from 254 (OIL, LIO) to 361 (YLD) for individual traits, whereas a total of 427 unique QTL were achieved by SS+SM, ranging from 56 (YLD) to 128 (LIO). SS models alone did not identify sufficient QTL for GS. The highest prediction accuracies were obtained using single-trait QTL identified by SS+SM+BM for OIL (0.929 ± 0.016), PRO (0.893 ± 0.023), YLD (0.892 ± 0.030), and DTM (0.730 ± 0.062), and by SS+SM for LIN (0.837 ± 0.053), LIO (0.835 ± 0.049), and IOD (0.835 ± 0.041). In terms of the number of QTL markers and prediction accuracy, SS+SM outperformed other models or combinations thereof. The use of all SNPs or QTL of all seven traits significantly reduced the prediction accuracy of traits. The results further validated that QTL outperformed high-density genome-wide random markers, and demonstrated that the combined use of single and multi-locus models can effectively identify a comprehensive set of QTL that improve prediction accuracy, but further studies on detection and removal of redundant or false-positive QTL to maximize prediction accuracy and minimize the number of QTL markers in GS are warranted.
Collapse
Affiliation(s)
- Samuel Lan
- Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, Ottawa, ON K1A 0C6, Canada; (S.L.); (C.Z.); (K.H.); (M.M.)
- Department of Mathematics and Statistics, University of Waterloo, Waterloo, ON N2L 3G1, Canada
| | - Chunfang Zheng
- Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, Ottawa, ON K1A 0C6, Canada; (S.L.); (C.Z.); (K.H.); (M.M.)
| | - Kyle Hauck
- Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, Ottawa, ON K1A 0C6, Canada; (S.L.); (C.Z.); (K.H.); (M.M.)
- Department of Mathematics and Statistics, University of Waterloo, Waterloo, ON N2L 3G1, Canada
| | - Madison McCausland
- Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, Ottawa, ON K1A 0C6, Canada; (S.L.); (C.Z.); (K.H.); (M.M.)
- Department of Plant Sciences, University of Manitoba, Winnipeg, MB R3T 2N2, Canada
| | - Scott D. Duguid
- Morden Research and Development Centre, Agriculture and Agri-Food Canada, Morden, MB R6M 1Y5, Canada;
| | - Helen M. Booker
- Crop Development Centre, University of Saskatchewan, Saskatoon, SK S7N 5A8, Canada;
| | - Sylvie Cloutier
- Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, Ottawa, ON K1A 0C6, Canada; (S.L.); (C.Z.); (K.H.); (M.M.)
- Correspondence: (F.M.Y.); (S.C); Tel.: +1-613-759-1539 (F.M.Y.); +1-613-759-1744 (S.C.)
| | - Frank M. You
- Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, Ottawa, ON K1A 0C6, Canada; (S.L.); (C.Z.); (K.H.); (M.M.)
- Correspondence: (F.M.Y.); (S.C); Tel.: +1-613-759-1539 (F.M.Y.); +1-613-759-1744 (S.C.)
| |
Collapse
|
31
|
Selecting Closely-Linked SNPs Based on Local Epistatic Effects for Haplotype Construction Improves Power of Association Mapping. G3-GENES GENOMES GENETICS 2019; 9:4115-4126. [PMID: 31604824 PMCID: PMC6893203 DOI: 10.1534/g3.119.400451] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
Genome-wide association studies (GWAS) have gained central importance for the identification of candidate loci underlying complex traits. Single nucleotide polymorphism (SNP) markers are mostly used as genetic variants for the analysis of genotype-phenotype associations in populations, but closely linked SNPs that are grouped into haplotypes are also exploited. The benefit of haplotype-based GWAS approaches vs. SNP-based approaches is still under debate because SNPs in high linkage disequilibrium provide redundant information. To overcome some constraints of the commonly-used haplotype-based GWAS in which only consecutive SNPs are considered for haplotype construction, we propose a new method called functional haplotype-based GWAS (FH GWAS). FH GWAS is featured by combining SNPs into haplotypes based on the additive and epistatic effects among SNPs. Such haplotypes were termed functional haplotypes (FH). As shown by simulation studies, the FH GWAS approach clearly outperformed the SNP-based approach unless the minor allele frequency of the SNPs making up the haplotypes is low and the linkage disequilibrium between them is high. Applying FH GWAS for the trait flowering time in a large Arabidopsis thaliana population with whole-genome sequencing data revealed its potential empirically. FH GWAS identified all candidate regions which were detected in SNP-based and two other haplotype-based GWAS approaches. In addition, a novel region on chromosome 4 was solely detected by FH GWAS. Thus both the results of our simulation and empirical studies demonstrate that FH GWAS is a promising method and superior to the SNP-based approach even if almost complete genotype information is available.
Collapse
|
32
|
Sehgal D, Dreisigacker S. Haplotypes-based genetic analysis: benefits and challenges. Vavilovskii Zhurnal Genet Selektsii 2019. [DOI: 10.18699/vj19.37-o] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
The increasing availability of Single Nucleotide Polymorphisms (SNPs) discovered by Next Generation Sequencing will enable a range of new genetic analyses in crops, which was not possible before. Concomitantly, researchers will face the challenge of handling large data sets at the whole-genome level. By grouping thousands of SNPs into a few hundred haplotype blocks, complexity of the data can be reduced with fewer statistical tests and a lower probability of spurious associations. Owing to the strong genome structure present in breeding lines of most crops, the deployment of haplotypes could be a powerful complement to improve efficiency of marker-assisted and genomic selection. This review describes in brief the commonly used approaches to construct haplotype blocks and some examples in animals and crops are cited where haplotype-based dissection of traits were proven beneficial. Some important considerations and facts while working with haplotypes in crops are reviewed at the end.
Collapse
Affiliation(s)
- D. Sehgal
- International Center for Maize and Wheat Improvement (CIMMYT)
| | - S. Dreisigacker
- International Center for Maize and Wheat Improvement (CIMMYT)
| |
Collapse
|
33
|
Gedil M, Menkir A. An Integrated Molecular and Conventional Breeding Scheme for Enhancing Genetic Gain in Maize in Africa. FRONTIERS IN PLANT SCIENCE 2019; 10:1430. [PMID: 31781144 PMCID: PMC6851238 DOI: 10.3389/fpls.2019.01430] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/08/2019] [Accepted: 10/15/2019] [Indexed: 05/22/2023]
Abstract
Maize production in West and Central Africa (WCA) is constrained by a wide range of interacting stresses that keep productivity below potential yields. Among the many problems afflicting maize production in WCA, drought, foliar diseases, and parasitic weeds are the most critical. Several decades of efforts devoted to the genetic improvement of maize have resulted in remarkable genetic gain, leading to increased yields of maize on farmers' fields. The revolution unfolding in the areas of genomics, bioinformatics, and phenomics is generating innovative tools, resources, and technologies for transforming crop breeding programs. It is envisaged that such tools will be integrated within maize breeding programs, thereby advancing these programs and addressing current and future challenges. Accordingly, the maize improvement program within International Institute of Tropical Agriculture (IITA) is undergoing a process of modernization through the introduction of innovative tools and new schemes that are expected to enhance genetic gains and impact on smallholder farmers in the region. Genomic tools enable genetic dissections of complex traits and promote an understanding of the physiological basis of key agronomic and nutritional quality traits. Marker-aided selection and genome-wide selection schemes are being implemented to accelerate genetic gain relating to yield, resilience, and nutritional quality. Therefore, strategies that effectively combine genotypic information with data from field phenotyping and laboratory-based analysis are currently being optimized. Molecular breeding, guided by methodically defined product profiles tailored to different agroecological zones and conditions of climate change, supported by state-of-the-art decision-making tools, is pivotal for the advancement of modern, genomics-aided maize improvement programs. Accelerated genetic gain, in turn, catalyzes a faster variety replacement rate. It is critical to forge and strengthen partnerships for enhancing the impacts of breeding products on farmers' livelihood. IITA has well-established channels for delivering its research products/technologies to partner organizations for further testing, multiplication, and dissemination across various countries within the subregion. Capacity building of national agricultural research system (NARS) will facilitate the smooth transfer of technologies and best practices from IITA and its partners.
Collapse
Affiliation(s)
- Melaku Gedil
- Bioscience Center and Maize Improvement Program, International Institute of Tropical Agriculture, Ibadan, Nigeria
| | - Abebe Menkir
- Maize Improvement Program, International Institute of Tropical Agriculture, Ibadan, Nigeria
| |
Collapse
|
34
|
He T, Hill CB, Angessa TT, Zhang XQ, Chen K, Moody D, Telfer P, Westcott S, Li C. Gene-set association and epistatic analyses reveal complex gene interaction networks affecting flowering time in a worldwide barley collection. JOURNAL OF EXPERIMENTAL BOTANY 2019; 70:5603-5616. [PMID: 31504706 PMCID: PMC6812734 DOI: 10.1093/jxb/erz332] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/05/2019] [Accepted: 08/13/2019] [Indexed: 05/10/2023]
Abstract
Single-marker genome-wide association studies (GWAS) have successfully detected associations between single nucleotide polymorphisms (SNPs) and agronomic traits such as flowering time and grain yield in barley. However, the analysis of individual SNPs can only account for a small proportion of genetic variation, and can only provide limited knowledge on gene network interactions. Gene-based GWAS approaches provide enormous opportunity both to combine genetic information and to examine interactions among genetic variants. Here, we revisited a previously published phenotypic and genotypic data set of 895 barley varieties grown in two years at four different field locations in Australia. We employed statistical models to examine gene-phenotype associations, as well as two-way epistasis analyses to increase the capability to find novel genes that have significant roles in controlling flowering time in barley. Genetic associations were tested between flowering time and corresponding genotypes of 174 putative flowering time-related genes. Gene-phenotype association analysis detected 113 genes associated with flowering time in barley, demonstrating the unprecedented power of gene-based analysis. Subsequent two-way epistasis analysis revealed 19 pairs of gene×gene interactions involved in controlling flowering time. Our study demonstrates that gene-based association approaches can provide higher capacity for future crop improvement to increase crop performance and adaptation to different environments.
Collapse
Affiliation(s)
- Tianhua He
- Western Barley Genetics Alliance, Western Australian State Agricultural Biotechnology Centre, College of Science, Health, Engineering and Education, Murdoch University, Murdoch, WA, Australia
| | - Camilla Beate Hill
- Western Barley Genetics Alliance, Western Australian State Agricultural Biotechnology Centre, College of Science, Health, Engineering and Education, Murdoch University, Murdoch, WA, Australia
| | - Tefera Tolera Angessa
- Western Barley Genetics Alliance, Western Australian State Agricultural Biotechnology Centre, College of Science, Health, Engineering and Education, Murdoch University, Murdoch, WA, Australia
| | - Xiao-Qi Zhang
- Western Barley Genetics Alliance, Western Australian State Agricultural Biotechnology Centre, College of Science, Health, Engineering and Education, Murdoch University, Murdoch, WA, Australia
| | - Kefei Chen
- SAGI-WEST, Faculty of Science and Engineering, Curtin University, Bentley, WA, Australia
| | | | - Paul Telfer
- Australian Grain Technologies Pty Ltd (AGT), SA, Australia
| | - Sharon Westcott
- Agriculture and Food, Department of Primary Industries and Regional Development, South Perth, WA, Australia
| | - Chengdao Li
- Western Barley Genetics Alliance, Western Australian State Agricultural Biotechnology Centre, College of Science, Health, Engineering and Education, Murdoch University, Murdoch, WA, Australia
- Agriculture and Food, Department of Primary Industries and Regional Development, South Perth, WA, Australia
- Hubei Collaborative Innovation Centre for Grain Industry, Yangtze University, Hubei Jingzhou, China
- Correspondence:
| |
Collapse
|
35
|
Cuevas J, Montesinos-López O, Juliana P, Guzmán C, Pérez-Rodríguez P, González-Bucio J, Burgueño J, Montesinos-López A, Crossa J. Deep Kernel for Genomic and Near Infrared Predictions in Multi-environment Breeding Trials. G3 (BETHESDA, MD.) 2019; 9:2913-2924. [PMID: 31289023 PMCID: PMC6723142 DOI: 10.1534/g3.119.400493] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/28/2019] [Accepted: 07/04/2019] [Indexed: 01/15/2023]
Abstract
Kernel methods are flexible and easy to interpret and have been successfully used in genomic-enabled prediction of various plant species. Kernel methods used in genomic prediction comprise the linear genomic best linear unbiased predictor (GBLUP or GB) kernel, and the Gaussian kernel (GK). In general, these kernels have been used with two statistical models: single-environment and genomic × environment (GE) models. Recently near infrared spectroscopy (NIR) has been used as an inexpensive and non-destructive high-throughput phenotyping method for predicting unobserved line performance in plant breeding trials. In this study, we used a non-linear arc-cosine kernel (AK) that emulates deep learning artificial neural networks. We compared AK prediction accuracy with the prediction accuracy of GB and GK kernel methods in four genomic data sets, one of which also includes pedigree and NIR information. Results show that for all four data sets, AK and GK kernels achieved higher prediction accuracy than the linear GB kernel for the single-environment and GE multi-environment models. In addition, AK achieved similar or slightly higher prediction accuracy than the GK kernel. For all data sets, the GE model achieved higher prediction accuracy than the single-environment model. For the data set that includes pedigree, markers and NIR, results show that the NIR wavelength alone achieved lower prediction accuracy than the genomic information alone; however, the pedigree plus NIR information achieved only slightly lower prediction accuracy than the marker plus the NIR high-throughput data.
Collapse
Affiliation(s)
- Jaime Cuevas
- Universidad de Quintana Roo, Chetumal, Quintana Roo, 77019 México
| | | | - Philomin Juliana
- International Maize and Wheat Improvement Center (CIMMYT), Carretera Mexico- Veracruz Km. 45, El Batán, 56237, Texcoco, Edo. de Mexico, Mexico
| | - Carlos Guzmán
- International Maize and Wheat Improvement Center (CIMMYT), Carretera Mexico- Veracruz Km. 45, El Batán, 56237, Texcoco, Edo. de Mexico, Mexico
| | | | | | - Juan Burgueño
- International Maize and Wheat Improvement Center (CIMMYT), Carretera Mexico- Veracruz Km. 45, El Batán, 56237, Texcoco, Edo. de Mexico, Mexico
| | - Abelardo Montesinos-López
- Departamento de Matemáticas, Centro Universitario de Ciencias Exactas e Ingenierías, (CUCEI), Universidad de Guadalajara, Guadalajara, Jalisco, 44430
| | - José Crossa
- International Maize and Wheat Improvement Center (CIMMYT), Carretera Mexico- Veracruz Km. 45, El Batán, 56237, Texcoco, Edo. de Mexico, Mexico
| |
Collapse
|
36
|
Pook T, Schlather M, de Los Campos G, Mayer M, Schoen CC, Simianer H. HaploBlocker: Creation of Subgroup-Specific Haplotype Blocks and Libraries. Genetics 2019; 212:1045-1061. [PMID: 31152070 PMCID: PMC6707469 DOI: 10.1534/genetics.119.302283] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2019] [Accepted: 05/30/2019] [Indexed: 11/18/2022] Open
Abstract
The concept of haplotype blocks has been shown to be useful in genetics. Fields of application range from the detection of regions under positive selection to statistical methods that make use of dimension reduction. We propose a novel approach ("HaploBlocker") for defining and inferring haplotype blocks that focuses on linkage instead of the commonly used population-wide measures of linkage disequilibrium. We define a haplotype block as a sequence of genetic markers that has a predefined minimum frequency in the population, and only haplotypes with a similar sequence of markers are considered to carry that block, effectively screening a dataset for group-wise identity-by-descent. From these haplotype blocks, we construct a haplotype library that represents a large proportion of genetic variability with a limited number of blocks. Our method is implemented in the associated R-package HaploBlocker, and provides flexibility not only to optimize the structure of the obtained haplotype library for subsequent analyses, but also to handle datasets of different marker density and genetic diversity. By using haplotype blocks instead of single nucleotide polymorphisms (SNPs), local epistatic interactions can be naturally modeled, and the reduced number of parameters enables a wide variety of new methods for further genomic analyses such as genomic prediction and the detection of selection signatures. We illustrate our methodology with a dataset comprising 501 doubled haploid lines in a European maize landrace genotyped at 501,124 SNPs. With the suggested approach, we identified 2991 haplotype blocks with an average length of 2685 SNPs that together represent 94% of the dataset.
Collapse
Affiliation(s)
- Torsten Pook
- Department of Animal Sciences, Animal Breeding and Genetics Group, University of Goettingen, 37075, Germany
- Center for Integrated Breeding Research, University of Goettingen, 37075, Germany
| | - Martin Schlather
- Center for Integrated Breeding Research, University of Goettingen, 37075, Germany
- Stochastics and Its Applications Group, University of Mannheim, 68159, Germany
| | - Gustavo de Los Campos
- Departments of Epidemiology and Biostatistics and Statistics and Probability, Institute for Quantitative Health Science and Engineering, Michigan State University, Michigan 48824
| | - Manfred Mayer
- Plant Breeding, Technical University of Munich School of Life Sciences Weihenstephan, 85354 Freising, Germany
| | - Chris Carolin Schoen
- Plant Breeding, Technical University of Munich School of Life Sciences Weihenstephan, 85354 Freising, Germany
| | - Henner Simianer
- Department of Animal Sciences, Animal Breeding and Genetics Group, University of Goettingen, 37075, Germany
- Center for Integrated Breeding Research, University of Goettingen, 37075, Germany
| |
Collapse
|
37
|
Michel S, Löschenberger F, Ametz C, Pachler B, Sparry E, Bürstmayr H. Simultaneous selection for grain yield and protein content in genomics-assisted wheat breeding. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2019; 132:1745-1760. [PMID: 30810763 PMCID: PMC6531418 DOI: 10.1007/s00122-019-03312-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/17/2018] [Accepted: 02/15/2019] [Indexed: 05/10/2023]
Abstract
KEY MESSAGE Large genetic improvement can be achieved by simultaneous genomic selection for grain yield and protein content when combining different breeding strategies in the form of selection indices. Genomic selection has been implemented in many national and international breeding programmes in recent years. Numerous studies have shown the potential of this new breeding tool; few have, however, taken the simultaneous selection for multiple traits into account that is though common practice in breeding programmes. The simultaneous improvement in grain yield and protein content is thereby a major challenge in wheat breeding due to a severe negative trade-off. Accordingly, the potential and limits of multi-trait selection for this particular trait complex utilizing the vast phenotypic and genomic data collected in an applied wheat breeding programme were investigated in this study. Two breeding strategies based on various genomic-selection indices were compared, which (1) aimed to select high-protein genotypes with acceptable yield potential and (2) develop high-yielding varieties, while maintaining protein content. The prediction accuracy of preliminary yield trials could be strongly improved when combining phenotypic and genomic information in a genomics-assisted selection approach, which surpassed both genomics-based and classical phenotypic selection methods both for single trait predictions and in genomic index selection across years. The employed genomic selection indices mitigated furthermore the negative trade-off between grain yield and protein content leading to a substantial selection response for protein yield, i.e. total seed nitrogen content, which suggested that it is feasible to develop varieties that combine a superior yield potential with comparably high protein content, thus utilizing available nitrogen resources more efficiently.
Collapse
Affiliation(s)
- Sebastian Michel
- Department of Agrobiotechnology, IFA-Tulln, University of Natural Resources and Life Sciences Vienna, Konrad-Lorenz-Str. 20, 3430, Tulln, Austria.
| | | | - Christian Ametz
- Saatzucht Donau GesmbH & CoKG, Saatzuchtstrasse 11, 2301, Probstdorf, Austria
| | - Bernadette Pachler
- Saatzucht Donau GesmbH & CoKG, Saatzuchtstrasse 11, 2301, Probstdorf, Austria
| | - Ellen Sparry
- C&M Seeds, 6180 5th Line, Palmerston, ON, N0G 2P0, Canada
| | - Hermann Bürstmayr
- Department of Agrobiotechnology, IFA-Tulln, University of Natural Resources and Life Sciences Vienna, Konrad-Lorenz-Str. 20, 3430, Tulln, Austria
| |
Collapse
|
38
|
A Low Resolution Epistasis Mapping Approach To Identify Chromosome Arm Interactions in Allohexaploid Wheat. G3-GENES GENOMES GENETICS 2019; 9:675-684. [PMID: 30455184 PMCID: PMC6404624 DOI: 10.1534/g3.118.200646] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Epistasis is an important contributor to genetic variance. In inbred populations, pairwise epistasis is present as additive by additive interactions. Testing for epistasis presents a multiple testing problem as the pairwise search space for modest numbers of markers is large. Single markers do not necessarily track functional units of interacting chromatin as well as haplotype based methods do. To harness the power of multiple markers while minimizing the number of tests conducted, we present a low resolution test for epistatic interactions across whole chromosome arms. Epistasis covariance matrices were constructed from the additive covariances of individual chromosome arms. These covariances were subsequently used to estimate an epistatic variance parameter while correcting for background additive and epistatic effects. We find significant epistasis for 2% of the interactions tested for four agronomic traits in a winter wheat breeding population. Interactions across homeologous chromosome arms were identified, but were less abundant than other chromosome arm pair interactions. The homeologous chromosome arm pair 4BL/4DL showed a strong negative relationship between additive and interaction effects that may be indicative of functional redundancy. Several chromosome arms appeared to act as hubs in an interaction network, suggesting that they may contain important regulatory factors. The differential patterns of epistasis across different traits demonstrate that detection of epistatic interactions is robust when correcting for background additive and epistatic effects in the population. The low resolution epistasis mapping method presented here identifies important epistatic interactions with a limited number of statistical tests at the cost of low precision.
Collapse
|
39
|
Rembe M, Zhao Y, Jiang Y, Reif JC. Reciprocal recurrent genomic selection: an attractive tool to leverage hybrid wheat breeding. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2019; 132:687-698. [PMID: 30488192 DOI: 10.1007/s00122-018-3244-x] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/03/2018] [Accepted: 11/16/2018] [Indexed: 06/09/2023]
Abstract
Using a two-part breeding strategy based on a population improvement and a product development component can leverage hybrid wheat breeding. Despite the technological advance of methods to facilitate hybrid breeding in self-pollinating crops, line breeding is still the dominating breeding strategy. This is likely due to a higher long-term selection gain in line compared to hybrid breeding. In this respect, recent studies on two-part strategies splitting the breeding program into a population improvement and a product development component could mark a trend reversal. Here, an overview of experimental and simulation-based studies exploring the possibilities to integrate genome-wide prediction into recurrent selection is given. Furthermore, possibilities to make use of recurrent selection for inter-population improvement are discussed. Current findings of simulation studies and quantitative genetic considerations suggest that long-term selection gain of hybrid breeding can be increased by implementing a two-part selection strategy based on reciprocal recurrent genomic selection. This would strengthen the competitiveness of hybrid versus line breeding facilitating to develop outstanding hybrid varieties also for self-pollinating plants such as wheat.
Collapse
Affiliation(s)
- Maximilian Rembe
- Department of Breeding Research, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466, Gatersleben, Germany
| | - Yusheng Zhao
- Department of Breeding Research, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466, Gatersleben, Germany
| | - Yong Jiang
- Department of Breeding Research, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466, Gatersleben, Germany
| | - Jochen C Reif
- Department of Breeding Research, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466, Gatersleben, Germany.
| |
Collapse
|
40
|
Rasheed A, Xia X. From markers to genome-based breeding in wheat. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2019; 132:767-784. [PMID: 30673804 DOI: 10.1007/s00122-019-03286-4] [Citation(s) in RCA: 52] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2018] [Accepted: 01/16/2019] [Indexed: 05/22/2023]
Abstract
Recent technological advances in wheat genomics provide new opportunities to uncover genetic variation in traits of breeding interest and enable genome-based breeding to deliver wheat cultivars for the projected food requirements for 2050. There has been tremendous progress in development of whole-genome sequencing resources in wheat and its progenitor species during the last 5 years. High-throughput genotyping is now possible in wheat not only for routine gene introgression but also for high-density genome-wide genotyping. This is a major transition phase to enable genome-based breeding to achieve progressive genetic gains to parallel to projected wheat production demands. These advances have intrigued wheat researchers to practice less pursued analytical approaches which were not practiced due to the short history of genome sequence availability. Such approaches have been successful in gene discovery and breeding applications in other crops and animals for which genome sequences have been available for much longer. These strategies include, (i) environmental genome-wide association studies in wheat genetic resources stored in genbanks to identify genes for local adaptation by using agroclimatic traits as phenotypes, (ii) haplotype-based analyses to improve the statistical power and resolution of genomic selection and gene mapping experiments, (iii) new breeding strategies for genome-based prediction of heterosis patterns in wheat, and (iv) ultimate use of genomics information to develop more efficient and robust genome-wide genotyping platforms to precisely predict higher yield potential and stability with greater precision. Genome-based breeding has potential to achieve the ultimate objective of ensuring sustainable wheat production through developing high yielding, climate-resilient wheat cultivars with high nutritional quality.
Collapse
Affiliation(s)
- Awais Rasheed
- Institute of Crop Sciences, National Wheat Improvement Center, Chinese Academy of Agricultural Sciences (CAAS), 12 Zhongguancun South Street, Beijing, 100081, China
- International Maize and Wheat Improvement Center (CIMMYT), c/o CAAS, 12 Zhongguancun South Street, Beijing, 100081, China
- Department of Plant Sciences, Quaid-i-Azam University, Islamabad, 45320, Pakistan
| | - Xianchun Xia
- Institute of Crop Sciences, National Wheat Improvement Center, Chinese Academy of Agricultural Sciences (CAAS), 12 Zhongguancun South Street, Beijing, 100081, China.
| |
Collapse
|