1
|
Chegini A, Strandén I, Karaman E, Iso-Touru T, Pösö J, Aamand GP, Lidauer MH. Marker weighting improves single-step genomic prediction reliabilities of udder health traits in Nordic Red and Jersey dairy cattle populations. J Dairy Sci 2024:S0022-0302(24)01196-2. [PMID: 39369893 DOI: 10.3168/jds.2024-25374] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2024] [Accepted: 09/04/2024] [Indexed: 10/08/2024]
Abstract
The standard single-step genomic prediction model assumes that all SNP markers explain an equal amount of genetic variance, which, however, may not be true. This is because SNPs are located in or near different genes with different functions. Therefore, it seems logical to consider SNP marker-specific weights when predicting genomic breeding values. We hypothesized that allowing differences in the amount of genetic variance explained by each SNP marker will improve prediction reliability and response to selection. To investigate this hypothesis, we first developed multi-trait standard single-step genomic models based on the current multi-trait random regression evaluation models for udder health traits of the Nordic Red (RDC) and Jersey (JER) dairy cattle populations. The models included 4 clinical mastitis (CM) traits, 3 test-day somatic cell score (SCS) traits, and the conformation traits fore udder attachment and udder depth. In the second step, we investigated the effect of applying different SNP marker weighting scenarios in the single-step genomic prediction models, for which a single-step SNP best linear unbiased prediction model was applied. We investigated the prediction reliability of the different models by forward prediction, where the last 4 years of the data were removed to estimate breeding values for validation candidates. In addition, genetic trends of the pedigree-based estimated breeding values (PEBV) and genomic enhanced breeding values (GEBV) were examined. The data sets for RDC and JER included 6.9 and 1.2 million animals of which 5.6 and 0.9 million cows had records, respectively. The number of genotyped animals was 125,789 and 64,777 for RDC and JER, respectively. Cows had repeated SCS observations but only single observations for all other traits and breeding values for all traits were modeled by one covariance function. This required modeling 12 eigenvalue breeding value coefficients for each cow and developing SNP marker weights for the principal components rather than for the biological traits. We investigated 3 SNP marker weighting scenarios: 1) a nonlinear method similar to BayesA, 2) using the classical formula 2pqû2 that accounts for allele heterozygosity, and 3) applying a mean SNP weight calculated by 2pqû2 for every 20 adjacent SNP markers. Bias, dispersion, and prediction reliability were calculated using PEBV or GEBV from the evaluation based on the full data set on those using the reduced data set. We found that the recent favorable genetic trend in CM and SCS has been accelerated since the introduction of genomic selection. The study also shows that a significant increase in prediction reliability, i.e., 0.74 vs. 0.48 for RDC and 0.72 vs. 0.41 for JER cows for CM, can be achieved with a standard single-step genomic prediction model compared with a pedigree-based prediction model. Almost all scenarios with SNP marker weighting further improved the prediction reliability between 0.5% and 12.7%. The highest improvement was achieved by weighing the SNP markers based on the 2pqû2 formula.
Collapse
Affiliation(s)
- Arash Chegini
- Natural Resources Institute Finland (Luke), Tietotie 4, 31600 Jokioinen, Finland.
| | - Ismo Strandén
- Natural Resources Institute Finland (Luke), Tietotie 4, 31600 Jokioinen, Finland
| | - Emre Karaman
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830 Tjele, Denmark
| | - Terhi Iso-Touru
- Natural Resources Institute Finland (Luke), Tietotie 4, 31600 Jokioinen, Finland
| | | | - Gert P Aamand
- Nordic Cattle Genetic Evaluation (NAV), Agro Food Park 15, 8200 Aarhus, Denmark
| | - Martin H Lidauer
- Natural Resources Institute Finland (Luke), Tietotie 4, 31600 Jokioinen, Finland
| |
Collapse
|
2
|
Villar-Hernández BDJ, Pérez-Rodríguez P, Vitale P, Gerard G, Montesinos-Lopez OA, Saint Pierre C, Crossa J, Dreisigacker S. Optimizing Genomic Parental Selection for Categorical and Continuous-Categorical Multi-Trait Mixtures. Genes (Basel) 2024; 15:995. [PMID: 39202356 PMCID: PMC11353433 DOI: 10.3390/genes15080995] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2024] [Revised: 07/20/2024] [Accepted: 07/25/2024] [Indexed: 09/03/2024] Open
Abstract
This study presents a novel approach for the optimization of genomic parental selection in breeding programs involving categorical and continuous-categorical multi-trait mixtures (CMs and CCMMs). Utilizing the Bayesian decision theory (BDT) and latent trait models within a multivariate normal distribution framework, we address the complexities of selecting new parental lines across ordinal and continuous traits for breeding. Our methodology enhances precision and flexibility in genetic selection, validated through extensive simulations. This unified approach presents significant potential for the advancement of genetic improvements in diverse breeding contexts, underscoring the importance of integrating both categorical and continuous traits in genomic selection frameworks.
Collapse
Affiliation(s)
- Bartolo de Jesús Villar-Hernández
- International Maize and Wheat Improvement Center (CIMMYT), Km 45, Carretera México-Veracruz, Texcoco CP 52640, Estado de México, Mexico; (B.d.J.V.-H.); (P.V.); (G.G.); (C.S.P.)
| | | | - Paolo Vitale
- International Maize and Wheat Improvement Center (CIMMYT), Km 45, Carretera México-Veracruz, Texcoco CP 52640, Estado de México, Mexico; (B.d.J.V.-H.); (P.V.); (G.G.); (C.S.P.)
| | - Guillermo Gerard
- International Maize and Wheat Improvement Center (CIMMYT), Km 45, Carretera México-Veracruz, Texcoco CP 52640, Estado de México, Mexico; (B.d.J.V.-H.); (P.V.); (G.G.); (C.S.P.)
| | | | - Carolina Saint Pierre
- International Maize and Wheat Improvement Center (CIMMYT), Km 45, Carretera México-Veracruz, Texcoco CP 52640, Estado de México, Mexico; (B.d.J.V.-H.); (P.V.); (G.G.); (C.S.P.)
| | - José Crossa
- International Maize and Wheat Improvement Center (CIMMYT), Km 45, Carretera México-Veracruz, Texcoco CP 52640, Estado de México, Mexico; (B.d.J.V.-H.); (P.V.); (G.G.); (C.S.P.)
- Colegio de Postgraduados, Montecillos CP 56230, Estado de México, Mexico;
- Louisiana State University, Baton Rouge, LA 70803, USA
- Distinguish Scientist Fellowship Program and Department of Statistics and Operations Research, King Saud University, Riyah 11459, Saudi Arabia
| | - Susanne Dreisigacker
- International Maize and Wheat Improvement Center (CIMMYT), Km 45, Carretera México-Veracruz, Texcoco CP 52640, Estado de México, Mexico; (B.d.J.V.-H.); (P.V.); (G.G.); (C.S.P.)
| |
Collapse
|
3
|
Sarakul M, Elzo MA, Koonawootrittriron S, Suwanasopee T, Jattawa D, Laodim T. A comparison of five sets of overlapping and non-overlapping sliding windows for semen production traits in the Thai multibreed dairy population. Anim Biosci 2024; 37:428-436. [PMID: 37946424 PMCID: PMC10915195 DOI: 10.5713/ab.23.0230] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 09/03/2023] [Accepted: 10/02/2023] [Indexed: 11/12/2023] Open
Abstract
OBJECTIVE This study compared five distinct sets of biological pathways and associated genes related to semen volume (VOL), number of sperm (NS), and sperm motility (MOT) in the Thai multibreed dairy population. METHODS The phenotypic data included 13,533 VOL records, 12,773 NS records, and 12,660 MOT records from 131 bulls. The genotypic data consisted of 76,519 imputed and actual single nucleotide polymorphisms (SNPs) from 72 animals. The SNP additive genetic variances for VOL, NS, and MOT were estimated for SNP windows of one SNP (SW1), ten SNP (SW10), 30 SNP (SW30), 50 SNP (SW50), and 100 SNP (SW100) using a single-step genomic best linear unbiased prediction approach. The fixed effects in the model were contemporary group, ejaculate order, bull age, ambient temperature, and heterosis. The random effects accounted for animal additive genetic effects, permanent environment effects, and residual. The SNPs explaining at least 0.001% of the additive genetic variance in SW1, 0.01% in SW10, 0.03% in SW30, 0.05% in SW50, and 0.1% in SW100 were selected for gene identification through the NCBI database. The pathway analysis utilized genes associated with the identified SNP windows. RESULTS Comparison of overlapping and non-overlapping SNP windows revealed notable differences among the identified pathways and genes associated with the studied traits. Overlapping windows consistently yielded a larger number of shared biological pathways and genes than non-overlapping windows. In particular, overlapping SW30 and SW50 identified the largest number of shared pathways and genes in the Thai multibreed dairy population. CONCLUSION This study yielded valuable insights into the genetic architecture of VOL, NS, and MOT. It also highlighted the importance of assessing overlapping and non-overlapping SNP windows of various sizes for their effectiveness to identify shared pathways and genes influencing multiple traits.
Collapse
Affiliation(s)
- Mattaneeya Sarakul
- Department of Animal Science, Nakhon Phanom University, Nakhon Phanom, 48000,
Thailand
| | - Mauricio A. Elzo
- Department of Animal Sciences, University of Florida, Gainesville, FL 32611-0910,
USA
| | | | | | - Danai Jattawa
- Department of Animal Science, Kasetsart University, Bangkok 10900,
Thailand
| | - Thawee Laodim
- Department of Animal Science, Kasetsart University, Kamphaeng Saen Campus, Nakhon Pathom 73140,
Thailand
| |
Collapse
|
4
|
Misztal I, Lourenco D. Potential negative effects of genomic selection. J Anim Sci 2024; 102:skae155. [PMID: 38847068 PMCID: PMC11229339 DOI: 10.1093/jas/skae155] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Accepted: 06/05/2024] [Indexed: 07/10/2024] Open
Abstract
Initial findings on genomic selection (GS) indicated substantial improvement for major traits, such as performance, and even successful selection for antagonistic traits. However, recent unofficial reports indicate an increased frequency of deterioration of secondary traits. This phenomenon may arise due to the mismatch between the accelerated selection process and resource allocation. Traits explicitly or implicitly accounted for by a selection index move toward the desired direction, whereas neglected traits change according to the genetic correlations with selected traits. Historically, the first stage of commercial genetic selection focused on production traits. After long-term selection, production traits improved, whereas fitness traits deteriorated, although this deterioration was partially compensated for by constantly improving management. Adding these fitness traits to the breeding objective and the used selection index also helped offset their decline while promoting long-term gains. Subsequently, the trend in observed fitness traits was a combination of a negative response due to genetic antagonism, positive response from inclusion in the selection index, and a positive effect of improving management. Under GS, the genetic trends accelerate, especially for well-recorded higher heritability traits, magnifying the negatively correlated responses for fitness traits. Then, the observed trend for fitness traits can become negative, especially because management modifications do not accelerate under GS. Additional deterioration can occur due to the rapid turnover of GS, as heritabilities for production traits can decline and the genetic antagonism between production and fitness traits can intensify. If the genetic parameters are not updated, the selection index will be inaccurate, and the intended gains will not occur. While the deterioration can accelerate for unrecorded or sparsely recorded fitness traits, GS can lead to an improvement for widely recorded fitness traits. In the context of GS, it is crucial to look for unexpected changes in relevant traits and take rapid steps to prevent further declines, especially in secondary traits. Changes can be anticipated by investigating the temporal dynamics of genetic parameters, especially genetic correlations. However, new methods are needed to estimate genetic parameters for the last generation with large amounts of genomic data.
Collapse
Affiliation(s)
- Ignacy Misztal
- Department of Animal and Dairy Science, University of Georgia, Athens, GA 30602, USA
| | - Daniela Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, GA 30602, USA
| |
Collapse
|
5
|
Berry DP, Spangler ML. Animal board invited review: Practical applications of genomic information in livestock. Animal 2023; 17:100996. [PMID: 37820404 DOI: 10.1016/j.animal.2023.100996] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Revised: 09/08/2023] [Accepted: 09/11/2023] [Indexed: 10/13/2023] Open
Abstract
Access to high-dimensional genomic information in many livestock species is accelerating. This has been greatly aided not only by continual reductions in genotyping costs but also an expansion in the services available that leverage genomic information to create a greater return-on-investment. Genomic information on individual animals has many uses including (1) parentage verification and discovery, (2) traceability, (3) karyotyping, (4) sex determination, (5) reporting and monitoring of mutations conferring major effects or congenital defects, (6) better estimating inbreeding of individuals and coancestry among individuals, (7) mating advice, (8) determining breed composition, (9) enabling precision management, and (10) genomic evaluations; genomic evaluations exploit genome-wide genotype information to improve the accuracy of predicting an animal's (and by extension its progeny's) genetic merit. Genomic data also provide a huge resource for research, albeit the outcome from this research, if successful, should eventually be realised through one of the ten applications already mentioned. The process for generating a genotype all the way from sample procurement to identifying erroneous genotypes is described, as are the steps that should be considered when developing a bespoke genotyping panel for practical application.
Collapse
Affiliation(s)
- D P Berry
- Animal & Grassland Research and Innovation Centre, Teagasc, Moorepark, Fermoy, Cork, Ireland.
| | - M L Spangler
- Department of Animal Science, University of Nebraska-Lincoln, Lincoln, NE, United States
| |
Collapse
|
6
|
Sinha D, Maurya AK, Abdi G, Majeed M, Agarwal R, Mukherjee R, Ganguly S, Aziz R, Bhatia M, Majgaonkar A, Seal S, Das M, Banerjee S, Chowdhury S, Adeyemi SB, Chen JT. Integrated Genomic Selection for Accelerating Breeding Programs of Climate-Smart Cereals. Genes (Basel) 2023; 14:1484. [PMID: 37510388 PMCID: PMC10380062 DOI: 10.3390/genes14071484] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Revised: 07/14/2023] [Accepted: 07/18/2023] [Indexed: 07/30/2023] Open
Abstract
Rapidly rising population and climate changes are two critical issues that require immediate action to achieve sustainable development goals. The rising population is posing increased demand for food, thereby pushing for an acceleration in agricultural production. Furthermore, increased anthropogenic activities have resulted in environmental pollution such as water pollution and soil degradation as well as alterations in the composition and concentration of environmental gases. These changes are affecting not only biodiversity loss but also affecting the physio-biochemical processes of crop plants, resulting in a stress-induced decline in crop yield. To overcome such problems and ensure the supply of food material, consistent efforts are being made to develop strategies and techniques to increase crop yield and to enhance tolerance toward climate-induced stress. Plant breeding evolved after domestication and initially remained dependent on phenotype-based selection for crop improvement. But it has grown through cytological and biochemical methods, and the newer contemporary methods are based on DNA-marker-based strategies that help in the selection of agronomically useful traits. These are now supported by high-end molecular biology tools like PCR, high-throughput genotyping and phenotyping, data from crop morpho-physiology, statistical tools, bioinformatics, and machine learning. After establishing its worth in animal breeding, genomic selection (GS), an improved variant of marker-assisted selection (MAS), has made its way into crop-breeding programs as a powerful selection tool. To develop novel breeding programs as well as innovative marker-based models for genetic evaluation, GS makes use of molecular genetic markers. GS can amend complex traits like yield as well as shorten the breeding period, making it advantageous over pedigree breeding and marker-assisted selection (MAS). It reduces the time and resources that are required for plant breeding while allowing for an increased genetic gain of complex attributes. It has been taken to new heights by integrating innovative and advanced technologies such as speed breeding, machine learning, and environmental/weather data to further harness the GS potential, an approach known as integrated genomic selection (IGS). This review highlights the IGS strategies, procedures, integrated approaches, and associated emerging issues, with a special emphasis on cereal crops. In this domain, efforts have been taken to highlight the potential of this cutting-edge innovation to develop climate-smart crops that can endure abiotic stresses with the motive of keeping production and quality at par with the global food demand.
Collapse
Affiliation(s)
- Dwaipayan Sinha
- Department of Botany, Government General Degree College, Mohanpur 721436, India
| | - Arun Kumar Maurya
- Department of Botany, Multanimal Modi College, Modinagar, Ghaziabad 201204, India
| | - Gholamreza Abdi
- Department of Biotechnology, Persian Gulf Research Institute, Persian Gulf University, Bushehr 75169, Iran
| | - Muhammad Majeed
- Department of Botany, University of Gujrat, Punjab 50700, Pakistan
| | - Rachna Agarwal
- Applied Genomics Section, Bhabha Atomic Research Centre, Mumbai 400085, India
| | - Rashmi Mukherjee
- Research Center for Natural and Applied Sciences, Department of Botany (UG & PG), Raja Narendralal Khan Women's College, Gope Palace, Midnapur 721102, India
| | - Sharmistha Ganguly
- Department of Dravyaguna, Institute of Post Graduate Ayurvedic Education and Research, Kolkata 700009, India
| | - Robina Aziz
- Department of Botany, Government, College Women University, Sialkot 51310, Pakistan
| | - Manika Bhatia
- TERI School of Advanced Studies, New Delhi 110070, India
| | - Aqsa Majgaonkar
- Department of Botany, St. Xavier's College (Autonomous), Mumbai 400001, India
| | - Sanchita Seal
- Department of Botany, Polba Mahavidyalaya, Polba 712148, India
| | - Moumita Das
- V. Sivaram Research Foundation, Bangalore 560040, India
| | - Swastika Banerjee
- Department of Botany, Kairali College of +3 Science, Champua, Keonjhar 758041, India
| | - Shahana Chowdhury
- Department of Biotechnology, Faculty of Engineering Sciences, German University Bangladesh, TNT Road, Telipara, Chandona Chowrasta, Gazipur 1702, Bangladesh
| | - Sherif Babatunde Adeyemi
- Ethnobotany/Phytomedicine Laboratory, Department of Plant Biology, Faculty of Life Sciences, University of Ilorin, Ilorin P.M.B 1515, Nigeria
| | - Jen-Tsung Chen
- Department of Life Sciences, National University of Kaohsiung, Kaohsiung 811, Taiwan
| |
Collapse
|
7
|
Vahedi SM, Salek Ardetani S, Brito LF, Karimi K, Pahlavan Afshari K, Banabazi MH. Expanding the application of haplotype-based genomic predictions to the wild: A case of antibody response against Teladorsagia circumcincta in Soay sheep. BMC Genomics 2023; 24:335. [PMID: 37330501 PMCID: PMC10276919 DOI: 10.1186/s12864-023-09407-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Accepted: 05/24/2023] [Indexed: 06/19/2023] Open
Abstract
BACKGROUND Genomic prediction of breeding values (GP) has been adopted in evolutionary genomic studies to uncover microevolutionary processes of wild populations or improve captive breeding strategies. While recent evolutionary studies applied GP with individual single nucleotide polymorphism (SNP), haplotype-based GP could outperform individual SNP predictions through better capturing the linkage disequilibrium (LD) between the SNP and quantitative trait loci (QTL). This study aimed to evaluate the accuracy and bias of haplotype-based GP of immunoglobulin (Ig) A (IgA), IgE, and IgG against Teladorsagia circumcincta in lambs of an unmanaged sheep population (Soay breed) based on Genomic Best Linear Unbiased Prediction (GBLUP) and five Bayesian [BayesA, BayesB, BayesCπ, Bayesian Lasso (BayesL), and BayesR] methods. RESULTS The accuracy and bias of GPs using SNP, haplotypic pseudo-SNP from blocks with different LD thresholds (0.15, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, and 1.00), or the combinations of pseudo-SNPs and non-LD clustered SNPs were obtained. Across methods and marker sets, higher ranges of genomic estimated breeding values (GEBV) accuracies were observed for IgA (0.20 to 0.49), followed by IgE (0.08 to 0.20) and IgG (0.05 to 0.14). Considering the methods evaluated, up to 8% gains in GP accuracy of IgG were achieved using pseudo-SNPs compared to SNPs. Up to 3% gain in GP accuracy for IgA was also obtained using the combinations of the pseudo-SNPs with non-clustered SNPs in comparison to fitting individual SNP. No improvement in GP accuracy of IgE was observed using haplotypic pseudo-SNPs or their combination with non-clustered SNPs compared to individual SNP. Bayesian methods outperformed GBLUP for all traits. Most scenarios yielded lower accuracies for all traits with an increased LD threshold. GP models using haplotypic pseudo-SNPs predicted less-biased GEBVs mainly for IgG. For this trait, lower bias was observed with higher LD thresholds, whereas no distinct trend was observed for other traits with changes in LD. CONCLUSIONS Haplotype information improves GP performance of anti-helminthic antibody traits of IgA and IgG compared to fitting individual SNP. The observed gains in the predictive performances indicate that haplotype-based methods could benefit GP of some traits in wild animal populations.
Collapse
Affiliation(s)
- Seyed Milad Vahedi
- Department of Animal Science and Aquaculture, Dalhousie University, Truro, NS, B2N5E3, Canada
| | | | - Luiz F Brito
- Department of Animal Sciences, Purdue University, West Lafayette, IN, 47907, USA
| | - Karim Karimi
- Molecular Diagnostics Program, Verspeeten Clinical Genome Centre, London Health Sciences Centre, London, ON, N6A 5W9, Canada
| | - Kian Pahlavan Afshari
- Department of Animal Sciences, Islamic Azad University, Varamin, Varamin-Pishva Branch3381774895, Iran
| | - Mohammad Hossein Banabazi
- Department of Animal Breeding and Genetics (HGEN), Centre for Veterinary Medicine and Animal Science (VHC), Swedish University of Agricultural Sciences (SLU), 75007, Uppsala, Sweden.
- Department of Biotechnology, Animal Science Research Institute of IRAN (ASRI), Agricultural Research, Education & Extension Organization (AREEO), Karaj, 3146618361, Iran.
| |
Collapse
|
8
|
Alemu A, Batista L, Singh PK, Ceplitis A, Chawade A. Haplotype-tagged SNPs improve genomic prediction accuracy for Fusarium head blight resistance and yield-related traits in wheat. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2023; 136:92. [PMID: 37009920 PMCID: PMC10068637 DOI: 10.1007/s00122-023-04352-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Accepted: 03/21/2023] [Indexed: 06/19/2023]
Abstract
Linkage disequilibrium (LD)-based haplotyping with subsequent SNP tagging improved the genomic prediction accuracy up to 0.07 and 0.092 for Fusarium head blight resistance and spike width, respectively, across six different models. Genomic prediction is a powerful tool to enhance genetic gain in plant breeding. However, the method is accompanied by various complications leading to low prediction accuracy. One of the major challenges arises from the complex dimensionality of marker data. To overcome this issue, we applied two pre-selection methods for SNP markers viz. LD-based haplotype-tagging and GWAS-based trait-linked marker identification. Six different models were tested with preselected SNPs to predict the genomic estimated breeding values (GEBVs) of four traits measured in 419 winter wheat genotypes. Ten different sets of haplotype-tagged SNPs were selected by adjusting the level of LD thresholds. In addition, various sets of trait-linked SNPs were identified with different scenarios from the training-test combined and only from the training populations. The BRR and RR-BLUP models developed from haplotype-tagged SNPs had a higher prediction accuracy for FHB and SPW by 0.07 and 0.092, respectively, compared to the corresponding models developed without marker pre-selection. The highest prediction accuracy for SPW and FHB was achieved with tagged SNPs pruned at weak LD thresholds (r2 < 0.5), while stringent LD was required for spike length (SPL) and flag leaf area (FLA). Trait-linked SNPs identified only from training populations failed to improve the prediction accuracy of the four studied traits. Pre-selection of SNPs via LD-based haplotype-tagging could play a vital role in optimizing genomic selection and reducing genotyping costs. Furthermore, the method could pave the way for developing low-cost genotyping methods through customized genotyping platforms targeting key SNP markers tagged to essential haplotype blocks.
Collapse
Affiliation(s)
- Admas Alemu
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden
| | | | - Pawan K Singh
- International Maize and Wheat Improvement Center, Texcoco, Mexico
| | | | - Aakash Chawade
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden.
| |
Collapse
|
9
|
Dash S, Singh A, Dixit SP, Kumar A, Behera R. Exploring haplotype block structure, runs of homozygosity, and effective population size among dairy cattle breeds of India. Trop Anim Health Prod 2023; 55:129. [PMID: 36952060 DOI: 10.1007/s11250-023-03534-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Accepted: 03/06/2023] [Indexed: 03/24/2023]
Abstract
The present study aimed to explore haplotype structure, runs of homozygosity (ROH), effective population size and persistence of gametic phase among three indigenous dairy cattle breeds, viz., Sahiwal (n = 19), Tharparkar (n = 17), and Gir (n = 16) by using BovineHD single nucleotide polymorphism (SNP) genotyping assay. The filtered SNPs after quality control ranged from 44% in Sahiwal to 53% in Gir. The highest number of haplotype blocks was observed in Tharparkar (15,640) and the lowest in Sahiwal (8027) spanning 17.3% and 7.8% of genome, respectively. The average block length was found close to 26 kb which suggests that multiple recombination events fragmented the ancestral haplotypes into smaller sizes. Gir cattle had the largest number of runs of homozygosity (ROH) regions (1762) followed by Tharparkar (1528) and Sahiwal (1138). Without pedigree information, inbreeding coefficients estimated from ROH (FROH) revealed that Gir had the highest FROH (0.099) proposing more inbreeding rate in this population. Effective population size (Ne) decreased slowly over the last 60 generations and at 13 generations ago; Ne was estimated as 70 for all the three dairy breeds. The highest gametic phase correlation (r = 0.78) was observed for Sahiwal and Tharparkar breed pair suggesting formulation of multi-breed reference population for successful implementation of genomic selection among dairy breeds. The decline in effective population size among native Indian cattle breeds may help in formulating strategies for conservation and genetic improvement of native germplasm for future use.
Collapse
Affiliation(s)
- Soumya Dash
- Animal Genetics Division, ICAR-National Bureau of Animal Genetic Resources, Karnal, 132001, Haryana, India.
- School of Crop Health Policy Support Research, ICAR-National Institute of Biotic Stress Management, Raipur, 493225, Chhattisgarh, India.
| | - Avtar Singh
- Animal Genetics and Breeding Division, ICAR-National Bureau of Animal Genetic Resources, Karnal, 132001, Haryana, India
| | - S P Dixit
- Animal Genetics Division, ICAR-National Bureau of Animal Genetic Resources, Karnal, 132001, Haryana, India
| | - Avnish Kumar
- Animal Genetics Division, ICAR-National Bureau of Animal Genetic Resources, Karnal, 132001, Haryana, India
| | - Rajalaxmi Behera
- Regional Centre, ICAR-Directorate of Poultry Research, Bhubaneswar, 751003, Odisha, India
| |
Collapse
|
10
|
Exploring the statistical nature of independent chromosome segments. Livest Sci 2023. [DOI: 10.1016/j.livsci.2023.105207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/13/2023]
|
11
|
Araujo AC, Carneiro PLS, Oliveira HR, Lewis RM, Brito LF. SNP- and haplotype-based single-step genomic predictions for body weight, wool, and reproductive traits in North American Rambouillet sheep. J Anim Breed Genet 2023; 140:216-234. [PMID: 36408677 PMCID: PMC10099590 DOI: 10.1111/jbg.12748] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2022] [Accepted: 10/23/2022] [Indexed: 11/22/2022]
Abstract
Rambouillet sheep are commonly raised in extensive grazing systems in the US, mainly for wool and meat production. Genomic evaluations in US sheep breeds, including Rambouillet, are still incipient. Therefore, we aimed to evaluate the feasibility of performing genomic prediction of breeding values for various traits in Rambouillet sheep based on single nucleotide polymorphisms (SNP) or haplotypes (fitted as pseudo-SNP) under a single-step GBLUP approach. A total of 28,834 records for birth weight (BWT), 23,306 for postweaning weight (PWT), 5,832 for yearling weight (YWT), 9,880 for yearling fibre diameter (YFD), 11,872 for yearling greasy fleece weight (YGFW), and 15,984 for number of lambs born (NLB) were used in this study. Seven hundred forty-one individuals were genotyped using a moderate (50 K; n = 677) or high (600 K; n = 64) density SNP panel, in which 32 K SNP in common between the two SNP panels (after genotypic quality control) were used for further analyses. Single-step genomic predictions using SNP (H-BLUP) or haplotypes (HAP-BLUP) from blocks with different linkage disequilibrium (LD) thresholds (0.15, 0.35, 0.50, 0.65, and 0.80) were evaluated. We also considered different blending parameters when constructing the genomic relationship matrix used to predict the genomic-enhanced estimated breeding values (GEBV), with alpha equal to 0.95 or 0.50. The GEBV were compared to the estimated breeding values (EBV) obtained from traditional pedigree-based evaluations (A-BLUP). The mean theoretical accuracy ranged from 0.499 (A-BLUP for PWT) to 0.795 (HAP-BLUP using haplotypes from blocks with LD threshold of 0.35 and alpha equal to 0.95 for YFD). The prediction accuracies ranged from 0.143 (A-BLUP for PWT) to 0.330 (A-BLUP for YGFW) while the prediction bias ranged from -0.104 (H-BLUP for PWT) to 0.087 (HAP-BLUP using haplotypes from blocks with LD threshold of 0.15 and alpha equal to 0.95 for YGFW). The GEBV dispersion ranged from 0.428 (A-BLUP for PWT) to 1.035 (A-BLUP for YGFW). Similar results were observed for H-BLUP or HAP-BLUP, independently of the LD threshold to create the haplotypes, alpha value, or trait analysed. Using genomic information (fitting individual SNP or haplotypes) provided similar or higher prediction and theoretical accuracies and reduced the dispersion of the GEBV for body weight, wool, and reproductive traits in Rambouillet sheep. However, there were no clear improvements in the prediction bias when compared to pedigree-based predictions. The next step will be to enlarge the training populations for this breed to increase the benefits of genomic predictions.
Collapse
Affiliation(s)
- Andre C. Araujo
- Graduate Program in Animal SciencesState University of Southwestern BahiaItapetingaBahiaBrazil
- Department of Animal SciencesPurdue UniversityWest LafayetteIndianaUSA
| | | | | | - Ronald M. Lewis
- Department of Animal SciencesUniversity of Nebraska‐LincolnLincolnNebraskaUSA
| | - Luiz F. Brito
- Department of Animal SciencesPurdue UniversityWest LafayetteIndianaUSA
| |
Collapse
|
12
|
Angarita Barajas BK, Cantet RJC, Steibel JP, Schrauf MF, Forneris NS. Heritability estimates and predictive ability for pig meat quality traits using identity-by-state and identity-by-descent relationships in an F 2 population. J Anim Breed Genet 2023; 140:13-27. [PMID: 36300585 DOI: 10.1111/jbg.12742] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2021] [Accepted: 10/05/2022] [Indexed: 12/13/2022]
Abstract
Genomic relationships can be computed with dense genome-wide genotypes through different methods, either based on identity-by-state (IBS) or identity-by-descent (IBD). The latter has been shown to increase the accuracy of both estimated relationships and predicted breeding values. However, it is not clear whether an IBD approach would achieve greater heritability ( h 2 ) and predictive ability ( r ̂ y , y ̂ ) than its IBS counterpart for data with low-depth pedigrees. Here, we compare both approaches in terms of the estimated of h 2 and r ̂ y , y ̂ , using data on meat quality and carcass traits recorded in experimental crossbred pigs, with a pedigree constrained to only three generations. Three animal models were fitted which differed on the relationship matrix: an IBS model ( G IBS ), an IBD (defined within the known pedigree) model ( G IBD ), and a pedigree model ( A 22 ). In 9 of 20 traits, the range of increase for the estimates of σ u 2 and h 2 was 1.2-2.9 times greater with G IBS and G IBD models than with A 22 . Whereas for all traits, both parameters were similar between genomic models. The r ̂ y , y ̂ of the genomic models was higher compared to A 22 . A scarce increment in r ̂ y , y ̂ was found with G IBS when compared to G IBD , most likely due to the former recovering sizeable relationships among founder F0 animals.
Collapse
Affiliation(s)
| | - Rodolfo J C Cantet
- Instituto de Investigaciones en Producción Animal (INPA-CONICET-UBA), Buenos Aires, Argentina.,Departamento de Producción Animal, Facultad de Agronomía, Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Juan P Steibel
- Department of Animal Science, Michigan State University, East Lansing, Michigan, USA.,Department of Fisheries and Wildlife, Michigan State University, East Lansing, Michigan, USA
| | - Matias F Schrauf
- Departamento de Métodos Cuantitativos y Sistemas de Información, Facultad de Agronomía, Universidad de Buenos Aires, Buenos Aires, Argentina.,Animal Breeding & Genomics, Wageningen Livestock Research, Wageningen University & Research, Wageningen, The Netherlands
| | - Natalia S Forneris
- Instituto de Investigaciones en Producción Animal (INPA-CONICET-UBA), Buenos Aires, Argentina.,Departamento de Producción Animal, Facultad de Agronomía, Universidad de Buenos Aires, Buenos Aires, Argentina
| |
Collapse
|
13
|
Li H, Wang Z, Xu L, Li Q, Gao H, Ma H, Cai W, Chen Y, Gao X, Zhang L, Gao H, Zhu B, Xu L, Li J. Genomic prediction of carcass traits using different haplotype block partitioning methods in beef cattle. Evol Appl 2022; 15:2028-2042. [PMID: 36540636 PMCID: PMC9753827 DOI: 10.1111/eva.13491] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Accepted: 09/18/2022] [Indexed: 09/22/2023] Open
Abstract
Genomic prediction (GP) based on haplotype alleles can capture quantitative trait loci (QTL) effects and increase predictive ability because the haplotypes are expected to be in linkage disequilibrium (LD) with QTL. In this study, we constructed haploblocks using LD-based and the fixed number of single nucleotide polymorphisms (fixed-SNP) methods with Illumina BovineHD chip in beef cattle. To evaluate the performance of different haplotype block partitioning methods, we constructed haploblocks based on LD thresholds (from r 2 > 0.2 to r 2 > 0.8) and the number of fixed-SNPs (5, 10, 20). The performance of predictive methods for three carcass traits including liveweight (LW), dressing percentage (DP), and longissimus dorsi muscle weight (LDMW) was evaluated using three approaches (GBLUP and BayesB model based on the SNP, GHBLUP, and BayesBH models based on the haploblock, and GHBLUP+GBLUP and BayesBH+BayesB models based on the combined haploblock and the nonblocked SNPs, which were located between blocks). In this study, we found the accuracies of LD-based and fixed-SNP haplotype Bayesian methods outperformed the Bayesian models (up to 8.54 ± 7.44% and 5.74 ± 2.95%, respectively). GHBLUP showed a high improvement (up to 11.29 ± 9.87%) compared with GBLUP. The Bayesian models have higher accuracies than BLUP models in most scenarios. The average computing time of the BayesBH+BayesB model can reduce by 29.3% compared with the BayesB model. The prediction accuracies using the LD-based haplotype method showed higher improvements than the fixed-SNP haplotype method. In addition, to avoid the influence of rare haplotypes generated from haplotype construction, we compared the performance of GP by filtering four types of minor haplotype allele frequency (MHAF) (0.01, 0.025, 0.05, and 0.1) under different conditions (LD levels were set at r 2 > 0.3, and the fixed number of SNPs was 5). We found the optimal MHAF threshold for LW was 0.01, and the optimal MHAF threshold for DP and LDMW was 0.025.
Collapse
Affiliation(s)
- Hongwei Li
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal SciencesChinese Academy of Agricultural SciencesBeijingChina
| | - Zezhao Wang
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal SciencesChinese Academy of Agricultural SciencesBeijingChina
| | - Lei Xu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal SciencesChinese Academy of Agricultural SciencesBeijingChina
| | - Qian Li
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal SciencesChinese Academy of Agricultural SciencesBeijingChina
| | - Han Gao
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal SciencesChinese Academy of Agricultural SciencesBeijingChina
| | - Haoran Ma
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal SciencesChinese Academy of Agricultural SciencesBeijingChina
| | - Wentao Cai
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal SciencesChinese Academy of Agricultural SciencesBeijingChina
| | - Yan Chen
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal SciencesChinese Academy of Agricultural SciencesBeijingChina
| | - Xue Gao
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal SciencesChinese Academy of Agricultural SciencesBeijingChina
| | - Lupei Zhang
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal SciencesChinese Academy of Agricultural SciencesBeijingChina
| | - Huijiang Gao
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal SciencesChinese Academy of Agricultural SciencesBeijingChina
| | - Bo Zhu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal SciencesChinese Academy of Agricultural SciencesBeijingChina
| | - Lingyang Xu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal SciencesChinese Academy of Agricultural SciencesBeijingChina
| | - Junya Li
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal SciencesChinese Academy of Agricultural SciencesBeijingChina
| |
Collapse
|
14
|
Narayana SG, de Jong E, Schenkel FS, Fonseca PA, Chud TC, Powel D, Wachoski-Dark G, Ronksley PE, Miglior F, Orsel K, Barkema HW. Underlying genetic architecture of resistance to mastitis in dairy cattle: A systematic review and gene prioritization analysis of genome-wide association studies. J Dairy Sci 2022; 106:323-351. [DOI: 10.3168/jds.2022-21923] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2022] [Accepted: 08/01/2022] [Indexed: 11/05/2022]
|
15
|
Ye H, Zhang Z, Ren D, Cai X, Zhu Q, Ding X, Zhang H, Zhang Z, Li J. Genomic Prediction Using LD-Based Haplotypes in Combined Pig Populations. Front Genet 2022; 13:843300. [PMID: 35754827 PMCID: PMC9218795 DOI: 10.3389/fgene.2022.843300] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2021] [Accepted: 05/02/2022] [Indexed: 11/13/2022] Open
Abstract
The size of reference population is an important factor affecting genomic prediction. Thus, combining different populations in genomic prediction is an attractive way to improve prediction ability. However, combining multireference population roughly cannot increase the prediction accuracy as well as expected in pig. This may be due to different linkage disequilibrium (LD) pattern differences between population. In this study, we used the imputed whole-genome sequencing (WGS) data to construct LD-based haplotypes for genomic prediction in combined population to explore the impact of different single-nucleotide polymorphism (SNP) densities, variant representation (SNPs or haplotype alleles), and reference population size on the prediction accuracy for reproduction traits. Our results showed that genomic best linear unbiased prediction (GBLUP) using the WGS data can improve prediction accuracy in multi-population but not within-population. Not only the genomic prediction accuracy of the haplotype method using 80 K chip data in multi-population but also GBLUP for the multi-population (3.4–5.9%) was higher than that within-population (1.2–4.3%). More importantly, we have found that using the haplotype method based on the WGS data in multi-population has better genomic prediction performance, and our results showed that building haploblock in this scenario based on low LD threshold (r2 = 0.2–0.3) produced an optimal set of variables for reproduction traits in Yorkshire pig population. Our results suggested that whether the use of the haplotype method based on the chip data or GBLUP (individual SNP method) based on the WGS data were beneficial for genomic prediction in multi-population, while simultaneously combining the haplotype method and WGS data was a better strategy for multi-population genomic evaluation.
Collapse
Affiliation(s)
- Haoqiang Ye
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Zipeng Zhang
- Key Laboratory of Animal Genetics and Breeding of Ministry of Agriculture and Rural Affairs, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Duanyang Ren
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Xiaodian Cai
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Qianghui Zhu
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Xiangdong Ding
- Key Laboratory of Animal Genetics and Breeding of Ministry of Agriculture and Rural Affairs, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Hao Zhang
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Zhe Zhang
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Jiaqi Li
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, China
| |
Collapse
|
16
|
Seyum EG, Bille NH, Abtew WG, Rastas P, Arifianto D, Domonhédo H, Cochard B, Jacob F, Riou V, Pomiès V, Lopez D, Bell JM, Cros D. Genome properties of key oil palm (Elaeis guineensis Jacq.) breeding populations. J Appl Genet 2022; 63:633-650. [PMID: 35691996 DOI: 10.1007/s13353-022-00708-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Revised: 05/26/2022] [Accepted: 06/04/2022] [Indexed: 11/29/2022]
Abstract
A good knowledge of the genome properties of the populations makes it possible to optimize breeding methods, in particular genomic selection (GS). In oil palm (Elaeis guineensis Jacq), the world's main source of vegetable oil, this would provide insight into the promising GS results obtained so far. The present study considered two complex breeding populations, Deli and La Mé, with 943 individuals and 7324 single-nucleotide polymorphisms (SNPs) from genotyping-by-sequencing. Linkage disequilibrium (LD), haplotype sharing, effective size (Ne), and fixation index (Fst) were investigated. A genetic linkage map spanning 1778.52 cM and with a recombination rate of 2.85 cM/Mbp was constructed. The LD at r2=0.3, considered the minimum to get reliable GS results, spanned over 1.05 cM/0.22 Mbp in Deli and 0.9 cM/0.21 Mbp in La Mé. The significant degree of differentiation existing between Deli and La Mé was confirmed by the high Fst value (0.53), the pattern of correlation of SNP heterozygosity and allele frequency among populations, and the decrease of persistence of LD and of haplotype sharing among populations with increasing SNP distance. However, the level of resemblance between the two populations over short genomic distances (correlation of r values between populations >0.6 for SNPs separated by <0.5 cM/1 kbp and percentage of common haplotypes >40% for haplotypes <3600 bp/0.20 cM) likely explains the superiority of GS models ignoring the parental origin of marker alleles over models taking this information into account. The two populations had low Ne (<5). Population-specific genetic maps and reference genomes are recommended for future studies.
Collapse
Affiliation(s)
- Essubalew Getachew Seyum
- Department of Plant Biology and Physiology, Faculty of Sciences, University of Yaoundé I, Yaoundé, Cameroon
- CETIC (African Center of Excellence in Information and Communication Technologies), University of Yaoundé I, Yaoundé, Cameroon
- Department of Horticulture and Plant Sciences, Jimma University College of Agriculture and Veterinary Medicine, P.O. Box 307, Jimma, Ethiopia
| | - Ngalle Hermine Bille
- Department of Plant Biology and Physiology, Faculty of Sciences, University of Yaoundé I, Yaoundé, Cameroon
| | - Wosene Gebreselassie Abtew
- Department of Horticulture and Plant Sciences, Jimma University College of Agriculture and Veterinary Medicine, P.O. Box 307, Jimma, Ethiopia
| | - Pasi Rastas
- Institute of Biotechnology, Helsinki Institute of Life Science (HiLIFE), University of Helsinki, 00014, Helsinki, Finland
| | | | | | | | | | - Virginie Riou
- CIRAD (Centre de coopération Internationale en Recherche Agronomique pour le Développement), UMR AGAP Institut, F-34398, Montpellier, France
- UMR AGAP Institut, University of Montpellier, CIRAD, INRAE, Institut Agro, F-34398, Montpellier, France
| | - Virginie Pomiès
- CIRAD (Centre de coopération Internationale en Recherche Agronomique pour le Développement), UMR AGAP Institut, F-34398, Montpellier, France
- UMR AGAP Institut, University of Montpellier, CIRAD, INRAE, Institut Agro, F-34398, Montpellier, France
| | - David Lopez
- CIRAD (Centre de coopération Internationale en Recherche Agronomique pour le Développement), UMR AGAP Institut, F-34398, Montpellier, France
- UMR AGAP Institut, University of Montpellier, CIRAD, INRAE, Institut Agro, F-34398, Montpellier, France
| | - Joseph Martin Bell
- Department of Plant Biology and Physiology, Faculty of Sciences, University of Yaoundé I, Yaoundé, Cameroon
| | - David Cros
- CIRAD (Centre de coopération Internationale en Recherche Agronomique pour le Développement), UMR AGAP Institut, F-34398, Montpellier, France.
- UMR AGAP Institut, University of Montpellier, CIRAD, INRAE, Institut Agro, F-34398, Montpellier, France.
| |
Collapse
|
17
|
Yan X, Zhang T, Liu L, Yu Y, Yang G, Han Y, Gong G, Wang F, Zhang L, Liu H, Li W, Yan X, Mao H, Li Y, Du C, Li J, Zhang Y, Wang R, Lv Q, Wang Z, Zhang J, Liu Z, Wang Z, Su R. Accuracy of Genomic Selection for Important Economic Traits of Cashmere and Meat Goats Assessed by Simulation Study. Front Vet Sci 2022; 9:770539. [PMID: 35372544 PMCID: PMC8966406 DOI: 10.3389/fvets.2022.770539] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2021] [Accepted: 01/24/2022] [Indexed: 11/13/2022] Open
Abstract
Genomic selection in plants and animals has become a standard tool for breeding because of the advantages of high accuracy and short generation intervals. Implementation of this technology is hindered by the high cost of genotyping and other factors. The aim of this study was to determine an optional marker density panel and reference population size for using genomic selection of goats, with speculation on the number of QTLs that affect the important economic traits of goats. In addition, the effect of buck population size in the reference population on the accuracy of genomic estimated breeding value (GEBV) was discussed. Based on the previous genetic evaluation results of Inner Mongolia White Cashmere Goats, live body weight (LBW, h2 = 0.11) and fiber diameter (FD, h2 = 0.34) were chosen to perform genomic selection in this study. Reasonable genome parameters and generation transmission processes were set, and phenotypic and genotype data of the two traits were simulated. Then, different sizes of the reference population and validation population were selected from progeny. The GEBVs were obtained by six methods, including GBLUP (Genomic Best Linear Unbiased Prediction), ssGBLUP (Single Step Genomic Best Linear Unbiased Prediction), BayesA, BayesB, Bayesian ridge regression, and Bayesian LASSO. The correlation coefficient between the predicted and realized phenotypes from simulation was calculated and used as a measure of the accuracy of GEBV in each trait. The results showed that the medium marker density Panel (45 K) could be used for genomic selection in goats, which can ensure the accuracy of the GEBV. The reference population size of 1,500 can achieve greater genetic progress in genomic selection for fiber diameter and live body weight in goats by comparing with the population size below this level. The accuracy of the GEBV for live body weight and fiber diameter was better when the number of QTLs was 100 and 50, respectively. Additionally, the accuracy of GEBV was discovered to be good when the buck population size was up to 200. Meanwhile, the accuracy of the GEBV for medium heritability traits (FDs) was found to be higher than the accuracy of the GEBV for low heritability traits (LBWs). These findings will provide theoretical guidance for genomic selection in goats by using real data.
Collapse
Affiliation(s)
- Xiaochun Yan
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Tao Zhang
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
- Inner Mongolia Bigvet Co., Ltd., Hohhot, China
| | - Lichun Liu
- College of Veterinary Medicine, Inner Mongolia Agricultural University, Hohhot, China
| | - Yongsheng Yu
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Guang Yang
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Yaqian Han
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Gao Gong
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Fenghong Wang
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Lei Zhang
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Hongfu Liu
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Wenze Li
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Xiaomin Yan
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Haoyu Mao
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Yaming Li
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Chen Du
- Department of Obstetrics and Gynaecology, Inner Mongolia Medical University, Hohhot, China
| | - Jinquan Li
- Key Laboratory of Mutton Sheep Genetics and Breeding, Ministry of Agriculture, Hohhot, China
- Key Laboratory of Animal Genetics, Breeding and Reproduction in Inner Mongolia Autonomous Region, Hohhot, China
- Engineering Research Centre for Goat Genetics and Breeding, Inner Mongolia Autonomous Region, Hohhot, China
| | - Yanjun Zhang
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Ruijun Wang
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Qi Lv
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Zhixin Wang
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Jiaxin Zhang
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Zhihong Liu
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Zhiying Wang
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
- *Correspondence: Zhiying Wang
| | - Rui Su
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
- Rui Su
| |
Collapse
|
18
|
Wientjes YCJ, Bijma P, Calus MPL, Zwaan BJ, Vitezica ZG, van den Heuvel J. The long-term effects of genomic selection: 1. Response to selection, additive genetic variance, and genetic architecture. Genet Sel Evol 2022; 54:19. [PMID: 35255802 PMCID: PMC8900405 DOI: 10.1186/s12711-022-00709-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2021] [Accepted: 02/10/2022] [Indexed: 11/10/2022] Open
Abstract
Abstract
Background
Genomic selection has revolutionized genetic improvement in animals and plants, but little is known about its long-term effects. Here, we investigated the long-term effects of genomic selection on response to selection, genetic variance, and the genetic architecture of traits using stochastic simulations. We defined the genetic architecture as the set of causal loci underlying each trait, their allele frequencies, and their statistical additive effects. We simulated a livestock population under 50 generations of phenotypic, pedigree, or genomic selection for a single trait, controlled by either only additive, additive and dominance, or additive, dominance, and epistatic effects. The simulated epistasis was based on yeast data.
Results
Short-term response was always greatest with genomic selection, while response after 50 generations was greater with phenotypic selection than with genomic selection when epistasis was present, and was always greater than with pedigree selection. This was mainly because loss of genetic variance and of segregating loci was much greater with genomic and pedigree selection than with phenotypic selection. Compared to pedigree selection, selection response was always greater with genomic selection. Pedigree and genomic selection lost a similar amount of genetic variance after 50 generations of selection, but genomic selection maintained more segregating loci, which on average had lower minor allele frequencies than with pedigree selection. Based on this result, genomic selection is expected to better maintain genetic gain after 50 generations than pedigree selection. The amount of change in the genetic architecture of traits was considerable across generations and was similar for genomic and pedigree selection, but slightly less for phenotypic selection. Presence of epistasis resulted in smaller changes in allele frequencies and less fixation of causal loci, but resulted in substantial changes in statistical additive effects across generations.
Conclusions
Our results show that genomic selection outperforms pedigree selection in terms of long-term genetic gain, but results in a similar reduction of genetic variance. The genetic architecture of traits changed considerably across generations, especially under selection and when non-additive effects were present. In conclusion, non-additive effects had a substantial impact on the accuracy of selection and long-term response to selection, especially when selection was accurate.
Collapse
|
19
|
Zhao J, Sauvage C, Bitton F, Causse M. Multiple haplotype-based analyses provide genetic and evolutionary insights into tomato fruit weight and composition. HORTICULTURE RESEARCH 2022; 9:uhab009. [PMID: 35039843 PMCID: PMC8771453 DOI: 10.1093/hr/uhab009] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Revised: 10/12/2021] [Accepted: 10/15/2021] [Indexed: 05/05/2023]
Abstract
Improving fruit quality traits such as metabolic composition remains a challenge for tomato breeders. To better understand the genetic architecture of these traits and decipher the demographic history of the loci controlling tomato quality traits, we applied an innovative approach using multiple haplotype-based analyses, aiming to test the potentials of haplotype based study in association and genomic prediction studies. We performed and compared haplotype vs SNP-based associations (hapQTL) with multi-locus mixed model (MLMM), focusing on tomato fruit weight and metabolite contents (i.e. sugars, organic acids and amino acids). Using a panel of 163 tomato accessions genotyped with 5995 SNPs, we detected a total of 784 haplotype blocks, with an average size of haplotype blocks ~58 kb. A total of 108 significant associations for 26 traits were detected thanks to Haplotype/SNP-based Bayes models. Haplotype-based Bayes model (97 associations) outperformed SNP-based Bayes model (50 associations) and MLMM (53 associations) in identifying marker-trait associations as well as in genomic prediction (especially for those traits with moderate to low heritability). To decipher the demographic history, we identified 24 positive selective sweeps using the integrated haplotype score (iHS). Most of the significant associations for tomato quality traits were located within selective sweeps (54.63% and 71.7% in hapQTL and MLMM models, respectively). Promising candidate genes were identified controlling tomato fruit weight and metabolite contents. We thus demonstrated the benefits of using haplotypes for evolutionary and genetic studies, providing novel insights into tomato quality improvement and breeding history.
Collapse
Affiliation(s)
- Jiantao Zhao
- INRA, UR1052, Centre de Recherche PACA, Génétique et Amélioration des Fruits et Légumes, Domaine Saint Maurice, 67 Allée des Chênes CS 60094 – 84140, Montfavet Cedex, France
- Boyce Thompson Institute for Plant Research, Cornell University, 533 Tower Road, Ithaca, NY 14853-1801, USA
| | - Christopher Sauvage
- INRA, UR1052, Centre de Recherche PACA, Génétique et Amélioration des Fruits et Légumes, Domaine Saint Maurice, 67 Allée des Chênes CS 60094 – 84140, Montfavet Cedex, France
- Syngenta SAS France, 1228 Chemin de l’Hobit, Saint Sauveur 31790, France
| | - Frédérique Bitton
- INRA, UR1052, Centre de Recherche PACA, Génétique et Amélioration des Fruits et Légumes, Domaine Saint Maurice, 67 Allée des Chênes CS 60094 – 84140, Montfavet Cedex, France
| | - Mathilde Causse
- INRA, UR1052, Centre de Recherche PACA, Génétique et Amélioration des Fruits et Légumes, Domaine Saint Maurice, 67 Allée des Chênes CS 60094 – 84140, Montfavet Cedex, France
| |
Collapse
|
20
|
Zhang M, Luo H, Xu L, Shi Y, Zhou J, Wang D, Zhang X, Huang X, Wang Y. Genomic Selection for Milk Production Traits in Xinjiang Brown Cattle. Animals (Basel) 2022; 12:ani12020136. [PMID: 35049759 PMCID: PMC8772551 DOI: 10.3390/ani12020136] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Revised: 12/22/2021] [Accepted: 12/30/2021] [Indexed: 11/16/2022] Open
Abstract
One-step genomic selection is a method for improving the reliability of the breeding value estimation. This study aimed to compare the reliability of pedigree-based best linear unbiased prediction (PBLUP) and single-step genomic best linear unbiased prediction (ssGBLUP), single-trait and multitrait models, and the restricted maximum likelihood (REML) and Bayesian methods. Data were collected from the production performance records of 2207 Xinjiang Brown cattle in Xinjiang from 1983 to 2018. A cross test was designed to calculate the genetic parameters and reliability of the breeding value of 305 daily milk yield (305 dMY), milk fat yield (MFY), milk protein yield (MPY), and somatic cell score (SCS) of Xinjiang Brown cattle. The heritability of 305 dMY, MFY, MPY, and SCS estimated using the REML and Bayesian multitrait models was approximately 0.39 (0.02), 0.40 (0.03), 0.49 (0.02), and 0.07 (0.02), respectively. The heritability and estimated breeding value (EBV) and the reliability of milk production traits of these cattle calculated based on PBLUP and ssGBLUP using the multitrait model REML and Bayesian methods were higher than those of the single-trait model REML method; the ssGBLUP method was significantly better than the PBLUP method. The reliability of the estimated breeding value can be improved from 0.9% to 3.6%, and the reliability of the genomic estimated breeding value (GEBV) for the genotyped population can reach 83%. Therefore, the genetic evaluation of the multitrait model is better than that of the single-trait model. Thus, genomic selection can be applied to small population varieties such as Xinjiang Brown cattle, in improving the reliability of the genomic estimated breeding value.
Collapse
Affiliation(s)
- Menghua Zhang
- College of Animal Science, Xinjiang Agricultural University, Urumqi 830052, China; (M.Z.); (L.X.); (D.W.); (X.Z.)
| | - Hanpeng Luo
- Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture of China, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing 100193, China;
| | - Lei Xu
- College of Animal Science, Xinjiang Agricultural University, Urumqi 830052, China; (M.Z.); (L.X.); (D.W.); (X.Z.)
| | - Yuangang Shi
- School of Agriculture, Ningxia University, Yinchuan 750021, China; (Y.S.); (J.Z.)
| | - Jinghang Zhou
- School of Agriculture, Ningxia University, Yinchuan 750021, China; (Y.S.); (J.Z.)
| | - Dan Wang
- College of Animal Science, Xinjiang Agricultural University, Urumqi 830052, China; (M.Z.); (L.X.); (D.W.); (X.Z.)
| | - Xiaoxue Zhang
- College of Animal Science, Xinjiang Agricultural University, Urumqi 830052, China; (M.Z.); (L.X.); (D.W.); (X.Z.)
| | - Xixia Huang
- College of Animal Science, Xinjiang Agricultural University, Urumqi 830052, China; (M.Z.); (L.X.); (D.W.); (X.Z.)
- Correspondence: (X.H.); (Y.W.); Tel.: +86-1399-999-6861 (X.H.); +86-1580-159-5851 (Y.W.)
| | - Yachun Wang
- Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture of China, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing 100193, China;
- Correspondence: (X.H.); (Y.W.); Tel.: +86-1399-999-6861 (X.H.); +86-1580-159-5851 (Y.W.)
| |
Collapse
|
21
|
Araujo AC, Carneiro PLS, Alvarenga AB, Oliveira HR, Miller SP, Retallick K, Brito LF. Haplotype-Based Single-Step GWAS for Yearling Temperament in American Angus Cattle. Genes (Basel) 2021; 13:17. [PMID: 35052358 PMCID: PMC8775055 DOI: 10.3390/genes13010017] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Revised: 12/14/2021] [Accepted: 12/18/2021] [Indexed: 01/23/2023] Open
Abstract
Behavior is a complex trait and, therefore, understanding its genetic architecture is paramount for the development of effective breeding strategies. The objective of this study was to perform traditional and weighted single-step genome-wide association studies (ssGWAS and WssGWAS, respectively) for yearling temperament (YT) in North American Angus cattle using haplotypes. Approximately 266 K YT records and 70 K animals genotyped using a 50 K single nucleotide polymorphisms (SNP) panel were used. Linkage disequilibrium thresholds (LD) of 0.15, 0.50, and 0.80 were used to create the haploblocks, and the inclusion of non-LD-clustered SNPs (NCSNP) with the haplotypes in the genomic models was also evaluated. WssGWAS did not perform better than ssGWAS. Cattle YT was found to be a highly polygenic trait, with genes and quantitative trait loci (QTL) broadly distributed across the whole genome. Association studies using LD-based haplotypes should include NCSNPs and different LD thresholds to increase the likelihood of finding the relevant genomic regions affecting the trait of interest. The main candidate genes identified, i.e., ATXN10, ADAM10, VAX2, ATP6V1B1, CRISPLD1, CAPRIN1, FA2H, SPEF2, PLXNA1, and CACNA2D3, are involved in important biological processes and metabolic pathways related to behavioral traits, social interactions, and aggressiveness in cattle. Future studies should further investigate the role of these candidate genes.
Collapse
Affiliation(s)
- Andre C. Araujo
- Graduate Program in Animal Sciences, State University of Southwestern Bahia, Itapetinga 45700-000, Brazil;
- Department of Animal Science, Purdue University, West Lafayette, IN 47907, USA; (A.B.A.); (H.R.O.)
| | - Paulo L. S. Carneiro
- Department of Biology, State University of Southwest Bahia, Jequié 45205-490, Brazil;
| | - Amanda B. Alvarenga
- Department of Animal Science, Purdue University, West Lafayette, IN 47907, USA; (A.B.A.); (H.R.O.)
| | - Hinayah R. Oliveira
- Department of Animal Science, Purdue University, West Lafayette, IN 47907, USA; (A.B.A.); (H.R.O.)
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, ON N1G2W1, Canada
| | - Stephen P. Miller
- American Angus Association, Angus Genetics Inc., 3201 Frederick Ave, St. Joseph, MO 64506, USA; (S.P.M.); (K.R.)
| | - Kelli Retallick
- American Angus Association, Angus Genetics Inc., 3201 Frederick Ave, St. Joseph, MO 64506, USA; (S.P.M.); (K.R.)
| | - Luiz F. Brito
- Department of Animal Science, Purdue University, West Lafayette, IN 47907, USA; (A.B.A.); (H.R.O.)
| |
Collapse
|
22
|
Araujo AC, Carneiro PLS, Oliveira HR, Schenkel FS, Veroneze R, Lourenco DAL, Brito LF. A Comprehensive Comparison of Haplotype-Based Single-Step Genomic Predictions in Livestock Populations With Different Genetic Diversity Levels: A Simulation Study. Front Genet 2021; 12:729867. [PMID: 34721524 PMCID: PMC8551834 DOI: 10.3389/fgene.2021.729867] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2021] [Accepted: 09/07/2021] [Indexed: 11/13/2022] Open
Abstract
The level of genetic diversity in a population is inversely proportional to the linkage disequilibrium (LD) between individual single nucleotide polymorphisms (SNPs) and quantitative trait loci (QTLs), leading to lower predictive ability of genomic breeding values (GEBVs) in high genetically diverse populations. Haplotype-based predictions could outperform individual SNP predictions by better capturing the LD between SNP and QTL. Therefore, we aimed to evaluate the accuracy and bias of individual-SNP- and haplotype-based genomic predictions under the single-step-genomic best linear unbiased prediction (ssGBLUP) approach in genetically diverse populations. We simulated purebred and composite sheep populations using literature parameters for moderate and low heritability traits. The haplotypes were created based on LD thresholds of 0.1, 0.3, and 0.6. Pseudo-SNPs from unique haplotype alleles were used to create the genomic relationship matrix ( G ) in the ssGBLUP analyses. Alternative scenarios were compared in which the pseudo-SNPs were combined with non-LD clustered SNPs, only pseudo-SNPs, or haplotypes fitted in a second G (two relationship matrices). The GEBV accuracies for the moderate heritability-trait scenarios fitting individual SNPs ranged from 0.41 to 0.55 and with haplotypes from 0.17 to 0.54 in the most (Ne ≅ 450) and less (Ne < 200) genetically diverse populations, respectively, and the bias fitting individual SNPs or haplotypes ranged between -0.14 and -0.08 and from -0.62 to -0.08, respectively. For the low heritability-trait scenarios, the GEBV accuracies fitting individual SNPs ranged from 0.24 to 0.32, and for fitting haplotypes, it ranged from 0.11 to 0.32 in the more (Ne ≅ 250) and less (Ne ≅ 100) genetically diverse populations, respectively, and the bias ranged between -0.36 and -0.32 and from -0.78 to -0.33 fitting individual SNPs or haplotypes, respectively. The lowest accuracies and largest biases were observed fitting only pseudo-SNPs from blocks constructed with an LD threshold of 0.3 (p < 0.05), whereas the best results were obtained using only SNPs or the combination of independent SNPs and pseudo-SNPs in one or two G matrices, in both heritability levels and all populations regardless of the level of genetic diversity. In summary, haplotype-based models did not improve the performance of genomic predictions in genetically diverse populations.
Collapse
Affiliation(s)
- Andre C Araujo
- Postgraduate Program in Animal Sciences, State University of Southwestern Bahia, Itapetinga, Brazil.,Department of Animal Sciences, Purdue University, West Lafayette, IN, United States
| | - Paulo L S Carneiro
- Department of Biology, State University of Southwestern Bahia, Jequié, Brazil
| | - Hinayah R Oliveira
- Department of Animal Sciences, Purdue University, West Lafayette, IN, United States.,Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, ON, Canada
| | - Flavio S Schenkel
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, ON, Canada
| | - Renata Veroneze
- Department of Animal Sciences, Federal University of Viçosa, Viçosa, Brazil
| | - Daniela A L Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, United States
| | - Luiz F Brito
- Department of Animal Sciences, Purdue University, West Lafayette, IN, United States
| |
Collapse
|
23
|
Yang R, Xu Z, Wang Q, Zhu D, Bian C, Ren J, Huang Z, Zhu X, Tian Z, Wang Y, Jiang Z, Zhao Y, Zhang D, Li N, Hu X. Genome‑wide association study and genomic prediction for growth traits in yellow-plumage chicken using genotyping-by-sequencing. Genet Sel Evol 2021; 53:82. [PMID: 34706641 PMCID: PMC8555081 DOI: 10.1186/s12711-021-00672-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Accepted: 09/08/2021] [Indexed: 12/25/2022] Open
Abstract
Background Growth traits are of great importance for poultry breeding and production and have been the topic of extensive investigation, with many quantitative trait loci (QTL) detected. However, due to their complex genetic background, few causative genes have been confirmed and the underlying molecular mechanisms remain unclear, thus limiting our understanding of QTL and their potential use for the genetic improvement of poultry. Therefore, deciphering the genetic architecture is a promising avenue for optimising genomic prediction strategies and exploiting genomic information for commercial breeding. The objectives of this study were to: (1) conduct a genome-wide association study to identify key genetic factors and explore the polygenicity of chicken growth traits; (2) investigate the efficiency of genomic prediction in broilers; and (3) evaluate genomic predictions that harness genomic features. Results We identified five significant QTL, including one on chromosome 4 with major effects and four on chromosomes 1, 2, 17, and 27 with minor effects, accounting for 14.5 to 34.1% and 0.2 to 2.6% of the genomic additive genetic variance, respectively, and 23.3 to 46.7% and 0.6 to 4.5% of the observed predictive accuracy of breeding values, respectively. Further analysis showed that the QTL with minor effects collectively had a considerable influence, reflecting the polygenicity of the genetic background. The accuracy of genomic best linear unbiased predictions (BLUP) was improved by 22.0 to 70.3% compared to that of the conventional pedigree-based BLUP model. The genomic feature BLUP model further improved the observed prediction accuracy by 13.8 to 15.2% compared to the genomic BLUP model. Conclusions A major QTL and four minor QTL were identified for growth traits; the remaining variance was due to QTL effects that were too small to be detected. The genomic BLUP and genomic feature BLUP models yielded considerably higher prediction accuracy compared to the pedigree-based BLUP model. This study revealed the polygenicity of growth traits in yellow-plumage chickens and demonstrated that the predictive ability can be greatly improved by using genomic information and related features. Supplementary Information The online version contains supplementary material available at 10.1186/s12711-021-00672-9.
Collapse
Affiliation(s)
- Ruifei Yang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China.,College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Zhenqiang Xu
- Wen's Nanfang Poultry Breeding Co. Ltd, Yunfu, 527400, Guangdong Province, China
| | - Qi Wang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Di Zhu
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Cheng Bian
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Jiangli Ren
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Zhuolin Huang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Xiaoning Zhu
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Zhixin Tian
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Yuzhe Wang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Ziqin Jiang
- Wen's Nanfang Poultry Breeding Co. Ltd, Yunfu, 527400, Guangdong Province, China
| | - Yiqiang Zhao
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Dexiang Zhang
- Wen's Nanfang Poultry Breeding Co. Ltd, Yunfu, 527400, Guangdong Province, China.
| | - Ning Li
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Xiaoxiang Hu
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China.
| |
Collapse
|
24
|
Zhu S, Guo T, Yuan C, Liu J, Li J, Han M, Zhao H, Wu Y, Sun W, Wang X, Wang T, Liu J, Tiambo CK, Yue Y, Yang B. Evaluation of Bayesian alphabet and GBLUP based on different marker density for genomic prediction in Alpine Merino sheep. G3 (BETHESDA, MD.) 2021; 11:6310012. [PMID: 34849779 PMCID: PMC8527494 DOI: 10.1093/g3journal/jkab206] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/28/2021] [Accepted: 06/01/2021] [Indexed: 01/20/2023]
Abstract
The marker density, the heritability level of trait and the statistical models adopted are critical to the accuracy of genomic prediction (GP) or selection (GS). If the potential of GP is to be fully utilized to optimize the effect of breeding and selection, in addition to incorporating the above factors into simulated data for analysis, it is essential to incorporate these factors into real data for understanding their impact on GP accuracy, more clearly and intuitively. Herein, we studied the GP of six wool traits of sheep by two different models, including Bayesian Alphabet (BayesA, BayesB, BayesCπ, and Bayesian LASSO) and genomic best linear unbiased prediction (GBLUP). We adopted fivefold cross-validation to perform the accuracy evaluation based on the genotyping data of Alpine Merino sheep (n = 821). The main aim was to study the influence and interaction of different models and marker densities on GP accuracy. The GP accuracy of the six traits was found to be between 0.28 and 0.60, as demonstrated by the cross-validation results. We showed that the accuracy of GP could be improved by increasing the marker density, which is closely related to the model adopted and the heritability level of the trait. Moreover, based on two different marker densities, it was derived that the prediction effect of GBLUP model for traits with low heritability was better; while with the increase of heritability level, the advantage of Bayesian Alphabet would be more obvious, therefore, different models of GP are appropriate in different traits. These findings indicated the significance of applying appropriate models for GP which would assist in further exploring the optimization of GP.
Collapse
Affiliation(s)
- Shaohua Zhu
- Animal Science Department, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China.,Sheep Breeding Engineering Technology Center, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| | - Tingting Guo
- Animal Science Department, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China.,Sheep Breeding Engineering Technology Center, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| | - Chao Yuan
- Animal Science Department, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China.,Sheep Breeding Engineering Technology Center, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| | - Jianbin Liu
- Animal Science Department, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China.,Sheep Breeding Engineering Technology Center, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| | - Jianye Li
- Animal Science Department, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China.,Sheep Breeding Engineering Technology Center, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| | - Mei Han
- Animal Science Department, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China.,Sheep Breeding Engineering Technology Center, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| | - Hongchang Zhao
- Animal Science Department, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China.,Sheep Breeding Engineering Technology Center, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| | - Yi Wu
- Animal Science Department, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China.,Sheep Breeding Engineering Technology Center, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| | - Weibo Sun
- Animal Science Department, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China.,Sheep Breeding Engineering Technology Center, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| | - Xijun Wang
- Gansu Provincial Sheep Breeding Technology Extension Station, Sunan 734400, China
| | - Tianxiang Wang
- Gansu Provincial Sheep Breeding Technology Extension Station, Sunan 734400, China
| | - Jigang Liu
- Gansu Provincial Sheep Breeding Technology Extension Station, Sunan 734400, China
| | - Christian Keambou Tiambo
- Centre for Tropical Livestock Genetics and Health (CTLGH), International Livestock Research Institute, Nairobi 00100, Kenya
| | - Yaojing Yue
- Sheep Breeding Engineering Technology Center, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| | - Bohui Yang
- Animal Science Department, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| |
Collapse
|
25
|
Bian C, Prakapenka D, Tan C, Yang R, Zhu D, Guo X, Liu D, Cai G, Li Y, Liang Z, Wu Z, Da Y, Hu X. Haplotype genomic prediction of phenotypic values based on chromosome distance and gene boundaries using low-coverage sequencing in Duroc pigs. Genet Sel Evol 2021; 53:78. [PMID: 34620094 PMCID: PMC8496108 DOI: 10.1186/s12711-021-00661-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2021] [Accepted: 08/20/2021] [Indexed: 11/22/2022] Open
Abstract
Background Genomic selection using single nucleotide polymorphism (SNP) markers has been widely used for genetic improvement of livestock, but most current methods of genomic selection are based on SNP models. In this study, we investigated the prediction accuracies of haplotype models based on fixed chromosome distances and gene boundaries compared to those of SNP models for genomic prediction of phenotypic values. We also examined the reasons for the successes and failures of haplotype genomic prediction. Methods We analyzed a swine population of 3195 Duroc boars with records on eight traits: body judging score (BJS), teat number (TN), age (AGW), loin muscle area (LMA), loin muscle depth (LMD) and back fat thickness (BF) at 100 kg live weight, and average daily gain (ADG) and feed conversion rate (FCR) from 30 to100 kg live weight. Ten-fold validation was used to evaluate the prediction accuracy of each SNP model and each multi-allelic haplotype model based on 488,124 autosomal SNPs from low-coverage sequencing. Haplotype blocks were defined using fixed chromosome distances or gene boundaries. Results Compared to the best SNP model, the accuracy of predicting phenotypic values using a haplotype model was greater by 7.4% for BJS, 7.1% for AGW, 6.6% for ADG, 4.9% for FCR, 2.7% for LMA, 1.9% for LMD, 1.4% for BF, and 0.3% for TN. The use of gene-based haplotype blocks resulted in the best prediction accuracy for LMA, LMD, and TN. Compared to estimates of SNP additive heritability, estimates of haplotype epistasis heritability were strongly correlated with the increase in prediction accuracy by haplotype models. The increase in prediction accuracy was largest for BJS, AGW, ADG, and FCR, which also had the largest estimates of haplotype epistasis heritability, 24.4% for BJS, 14.3% for AGW, 14.5% for ADG, and 17.7% for FCR. SNP and haplotype heritability profiles across the genome identified several genes with large genetic contributions to phenotypes: NUDT3 for LMA, LMD and BF, VRTN for TN, COL5A2 for BJS, BSND for ADG, and CARTPT for FCR. Conclusions Haplotype prediction models improved the accuracy for genomic prediction of phenotypes in Duroc pigs. For some traits, the best prediction accuracy was obtained with haplotypes defined using gene regions, which provides evidence that functional genomic information can improve the accuracy of haplotype genomic prediction for certain traits. Supplementary Information The online version contains supplementary material available at 10.1186/s12711-021-00661-y.
Collapse
Affiliation(s)
- Cheng Bian
- State Key Laboratory for Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Dzianis Prakapenka
- Department of Animal Science, University of Minnesota, Saint Paul, MN, 55108, USA
| | - Cheng Tan
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China.,National Engineering Research Center for Breeding Swine Industry, Wens Foodstuff Group Co., Ltd., Yunfu, 527400, China
| | - Ruifei Yang
- State Key Laboratory for Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Di Zhu
- State Key Laboratory for Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Xiaoli Guo
- State Key Laboratory for Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Dewu Liu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
| | - Gengyuan Cai
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China.,National Engineering Research Center for Breeding Swine Industry, Wens Foodstuff Group Co., Ltd., Yunfu, 527400, China
| | - Yalan Li
- National Engineering Research Center for Breeding Swine Industry, Wens Foodstuff Group Co., Ltd., Yunfu, 527400, China
| | - Zuoxiang Liang
- Department of Animal Science, University of Minnesota, Saint Paul, MN, 55108, USA
| | - Zhenfang Wu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China. .,National Engineering Research Center for Breeding Swine Industry, Wens Foodstuff Group Co., Ltd., Yunfu, 527400, China.
| | - Yang Da
- Department of Animal Science, University of Minnesota, Saint Paul, MN, 55108, USA.
| | - Xiaoxiang Hu
- State Key Laboratory for Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, 100193, China.
| |
Collapse
|
26
|
Ahmar S, Ballesta P, Ali M, Mora-Poblete F. Achievements and Challenges of Genomics-Assisted Breeding in Forest Trees: From Marker-Assisted Selection to Genome Editing. Int J Mol Sci 2021; 22:10583. [PMID: 34638922 PMCID: PMC8508745 DOI: 10.3390/ijms221910583] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Revised: 09/26/2021] [Accepted: 09/27/2021] [Indexed: 12/23/2022] Open
Abstract
Forest tree breeding efforts have focused mainly on improving traits of economic importance, selecting trees suited to new environments or generating trees that are more resilient to biotic and abiotic stressors. This review describes various methods of forest tree selection assisted by genomics and the main technological challenges and achievements in research at the genomic level. Due to the long rotation time of a forest plantation and the resulting long generation times necessary to complete a breeding cycle, the use of advanced techniques with traditional breeding have been necessary, allowing the use of more precise methods for determining the genetic architecture of traits of interest, such as genome-wide association studies (GWASs) and genomic selection (GS). In this sense, main factors that determine the accuracy of genomic prediction models are also addressed. In turn, the introduction of genome editing opens the door to new possibilities in forest trees and especially clustered regularly interspaced short palindromic repeats and CRISPR-associated protein 9 (CRISPR/Cas9). It is a highly efficient and effective genome editing technique that has been used to effectively implement targetable changes at specific places in the genome of a forest tree. In this sense, forest trees still lack a transformation method and an inefficient number of genotypes for CRISPR/Cas9. This challenge could be addressed with the use of the newly developing technique GRF-GIF with speed breeding.
Collapse
Affiliation(s)
- Sunny Ahmar
- Institute of Biological Sciences, University of Talca, 1 Poniente 1141, Talca 3460000, Chile;
| | - Paulina Ballesta
- The National Fund for Scientific and Technological Development, Av. del Agua 3895, Talca 3460000, Chile
| | - Mohsin Ali
- Department of Forestry and Range Management, University of Agriculture Faisalabad, Faisalabad 38000, Pakistan;
| | - Freddy Mora-Poblete
- Institute of Biological Sciences, University of Talca, 1 Poniente 1141, Talca 3460000, Chile;
| |
Collapse
|
27
|
Li H, Zhu B, Xu L, Wang Z, Xu L, Zhou P, Gao H, Guo P, Chen Y, Gao X, Zhang L, Gao H, Cai W, Xu L, Li J. Genomic Prediction Using LD-Based Haplotypes Inferred From High-Density Chip and Imputed Sequence Variants in Chinese Simmental Beef Cattle. Front Genet 2021; 12:665382. [PMID: 34394182 PMCID: PMC8358323 DOI: 10.3389/fgene.2021.665382] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2021] [Accepted: 06/30/2021] [Indexed: 01/05/2023] Open
Abstract
A haplotype is defined as a combination of alleles at adjacent loci belonging to the same chromosome that can be transmitted as a unit. In this study, we used both the Illumina BovineHD chip (HD chip) and imputed whole-genome sequence (WGS) data to explore haploblocks and assess haplotype effects, and the haploblocks were defined based on the different LD thresholds. The accuracies of genomic prediction (GP) for dressing percentage (DP), meat percentage (MP), and rib eye roll weight (RERW) based on haplotype were investigated and compared for both data sets in Chinese Simmental beef cattle. The accuracies of GP using the entire imputed WGS data were lower than those using the HD chip data in all cases. For DP and MP, the accuracy of GP using haploblock approaches outperformed the individual single nucleotide polymorphism (SNP) approach (GBLUP_In_Block) at specific LD levels. Hotelling’s test confirmed that GP using LD-based haplotypes from WGS data can significantly increase the accuracies of GP for RERW, compared with the individual SNP approach (∼1.4 and 1.9% for GHBLUP and GHBLUP+GBLUP, respectively). We found that the accuracies using haploblock approach varied with different LD thresholds. The LD thresholds (r2 ≥ 0.5) were optimal for most scenarios. Our results suggested that LD-based haploblock approach can improve accuracy of genomic prediction for carcass traits using both HD chip and imputed WGS data under the optimal LD thresholds in Chinese Simmental beef cattle.
Collapse
Affiliation(s)
- Hongwei Li
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Bo Zhu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China.,National Centre of Beef Cattle Genetic Evaluation, Beijing, China
| | - Ling Xu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Zezhao Wang
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Lei Xu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Peinuo Zhou
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Han Gao
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Peng Guo
- College of Computer and Information Engineering, Tianjin Agricultural University, Tianjin, China
| | - Yan Chen
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Xue Gao
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Lupei Zhang
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Huijiang Gao
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China.,National Centre of Beef Cattle Genetic Evaluation, Beijing, China
| | - Wentao Cai
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Lingyang Xu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Junya Li
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China.,National Centre of Beef Cattle Genetic Evaluation, Beijing, China
| |
Collapse
|
28
|
Dos Santos A, Rodrigues EV, Laviola BG, Teodoro LPR, Teodoro PE, Bhering LL. Increasing selection gain and accuracy of harvest prediction models in Jatropha through genome-wide selection. Sci Rep 2021; 11:13583. [PMID: 34193953 PMCID: PMC8245479 DOI: 10.1038/s41598-021-93022-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2021] [Accepted: 06/17/2021] [Indexed: 11/09/2022] Open
Abstract
Genome-wide selection (GWS) has been becoming an essential tool in the genetic breeding of long-life species, as it increases the gain per time unit. This study had a hypothesis that GWS is a tool that can decrease the breeding cycle in Jatropha. Our objective was to compare GWS with phenotypic selection in terms of accuracy and efficiency over three harvests. Models were developed throughout the harvests to evaluate their applicability in predicting genetic values in later harvests. For this purpose, 386 individuals of the breeding population obtained from crossings between 42 parents were evaluated. The population was evaluated in random block design, with six replicates over three harvests. The genetic effects of markers were predicted in the population using 811 SNP's markers with call rate = 95% and minor allele frequency (MAF) > 4%. GWS enables gains of 108 to 346% over the phenotypic selection, with a 50% reduction in the selection cycle. This technique has potential for the Jatropha breeding since it allows the accurate obtaining of GEBV and higher efficiency compared to the phenotypic selection by reducing the time necessary to complete the selection cycle. In order to apply GWS in the first harvests, a large number of individuals in the breeding population are needed. In the case of few individuals in the population, it is recommended to perform a larger number of harvests.
Collapse
Affiliation(s)
| | - Erina Vitório Rodrigues
- Life and Earth Sciences, Universidade de Brasília - Campus Planaltina, Brasília, Distrito Federal, Brazil
| | - Bruno Galvêas Laviola
- Genetics and Biotechnology Laboratory, Embrapa Agroenergia, Brasília, Distrito Federal, Brazil
| | | | - Paulo Eduardo Teodoro
- Department of Agronomy, Universidade Federal Do Mato Grosso Do Sul, Chapadão Do Sul, Mato Grosso Do Sul, Brazil.
| | - Leonardo Lopes Bhering
- Department of General Biology, Universidade Federal de Viçosa, Viçosa, Minas Gerais, Brazil
| |
Collapse
|
29
|
Validation of the Prediction Accuracy for 13 Traits in Chinese Simmental Beef Cattle Using a Preselected Low-Density SNP Panel. Animals (Basel) 2021; 11:ani11071890. [PMID: 34202066 PMCID: PMC8300368 DOI: 10.3390/ani11071890] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Revised: 06/08/2021] [Accepted: 06/15/2021] [Indexed: 11/17/2022] Open
Abstract
Simple Summary To reduce the breeding costs and promote the application of genomic selection (GS) in Chinese Simmental beef cattle, we developed a customized low-density single-nucleotide polymorphism (SNP) panel consisting of 30,684 SNPs. When comparing the predictive performance of the low-density SNP panel to that of the BovineHD Beadchip for 13 traits, we found that this ~30 K panel achieved moderate to high prediction accuracies for most traits, while reducing the prediction accuracies of six traits by 0.04–0.09 and decreasing the prediction accuracy of one trait by 0.2. For the remaining six traits, the usage of the low-density SNP panel was associated with a slight increase in prediction accuracy. Our studies suggested that the low-density SNP panel (~30 K) is a feasible and promising tool for cost-effective genomic prediction in Chinese Simmental beef cattle, which may provide breeding organizations with a cheaper option and greater returns on investment. Abstract Chinese Simmental beef cattle play a key role in the Chinese beef industry due to their great adaptability and marketability. To achieve efficient genetic gain at a low breeding cost, it is crucial to develop a customized cost-effective low-density SNP panel for this cattle population. Thirteen growth, carcass, and meat quality traits and a BovineHD Beadchip genotyping of 1346 individuals were used to select trait-associated variants and variants contributing to great genetic variance. In addition, highly informative SNPs with high MAF in each 500 kb sliding window and in each genic region were also included separately. A low-density SNP panel consisting of 30,684 SNPs was developed, with an imputation accuracy of 97.4% when imputed to the 770 K level. Among 13 traits, the average prediction accuracy levels evaluated by genomic best linear unbiased prediction (GBLUP) and BayesA/B/Cπ were 0.22–0.47 and 0.18–0.60 for the ~30 K array and BovineHD Beadchip, respectively. Generally, the predictive performance of the ~30 K array was trait-dependent, with reduced prediction accuracies for seven traits. While differences in terms of prediction accuracy were observed among the 13 traits, the low-density SNP panel achieved moderate to high accuracies for most of the traits and even improved the accuracies for some traits.
Collapse
|
30
|
O'Connor KM, Hayes BJ, Hardner CM, Alam M, Henry RJ, Topp BL. Genomic selection and genetic gain for nut yield in an Australian macadamia breeding population. BMC Genomics 2021; 22:370. [PMID: 34016055 PMCID: PMC8139092 DOI: 10.1186/s12864-021-07694-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Accepted: 05/10/2021] [Indexed: 02/06/2023] Open
Abstract
Background Improving yield prediction and selection efficiency is critical for tree breeding. This is vital for macadamia trees with the time from crossing to production of new cultivars being almost a quarter of a century. Genomic selection (GS) is a useful tool in plant breeding, particularly with perennial trees, contributing to an increased rate of genetic gain and reducing the length of the breeding cycle. We investigated the potential of using GS methods to increase genetic gain and accelerate selection efficiency in the Australian macadamia breeding program with comparison to traditional breeding methods. This study evaluated the prediction accuracy of GS in a macadamia breeding population of 295 full-sib progeny from 32 families (29 parents, reciprocals combined), along with a subset of parents. Historical yield data for tree ages 5 to 8years were used in the study, along with a set of 4113 SNP markers. The traits of focus were average nut yield from tree ages 5 to 8years and yield stability, measured as the standard deviation of yield over these 4 years. GBLUP GS models were used to obtain genomic estimated breeding values for each genotype, with a five-fold cross-validation method and two techniques: prediction across related populations and prediction across unrelated populations. Results Narrow-sense heritability of yield and yield stability was low (h2=0.30 and 0.04, respectively). Prediction accuracy for yield was 0.57 for predictions across related populations and 0.14 when predicted across unrelated populations. Accuracy of prediction of yield stability was high (r=0.79) for predictions across related populations. Predicted genetic gain of yield using GS in related populations was 474g/year, more than double that of traditional breeding methods (226g/year), due to the halving of generation length from 8 to 4years. Conclusions The results of this study indicate that the incorporation of GS for yield into the Australian macadamia breeding program may accelerate genetic gain due to reduction in generation length, though the cost of genotyping appears to be a constraint at present. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-07694-z.
Collapse
Affiliation(s)
- Katie M O'Connor
- Queensland Department of Agriculture and Fisheries, Maroochy Research Facility, 47 Mayers Road, Nambour, QLD, 4560, Australia. .,Queensland Alliance for Agriculture and Food Innovation, University of Queensland, Maroochy Research Facility, 47 Mayers Road, Nambour, QLD, 4560, Australia.
| | - Ben J Hayes
- Queensland Alliance for Agriculture and Food Innovation, University of Queensland, St Lucia, QLD, 4072, Australia
| | - Craig M Hardner
- Queensland Alliance for Agriculture and Food Innovation, University of Queensland, St Lucia, QLD, 4072, Australia
| | - Mobashwer Alam
- Queensland Alliance for Agriculture and Food Innovation, University of Queensland, Maroochy Research Facility, 47 Mayers Road, Nambour, QLD, 4560, Australia
| | - Robert J Henry
- Queensland Alliance for Agriculture and Food Innovation, University of Queensland, St Lucia, QLD, 4072, Australia
| | - Bruce L Topp
- Queensland Alliance for Agriculture and Food Innovation, University of Queensland, Maroochy Research Facility, 47 Mayers Road, Nambour, QLD, 4560, Australia
| |
Collapse
|
31
|
Schmidtmann C, Schönherz A, Guldbrandtsen B, Marjanovic J, Calus M, Hinrichs D, Thaller G. Assessing the genetic background and genomic relatedness of red cattle populations originating from Northern Europe. Genet Sel Evol 2021; 53:23. [PMID: 33676402 PMCID: PMC7936461 DOI: 10.1186/s12711-021-00613-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2020] [Accepted: 02/08/2021] [Indexed: 12/20/2022] Open
Abstract
Background Local cattle breeds need special attention, as they are valuable reservoirs of genetic diversity. Appropriate breeding decisions and adequate genomic management of numerically smaller populations are required for their conservation. At this point, the analysis of dense genome-wide marker arrays provides encompassing insights into the genomic constitution of livestock populations. We have analyzed the genetic characterization of ten cattle breeds originating from Germany, The Netherlands and Denmark belonging to the group of red dairy breeds in Northern Europe. The results are intended to provide initial evidence on whether joint genomic breeding strategies of these populations will be successful. Results Traditional Danish Red and Groningen White-Headed were the most genetically differentiated breeds and their populations showed the highest levels of inbreeding. In contrast, close genetic relationships and shared ancestry were observed for the populations of German Red and White Dual-Purpose, Dutch Meuse-Rhine-Yssel, and Dutch Deep Red breeds, reflecting their common histories. A considerable amount of gene flow from Red Holstein to German Angler and to German Red and White Dual-Purpose was revealed, which is consistent with frequent crossbreeding to improve productivity of these local breeds. In Red Holstein, marked genomic signatures of selection were reported on chromosome 18, suggesting directed selection for important breeding goal traits. Furthermore, tests for signatures of selection between Red Holstein, Red and White Dual-Purpose, and Meuse-Rhine-Yssel uncovered signals for all investigated pairs of populations. The corresponding genomic regions, which were putatively under different selection pressures, harboured various genes which are associated with traits such as milk and beef production, mastitis and female fertility. Conclusions This study provides comprehensive knowledge on the genetic constitution and genomic connectedness of divergent red cattle populations in Northern Europe. The results will help to design and optimize breeding strategies. A joint genomic evaluation including some of the breeds studied here seems feasible.
Collapse
Affiliation(s)
- Christin Schmidtmann
- Institute of Animal Breeding and Husbandry, Christian-Albrechts-University Kiel, 24098, Kiel, Germany.
| | - Anna Schönherz
- Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark.,Department of Animal Science, Aarhus University, 8830, Tjele, Denmark
| | - Bernt Guldbrandtsen
- Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark.,Department of Animal Sciences, Department of Animal Breeding and Husbandry, University of Bonn, 53115, Bonn, Germany
| | - Jovana Marjanovic
- Animal Breeding and Genomics, Wageningen University and Research, 6700AH, Wageningen, The Netherlands
| | - Mario Calus
- Animal Breeding and Genomics, Wageningen University and Research, 6700AH, Wageningen, The Netherlands
| | - Dirk Hinrichs
- Department of Animal Breeding, University of Kassel, 37213, Witzenhausen, Germany
| | - Georg Thaller
- Institute of Animal Breeding and Husbandry, Christian-Albrechts-University Kiel, 24098, Kiel, Germany
| |
Collapse
|
32
|
Minamikawa MF, Kunihisa M, Noshita K, Moriya S, Abe K, Hayashi T, Katayose Y, Matsumoto T, Nishitani C, Terakami S, Yamamoto T, Iwata H. Tracing founder haplotypes of Japanese apple varieties: application in genomic prediction and genome-wide association study. HORTICULTURE RESEARCH 2021; 8:49. [PMID: 33642580 PMCID: PMC7917097 DOI: 10.1038/s41438-021-00485-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Revised: 12/28/2020] [Accepted: 01/03/2021] [Indexed: 05/21/2023]
Abstract
Haplotypes provide useful information for genomics-based approaches, genomic prediction, and genome-wide association study. As a small number of superior founders have contributed largely to the breeding history of fruit trees, the information of founder haplotypes may be relevant for performing the genomics-based approaches in these plants. In this study, we proposed a method to estimate 14 haplotypes from 7 founders and automatically trace the haplotypes forward to apple parental (185 varieties) and breeding (659 F1 individuals from 16 full-sib families) populations based on 11,786 single-nucleotide polymorphisms, by combining multiple algorithms. Overall, 92% of the single-nucleotide polymorphisms information in the parental and breeding populations was characterized by the 14 founder haplotypes. The use of founder haplotype information improved the accuracy of genomic prediction in 7 traits and the resolution of genome-wide association study in 13 out of 27 fruit quality traits analyzed in this study. We also visualized the significant propagation of the founder haplotype with the largest genetic effect in genome-wide association study over the pedigree tree of the parental population. These results suggest that the information of founder haplotypes can be useful for not only genetic improvement of fruit quality traits in apples but also for understanding the selection history of founder haplotypes in the breeding program of Japanese apple varieties.
Collapse
Affiliation(s)
- Mai F Minamikawa
- Laboratory of Biometry and Bioinformatics, Department of Agricultural and Environmental Biology, Graduate School of Agricultural and Life Sciences, The University of Tokyo, 1-1-1 Yayoi, Bunkyo, Tokyo, 113-8657, Japan
| | - Miyuki Kunihisa
- Institute of Fruit Tree and Tea Science, National Agriculture and Food Research Organization (NARO), 2-1 Fujimoto, Tsukuba, Ibaraki, 305-8605, Japan
| | - Koji Noshita
- Laboratory of Biometry and Bioinformatics, Department of Agricultural and Environmental Biology, Graduate School of Agricultural and Life Sciences, The University of Tokyo, 1-1-1 Yayoi, Bunkyo, Tokyo, 113-8657, Japan
| | - Shigeki Moriya
- Division of Apple Research, Institute of Fruit Tree and Tea Science, NARO, 92-24 Shimokuriyagawa Nabeyashiki, Morioka, Iwate, 020-0123, Japan
| | - Kazuyuki Abe
- Division of Apple Research, Institute of Fruit Tree and Tea Science, NARO, 92-24 Shimokuriyagawa Nabeyashiki, Morioka, Iwate, 020-0123, Japan
| | - Takeshi Hayashi
- Institute of Crop Science, NARO, 2-1-2 Kannondai, Tsukuba, Ibaraki, 305-8518, Japan
| | - Yuichi Katayose
- Institute of Crop Science, NARO, 2-1-2 Kannondai, Tsukuba, Ibaraki, 305-8518, Japan
| | - Toshimi Matsumoto
- Institute of Crop Science, NARO, 2-1-2 Kannondai, Tsukuba, Ibaraki, 305-8518, Japan
- Institute of Agrobiological Sciences, NARO, 1-2 Owashi, Tsukuba, Ibaraki, 305-8634, Japan
| | - Chikako Nishitani
- Institute of Fruit Tree and Tea Science, National Agriculture and Food Research Organization (NARO), 2-1 Fujimoto, Tsukuba, Ibaraki, 305-8605, Japan
| | - Shingo Terakami
- Institute of Fruit Tree and Tea Science, National Agriculture and Food Research Organization (NARO), 2-1 Fujimoto, Tsukuba, Ibaraki, 305-8605, Japan
| | - Toshiya Yamamoto
- Institute of Fruit Tree and Tea Science, National Agriculture and Food Research Organization (NARO), 2-1 Fujimoto, Tsukuba, Ibaraki, 305-8605, Japan
| | - Hiroyoshi Iwata
- Laboratory of Biometry and Bioinformatics, Department of Agricultural and Environmental Biology, Graduate School of Agricultural and Life Sciences, The University of Tokyo, 1-1-1 Yayoi, Bunkyo, Tokyo, 113-8657, Japan.
| |
Collapse
|
33
|
Osorio LF, Gezan SA, Verma S, Whitaker VM. Independent Validation of Genomic Prediction in Strawberry Over Multiple Cycles. Front Genet 2021; 11:596258. [PMID: 33552121 PMCID: PMC7862747 DOI: 10.3389/fgene.2020.596258] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2020] [Accepted: 12/31/2020] [Indexed: 01/05/2023] Open
Abstract
The University of Florida strawberry (Fragaria × ananassa) breeding program has implemented genomic prediction (GP) as a tool for choosing outstanding parents for crosses over the last five seasons. This has allowed the use of some parents 1 year earlier than with traditional methods, thus reducing the duration of the breeding cycle. However, as the number of breeding cycles increases over time, greater knowledge is needed on how multiple cycles can be used in the practical implementation of GP in strawberry breeding. Advanced selections and cultivars totaling 1,558 unique individuals were tested in field trials for yield and fruit quality traits over five consecutive years and genotyped for 9,908 SNP markers. Prediction of breeding values was carried out using Bayes B models. Independent validation was carried out using separate trials/years as training (TRN) and testing (TST) populations. Single-trial predictive abilities for five polygenic traits averaged 0.35, which was reduced to 0.24 when individuals common across trials were excluded, emphasizing the importance of relatedness among training and testing populations. Training populations including up to four previous breeding cycles increased predictive abilities, likely due to increases in both training population size and relatedness. Predictive ability was also strongly influenced by heritability, but less so by changes in linkage disequilibrium and effective population size. Genotype by year interactions were minimal. A strategy for practical implementation of GP in strawberry breeding is outlined that uses multiple cycles to predict parental performance and accounts for traits not included in GP models when constructing crosses. Given the importance of relatedness to the success of GP in strawberry, future work could focus on the optimization of relatedness in the design of TRN and TST populations to increase predictive ability in the short-term without compromising long-term genetic gains.
Collapse
Affiliation(s)
- Luis F Osorio
- Gulf Coast Research and Education Center, University of Florida, Wimauma, FL, United States
| | - Salvador A Gezan
- School of Forest Resources and Conservation, University of Florida, Gainesville, FL, United States
| | - Sujeet Verma
- Gulf Coast Research and Education Center, University of Florida, Wimauma, FL, United States
| | - Vance M Whitaker
- Gulf Coast Research and Education Center, University of Florida, Wimauma, FL, United States
| |
Collapse
|
34
|
Genome wide association study and genomic prediction for stover quality traits in tropical maize (Zea mays L.). Sci Rep 2021; 11:686. [PMID: 33436870 PMCID: PMC7804097 DOI: 10.1038/s41598-020-80118-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2018] [Accepted: 12/15/2020] [Indexed: 11/09/2022] Open
Abstract
Maize is rapidly replacing traditionally cultivated dual purpose crops of South Asia, primarily due to the better economic remuneration. This has created an impetus for improving maize for both grain productivity and stover traits. Molecular techniques can largely assist breeders in determining approaches for effectively integrating stover trait improvement in their existing breeding pipeline. In the current study we identified a suite of potential genomic regions associated to the two major stover quality traits-in-vitro organic matter digestibility (IVOMD) and metabolizable energy (ME) through genome wide association study. However, considering the fact that the loci identified for these complex traits all had smaller effects and accounted only a small portion of phenotypic variation, the effectiveness of following a genomic selection approach for these traits was evaluated. The testing set consists of breeding lines recently developed within the program and the training set consists of a panel of lines from the working germplasm comprising the founder lines of the newly developed breeding lines and also an unrelated diversity set. The prediction accuracy as determined by the Pearson's correlation coefficient between observed and predicted values of these breeding lines were high even at lower marker density (200 random SNPs), when the training and testing set were related. However, the accuracies were dismal, when there was no relationship between the training and the testing set.
Collapse
|
35
|
Adoption and Optimization of Genomic Selection To Sustain Breeding for Apricot Fruit Quality. G3-GENES GENOMES GENETICS 2020; 10:4513-4529. [PMID: 33067307 PMCID: PMC7718743 DOI: 10.1534/g3.120.401452] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Genomic selection (GS) is a breeding approach which exploits genome-wide information and whose unprecedented success has shaped several animal and plant breeding schemes through delivering their genetic progress. This is the first study assessing the potential of GS in apricot (Prunus armeniaca) to enhance postharvest fruit quality attributes. Genomic predictions were based on a F1 pseudo-testcross population, comprising 153 individuals with contrasting fruit quality traits. They were phenotyped for physical and biochemical fruit metrics in contrasting climatic conditions over two years. Prediction accuracy (PA) varied from 0.31 for glucose content with the Bayesian LASSO (BL) to 0.78 for ethylene production with RR-BLUP, which yielded the most accurate predictions in comparison to Bayesian models and only 10% out of 61,030 SNPs were sufficient to reach accurate predictions. Useful insights were provided on the genetic architecture of apricot fruit quality whose integration in prediction models improved their performance, notably for traits governed by major QTL. Furthermore, multivariate modeling yielded promising outcomes in terms of PA within training partitions partially phenotyped for target traits. This provides a useful framework for the implementation of indirect selection based on easy-to-measure traits. Thus, we highlighted the main levers to take into account for the implementation of GS for fruit quality in apricot, but also to improve the genetic gain in perennial species.
Collapse
|
36
|
Liang Z, Tan C, Prakapenka D, Ma L, Da Y. Haplotype Analysis of Genomic Prediction Using Structural and Functional Genomic Information for Seven Human Phenotypes. Front Genet 2020; 11:588907. [PMID: 33324447 PMCID: PMC7726221 DOI: 10.3389/fgene.2020.588907] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2020] [Accepted: 10/28/2020] [Indexed: 11/19/2022] Open
Abstract
Genomic prediction using multi-allelic haplotype models improved the prediction accuracy for all seven human phenotypes, the normality transformed high density lipoproteins, low density lipoproteins, total cholesterol, triglycerides, weight, and the original height and body mass index without normality transformation. Eight SNP sets with 40,941-380,705 SNPs were evaluated. The increase in prediction accuracy due to haplotypes was 1.86-8.12%. Haplotypes using fixed chromosome distances had the best prediction accuracy for four phenotypes, fixed number of SNPs for two phenotypes, and gene-based haplotypes for high density lipoproteins and height (tied for best). Haplotypes of coding genes were more accurate than haplotypes of all autosome genes that included both coding and noncoding genes for triglycerides and weight, and nearly the same as haplotypes of all autosome genes for the other phenotypes. Haplotypes of noncoding genes (mostly lncRNAs) only improved the prediction accuracy over the SNP models for high density lipoproteins, total cholesterol, and height. ChIP-seq haplotypes had better prediction accuracy than gene-based haplotypes for total cholesterol, body mass index and low density lipoproteins. The accuracy of ChIP-seq haplotypes was most striking for low density lipoproteins, where all four haplotype models with ChIP-seq haplotypes had similarly high prediction accuracy over the best prediction model with gene-based haplotypes. Haplotype epistasis was shown to be the reason for the increased accuracy due to haplotypes. Low density lipoproteins had the largest haplotype epistasis heritability that explained 14.70% of the phenotypic variance and was 31.27% of the SNP additive heritability, and the largest increase in prediction accuracy relative to the best SNP model (8.12%). Relative to the SNP additive heritability of the same regions, noncoding genes had the highest haplotype epistasis heritability, followed by coding genes and ChIP-seq for the seven phenotypes. SNP and haplotype heritability profiles showed that the integration of SNP and haplotype additive values compensated the weakness of haplotypes in estimating SNP heritabilities for four phenotypes, whereas models with haplotype additive values fully accounted for SNP additive values for three phenotypes. These results showed that haplotype analysis can be a method to utilize functional and structural genomic information to improve the accuracy of genomic prediction.
Collapse
Affiliation(s)
- Zuoxiang Liang
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States
| | - Cheng Tan
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States
- National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, China
| | - Dzianis Prakapenka
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States
| | - Li Ma
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD, United States
| | - Yang Da
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States
| |
Collapse
|
37
|
Comparison of long-term effects of genomic selection index and genomic selection using different Bayesian methods. Livest Sci 2020. [DOI: 10.1016/j.livsci.2020.104207] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
38
|
Kim KD, Kang Y, Kim C. Application of Genomic Big Data in Plant Breeding:Past, Present, and Future. PLANTS (BASEL, SWITZERLAND) 2020; 9:E1454. [PMID: 33126607 PMCID: PMC7694055 DOI: 10.3390/plants9111454] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Revised: 10/26/2020] [Accepted: 10/26/2020] [Indexed: 01/11/2023]
Abstract
Plant breeding has a long history of developing new varieties that have ensured the food security of the human population. During this long journey together with humanity, plant breeders have successfully integrated the latest innovations in science and technologies to accelerate the increase in crop production and quality. For the past two decades, since the completion of human genome sequencing, genomic tools and sequencing technologies have advanced remarkably, and adopting these innovations has enabled us to cost down and/or speed up the plant breeding process. Currently, with the growing mass of genomic data and digitalized biological data, interdisciplinary approaches using new technologies could lead to a new paradigm of plant breeding. In this review, we summarize the overall history and advances of plant breeding, which have been aided by plant genomic research. We highlight the key advances in the field of plant genomics that have impacted plant breeding over the past decades and introduce the current status of innovative approaches such as genomic selection, which could overcome limitations of conventional breeding and enhance the rate of genetic gain.
Collapse
Affiliation(s)
- Kyung Do Kim
- Department of Bioscience and Bioinformatics, Myongji University, Yongin 17058, Korea;
| | - Yuna Kang
- Department of Crop Science, Chungnam National University, Daejeon 34134, Korea;
| | - Changsoo Kim
- Department of Crop Science, Chungnam National University, Daejeon 34134, Korea;
- Department of Smart Agriculture Systems, Chungnam National University, Daejeon 34134, Korea
| |
Collapse
|
39
|
Teissier M, Larroque H, Brito LF, Rupp R, Schenkel FS, Robert-Granié C. Genomic predictions based on haplotypes fitted as pseudo-SNP for milk production and udder type traits and SCS in French dairy goats. J Dairy Sci 2020; 103:11559-11573. [PMID: 33041034 DOI: 10.3168/jds.2020-18662] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Accepted: 07/27/2020] [Indexed: 12/18/2022]
Abstract
The development of statistical methods aiming to improve the accuracy of genomic predictions is of utmost value for dairy goat breeding programs. In this context, the use of haplotypes, instead of individual SNP, could improve the accuracy of genomic predictions by better capturing the effect of causal variants, instead of relying solely on linkage disequilibrium with individual SNP. Haplotypes can be included in genomic evaluation models in various ways, such as fitting them as pseudo-SNP (i.e., haplotypes converted into biallelic SNP format). This can be easily incorporated in the software already available for single-step genomic predictions (ssGBLUP). Therefore, the aim of this study was to compare the predictive performances of ssGBLUP and weighted ssGBLUP (WssGBLUP) based on individual SNP or on haplotypes fitted as pseudo-SNP. Performance was compared in terms of accuracy, bias, and weights for SNP versus pseudo-SNP. Genomic predictions were performed on 5 milk production traits, 5 udder type traits, and somatic cell score (SCS). The training population was formed by 307 Alpine and 247 Saanen progeny-tested bucks, genotyped using the Illumina Goat SNP50 BeadChip (Illumina, San Diego, CA). The validation population included 205 Alpine and 146 Saanen young bucks. The accuracy of genomic predictions was evaluated in the validation population as the Pearson correlation between genomic estimated breeding values (GEBV), predicted based on various methods, and daughter deviation (DD) based on the official genetic evaluation of January 2016. Haplotype-based models were shown to improve the performance of genomic predictions for some traits. Gains in accuracy of up to +19% (0.310 to 0.368 for fat yield) in Alpine and up to +3% (0.361 to 0.373 for udder shape) in Saanen were observed with ssGBLUP. The ssGBLUP accuracies averaged across all traits and methods were equal to 0.467 (SNP) versus 0.471 (pseudo-SNP) in Alpine and 0.528 (SNP) versus 0.523 (pseudo-SNP) in Saanen. With WssGBLUP, gains in accuracy of up to 24% (0.298 to 0.370 for fat yield) in Alpine and 14% (0.431 to 0.490 for SCS) in Saanen were observed with WssGBLUP. Accuracies of WssGBLUP averaged across all traits and methods were equal to 0.455 (SNP and pseudo-SNP) in Alpine and 0.542 (SNP) versus 0.528 (pseudo-SNP) in Saanen. The average (±SD) slope of the regression of DD on GEBV for the validation animals, across all breeds, traits and scenarios, were equal to 0.82 ± 0.20 (SNP) and 0.83 ± 0.18 (pseudo-SNP) for ssGBLUP and 0.67 ± 0.16 (SNP) and 0.65 ± 0.16 (pseudo-SNP) for WssGBLUP, which suggest that haplotype-based models and ssGBLUPSNP were similarly biased. However, WssGBLUP was more biased than ssGBLUP, and its gains in accuracies were limited to milk production traits. Despite the fact that genomic predictions based on haplotypes require additional steps and time, the observed gains in GEBV predictive performance indicate that haplotype-based methods could be recommended for some traits.
Collapse
Affiliation(s)
- Marc Teissier
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326 Castanet-Tolosan, France.
| | - Hélène Larroque
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326 Castanet-Tolosan, France
| | - Luiz F Brito
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, Ontario, N1G 2W1, Canada; Department of Animal Sciences, Purdue University, West Lafayette, IN 47907
| | - Rachel Rupp
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326 Castanet-Tolosan, France
| | - Flavio S Schenkel
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, Ontario, N1G 2W1, Canada
| | | |
Collapse
|
40
|
Akbarpour T, Ghavi Hossein-Zadeh N, Shadparvar AA. Marker genotyping error effects on genomic predictions under different genetic architectures. Mol Genet Genomics 2020; 296:79-89. [PMID: 32995954 DOI: 10.1007/s00438-020-01728-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2020] [Accepted: 09/19/2020] [Indexed: 11/26/2022]
Abstract
This study aimed to determine the effect of different rates of marker genotyping error on the accuracy of genomic prediction that was examined under distinct marker and quantitative trait loci (QTL) densities and different heritability estimates using a stochastic simulation approach. For each scenario of simulation, a reference population with phenotypic and genotypic records and a validation population with only genotypic records were considered. Marker effects were estimated in the reference population, and then their genotypic records were used to predict genomic breeding values in the validation population. The prediction accuracy was calculated as the correlation between estimated and true breeding values. The prediction bias was examined by computing the regression of true genomic breeding value on estimated genomic breeding value. The accuracy of the genomic evaluation was the highest in a scenario with no marker genotyping error and varied from 0.731 to 0.934. The accuracy of the genomic evaluation was the lowest in a scenario with marker genotyping error equal to 20% and changed from 0.517 to 0.762. The unbiased regression coefficients of true genomic breeding value on estimated genomic breeding value were obtained in the reference and validation populations when the rate of marker genotyping error was equal to zero. The results showed that marker genotyping error can reduce the accuracy of genomic evaluations. Moreover, marker genotyping error can provide biased estimates of genomic breeding values. Therefore, for obtaining accurate results it is recommended to minimize the marker genotyping errors to zero in genomic evaluation programs.
Collapse
Affiliation(s)
- Tahere Akbarpour
- Department of Animal Science, Faculty of Agricultural Sciences, University of Guilan, 41635-1314, Rasht, Iran
| | - Navid Ghavi Hossein-Zadeh
- Department of Animal Science, Faculty of Agricultural Sciences, University of Guilan, 41635-1314, Rasht, Iran.
| | - Abdol Ahad Shadparvar
- Department of Animal Science, Faculty of Agricultural Sciences, University of Guilan, 41635-1314, Rasht, Iran
| |
Collapse
|
41
|
Oyelami FO, Zhao Q, Xu Z, Zhang Z, Sun H, Zhang Z, Ma P, Wang Q, Pan Y. Haplotype Block Analysis Reveals Candidate Genes and QTLs for Meat Quality and Disease Resistance in Chinese Jiangquhai Pig Breed. Front Genet 2020; 11:752. [PMID: 33101353 PMCID: PMC7498712 DOI: 10.3389/fgene.2020.00752] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2019] [Accepted: 06/23/2020] [Indexed: 11/19/2022] Open
Abstract
The Jiangquhai (JQ) pig breed is one of the most widely recognized pig populations in China due to its unique and dominant characteristics. In this study, we examined the extent of Linkage disequilibrium (LD) and haplotype block structure of the JQ pig breed, and scanned the blocks for possible genes underlying important QTLs that could either be responsible for some adaptive features in these pigs or might have undergone some selection pressure. We compared some of our results with other Chinese and Western pig breeds. The results show that the JQ breed had the highest total block length (349.73 Mb ≈ 15% of its genome), and the coverage rate of blocks in most of its chromosomes was larger than those of other breeds except for Sus scrofa chromosome 4 (SSC4), SSC6, SSC7, SSC8, SSC10, SSC12, SSC13, SSC14, SSC17, SSC18, and SSCX. Moreover, the JQ breed had more SNPs that were clustered into haplotype blocks than the other breeds examined in this study. Our shared and unique haplotype block analysis revealed that the Hongdenglong (HD) breed had the lowest percentage of shared haplotype blocks while the Shanzhu (SZ) breed had the highest. We found that the JQ breed had an average r2 > 0.2 at SNPs distances 10–20 kb and concluded that about 120,000–240,000 SNPs would be needed for a successful GWAS in the breed. Finally, we detected a total of 88 genes harbored by selected haplotype blocks in the JQ breed, of which only 4 were significantly enriched (p-value ≤ 0.05). These genes were significantly enriched in 2 GO terms (p-value < 0.01), and 2 KEGG pathways (p-value < 0.02). Most of these enriched genes were related to health. Also, most of the overlapping QTLs detected in the haplotype blocks were related to meat and carcass quality, as well as health, with a few of them relating to reproduction and production. These results provide insights into the genetic architecture of some adaptive and meat quality traits observed in the JQ pig breed and also revealed the pattern of LD in the genome of the pig. Our result provides significant guidance for improving the statistical power of GWAS and optimizing the conservation strategy for this JQ pig breed.
Collapse
Affiliation(s)
- Favour Oluwapelumi Oyelami
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, China
| | - Qingbo Zhao
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, China
| | - Zhong Xu
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, China
| | - Zhe Zhang
- Department of Animal Breeding and Reproduction, College of Animal Science, Zhejiang University, Hangzhou, China
| | - Hao Sun
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, China
| | - Zhenyang Zhang
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, China
| | - Peipei Ma
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, China
| | - Qishan Wang
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, China.,Department of Animal Breeding and Reproduction, College of Animal Science, Zhejiang University, Hangzhou, China
| | - Yuchun Pan
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, China.,Department of Animal Breeding and Reproduction, College of Animal Science, Zhejiang University, Hangzhou, China
| |
Collapse
|
42
|
Sallam AH, Conley E, Prakapenka D, Da Y, Anderson JA. Improving Prediction Accuracy Using Multi-allelic Haplotype Prediction and Training Population Optimization in Wheat. G3 (BETHESDA, MD.) 2020; 10:2265-2273. [PMID: 32371453 PMCID: PMC7341132 DOI: 10.1534/g3.120.401165] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Accepted: 04/29/2020] [Indexed: 02/01/2023]
Abstract
The use of haplotypes may improve the accuracy of genomic prediction over single SNPs because haplotypes can better capture linkage disequilibrium and genomic similarity in different lines and may capture local high-order allelic interactions. Additionally, prediction accuracy could be improved by portraying population structure in the calibration set. A set of 383 advanced lines and cultivars that represent the diversity of the University of Minnesota wheat breeding program was phenotyped for yield, test weight, and protein content and genotyped using the Illumina 90K SNP Assay. Population structure was confirmed using single SNPs. Haplotype blocks of 5, 10, 15, and 20 adjacent markers were constructed for all chromosomes. A multi-allelic haplotype prediction algorithm was implemented and compared with single SNPs using both k-fold cross validation and stratified sampling optimization. After confirming population structure, the stratified sampling improved the predictive ability compared with k-fold cross validation for yield and protein content, but reduced the predictive ability for test weight. In all cases, haplotype predictions outperformed single SNPs. Haplotypes of 15 adjacent markers showed the best improvement in accuracy for all traits; however, this was more pronounced in yield and protein content. The combined use of haplotypes of 15 adjacent markers and training population optimization significantly improved the predictive ability for yield and protein content by 14.3 (four percentage points) and 16.8% (seven percentage points), respectively, compared with using single SNPs and k-fold cross validation. These results emphasize the effectiveness of using haplotypes in genomic selection to increase genetic gain in self-fertilized crops.
Collapse
Affiliation(s)
| | - Emily Conley
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN 55108
| | | | - Yang Da
- Department of Animal Science, and
| | - James A Anderson
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN 55108
| |
Collapse
|
43
|
Xu L, Gao N, Wang Z, Xu L, Liu Y, Chen Y, Xu L, Gao X, Zhang L, Gao H, Zhu B, Li J. Incorporating Genome Annotation Into Genomic Prediction for Carcass Traits in Chinese Simmental Beef Cattle. Front Genet 2020; 11:481. [PMID: 32499816 PMCID: PMC7243208 DOI: 10.3389/fgene.2020.00481] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2019] [Accepted: 04/17/2020] [Indexed: 01/08/2023] Open
Abstract
Various methods have been proposed for genomic prediction (GP) in livestock. These methods have mainly focused on statistical considerations and did not include genome annotation information. In this study, to improve the predictive performance of carcass traits in Chinese Simmental beef cattle, we incorporated the genome annotation information into GP. Single nucleotide polymorphisms (SNPs) were annotated to five genomic classes: intergenic, gene, exon, protein coding sequences, and 3'/5' untranslated region. Haploblocks were constructed for all markers and these five genomic classes by defining a biologically functional unit, and haplotype effects were modeled in both numerical dosage and categorical coding strategies. The first-order epistatic effects among SNPs and haplotypes were modeled using a categorical epistasis model. For all makers, the extension from the SNP-based model to a haplotype-based model improved the accuracy by 5.4-9.8% for carcass weight (CW), live weight (LW), and striploin (SI). For the five genomic classes using the haplotype-based prediction model, the incorporation of gene class information into the model improved the accuracies by an average of 1.4, 2.1, and 1.3% for CW, LW, and SI, respectively, compared with their corresponding results for all markers. Including the first-order epistatic effects into the prediction models improved the accuracies in some traits and genomic classes. Therefore, for traits with moderate-to-high heritability, incorporating genome annotation information of gene class into haplotype-based prediction models could be considered as a promising tool for GP in Chinese Simmental beef cattle, and modeling epistasis in prediction can further increase the accuracy to some degree.
Collapse
Affiliation(s)
- Ling Xu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Ning Gao
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Zezhao Wang
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Lei Xu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Ying Liu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Yan Chen
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Lingyang Xu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Xue Gao
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Lupei Zhang
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Huijiang Gao
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- National Centre of Beef Cattle Genetic Evaluation, Beijing, China
| | - Bo Zhu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- National Centre of Beef Cattle Genetic Evaluation, Beijing, China
| | - Junya Li
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- National Centre of Beef Cattle Genetic Evaluation, Beijing, China
| |
Collapse
|
44
|
Haplotype-Based Genome-Wide Association Study and Identification of Candidate Genes Associated with Carcass Traits in Hanwoo Cattle. Genes (Basel) 2020; 11:genes11050551. [PMID: 32423003 PMCID: PMC7290854 DOI: 10.3390/genes11050551] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2020] [Revised: 04/30/2020] [Accepted: 05/05/2020] [Indexed: 12/20/2022] Open
Abstract
Hanwoo, is the most popular native beef cattle in South Korea. Due to its extensive popularity, research is ongoing to enhance its carcass quality and marbling traits. In this study we conducted a haplotype-based genome-wide association study (GWAS) by constructing haplotype blocks by three methods: number of single nucleotide polymorphisms (SNPs) in a haplotype block (nsnp), length of genomic region in kb (Len) and linkage disequilibrium (LD). Significant haplotype blocks and genes associated with them were identified for carcass traits such as BFT (back fat thickness), EMA (eye Muscle area), CWT (carcass weight) and MS (marbling score). Gene-set enrichment analysis and functional annotation of genes in the significantly-associated loci revealed candidate genes, including PLCB1 and PLCB4 present on BTA13, coding for phospholipases, which might be important candidates for increasing fat deposition due to their role in lipid metabolism and adipogenesis. CEL (carboxyl ester lipase), a bile-salt activated lipase, responsible for lipid catabolic process was also identified within the significantly-associated haplotype block on BTA11. The results were validated in a different Hanwoo population. The genes and pathways identified in this study may serve as good candidates for improving carcass traits in Hanwoo cattle.
Collapse
|
45
|
VanRaden PM. Symposium review: How to implement genomic selection. J Dairy Sci 2020; 103:5291-5301. [PMID: 32331884 DOI: 10.3168/jds.2019-17684] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Accepted: 01/03/2020] [Indexed: 12/16/2022]
Abstract
Genomic selection was adopted very quickly in the 10 yr after first implementation, and breeders continue to find new uses for genomic testing. Breeding values with higher reliability earlier in life are estimated by combining DNA genotypes for many thousands of loci using existing identification, pedigree, and phenotype databases for millions of animals. Quality control for both new and previous data is greatly improved by comparing genomic and pedigree relationships to correct parent-progeny conflicts and discover many additional ancestors. Many quantitative trait loci and gene tests have been added to previous assays that used only evenly spaced, highly polymorphic markers. Imputation now combines genotypes from many assays of differing marker densities. Prediction models have gradually advanced from normal or Bayesian distributions within trait and breed to single-step, multitrait, or other more complex models, such as multibreed models that may be needed for crossbred prediction. Genomic selection was initially applied to males to predict progeny performance but is now widely applied to females or even embryos to predict their own later performance. The initial focus on additive merit has expanded to include mating programs, genomic inbreeding, and recessive alleles. Many producers now use DNA testing to decide which heifers should be inseminated with elite dairy, beef, or sex-sorted semen, which should be embryo donors or recipients, or which should be sold or kept for breeding. Because some of these decisions are expensive to delay, predictions are now provided weekly instead of every few months. Predictions from international genomic databases are often more accurate and cost-effective than those from within-country databases that were previously designed for progeny testing unless local breeds, conditions, or traits differ greatly from the larger database. Selection indexes include many new traits, often with lower heritability or requiring large initial investments to obtain phenotypes, which provide further incentive to cooperate internationally. The genomic prediction methods developed for dairy cattle are now applied widely to many animal, human, and plant populations and could be applied to many more.
Collapse
Affiliation(s)
- P M VanRaden
- Animal Genomics and Improvement Laboratory, USDA, Agricultural Research Service, Beltsville, MD 20705-2350.
| |
Collapse
|
46
|
Prakapenka D, Wang C, Liang Z, Bian C, Tan C, Da Y. GVCHAP: A Computing Pipeline for Genomic Prediction and Variance Component Estimation Using Haplotypes and SNP Markers. Front Genet 2020; 11:282. [PMID: 32318093 PMCID: PMC7154123 DOI: 10.3389/fgene.2020.00282] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2019] [Accepted: 03/09/2020] [Indexed: 01/05/2023] Open
Abstract
Haplotype prediction models open many possibilities to improve the accuracy of genomic selection but require more data processing and computing time than single-SNP prediction models. To facilitate haplotype analysis for genomic prediction and estimation using structural and functional genomic information, we developed a computing pipeline to implement haplotype analysis with capabilities for preparation of input data for haplotype analysis, genomic prediction and estimation using GVCHAP, and analysis of GVCHAP results. Data preparation includes utility programs for haplotype imputing; defining haplotype blocks by a fixed number of SNPs, a fixed distance in base pairs per block, or user defined block lengths based on structural or functional genomic information or a mixture of both types of information; and defining haplotype genotypes within each haplotype block. GVCHAP is the main program for genomic prediction and estimation, calculates GREML (genomic restricted maximum likelihood) estimates of variance components and heritabilities, and calculates GBLUP (genomic best linear unbiased prediction) for additive and dominance values of single SNPs as well as additive values of haplotypes with reliability estimates for training and validation populations. A two-step strategy and a method of multi-node processing are implemented to remove the computing bottleneck due to the creation of genomic relationship matrices for large samples. The analysis of GVCHAP results includes calculation of observed prediction accuracies from validation studies and preparation of input files for graphical visualization of heritability estimates of haplotype blocks as well as estimates of SNP effects and heritabilities. The entire pipeline provides an efficient and versatile computing tool for identifying the most accurate haplotype model among many candidate haplotype models utilizing structural and functional genomic information for genomic selection.
Collapse
Affiliation(s)
- Dzianis Prakapenka
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States
| | - Chunkao Wang
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States
| | - Zuoxiang Liang
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States
| | - Cheng Bian
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States.,State Key Laboratory for Agrobiotechnology, China Agricultural University, Beijing, China
| | - Cheng Tan
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States.,National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, China
| | - Yang Da
- Department of Animal Science, University of Minnesota, Saint Paul, MN, United States
| |
Collapse
|
47
|
Won S, Park JE, Son JH, Lee SH, Park BH, Park M, Park WC, Chai HH, Kim H, Lee J, Lim D. Genomic Prediction Accuracy Using Haplotypes Defined by Size and Hierarchical Clustering Based on Linkage Disequilibrium. Front Genet 2020; 11:134. [PMID: 32211021 PMCID: PMC7067973 DOI: 10.3389/fgene.2020.00134] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2019] [Accepted: 02/04/2020] [Indexed: 11/13/2022] Open
Abstract
Genomic prediction is an effective way to estimate the genomic breeding values from genetic information based on statistical methods such as best linear unbiased prediction (BLUP). The used of haplotype, clusters of linked single nucleotide polymorphism (SNP) as markers instead of individual SNPs can improve the accuracy of genomic prediction. Since the probability of a quantitative trait loci to be in strong linkage disequilibrium (LD) with a cluster of markers is higher compared to an individual marker. To make haplotypes efficient in genomic prediction, finding optimal ways to define haplotypes is essential. In this study, 770K or 50K SNP chip data was collected from Hanwoo (Korean cattle) population consisted of 3,498 cattle. Using SNP chip data, haplotype was defined in three different ways based on 1) the number of SNPs included, 2) length of haplotypes (bp), and 3) agglomerative hierarchical clustering based on LD. To compare the methods in parallel, haplotypes defined by all methods were set to have comparable sizes; 5, 10, 20 or 50 SNPs on average per haplotype. A linear mixed model using haplotype to calculated the covariance matrix was applied for testing the prediction accuracy of each haplotype size. Also, conventional SNP-based linear mixed model was tested to evaluate the performance of the haplotype sets on genomic prediction. Carcass weight (CWT), eye muscle area (EMA) and backfat thickness (BFT) were used as the phenotypes. This study reveals that using haplotypes generally showed increased accuracy compared to conventional SNP-based model for CWT and EMA, but found to be small or no increase in accuracy for BFT. LD clustering-based haplotypes specifically the five SNPs size showed the highest prediction accuracy for CWT and EMA. Meanwhile, the highest accuracy was obtained when length-based haplotypes with five SNPs were used for BFT. The maximum gain in accuracy was 1.3% from cross-validation and 4.6% from forward validation for EMA, suggesting that genomic prediction accuracy can be increased by using haplotypes. However, the improvement from using haplotypes may depend on the trait of interest. In addition, when the number of alleles generated by each haplotype defining methods was compared, clustering by LD generated the least number of alleles, thereby reducing computational costs. Therefore, finding optimal ways to define haplotypes and using the haplotype alleles as markers can improve the accuracy of genomic prediction.
Collapse
Affiliation(s)
- Sohyoung Won
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, South Korea
| | - Jong-Eun Park
- National Institute of Animal Science, RDA, Wanju, South Korea
| | - Ju-Hwan Son
- National Institute of Animal Science, RDA, Wanju, South Korea
| | - Seung-Hwan Lee
- Department of Animal Science and Biotechnology, Chungnam National University, Daejeon, South Korea
| | - Byeong Ho Park
- National Institute of Animal Science, RDA, Wanju, South Korea
| | - Mina Park
- National Institute of Animal Science, RDA, Wanju, South Korea
| | - Won-Chul Park
- National Institute of Animal Science, RDA, Wanju, South Korea
| | - Han-Ha Chai
- National Institute of Animal Science, RDA, Wanju, South Korea
| | - Heebal Kim
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, South Korea.,Department of Agricultural Biotechnology and Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul, South Korea.,eGnome, Inc, Seoul, South Korea
| | - Jungjae Lee
- Jung P&C Institute, Inc., Yongin-si, South Korea
| | - Dajeong Lim
- National Institute of Animal Science, RDA, Wanju, South Korea
| |
Collapse
|
48
|
Baseggio M, Murray M, Magallanes-Lundback M, Kaczmar N, Chamness J, Buckler ES, Smith ME, DellaPenna D, Tracy WF, Gore MA. Natural variation for carotenoids in fresh kernels is controlled by uncommon variants in sweet corn. THE PLANT GENOME 2020; 13:e20008. [PMID: 33016632 DOI: 10.1002/tpg2.20008] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2019] [Accepted: 10/30/2019] [Indexed: 06/11/2023]
Abstract
Sweet corn (Zea mays L.) is highly consumed in the United States, but does not make major contributions to the daily intake of carotenoids (provitamin A carotenoids, lutein and zeaxanthin) that would help in the prevention of health complications. A genome-wide association study of seven kernel carotenoids and twelve derivative traits was conducted in a sweet corn inbred line association panel ranging from light to dark yellow in endosperm color to elucidate the genetic basis of carotenoid levels in fresh kernels. In agreement with earlier studies of maize kernels at maturity, we detected an association of β-carotene hydroxylase (crtRB1) with β-carotene concentration and lycopene epsilon cyclase (lcyE) with the ratio of flux between the α- and β-carotene branches in the carotenoid biosynthetic pathway. Additionally, we found that 5% or less of the evaluated inbred lines possessing the shrunken2 (sh2) endosperm mutation had the most favorable lycE allele or crtRB1 haplotype for elevating β-branch carotenoids (β-carotene and zeaxanthin) or β-carotene, respectively. Genomic prediction models with genome-wide markers obtained moderately high predictive abilities for the carotenoid traits, especially lutein, and outperformed models with less markers that targeted candidate genes implicated in the synthesis, retention, and/or genetic control of kernel carotenoids. Taken together, our results constitute an important step toward increasing carotenoids in fresh sweet corn kernels.
Collapse
Affiliation(s)
- Matheus Baseggio
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Matthew Murray
- Dep. of Agronomy, Univ. of Wisconsin-Madison, Madison, WI, 53706, USA
| | | | - Nicholas Kaczmar
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - James Chamness
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Edward S Buckler
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
- Institute for Genomic Diversity, Cornell Univ., Ithaca, NY, 14853, USA
- US Department of Agriculture-Agricultural Research Service, Robert W. Holley Center for Agriculture and Health, Ithaca, NY, 14853, USA
| | - Margaret E Smith
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Dean DellaPenna
- Dep. of Biochemistry and Molecular Biology, Michigan State Univ., East Lansing, MI, 48824, USA
| | - William F Tracy
- Dep. of Agronomy, Univ. of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Michael A Gore
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| |
Collapse
|
49
|
Feitosa FLB, Pereira ASC, Amorim ST, Peripolli E, Silva RMDO, Braz CU, Ferrinho AM, Schenkel FS, Brito LF, Espigolan R, de Albuquerque LG, Baldi F. Comparison between haplotype-based and individual snp-based genomic predictions for beef fatty acid profile in Nelore cattle. J Anim Breed Genet 2019; 137:468-476. [PMID: 31867831 DOI: 10.1111/jbg.12463] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2019] [Revised: 11/18/2019] [Accepted: 11/24/2019] [Indexed: 11/28/2022]
Abstract
The aim of this study was to evaluate the genomic predictions using the single-step genomic best linear unbiased predictor (ssGBLUP) method based on SNPs and haplotype markers associated with beef fatty acids (FAs) profile in Nelore cattle. The data set contained records from 963 Nelore bulls finished in feedlot (±90 days) and slaughtered with approximately 24 months of age. Meat samples from the Longissimus dorsi muscle were taken for FAs profile measurement. FAs were quantified by gas chromatography using a SP-2560 capillary column. Animals were genotyped with the high-density SNP panel (BovineHD BeadChip assay) containing 777,962 markers. SNPs with a minor allele frequency and a call rate lower than 0.05 and 0.90, respectively, monomorphic, located on sex chromosomes, and with unknown position were removed from the data set. After genomic quality control, a total of 469,981 SNPs and 892 samples were available for subsequent analyses. Missing genotypes were imputed and phased using the FImpute software. Haplotype blocks were defined based on linkage disequilibrium using the Haploview software. The model to estimate variance components and genetic parameters and to predict the genomic values included the random genetic additive effects, fixed effects of the contemporary group and the age at slaughter as a linear covariate. Accuracies using the haplotype-based approach ranged from 0.07 to 0.31, and those SNP-based ranged from 0.06 to 0.33. Regression coefficients ranged from 0.07 to 0.74 and from 0.08 to 1.45 using the haplotype- and SNP-based approaches, respectively. Despite the low to moderate accuracies for the genomic values, it is possible to obtain genetic progress trough selection using genomic information based either on SNPs or haplotype markers. The SNP-based approach allows less biased genomic evaluations, and it is more feasible when taking into account the computational and operational cost underlying the haplotypes inference.
Collapse
Affiliation(s)
- Fabieli Loise Braga Feitosa
- Faculdade de Ciências Agrárias e Veterinárias, Departamento de Zootecnia, Universidade Estadual Paulista Júlio de Mesquita Filho, UNESP, Jaboticabal, Brazil
| | - Angélica Simone Cravo Pereira
- Faculdade de Zootecnia e Engenharia de Alimentos, Departamento de Nutrição e Produção Animal, Universidade de São Paulo, Pirassununga, Brazil
| | - Sabrina Thaise Amorim
- Faculdade de Ciências Agrárias e Veterinárias, Departamento de Zootecnia, Universidade Estadual Paulista Júlio de Mesquita Filho, UNESP, Jaboticabal, Brazil
| | - Elisa Peripolli
- Faculdade de Ciências Agrárias e Veterinárias, Departamento de Zootecnia, Universidade Estadual Paulista Júlio de Mesquita Filho, UNESP, Jaboticabal, Brazil
| | | | - Camila Urbano Braz
- Faculdade de Ciências Agrárias e Veterinárias, Departamento de Zootecnia, Universidade Estadual Paulista Júlio de Mesquita Filho, UNESP, Jaboticabal, Brazil
| | - Adrielle Matias Ferrinho
- Faculdade de Zootecnia e Engenharia de Alimentos, Departamento de Nutrição e Produção Animal, Universidade de São Paulo, Pirassununga, Brazil
| | | | | | - Rafael Espigolan
- Faculdade de Zootecnia e Engenharia de Alimentos, Departamento de Medicina Veterinária, Universidade de São Paulo, Pirassununga, Brazil
| | - Lucia Galvão de Albuquerque
- Faculdade de Ciências Agrárias e Veterinárias, Departamento de Zootecnia, Universidade Estadual Paulista Júlio de Mesquita Filho, UNESP, Jaboticabal, Brazil
| | - Fernando Baldi
- Faculdade de Ciências Agrárias e Veterinárias, Departamento de Zootecnia, Universidade Estadual Paulista Júlio de Mesquita Filho, UNESP, Jaboticabal, Brazil
| |
Collapse
|
50
|
Theoretical Evaluation of Multi-Breed Genomic Prediction in Chinese Indigenous Cattle. Animals (Basel) 2019; 9:ani9100789. [PMID: 31614691 PMCID: PMC6827096 DOI: 10.3390/ani9100789] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2019] [Revised: 09/24/2019] [Accepted: 10/02/2019] [Indexed: 12/19/2022] Open
Abstract
Simple Summary In order to evaluate the potential application of genomic selection (GS) for Chinese indigenous cattle, we assessed the influence of combining multiple populations on the reliability of genomic predictions for 10 indigenous breeds of Chinese cattle using simulated data. We found the predictive accuracies to be low when the reference and validation populations were sampled from different breeds. When using multiple breeds for the reference population, the predictive accuracies were higher if the reference was comprised of breeds with close relationships. In addition, the accuracy increased in all scenarios when the heritability increased, and the genetic architecture of the QTL can affect genomic prediction. Our study suggested that the application of meta-populations can increase accuracy in scenarios with a reduced size of reference populations. Abstract Genomic selection (GS) has been widely considered as a valuable strategy for enhancing the rate of genetic gain in farm animals. However, the construction of a large reference population is a big challenge for small populations like indigenous cattle. In order to evaluate the potential application of GS for Chinese indigenous cattle, we assessed the influence of combining multiple populations on the reliability of genomic predictions for 10 indigenous breeds of Chinese cattle using simulated data. Also, we examined the effect of different genetic architecture on prediction accuracy. In this study, we simulated a set of genotype data by a resampling approach which can reflect the realistic linkage disequilibrium pattern for multiple populations. We found within-breed evaluations yielded the highest accuracies ranged from 0.64 to 0.68 for four different simulated genetic architectures. For scenarios using multiple breeds as reference, the predictive accuracies were higher when the reference was comprised of breeds with a close relationship, while the accuracies were low when prediction were carried out among breeds. In addition, the accuracy increased in all scenarios with the heritability increased. Our results suggested that using meta-population as reference can increase accuracy of genomic predictions for small populations. Moreover, multi-breed genomic selection was feasible for Chinese indigenous populations with genetic relationships.
Collapse
|