1
|
Wang Y, Zhu B, Wang J, Zhang L, Xu L, Chen Y, Wang Z, Gao H, Li J, Gao X. Evaluation of genomic mating approach based on genetic algorithms for long-term selection in Huaxi cattle. BMC Genomics 2024; 25:1140. [PMID: 39587475 PMCID: PMC11590262 DOI: 10.1186/s12864-024-11057-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2024] [Accepted: 11/15/2024] [Indexed: 11/27/2024] Open
Abstract
BACKGROUND Genomic mating (GM) can effectively control the growth rate of inbreeding in population and achieve long-term sustainable genetic progress. However, the design of GM method and assessment of its effects during long-term selection have not been fully explored in beef cattle breeding. RESULTS In this study, we constructed a simulated population based on the real genotypes of Huaxi cattle, where five generations of simulated breeding were carried out using the genomic optimal contribution selection (GOCS), genetic algorithms strategy and three traditional mating strategies. During the breeding process, genetic parameters including average genomic estimated breeding value (GEBV), genetic gain values ( Δ G ), the rate of inbreeding values ( Δ F ) were calculated and compared across generations. Our results showed that the GM method could significantly improve the genetic gain while effectively controlling the inbreeding accumulation within the population. When using the GM method, there was an increase in genetic gain for Huaxi cattle ranging from 1.1% to 25.6% compared to traditional mating strategy, with inbreeding decreasing in the range of 5.8% to 36.2%. Validation using the real dataset from Huaxi cattle further confirmed our findings from the simulated study, offspring populations using the GM strategy exhibited a 7.3% increase in genetic gain compared to positive assortative mating. CONCLUSIONS These findings suggest that the GM method shows potential for achieving sustainable genetic gain and could be utilized during long-term selection in beef cattle breeding.
Collapse
Affiliation(s)
- Yuanqing Wang
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Bo Zhu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
- National Centre of Beef Cattle Genetic Evaluation, Beijing, 100193, China
- Northern Agriculture and Livestock Husbandry Technology Innovation Center, Hohhot, 010010, China
| | - Jing Wang
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Lupei Zhang
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Lingyang Xu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Yan Chen
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Zezhao Wang
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Huijiang Gao
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Junya Li
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China.
| | - Xue Gao
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China.
| |
Collapse
|
2
|
Cui Z, Schumacher FR. Small-group originating model: Optimized individual-level GWAS simulation featured by SLiM and using open-access data. Comput Biol Chem 2024; 112:108147. [PMID: 39033733 DOI: 10.1016/j.compbiolchem.2024.108147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Revised: 05/22/2024] [Accepted: 07/08/2024] [Indexed: 07/23/2024]
Abstract
The development of analytical methods for Genome-wide Association Studies (GWAS) has outpaced the evolution of simulation techniques and pipelines. This disparity underscores the importance of innovative simulation methods that can keep pace with the rapidly increasing scale of GWAS. The median sample size of GWAS over the past ten years has exceeded 50,000 individuals, a trend that emphasizes the need for simulation tools capable of generating data on a similar or larger scale. This paper introduces a novel method, the small-group originating (SGO) model, utilizing the SLiM software for simulating individual-level GWAS data. Our standardized protocol facilitates the generation of tens of thousands of pseudo-individuals with millions of variants from small (30-90) open-access datasets. SGO stands out, especially when compared to the widely-used resampling method in HapGen, showcasing superior simulation efficiency for large sample sizes (> 13,000) of unrelated individuals. This capability is particularly relevant given the current trajectory towards larger GWAS, necessitating tools that can simulate datasets reflective of this growth. Additionally, SGO provides customization options and can model dynamic life cycles and mating across generations, positioning it as a highly promising alternative for GWAS simulations. In a case study, sensitivity analyses of chromosome-level principal component analysis and kinship coefficient estimation were conducted. The results highlighted the poor robustness of chromosome-level quality control (QC) indexes and the uneven distribution of population structure across chromosomes and ancestries, advocating for the caution against relying solely on chromosome-level QC statistics. With its flexible and efficient approach to generating pseudo GWAS data, our standardized SGO protocol emerges as a crucial asset for method development, power analysis, and benchmarking in GWAS research. It is especially vital in the context of accommodating the demands for large-scale simulations, aligning with the current and future scale of GWAS.
Collapse
Affiliation(s)
- Zuxi Cui
- Department of Population and Quantitative Health Sciences, Case Western Reserve University School of Medicine, Cleveland, OH, USA
| | - Fredrick R Schumacher
- Department of Population and Quantitative Health Sciences, Case Western Reserve University School of Medicine, Cleveland, OH, USA; Case Comprehensive Cancer Center, Case Western Reserve University School of Medicine, Cleveland, OH, USA.
| |
Collapse
|
3
|
Wientjes YCJ, Bijma P, van den Heuvel J, Zwaan BJ, Vitezica ZG, Calus MPL. The long-term effects of genomic selection: 2. Changes in allele frequencies of causal loci and new mutations. Genetics 2023; 225:iyad141. [PMID: 37506255 PMCID: PMC10471209 DOI: 10.1093/genetics/iyad141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Revised: 05/17/2023] [Accepted: 07/18/2023] [Indexed: 07/30/2023] Open
Abstract
Genetic selection has been applied for many generations in animal, plant, and experimental populations. Selection changes the allelic architecture of traits to create genetic gain. It remains unknown whether the changes in allelic architecture are different for the recently introduced technique of genomic selection compared to traditional selection methods and whether they depend on the genetic architectures of traits. Here, we investigate the allele frequency changes of old and new causal loci under 50 generations of phenotypic, pedigree, and genomic selection, for a trait controlled by either additive, additive and dominance, or additive, dominance, and epistatic effects. Genomic selection resulted in slightly larger and faster changes in allele frequencies of causal loci than pedigree selection. For each locus, allele frequency change per generation was not only influenced by its statistical additive effect but also to a large extent by the linkage phase with other loci and its allele frequency. Selection fixed a large number of loci, and 5 times more unfavorable alleles became fixed with genomic and pedigree selection than with phenotypic selection. For pedigree selection, this was mainly a result of increased genetic drift, while genetic hitchhiking had a larger effect on genomic selection. When epistasis was present, the average allele frequency change was smaller (∼15% lower), and a lower number of loci became fixed for all selection methods. We conclude that for long-term genetic improvement using genomic selection, it is important to consider hitchhiking and to limit the loss of favorable alleles.
Collapse
Affiliation(s)
- Yvonne C J Wientjes
- Animal Breeding and Genomics, Wageningen University & Research, 6700 AH Wageningen, The Netherlands
| | - Piter Bijma
- Animal Breeding and Genomics, Wageningen University & Research, 6700 AH Wageningen, The Netherlands
| | - Joost van den Heuvel
- Laboratory of Genetics, Wageningen University & Research, 6700 AH Wageningen, The Netherlands
| | - Bas J Zwaan
- Laboratory of Genetics, Wageningen University & Research, 6700 AH Wageningen, The Netherlands
| | | | - Mario P L Calus
- Animal Breeding and Genomics, Wageningen University & Research, 6700 AH Wageningen, The Netherlands
| |
Collapse
|
4
|
Zheng X, Wang T, Niu Q, Wu J, Zhao Z, Gao H, Li J, Xu L. Evaluation of Linear Programming and Optimal Contribution Selection Approaches for Long-Term Selection on Beef Cattle Breeding. BIOLOGY 2023; 12:1157. [PMID: 37759557 PMCID: PMC10525978 DOI: 10.3390/biology12091157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Revised: 08/15/2023] [Accepted: 08/17/2023] [Indexed: 09/29/2023]
Abstract
The optimized selection method can maximize the genetic gain in offspring under the premise of controlling the inbreeding level of the population. At present, genetic gain has been largely improved by using genomic selection in multiple farm animals. However, the design of the optimal selection method and assessment of its effects during long-term selection in beef cattle breeding are yet to be fully explored. In this study, a simulated beef cattle population was constructed, and 15 generations of simulated breeding were carried out using the linear programming breeding strategy (LP) and optimal contribution selection strategy (OCS), respectively. The truncation selection strategy (TS-I and TS-II) was used as the control. During the breeding process, genetic parameters including genetic gain, average kinship coefficient, QTL effect variance, and average observed heterozygosity were calculated and compared across generations. Our results showed that the LP method can significantly improve the genetic gain in the population, especially the genetic performance of the traits with high heritability and the traits with high weight in the breeding process, but the inbreeding level of the population is higher under LP strategy. Although the genetic gain in the population under the OCS strategy is lower than the TS-II strategy, this method can effectively control the inbreeding level of the population. Our findings also suggest that the LP and OCS method can be used as an effective means to improve genetic gain, while the OCS method is a more ideal method to obtain sustainable genetic gain during long-term selection.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Junya Li
- State Key Laboratory of Animal Biotech Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, China; (X.Z.); (T.W.); (Q.N.); (J.W.); (Z.Z.); (H.G.)
| | - Lingyang Xu
- State Key Laboratory of Animal Biotech Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, China; (X.Z.); (T.W.); (Q.N.); (J.W.); (Z.Z.); (H.G.)
| |
Collapse
|
5
|
Hu X, Carver BF, El-Kassaby YA, Zhu L, Chen C. Weighted kernels improve multi-environment genomic prediction. Heredity (Edinb) 2023; 130:82-91. [PMID: 36522412 PMCID: PMC9905581 DOI: 10.1038/s41437-022-00582-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2021] [Revised: 11/27/2022] [Accepted: 11/28/2022] [Indexed: 12/23/2022] Open
Abstract
Crucial to variety improvement programs is the reliable and accurate prediction of genotype's performance across environments. However, due to the impactful presence of genotype by environment (G×E) interaction that dictates how changes in expression and function of genes influence target traits in different environments, prediction performance of genomic selection (GS) using single-environment models often falls short. Furthermore, despite the successes of genome-wide association studies (GWAS), the genetic insights derived from genome-to-phenome mapping have not yet been incorporated in predictive analytics, making GS models that use Gaussian kernel primarily an estimator of genomic similarity, instead of the underlying genetics characteristics of the populations. Here, we developed a GS framework that, in addition to capturing the overall genomic relationship, can capitalize on the signal of genetic associations of the phenotypic variation as well as the genetic characteristics of the populations. The capacity of predicting the performance of populations across environments was demonstrated by an overall gain in predictability up to 31% for the winter wheat DH population. Compared to Gaussian kernels, we showed that our multi-environment weighted kernels could better leverage the significance of genetic associations and yielded a marked improvement of 4-33% in prediction accuracy for half-sib families. Furthermore, the flexibility incorporated in our Bayesian implementation provides the generalizable capacity required for predicting multiple highly genetic heterogeneous populations across environments, allowing reliable GS for genetic improvement programs that have no access to genetically uniform material.
Collapse
Affiliation(s)
- Xiaowei Hu
- grid.65519.3e0000 0001 0721 7331Department of Statistics, Oklahoma State University, Stillwater, OK USA ,grid.27755.320000 0000 9136 933XPresent Address: Center for Public Health Genomics, University of Virginia, Charlottesville, VA USA
| | - Brett F. Carver
- grid.65519.3e0000 0001 0721 7331Department of Plant and Soil Sciences, Oklahoma State University, Stillwater, OK USA
| | - Yousry A. El-Kassaby
- grid.17091.3e0000 0001 2288 9830Department of Forest and Conservation Sciences, University of British Columbia, Vancouver, BC Canada
| | - Lan Zhu
- grid.65519.3e0000 0001 0721 7331Department of Statistics, Oklahoma State University, Stillwater, OK USA
| | - Charles Chen
- Department of Biochemistry and Molecular Biology, Oklahoma State University, Stillwater, OK, USA.
| |
Collapse
|
6
|
Caballero A, Fernández A, Villanueva B, Toro MA. A comparison of marker-based estimators of inbreeding and inbreeding depression. Genet Sel Evol 2022; 54:82. [PMID: 36575379 PMCID: PMC9793638 DOI: 10.1186/s12711-022-00772-0] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Accepted: 12/14/2022] [Indexed: 12/28/2022] Open
Abstract
BACKGROUND The availability of genome-wide marker data allows estimation of inbreeding coefficients (F, the probability of identity-by-descent, IBD) and, in turn, estimation of the rate of inbreeding depression (ΔID). We investigated, by computer simulations, the accuracy of the most popular estimators of inbreeding based on molecular markers when computing F and ΔID in populations under random mating, equalization of parental contributions, and artificially selected populations. We assessed estimators described by Li and Horvitz (FLH1 and FLH2), VanRaden (FVR1 and FVR2), Yang and colleagues (FYA1 and FYA2), marker homozygosity (FHOM), runs of homozygosity (FROH) and estimates based on pedigree (FPED) in comparison with estimates obtained from IBD measures (FIBD). RESULTS If the allele frequencies of a base population taken as a reference for the computation of inbreeding are known, all estimators based on marker allele frequencies are highly correlated with FIBD and provide accurate estimates of the mean ΔID. If base population allele frequencies are unknown and current frequencies are used in the estimations, the largest correlation with FIBD is generally obtained by FLH1 and the best estimator of ΔID is FYA2. The estimators FVR2 and FLH2 have the poorest performance in most scenarios. The assumption that base population allele frequencies are equal to 0.5 results in very biased estimates of the average inbreeding coefficient but they are highly correlated with FIBD and give relatively good estimates of ΔID. Estimates obtained directly from marker homozygosity (FHOM) substantially overestimated ΔID. Estimates based on runs of homozygosity (FROH) provide accurate estimates of inbreeding and ΔID. Finally, estimates based on pedigree (FPED) show a lower correlation with FIBD than molecular estimators but provide rather accurate estimates of ΔID. An analysis of data from a pig population supports the main findings of the simulations. CONCLUSIONS When base population allele frequencies are known, all marker-allele frequency-based estimators of inbreeding coefficients generally show a high correlation with FIBD and provide good estimates of ΔID. When base population allele frequencies are unknown, FLH1 is the marker frequency-based estimator that is most correlated with FIBD, and FYA2 provides the most accurate estimates of ΔID. Estimates from FROH are also very precise in most scenarios. The estimators FVR2 and FLH2 have the poorest performances.
Collapse
Affiliation(s)
- Armando Caballero
- grid.6312.60000 0001 2097 6738Centro de Investigación Mariña, Universidade de Vigo, Facultade de Bioloxía, 36310 Vigo, Spain
| | - Almudena Fernández
- Departamento de Mejora Genética Animal, INIA-CSIC, Ctra. de La Coruña, Km 7.5, 28040 Madrid, Spain
| | - Beatriz Villanueva
- Departamento de Mejora Genética Animal, INIA-CSIC, Ctra. de La Coruña, Km 7.5, 28040 Madrid, Spain
| | - Miguel A. Toro
- grid.5690.a0000 0001 2151 2978Departamento de Producción Agraria, ETSI Agronómica, Alimentaria y de Biosistemas, Universidad Politécnica de Madrid, 28040 Madrid, Spain
| |
Collapse
|
7
|
Oldenbroek JK, Windig JJ. Opportunities of Genomics for the Use of Semen Cryo-Conserved in Gene Banks. Front Genet 2022; 13:907411. [PMID: 35938018 PMCID: PMC9350965 DOI: 10.3389/fgene.2022.907411] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Accepted: 06/07/2022] [Indexed: 11/13/2022] Open
Abstract
Shortly after the introduction of cryo-conserved semen in the main farm animal species, gene banks were founded. Safeguarding farm animal genetic diversity for future use was and is the main objective. A sampling of sires was based on their pedigree and phenotypic information. Nowadays, DNA information from cryo-conserved sires and from animals in the living populations has become available. The combination of their DNA information can be used to realize three opportunities: 1) to make the gene bank a more complete archive of genetic diversity, 2) to determine the history of the genetic diversity from the living populations, and 3) to improve the performance and genetic diversity of living populations. These three opportunities for the use of gene bank sires in the genomic era are outlined in this study, and relevant recent literature is summarized to illustrate the great value of a gene bank as an archive of genetic diversity.
Collapse
|
8
|
Wientjes YCJ, Bijma P, Calus MPL, Zwaan BJ, Vitezica ZG, van den Heuvel J. The long-term effects of genomic selection: 1. Response to selection, additive genetic variance, and genetic architecture. Genet Sel Evol 2022; 54:19. [PMID: 35255802 PMCID: PMC8900405 DOI: 10.1186/s12711-022-00709-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2021] [Accepted: 02/10/2022] [Indexed: 11/10/2022] Open
Abstract
Abstract
Background
Genomic selection has revolutionized genetic improvement in animals and plants, but little is known about its long-term effects. Here, we investigated the long-term effects of genomic selection on response to selection, genetic variance, and the genetic architecture of traits using stochastic simulations. We defined the genetic architecture as the set of causal loci underlying each trait, their allele frequencies, and their statistical additive effects. We simulated a livestock population under 50 generations of phenotypic, pedigree, or genomic selection for a single trait, controlled by either only additive, additive and dominance, or additive, dominance, and epistatic effects. The simulated epistasis was based on yeast data.
Results
Short-term response was always greatest with genomic selection, while response after 50 generations was greater with phenotypic selection than with genomic selection when epistasis was present, and was always greater than with pedigree selection. This was mainly because loss of genetic variance and of segregating loci was much greater with genomic and pedigree selection than with phenotypic selection. Compared to pedigree selection, selection response was always greater with genomic selection. Pedigree and genomic selection lost a similar amount of genetic variance after 50 generations of selection, but genomic selection maintained more segregating loci, which on average had lower minor allele frequencies than with pedigree selection. Based on this result, genomic selection is expected to better maintain genetic gain after 50 generations than pedigree selection. The amount of change in the genetic architecture of traits was considerable across generations and was similar for genomic and pedigree selection, but slightly less for phenotypic selection. Presence of epistasis resulted in smaller changes in allele frequencies and less fixation of causal loci, but resulted in substantial changes in statistical additive effects across generations.
Conclusions
Our results show that genomic selection outperforms pedigree selection in terms of long-term genetic gain, but results in a similar reduction of genetic variance. The genetic architecture of traits changed considerably across generations, especially under selection and when non-additive effects were present. In conclusion, non-additive effects had a substantial impact on the accuracy of selection and long-term response to selection, especially when selection was accurate.
Collapse
|
9
|
Villanueva B, Fernández A, Saura M, Caballero A, Fernández J, Morales-González E, Toro MA, Pong-Wong R. The value of genomic relationship matrices to estimate levels of inbreeding. Genet Sel Evol 2021; 53:42. [PMID: 33933002 PMCID: PMC8088726 DOI: 10.1186/s12711-021-00635-0] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Accepted: 04/19/2021] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND Genomic relationship matrices are used to obtain genomic inbreeding coefficients. However, there are several methodologies to compute these matrices and there is still an unresolved debate on which one provides the best estimate of inbreeding. In this study, we investigated measures of inbreeding obtained from five genomic matrices, including the Nejati-Javaremi allelic relationship matrix (FNEJ), the Li and Horvitz matrix based on excess of homozygosity (FL&H), and the VanRaden (methods 1, FVR1, and 2, FVR2) and Yang (FYAN) genomic relationship matrices. We derived expectations for each inbreeding coefficient, assuming a single locus model, and used these expectations to explain the patterns of the coefficients that were computed from thousands of single nucleotide polymorphism genotypes in a population of Iberian pigs. RESULTS Except for FNEJ, the evaluated measures of inbreeding do not match with the original definitions of inbreeding coefficient of Wright (correlation) or Malécot (probability). When inbreeding coefficients are interpreted as indicators of variability (heterozygosity) that was gained or lost relative to a base population, both FNEJ and FL&H led to sensible results but this was not the case for FVR1, FVR2 and FYAN. When variability has increased relative to the base, FVR1, FVR2 and FYAN can indicate that it decreased. In fact, based on FYAN, variability is not expected to increase. When variability has decreased, FVR1 and FVR2 can indicate that it has increased. Finally, these three coefficients can indicate that more variability than that present in the base population can be lost, which is also unreasonable. The patterns for these coefficients observed in the pig population were very different, following the derived expectations. As a consequence, the rate of inbreeding depression estimated based on these inbreeding coefficients differed not only in magnitude but also in sign. CONCLUSIONS Genomic inbreeding coefficients obtained from the diagonal elements of genomic matrices can lead to inconsistent results in terms of gain and loss of genetic variability and inbreeding depression estimates, and thus to misleading interpretations. Although these matrices have proven to be very efficient in increasing the accuracy of genomic predictions, they do not always provide a useful measure of inbreeding.
Collapse
Affiliation(s)
- Beatriz Villanueva
- Departamento de Mejora Genética Animal, INIA, Ctra. de La Coruña, km 7.5, 28040 Madrid, Spain
| | - Almudena Fernández
- Departamento de Mejora Genética Animal, INIA, Ctra. de La Coruña, km 7.5, 28040 Madrid, Spain
| | - María Saura
- Departamento de Mejora Genética Animal, INIA, Ctra. de La Coruña, km 7.5, 28040 Madrid, Spain
| | - Armando Caballero
- Centro de Investigación Mariña, Universidade de Vigo, Departamento de Bioquímica, Genética E Inmunología, Campus de Vigo, 36310 Vigo, Spain
| | - Jesús Fernández
- Departamento de Mejora Genética Animal, INIA, Ctra. de La Coruña, km 7.5, 28040 Madrid, Spain
| | | | - Miguel A. Toro
- Departamento de Producción Agraria, ETSI Agrónomos, Universidad Politécnica de Madrid, 28040 Madrid, Spain
| | - Ricardo Pong-Wong
- Genetics and Genomics, The Roslin Institute and the R(D)SVS, University of Edinburgh, Easter Bush, Midlothian, EH25 9RG UK
| |
Collapse
|
10
|
Dementieva NV, Mitrofanova OV, Dysin AP, Kudinov AA, Stanishevskaya OI, Larkina TA, Plemyashov KV, Griffin DK, Romanov MN, Smaragdov MG. Assessing the effects of rare alleles and linkage disequilibrium on estimates of genetic diversity in the chicken populations. Animal 2021; 15:100171. [PMID: 33563558 DOI: 10.1016/j.animal.2021.100171] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Revised: 12/16/2020] [Accepted: 12/18/2020] [Indexed: 12/15/2022] Open
Abstract
Phenotypic diversity in poultry has been mainly driven by artificial selection and genetic drift. These led to the adaptation to the environment and the development of specific phenotypic traits of chickens in response to their economic use. This study evaluated genetic diversity within and between Russian breeds and populations using Illumina Chicken 60K SNP iSelect BeadChip by analysing genetic differences between populations with Hudson's fixation index (FST statistic) and heterozygosity. We estimated the effect of rare alleles and linkage disequilibrium (LD) on these measurements. To assess the effect of LD on the genetic diversity population, we carried out the LD-based pruning (LD<0.5 and LD<0.1) for seven chicken populations combined (I) or separately (II). LD pruning was specific for different dataset groups. Because of the noticeably large sample size in the Russian White RG population, pruning was substantial for Dataset I, and FST values were only positive when LD<0.1 pruning was applied. For Dataset II, the LD pruning results were confirmed by examining heterozygosity and alleles' frequency distribution. LD between single nucleotide polymorphisms was consistent across the seven chicken populations, except the Russian White RG population with the smallest r2 values and the largest effective population size. Our findings suggest to study variability in each population LD pruning has to be carried separately not after merging to avoid bias in estimates.
Collapse
Affiliation(s)
- N V Dementieva
- Russian Research Institute of Farm Animal Genetics and Breeding - Branch of the L. K. Ernst Federal Science Centre for Animal Husbandry, Pushkin, St Petersburg, Russia
| | - O V Mitrofanova
- Russian Research Institute of Farm Animal Genetics and Breeding - Branch of the L. K. Ernst Federal Science Centre for Animal Husbandry, Pushkin, St Petersburg, Russia
| | - A P Dysin
- Russian Research Institute of Farm Animal Genetics and Breeding - Branch of the L. K. Ernst Federal Science Centre for Animal Husbandry, Pushkin, St Petersburg, Russia
| | - A A Kudinov
- Russian Research Institute of Farm Animal Genetics and Breeding - Branch of the L. K. Ernst Federal Science Centre for Animal Husbandry, Pushkin, St Petersburg, Russia
| | - O I Stanishevskaya
- Russian Research Institute of Farm Animal Genetics and Breeding - Branch of the L. K. Ernst Federal Science Centre for Animal Husbandry, Pushkin, St Petersburg, Russia
| | - T A Larkina
- Russian Research Institute of Farm Animal Genetics and Breeding - Branch of the L. K. Ernst Federal Science Centre for Animal Husbandry, Pushkin, St Petersburg, Russia
| | - K V Plemyashov
- Russian Research Institute of Farm Animal Genetics and Breeding - Branch of the L. K. Ernst Federal Science Centre for Animal Husbandry, Pushkin, St Petersburg, Russia
| | - D K Griffin
- School of Biosciences, University of Kent, Canterbury, Kent, UK
| | - M N Romanov
- School of Biosciences, University of Kent, Canterbury, Kent, UK.
| | - M G Smaragdov
- Russian Research Institute of Farm Animal Genetics and Breeding - Branch of the L. K. Ernst Federal Science Centre for Animal Husbandry, Pushkin, St Petersburg, Russia
| |
Collapse
|
11
|
Bérodier M, Berg P, Meuwissen T, Boichard D, Brochard M, Ducrocq V. Improved dairy cattle mating plans at herd level using genomic information. Animal 2020; 15:100016. [PMID: 33516018 DOI: 10.1016/j.animal.2020.100016] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2019] [Revised: 07/17/2020] [Accepted: 07/28/2020] [Indexed: 11/19/2022] Open
Abstract
From 2012 to 2018, 223 180 Montbéliarde females were genotyped in France and the number of newly genotyped females increased at a rate of about 33% each year. With female genotyping information, farmers have access to the genomic estimated breeding values of the females in their herd and to their carrier status for genetic defects or major genes segregating in the breed. This information, combined with genomic coancestry, can be used when planning matings in order to maximize the expected on-farm profit of future female offspring. We compared different mating allocation approaches for their capacity to maximize the expected genetic gain while limiting expected progeny inbreeding and the probability to conceive an offspring homozygous for a lethal recessive allele. Three mate allocation strategies (random mating (RAND), sequential mating (gSEQ€) and linear programing mating (gLP€)) were compared on 160 actual Montbéliarde herds using male and female genomic information. Then, we assessed the benefit of using female genomic information by comparing matings planned using only female pedigree information with the equivalent strategy using genomic information. We measured the benefit of adding genomic expected inbreeding and risk of conception of an offspring homozygous for a lethal recessive allele to Net merit in mating plans. The influence of three constraints was tested: by relaxing the constraint on availability of a particular semen type (sexed or conventional) for bulls, by adding an upper limit of 8.5% coancestry between mate pairs or by using a more stringent maximum use of a bull in a herd (5% vs 10%). The use of genomic information instead of pedigree information improved the mate allocation method in terms of progeny expected genetic merit, genetic diversity and risk to conceive an offspring homozygous for a lethal recessive allele. Optimizing mate allocation using linear programming and constraining coancestry to a maximum of 8.5% per mate pair reduced the average coancestry with a small impact on expected Net Merit. In summary, for male and female selection pathways, using genomic information is more efficient than using pedigree information to maximize genetic gain while constraining the expected inbreeding of the progeny and the risk to conceive an offspring homozygous for a lethal recessive allele. This study also underlines the key role of semen type (sexed vs conventional) and the associated constraints on the mate allocation algorithm to maximize genetic gain while maintaining genetic diversity and limiting the risk to conceive an offspring homozygous for a lethal recessive allele.
Collapse
Affiliation(s)
- M Bérodier
- UMR GABI, AgroParisTech, INRAE, Université Paris-Saclay, 78350 Jouy-en-Josas, France; MO3, 01250, Ceyzériat, France.
| | - P Berg
- Norwegian University of Life Sciences, PB 5002, N-1432 Ås, Norway
| | - T Meuwissen
- Norwegian University of Life Sciences, PB 5002, N-1432 Ås, Norway
| | - D Boichard
- UMR GABI, AgroParisTech, INRAE, Université Paris-Saclay, 78350 Jouy-en-Josas, France
| | - M Brochard
- MO3, 01250, Ceyzériat, France; Umotest, 01250, Ceyzériat, France
| | - V Ducrocq
- UMR GABI, AgroParisTech, INRAE, Université Paris-Saclay, 78350 Jouy-en-Josas, France
| |
Collapse
|
12
|
Intensified Use of Reproductive Technologies and Reduced Dimensions of Breeding Schemes Put Genetic Diversity at Risk in Dairy Cattle Breeds. Animals (Basel) 2020; 10:ani10101903. [PMID: 33080801 PMCID: PMC7650664 DOI: 10.3390/ani10101903] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2020] [Revised: 10/14/2020] [Accepted: 10/15/2020] [Indexed: 11/16/2022] Open
Abstract
In the management of dairy cattle breeds, two recent trends have arisen that pose potential threats to genetic diversity: the use of reproductive technologies (RT) and a reduction in the number of bulls in breeding schemes. The expected outcome of these changes, in terms of both genetic gain and genetic diversity, is not trivial to predict. Here, we simulated 15 breeding schemes similar to those carried out in large French dairy cattle breeds; breeding schemes differed with respect to their dimensions, the intensity of RT use, and the type of RT involved. We found that intensive use of RT resulted in improved genetic gain, but deteriorated genetic diversity. Specifically, a reduction in the interval between generations through the use of ovum pick-up and in vitro fertilization (OPU-IVF) resulted in a large increase in the inbreeding rate both per year and per generation, suggesting that OPU-IVF could have severe adverse effects on genetic diversity. To achieve a given level of genetic gain, the scenarios that best maintained genetic diversity were those with a higher number of sires/bulls and a medium intensity of RT use or those with a higher number of female donors to compensate for the increased intensity of RT.
Collapse
|
13
|
Meuwissen THE, Sonesson AK, Gebregiwergis G, Woolliams JA. Management of Genetic Diversity in the Era of Genomics. Front Genet 2020; 11:880. [PMID: 32903415 PMCID: PMC7438563 DOI: 10.3389/fgene.2020.00880] [Citation(s) in RCA: 50] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2019] [Accepted: 07/17/2020] [Indexed: 12/27/2022] Open
Abstract
Management of genetic diversity aims to (i) maintain heterozygosity, which ameliorates inbreeding depression and loss of genetic variation at loci that may become of importance in the future; and (ii) avoid genetic drift, which prevents deleterious recessives (e.g., rare disease alleles) from drifting to high frequency, and prevents random drift of (functional) traits. In the genomics era, genomics data allow for many alternative measures of inbreeding and genomic relationships. Genomic relationships/inbreeding can be classified into (i) homozygosity/heterozygosity based (e.g., molecular kinship matrix); (ii) genetic drift-based, i.e., changes of allele frequencies; or (iii) IBD-based, i.e., SNPs are used in linkage analyses to identify IBD segments. Here, alternative measures of inbreeding/relationship were used to manage genetic diversity in genomic optimal contribution (GOC) selection schemes. Contrary to classic inbreeding theory, it was found that drift and homozygosity-based inbreeding could differ substantially in GOC schemes unless diversity management was based upon IBD. When using a homozygosity-based measure of relationship, the inbreeding management resulted in allele frequency changes toward 0.5 giving a low rate of increase in homozygosity for the panel used for management, but not for unmanaged neutral loci, at the expense of a high genetic drift. When genomic relationship matrices were based on drift, following VanRaden and as in GCTA, drift was low at the expense of a high rate of increase in homozygosity. The use of IBD-based relationship matrices for inbreeding management limited both drift and the homozygosity-based rate of inbreeding to their target values. Genetic improvement per percent of inbreeding was highest when GOC used IBD-based relationships irrespective of the inbreeding measure used. Genomic relationships based on runs of homozygosity resulted in very high initial improvement per percent of inbreeding, but also in substantial discrepancies between drift and homozygosity-based rates of inbreeding, and resulted in a drift that exceeded its target value. The discrepancy between drift and homozygosity-based rates of inbreeding was caused by a covariance between initial allele frequency and the subsequent change in frequency, which becomes stronger when using data from whole genome sequence.
Collapse
Affiliation(s)
- Theo H E Meuwissen
- Department of Animal and Aquacultural Sciences, Norwegian University of Life Sciences, Ås, Norway
| | | | | | - John A Woolliams
- The Roslin Institute and R(D)SVS, The University of Edinburgh, Edinburgh, United Kingdom
| |
Collapse
|
14
|
Smaragdov MG, Kudinov AA. Assessing the power of principal components and wright's fixation index analyzes applied to reveal the genome-wide genetic differences between herds of Holstein cows. BMC Genet 2020; 21:47. [PMID: 32345235 PMCID: PMC7189535 DOI: 10.1186/s12863-020-00848-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2019] [Accepted: 03/27/2020] [Indexed: 11/30/2022] Open
Abstract
Background Due to the advent of SNP array technology, a genome-wide analysis of genetic differences between populations and breeds has become possible at a previously unattainable level. The Wright’s fixation index (Fst) and the principal component analysis (PCA) are widely used methods in animal genetics studies. In paper we compared the power of these methods, their complementing each other and which of them is the most powerful. Results Comparative analysis of the power Principal Components Analysis (PCA) and Fst were carried out to reveal genetic differences between herds of Holsteinized cows. Totally, 803 BovineSNP50 genotypes of cows from 13 herds were used in current study. Obtained Fst values were in the range of 0.002–0.012 (mean 0.0049) while for rare SNPs with MAF 0.0001–0.005 they were even smaller in the range of 0.001–0.01 (mean 0.0027). Genetic relatedness of the cows in the herds was the cause of such small Fst values. The contribution of rare alleles with MAF 0.0001–0.01 to the Fst values was much less than common alleles and this effect depends on linkage disequilibrium (LD). Despite of substantial change in the MAF spectrum and the number of SNPs we observed small effect size of LD - based pruning on Fst data. PCA analysis confirmed the mutual admixture and small genetic difference between herds. Moreover, PCA analysis of the herds based on the visualization the results of a single eigenvector cannot be used to significantly differentiate herds. Only summed eigenvectors should be used to realize full power of PCA to differentiate small between herds genetic difference. Finally, we presented evidences that the significance of Fst data far exceeds the significance of PCA data when these methods are used to reveal genetic differences between herds. Conclusions LD - based pruning had a small effect on findings of Fst and PCA analyzes. Therefore, for weakly structured populations the LD - based pruning is not effective. In addition, our results show that the significance of genetic differences between herds obtained by Fst analysis exceeds the values of PCA. Proposed, to differentiate herds or low structured populations we recommend primarily using the Fst approach and only then PCA.
Collapse
Affiliation(s)
- M G Smaragdov
- Russian Research Institute of Farm Animal Genetics and Breeding - Branch of the l.K. Ernst Federal Science Center for Animal Husbandry, St. Petersburg, Pushkin, Russia. .,, St. Petersburg, Russian Federation.
| | - A A Kudinov
- Russian Research Institute of Farm Animal Genetics and Breeding - Branch of the l.K. Ernst Federal Science Center for Animal Husbandry, St. Petersburg, Pushkin, Russia.,Department of Agricultural Science, University of Helsinki, FI-00014, Helsinki, Finland
| |
Collapse
|
15
|
Pyhäjärvi T, Kujala ST, Savolainen O. 275 years of forestry meets genomics in Pinus sylvestris. Evol Appl 2020; 13:11-30. [PMID: 31988655 PMCID: PMC6966708 DOI: 10.1111/eva.12809] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2018] [Revised: 04/05/2019] [Accepted: 04/24/2019] [Indexed: 12/12/2022] Open
Abstract
Pinus sylvestris has a long history of basic and applied research that is relevant for both forestry and evolutionary studies. Its patterns of adaptive variation and role in forest economic and ecological systems have been studied extensively for nearly 275 years, detailed demography for a 100 years and mating system more than 50 years. However, its reference genome sequence is not yet available and genomic studies have been lagging compared to, for example, Pinus taeda and Picea abies, two other economically important conifers. Despite the lack of reference genome, many modern genomic methods are applicable for a more detailed look at its biological characteristics. For example, RNA-seq has revealed a complex transcriptional landscape and targeted DNA sequencing displays an excess of rare variants and geographically homogenously distributed molecular genetic diversity. Current DNA and RNA resources can be used as a reference for gene expression studies, SNP discovery, and further targeted sequencing. In the future, specific consequences of the large genome size, such as functional effects of regulatory open chromatin regions and transposable elements, should be investigated more carefully. For forest breeding and long-term management purposes, genomic data can help in assessing the genetic basis of inbreeding depression and the application of genomic tools for genomic prediction and relatedness estimates. Given the challenges of breeding (long generation time, no easy vegetative propagation) and the economic importance, application of genomic tools has a potential to have a considerable impact. Here, we explore how genomic characteristics of P. sylvestris, such as rare alleles and the low extent of linkage disequilibrium, impact the applicability and power of the tools.
Collapse
Affiliation(s)
- Tanja Pyhäjärvi
- Department of Ecology and GeneticsUniversity of OuluOuluFinland
- Biocenter OuluUniversity of OuluOuluFinland
| | | | - Outi Savolainen
- Department of Ecology and GeneticsUniversity of OuluOuluFinland
- Biocenter OuluUniversity of OuluOuluFinland
| |
Collapse
|
16
|
Doublet AC, Croiseau P, Fritz S, Michenet A, Hozé C, Danchin-Burge C, Laloë D, Restoux G. The impact of genomic selection on genetic diversity and genetic gain in three French dairy cattle breeds. Genet Sel Evol 2019; 51:52. [PMID: 31547802 PMCID: PMC6757367 DOI: 10.1186/s12711-019-0495-1] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2019] [Accepted: 09/11/2019] [Indexed: 11/23/2022] Open
Abstract
Background In France, implementation of genomic evaluations in dairy cattle breeds started in 2009 and this has modified the breeding schemes drastically. In this context, the goal of our study was to understand the impact of genomic selection on the genetic diversity of bulls from three French dairy cattle breeds born between 2005 and 2015 (Montbéliarde, Normande and Holstein) and the factors that are involved. Methods We compared annual genetic gains, inbreeding rates based on runs of homozygosity (ROH) and pedigree data, and mean ROH length within breeds, before and after the implementation of genomic selection. Results Genomic selection induced an increase in mean annual genetic gains of 50, 71 and 33% for Montbéliarde, Normande and Holstein bulls, respectively, and in parallel, the generation intervals were reduced by a factor of 1.7, 1.9 and 2, respectively. We found no significant change in inbreeding rate for the two national breeds, Montbéliarde and Normande, and a significant increase in inbreeding rate for the Holstein international breed, which is now as high as 0.55% per year based on ROH and 0.49% per year based on pedigree data (equivalent to a rate of 1.36 and 1.39% per generation, respectively). The mean ROH length was longer for bulls from the Holstein breed than for those from the other two breeds. Conclusions With the implementation of genomic selection, the annual genetic gain increased for bulls from the three major French dairy cattle breeds. At the same time, the annual loss of genetic diversity increased for Holstein bulls, possibly because of the massive use of a few elite bulls in this breed, but not for Montbéliarde and Normande bulls. The increase in mean ROH length in Holstein may reflect the occurrence of recent inbreeding. New strategies in breeding schemes, such as female donor stations and embryo transfer, and recent implementation of genomic evaluations in small regional breeds should be studied carefully in order to ensure the sustainability of breeding schemes in the future.
Collapse
Affiliation(s)
- Anna-Charlotte Doublet
- GABI, INRA, AgroParisTech, Université Paris-Saclay, 78350, Jouy-en-Josas, France. .,ALLICE, Paris, France.
| | - Pascal Croiseau
- GABI, INRA, AgroParisTech, Université Paris-Saclay, 78350, Jouy-en-Josas, France
| | - Sébastien Fritz
- GABI, INRA, AgroParisTech, Université Paris-Saclay, 78350, Jouy-en-Josas, France.,ALLICE, Paris, France
| | - Alexis Michenet
- GABI, INRA, AgroParisTech, Université Paris-Saclay, 78350, Jouy-en-Josas, France.,ALLICE, Paris, France
| | - Chris Hozé
- GABI, INRA, AgroParisTech, Université Paris-Saclay, 78350, Jouy-en-Josas, France.,ALLICE, Paris, France
| | | | - Denis Laloë
- GABI, INRA, AgroParisTech, Université Paris-Saclay, 78350, Jouy-en-Josas, France
| | - Gwendal Restoux
- GABI, INRA, AgroParisTech, Université Paris-Saclay, 78350, Jouy-en-Josas, France
| |
Collapse
|
17
|
Gervais L, Perrier C, Bernard M, Merlet J, Pemberton JM, Pujol B, Quéméré E. RAD-sequencing for estimating genomic relatedness matrix-based heritability in the wild: A case study in roe deer. Mol Ecol Resour 2019; 19:1205-1217. [PMID: 31058463 DOI: 10.1111/1755-0998.13031] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2018] [Revised: 04/19/2019] [Accepted: 04/23/2019] [Indexed: 01/02/2023]
Abstract
Estimating the evolutionary potential of quantitative traits and reliably predicting responses to selection in wild populations are important challenges in evolutionary biology. The genomic revolution has opened up opportunities for measuring relatedness among individuals with precision, enabling pedigree-free estimation of trait heritabilities in wild populations. However, until now, most quantitative genetic studies based on a genomic relatedness matrix (GRM) have focused on long-term monitored populations for which traditional pedigrees were also available, and have often had access to knowledge of genome sequence and variability. Here, we investigated the potential of RAD-sequencing for estimating heritability in a free-ranging roe deer (Capreolous capreolus) population for which no prior genomic resources were available. We propose a step-by-step analytical framework to optimize the quality and quantity of the genomic data and explore the impact of the single nucleotide polymorphism (SNP) calling and filtering processes on the GRM structure and GRM-based heritability estimates. As expected, our results show that sequence coverage strongly affects the number of recovered loci, the genotyping error rate and the amount of missing data. Ultimately, this had little effect on heritability estimates and their standard errors, provided that the GRM was built from a minimum number of loci (above 7,000). Genomic relatedness matrix-based heritability estimates thus appear robust to a moderate level of genotyping errors in the SNP data set. We also showed that quality filters, such as the removal of low-frequency variants, affect the relatedness structure of the GRM, generating lower h2 estimates. Our work illustrates the huge potential of RAD-sequencing for estimating GRM-based heritability in virtually any natural population.
Collapse
Affiliation(s)
- Laura Gervais
- CEFS, INRA, Université de Toulouse, Castanet-Tolosan, Cedex, France.,Laboratoire Évolution & Diversité Biologique (EDB UMR 5174), CNRS, IRD, UPS, Université Fédérale de Toulouse Midi-Pyrénées, Toulouse, France
| | | | - Maria Bernard
- SIGENAE, INRA, Jouy-en-Josas, France.,GABI, INRA, AgroParisTech, Université Paris-Saclay, Jouy-en-Josas, France
| | - Joël Merlet
- CEFS, INRA, Université de Toulouse, Castanet-Tolosan, Cedex, France
| | - Josephine M Pemberton
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, UK
| | - Benoit Pujol
- Laboratoire Évolution & Diversité Biologique (EDB UMR 5174), CNRS, IRD, UPS, Université Fédérale de Toulouse Midi-Pyrénées, Toulouse, France.,PSL Université Paris: EPHE-UPVD-CNRS, Université de Perpignan, Perpignan, France
| | - Erwan Quéméré
- CEFS, INRA, Université de Toulouse, Castanet-Tolosan, Cedex, France
| |
Collapse
|
18
|
Pégard M, Rogier O, Bérard A, Faivre-Rampant P, Paslier MCL, Bastien C, Jorge V, Sánchez L. Sequence imputation from low density single nucleotide polymorphism panel in a black poplar breeding population. BMC Genomics 2019; 20:302. [PMID: 30999856 PMCID: PMC6471894 DOI: 10.1186/s12864-019-5660-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2018] [Accepted: 03/29/2019] [Indexed: 12/30/2022] Open
Abstract
Background Genomic selection accuracy increases with the use of high SNP (single nucleotide polymorphism) coverage. However, such gains in coverage come at high costs, preventing their prompt operational implementation by breeders. Low density panels imputed to higher densities offer a cheaper alternative during the first stages of genomic resources development. Our study is the first to explore the imputation in a tree species: black poplar. About 1000 pure-breed Populus nigra trees from a breeding population were selected and genotyped with a 12K custom Infinium Bead-Chip. Forty-three of those individuals corresponding to nodal trees in the pedigree were fully sequenced (reference), while the remaining majority (target) was imputed from 8K to 1.4 million SNPs using FImpute. Each SNP and individual was evaluated for imputation errors by leave-one-out cross validation in the training sample of 43 sequenced trees. Some summary statistics such as Hardy-Weinberg Equilibrium exact test p-value, quality of sequencing, depth of sequencing per site and per individual, minor allele frequency, marker density ratio or SNP information redundancy were calculated. Principal component and Boruta analyses were used on all these parameters to rank the factors affecting the quality of imputation. Additionally, we characterize the impact of the relatedness between reference population and target population. Results During the imputation process, we used 7540 SNPs from the chip to impute 1,438,827 SNPs from sequences. At the individual level, imputation accuracy was high with a proportion of SNPs correctly imputed between 0.84 and 0.99. The variation in accuracies was mostly due to differences in relatedness between individuals. At a SNP level, the imputation quality depended on genotyped SNP density and on the original minor allele frequency. The imputation did not appear to result in an increase of linkage disequilibrium. The genotype densification not only brought a better distribution of markers all along the genome, but also we did not detect any substantial bias in annotation categories. Conclusions This study shows that it is possible to impute low-density marker panels to whole genome sequence with good accuracy under certain conditions that could be common to many breeding populations. Electronic supplementary material The online version of this article (10.1186/s12864-019-5660-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Marie Pégard
- BioForA, INRA, ONF, 45075, Orléans, France, 2163 Avenue de la Pomme de Pin CS 40001 ARDON, Orléans Cedex 2, 45075, France
| | - Odile Rogier
- BioForA, INRA, ONF, 45075, Orléans, France, 2163 Avenue de la Pomme de Pin CS 40001 ARDON, Orléans Cedex 2, 45075, France
| | - Aurélie Bérard
- Etude du Polymorphisme des Génomes Végétaux (EPGV), INRA, Université Paris-Saclay, 91000, 2 rue Gaston Crémieux, Evry, 9100, France
| | - Patricia Faivre-Rampant
- Etude du Polymorphisme des Génomes Végétaux (EPGV), INRA, Université Paris-Saclay, 91000, 2 rue Gaston Crémieux, Evry, 9100, France
| | - Marie-Christine Le Paslier
- Etude du Polymorphisme des Génomes Végétaux (EPGV), INRA, Université Paris-Saclay, 91000, 2 rue Gaston Crémieux, Evry, 9100, France
| | - Catherine Bastien
- BioForA, INRA, ONF, 45075, Orléans, France, 2163 Avenue de la Pomme de Pin CS 40001 ARDON, Orléans Cedex 2, 45075, France
| | - Véronique Jorge
- BioForA, INRA, ONF, 45075, Orléans, France, 2163 Avenue de la Pomme de Pin CS 40001 ARDON, Orléans Cedex 2, 45075, France
| | - Leopoldo Sánchez
- BioForA, INRA, ONF, 45075, Orléans, France, 2163 Avenue de la Pomme de Pin CS 40001 ARDON, Orléans Cedex 2, 45075, France.
| |
Collapse
|
19
|
Doekes HP, Veerkamp RF, Bijma P, Hiemstra SJ, Windig J. Value of the Dutch Holstein Friesian germplasm collection to increase genetic variability and improve genetic merit. J Dairy Sci 2018; 101:10022-10033. [PMID: 30219429 DOI: 10.3168/jds.2018-15217] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2018] [Accepted: 08/01/2018] [Indexed: 11/19/2022]
Abstract
National gene bank collections for Holstein Friesian (HF) dairy cattle were set up in the 1990s. In this study, we assessed the value of bulls from the Dutch HF germplasm collection, also known as cryobank bulls, to increase genetic variability and improve genetic merit in the current bull population (bulls born in 2010-2015). Genetic variability was defined as 1 minus the mean genomic similarity (SIMSNP) or as 1 minus the mean pedigree-based kinship (fPED). Genetic merit was defined as the mean estimated breeding value for the total merit index or for 1 of 3 subindices (yield, fertility, and udder health). Using optimal contribution selection, we minimized relatedness (maximized variability) or maximized genetic merit at restricted levels of relatedness. We compared breeding schemes with only bulls from 2010 to 2015 with schemes in which cryobank bulls were also included. When we minimized relatedness, inclusion of genotyped cryobank bulls decreased mean SIMSNP by 0.7% and inclusion of both genotyped and nongenotyped cryobank bulls decreased mean fPED by 2.6% (in absolute terms). When we maximized merit at restricted levels of relatedness, inclusion of cryobank bulls provided additional merit at any level of mean SIMSNP or mean fPED except for the total merit index at high levels of mean SIMSNP. Additional merit from cryobank bulls depended on (1) the relative emphasis on genetic variability and (2) the selection criterion. Additional merit was higher when more emphasis was put on genetic variability. For fertility, for example, it was 1.74 SD at a mean SIMSNP restriction of 64.5% and 0.37 SD at a mean SIMSNP restriction of 67.5%. Additional merit was low to nonexistent for the total merit index and higher for the subindices, especially for fertility. At a mean SIMSNP of 64.5%, for example, it was 0.60 SD for the total merit index and 1.74 SD for fertility. In conclusion, Dutch HF cryobank bulls can be used to increase genetic variability and improve genetic merit in the current population, although their value is very limited when selecting for the current total merit index. Anticipating changes in the breeding goal in the future, the germplasm collection is a valuable resource for commercial breeding populations.
Collapse
Affiliation(s)
- H P Doekes
- Animal Breeding and Genomics, 6700 AH, Wageningen, the Netherlands; Centre for Genetic Resources the Netherlands, Wageningen University and Research, PO Box 338, 6700 AH, Wageningen, the Netherlands.
| | - R F Veerkamp
- Animal Breeding and Genomics, 6700 AH, Wageningen, the Netherlands
| | - P Bijma
- Animal Breeding and Genomics, 6700 AH, Wageningen, the Netherlands
| | - S J Hiemstra
- Centre for Genetic Resources the Netherlands, Wageningen University and Research, PO Box 338, 6700 AH, Wageningen, the Netherlands
| | - J Windig
- Animal Breeding and Genomics, 6700 AH, Wageningen, the Netherlands; Centre for Genetic Resources the Netherlands, Wageningen University and Research, PO Box 338, 6700 AH, Wageningen, the Netherlands
| |
Collapse
|
20
|
Doekes HP, Veerkamp RF, Bijma P, Hiemstra SJ, Windig JJ. Trends in genome-wide and region-specific genetic diversity in the Dutch-Flemish Holstein-Friesian breeding program from 1986 to 2015. Genet Sel Evol 2018; 50:15. [PMID: 29642838 PMCID: PMC5896142 DOI: 10.1186/s12711-018-0385-y] [Citation(s) in RCA: 42] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2017] [Accepted: 03/27/2018] [Indexed: 12/19/2022] Open
Abstract
Background In recent decades, Holstein–Friesian (HF) selection schemes have undergone profound changes, including the introduction of optimal contribution selection (OCS; around 2000), a major shift in breeding goal composition (around 2000) and the implementation of genomic selection (GS; around 2010). These changes are expected to have influenced genetic diversity trends. Our aim was to evaluate genome-wide and region-specific diversity in HF artificial insemination (AI) bulls in the Dutch-Flemish breeding program from 1986 to 2015. Methods Pedigree and genotype data (~ 75.5 k) of 6280 AI-bulls were used to estimate rates of genome-wide inbreeding and kinship and corresponding effective population sizes. Region-specific inbreeding trends were evaluated using regions of homozygosity (ROH). Changes in observed allele frequencies were compared to those expected under pure drift to identify putative regions under selection. We also investigated the direction of changes in allele frequency over time. Results Effective population size estimates for the 1986–2015 period ranged from 69 to 102. Two major breakpoints were observed in genome-wide inbreeding and kinship trends. Around 2000, inbreeding and kinship levels temporarily dropped. From 2010 onwards, they steeply increased, with pedigree-based, ROH-based and marker-based inbreeding rates as high as 1.8, 2.1 and 2.8% per generation, respectively. Accumulation of inbreeding varied substantially across the genome. A considerable fraction of markers showed changes in allele frequency that were greater than expected under pure drift. Putative selected regions harboured many quantitative trait loci (QTL) associated to a wide range of traits. In consecutive 5-year periods, allele frequencies changed more often in the same direction than in opposite directions, except when comparing the 1996–2000 and 2001–2005 periods. Conclusions Genome-wide and region-specific diversity trends reflect major changes in the Dutch-Flemish HF breeding program. Introduction of OCS and the shift in breeding goal were followed by a drop in inbreeding and kinship and a shift in the direction of changes in allele frequency. After introduction of GS, rates of inbreeding and kinship increased substantially while allele frequencies continued to change in the same direction as before GS. These results provide insight in the effect of breeding practices on genomic diversity and emphasize the need for efficient management of genetic diversity in GS schemes. Electronic supplementary material The online version of this article (10.1186/s12711-018-0385-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Harmen P Doekes
- Animal Breeding and Genomics, Wageningen University & Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands. .,Centre for Genetic Resources the Netherlands, Wageningen University & Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands.
| | - Roel F Veerkamp
- Animal Breeding and Genomics, Wageningen University & Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands
| | - Piter Bijma
- Animal Breeding and Genomics, Wageningen University & Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands
| | - Sipke J Hiemstra
- Centre for Genetic Resources the Netherlands, Wageningen University & Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands
| | - Jack J Windig
- Animal Breeding and Genomics, Wageningen University & Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands.,Centre for Genetic Resources the Netherlands, Wageningen University & Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands
| |
Collapse
|
21
|
Which Individuals To Choose To Update the Reference Population? Minimizing the Loss of Genetic Diversity in Animal Genomic Selection Programs. G3-GENES GENOMES GENETICS 2018; 8:113-121. [PMID: 29133511 PMCID: PMC5765340 DOI: 10.1534/g3.117.1117] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
Genomic selection (GS) is commonly used in livestock and increasingly in plant breeding. Relying on phenotypes and genotypes of a reference population, GS allows performance prediction for young individuals having only genotypes. This is expected to achieve fast high genetic gain but with a potential loss of genetic diversity. Existing methods to conserve genetic diversity depend mostly on the choice of the breeding individuals. In this study, we propose a modification of the reference population composition to mitigate diversity loss. Since the high cost of phenotyping is the limiting factor for GS, our findings are of major economic interest. This study aims to answer the following questions: how would decisions on the reference population affect the breeding population, and how to best select individuals to update the reference population and balance maximizing genetic gain and minimizing loss of genetic diversity? We investigated three updating strategies for the reference population: random, truncation, and optimal contribution (OC) strategies. OC maximizes genetic merit for a fixed loss of genetic diversity. A French Montbéliarde dairy cattle population with 50K SNP chip genotypes and simulations over 10 generations were used to compare these different strategies using milk production as the trait of interest. Candidates were selected to update the reference population. Prediction bias and both genetic merit and diversity were measured. Changes in the reference population composition slightly affected the breeding population. Optimal contribution strategy appeared to be an acceptable compromise to maintain both genetic gain and diversity in the reference and the breeding populations.
Collapse
|
22
|
Bouwman AC, Hayes BJ, Calus MPL. Estimated allele substitution effects underlying genomic evaluation models depend on the scaling of allele counts. Genet Sel Evol 2017; 49:79. [PMID: 29084514 PMCID: PMC5662034 DOI: 10.1186/s12711-017-0355-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2016] [Accepted: 10/20/2017] [Indexed: 12/03/2022] Open
Abstract
Background Genomic evaluation is used to predict direct genomic values (DGV) for selection candidates in breeding programs, but also to estimate allele substitution effects (ASE) of single nucleotide polymorphisms (SNPs). Scaling of allele counts influences the estimated ASE, because scaling of allele counts results in less shrinkage towards the mean for low minor allele frequency (MAF) variants. Scaling may become relevant for estimating ASE as more low MAF variants will be used in genomic evaluations. We show the impact of scaling on estimates of ASE using real data and a theoretical framework, and in terms of power, model fit and predictive performance. Results In a dairy cattle dataset with 630 K SNP genotypes, the correlation between DGV for stature from a random regression model using centered allele counts (RRc) and centered and scaled allele counts (RRcs) was 0.9988, whereas the overall correlation between ASE using RRc and RRcs was 0.27. The main difference in ASE between both methods was found for SNPs with a MAF lower than 0.01. Both the ratio (ASE from RRcs/ASE from RRc) and the regression coefficient (regression of ASE from RRcs on ASE from RRc) were much higher than 1 for low MAF SNPs. Derived equations showed that scenarios with a high heritability, a large number of individuals and a small number of variants have lower ratios between ASE from RRc and RRcs. We also investigated the optimal scaling parameter [from − 1 (RRcs) to 0 (RRc) in steps of 0.1] in the bovine stature dataset. We found that the log-likelihood was maximized with a scaling parameter of − 0.8, while the mean squared error of prediction was minimized with a scaling parameter of − 1, i.e., RRcs. Conclusions Large differences in estimated ASE were observed for low MAF SNPs when allele counts were scaled or not scaled because there is less shrinkage towards the mean for scaled allele counts. We derived a theoretical framework that shows that the difference in ASE due to shrinkage is heavily influenced by the power of the data. Increasing the power results in smaller differences in ASE whether allele counts are scaled or not.
Collapse
Affiliation(s)
- Aniek C Bouwman
- Animal Breeding and Genomics Centre, Wageningen Livestock Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands.
| | - Ben J Hayes
- Queensland Alliance for Agriculture and Food Innovation, Centre for Animal Science, University of Queensland, Brisbane, QLD, Australia.,Department of Economic Development, Jobs, Transport and Resources, Government of Victoria, 5 Ring Rd., Bundoora, VIC, 3083, Australia
| | - Mario P L Calus
- Animal Breeding and Genomics Centre, Wageningen Livestock Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands
| |
Collapse
|
23
|
Ni G, Cavero D, Fangmann A, Erbe M, Simianer H. Whole-genome sequence-based genomic prediction in laying chickens with different genomic relationship matrices to account for genetic architecture. Genet Sel Evol 2017; 49:8. [PMID: 28093063 PMCID: PMC5238523 DOI: 10.1186/s12711-016-0277-y] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2015] [Accepted: 12/05/2016] [Indexed: 11/10/2022] Open
Abstract
Background With the availability of next-generation sequencing technologies, genomic prediction based on whole-genome sequencing (WGS) data is now feasible in animal breeding schemes and was expected to lead to higher predictive ability, since such data may contain all genomic variants including causal mutations. Our objective was to compare prediction ability with high-density (HD) array data and WGS data in a commercial brown layer line with genomic best linear unbiased prediction (GBLUP) models using various approaches to weight single nucleotide polymorphisms (SNPs). Methods A total of 892 chickens from a commercial brown layer line were genotyped with 336 K segregating SNPs (array data) that included 157 K genic SNPs (i.e. SNPs in or around a gene). For these individuals, genome-wide sequence information was imputed based on data from re-sequencing runs of 25 individuals, leading to 5.2 million (M) imputed SNPs (WGS data), including 2.6 M genic SNPs. De-regressed proofs (DRP) for eggshell strength, feed intake and laying rate were used as quasi-phenotypic data in genomic prediction analyses. Four weighting factors for building a trait-specific genomic relationship matrix were investigated: identical weights, −(log10P) from genome-wide association study results, squares of SNP effects from random regression BLUP, and variable selection based weights (known as BLUP|GA). Predictive ability was measured as the correlation between DRP and direct genomic breeding values in five replications of a fivefold cross-validation. Results Averaged over the three traits, the highest predictive ability (0.366 ± 0.075) was obtained when only genic SNPs from WGS data were used. Predictive abilities with genic SNPs and all SNPs from HD array data were 0.361 ± 0.072 and 0.353 ± 0.074, respectively. Prediction with −(log10P) or squares of SNP effects as weighting factors for building a genomic relationship matrix or BLUP|GA did not increase accuracy, compared to that with identical weights, regardless of the SNP set used. Conclusions Our results show that little or no benefit was gained when using all imputed WGS data to perform genomic prediction compared to using HD array data regardless of the weighting factors tested. However, using only genic SNPs from WGS data had a positive effect on prediction ability. Electronic supplementary material The online version of this article (doi:10.1186/s12711-016-0277-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Guiyan Ni
- Animal Breeding and Genetics Group, Georg-August-Universität, Göttingen, Germany.
| | | | - Anna Fangmann
- Animal Breeding and Genetics Group, Georg-August-Universität, Göttingen, Germany
| | - Malena Erbe
- Animal Breeding and Genetics Group, Georg-August-Universität, Göttingen, Germany.,Institute for Animal Breeding, Bavarian State Research Centre for Agriculture, Grub, Germany
| | - Henner Simianer
- Animal Breeding and Genetics Group, Georg-August-Universität, Göttingen, Germany
| |
Collapse
|
24
|
Garbe JR, Prakapenka D, Tan C, Da Y. Genomic Inbreeding and Relatedness in Wild Panda Populations. PLoS One 2016; 11:e0160496. [PMID: 27494031 PMCID: PMC4975500 DOI: 10.1371/journal.pone.0160496] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2016] [Accepted: 07/20/2016] [Indexed: 11/18/2022] Open
Abstract
Inbreeding and relatedness in wild panda populations are important parameters for panda conservation. Habitat loss and fragmentation are expected to increase inbreeding but the actual inbreeding levels in natural panda habitats were unknown. Using 150,025 SNPs and 14,926 SNPs selected from published whole-genome sequences, we estimated genomic inbreeding coefficients and relatedness of 49 pandas including 34 wild pandas sampled from six habitats. Qinling and Liangshan pandas had the highest levels of inbreeding and relatedness measured by genomic inbreeding and coancestry coefficients, whereas the inbreeding levels in Qionglai and Minshan were 28–45% of those in Qinling and Liangshan. Genomic coancestry coefficients between pandas from different habitats showed that panda populations from the four largest habitats, Minshan, Qionglai, Qinling and Liangshan, were genetically unrelated. Pandas between these four habitats on average shared 66.0–69.1% common alleles and 45.6–48.6% common genotypes, whereas pandas within each habitat shared 71.8–77.0% common alleles and 51.7–60.4% common genotypes. Pandas in the smaller populations of Qinling and Liangshan were more similarly to each other than pandas in the larger populations of Qionglai and Minshan according to three genomic similarity measures. Panda genetic differentiation between these habitats was positively related to their geographical distances. Most pandas separated by 200 kilometers or more shared no common ancestral alleles. The results provided a genomic quantification of the actual levels of inbreeding and relatedness among pandas in their natural habitats, provided genomic confirmation of the relationship between genetic diversity and geographical distances, and provided genomic evidence to the urgency of habitat protection.
Collapse
Affiliation(s)
- John R. Garbe
- Minnesota Supercomputer Institute, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Dzianis Prakapenka
- Department of Animal Science, University of Minnesota, Saint Paul, Minnesota, United States of America
| | - Cheng Tan
- Department of Animal Science, University of Minnesota, Saint Paul, Minnesota, United States of America
- State Key Laboratory for Agrobiotechnology, China Agricultural University, Beijing, China
| | - Yang Da
- Department of Animal Science, University of Minnesota, Saint Paul, Minnesota, United States of America
- * E-mail:
| |
Collapse
|
25
|
Eynard SE, Windig JJ, Hiemstra SJ, Calus MPL. Whole-genome sequence data uncover loss of genetic diversity due to selection. Genet Sel Evol 2016; 48:33. [PMID: 27080121 PMCID: PMC4831198 DOI: 10.1186/s12711-016-0210-4] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2015] [Accepted: 03/23/2016] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Whole-genome sequence (WGS) data give access to more complete structural genetic information of individuals, including rare variants, not fully covered by single nucleotide polymorphism chips. We used WGS to investigate the amount of genetic diversity remaining after selection using optimal contribution (OC), considering different methods to estimate the relationships used in OC. OC was applied to minimise average relatedness of the selection candidates and thus miminise the loss of genetic diversity in a conservation strategy, e.g. for establishment of gene bank collections. Furthermore, OC was used to maximise average genetic merit of the selection candidates at a given level of relatedness, similar to a genetic improvement strategy. In this study, we used data from 277 bulls from the 1000 bull genomes project. We measured genetic diversity as the number of variants still segregating after selection using WGS data, and compared strategies that targeted conservation of rare (minor allele frequency <5 %) versus common variants. RESULTS When OC without restriction on the number of selected individuals was applied, loss of variants was minimal and most individuals were selected, which is often unfeasible in practice. When 20 individuals were selected, the number of segregating rare variants was reduced by 29 % for the conservation strategy, and by 34 % for the genetic improvement strategy. The overall number of segregating variants was reduced by 30 % when OC was restricted to selecting five individuals, for both conservation and genetic improvement strategies. For common variants, this loss was about 15 %, while it was much higher, 72 %, for rare variants. Fewer rare variants were conserved with the genetic improvement strategy compared to the conservation strategy. CONCLUSIONS The use of WGS for genetic diversity quantification revealed that selection results in considerable losses of genetic diversity for rare variants. Using WGS instead of SNP chip data to estimate relationships slightly reduced the loss of rare variants, while using 50 K SNP chip data was sufficient to conserve common variants. The loss of rare variants could be mitigated by a few percent (up to 8 %) depending on which method is chosen to estimate relationships from WGS data.
Collapse
Affiliation(s)
- Sonia E Eynard
- Animal Breeding and Genomics Centre, Wageningen UR Livestock Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands. .,GABI, INRA, AgroParisTech, Université Paris-Saclay, 78350, Jouy-en-Josas, France. .,Centre for Genetic Resources, the Netherlands, Wageningen UR, P.O. Box 338, 3700 AH, Wageningen, The Netherlands.
| | - Jack J Windig
- Animal Breeding and Genomics Centre, Wageningen UR Livestock Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands.,Centre for Genetic Resources, the Netherlands, Wageningen UR, P.O. Box 338, 3700 AH, Wageningen, The Netherlands
| | - Sipke J Hiemstra
- Centre for Genetic Resources, the Netherlands, Wageningen UR, P.O. Box 338, 3700 AH, Wageningen, The Netherlands
| | - Mario P L Calus
- Animal Breeding and Genomics Centre, Wageningen UR Livestock Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands
| |
Collapse
|
26
|
Rupp R, Mucha S, Larroque H, McEwan J, Conington J. Genomic application in sheep and goat breeding. Anim Front 2016. [DOI: 10.2527/af.2016-0006] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
|
27
|
Uemoto Y, Sasaki S, Kojima T, Sugimoto Y, Watanabe T. Impact of QTL minor allele frequency on genomic evaluation using real genotype data and simulated phenotypes in Japanese Black cattle. BMC Genet 2015; 16:134. [PMID: 26586567 PMCID: PMC4653875 DOI: 10.1186/s12863-015-0287-8] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2015] [Accepted: 10/27/2015] [Indexed: 11/14/2022] Open
Abstract
Background Genetic variance that is not captured by single nucleotide polymorphisms (SNPs) is due to imperfect linkage disequilibrium (LD) between SNPs and quantitative trait loci (QTLs), and the extent of LD between SNPs and QTLs depends on different minor allele frequencies (MAF) between them. To evaluate the impact of MAF of QTLs on genomic evaluation, we performed a simulation study using real cattle genotype data. Methods In total, 1368 Japanese Black cattle and 592,034 SNPs (Illumina BovineHD BeadChip) were used. We simulated phenotypes using real genotypes under different scenarios, varying the MAF categories, QTL heritability, number of QTLs, and distribution of QTL effect. After generating true breeding values and phenotypes, QTL heritability was estimated and the prediction accuracy of genomic estimated breeding value (GEBV) was assessed under different SNP densities, prediction models, and population size by a reference-test validation design. Results The extent of LD between SNPs and QTLs in this population was higher in the QTLs with high MAF than in those with low MAF. The effect of MAF of QTLs depended on the genetic architecture, evaluation strategy, and population size in genomic evaluation. In genetic architecture, genomic evaluation was affected by the MAF of QTLs combined with the QTL heritability and the distribution of QTL effect. The number of QTL was not affected on genomic evaluation if the number of QTL was more than 50. In the evaluation strategy, we showed that different SNP densities and prediction models affect the heritability estimation and genomic prediction and that this depends on the MAF of QTLs. In addition, accurate QTL heritability and GEBV were obtained using denser SNP information and the prediction model accounted for the SNPs with low and high MAFs. In population size, a large sample size is needed to increase the accuracy of GEBV. Conclusion The MAF of QTL had an impact on heritability estimation and prediction accuracy. Most genetic variance can be captured using denser SNPs and the prediction model accounted for MAF, but a large sample size is needed to increase the accuracy of GEBV under all QTL MAF categories. Electronic supplementary material The online version of this article (doi:10.1186/s12863-015-0287-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Yoshinobu Uemoto
- National Livestock Breeding Center, Nishigo, Fukushima, 961-8511, Japan.
| | - Shinji Sasaki
- National Livestock Breeding Center, Nishigo, Fukushima, 961-8511, Japan.
| | - Takatoshi Kojima
- National Livestock Breeding Center, Nishigo, Fukushima, 961-8511, Japan.
| | - Yoshikazu Sugimoto
- Shirakawa Institute of Animal Genetics, Japan Livestock Technology Association, Nishigo, Fukushima, 961-8511, Japan.
| | - Toshio Watanabe
- National Livestock Breeding Center, Nishigo, Fukushima, 961-8511, Japan.
| |
Collapse
|