1
|
Sahana G, Cai Z, Sanchez MP, Bouwman AC, Boichard D. Invited review: Good practices in genome-wide association studies to identify candidate sequence variants in dairy cattle. J Dairy Sci 2023:S0022-0302(23)00357-0. [PMID: 37349208 DOI: 10.3168/jds.2022-22694] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Accepted: 02/01/2023] [Indexed: 06/24/2023]
Abstract
Genotype data from dairy cattle selection programs have greatly facilitated GWAS to identify variants related to economic traits. Results can enhance the accuracy of genomic prediction, analyze more complex models that go beyond additive effects, elucidate the genetic architecture of a trait, and finally, decipher the underlying biology of traits. The entire process, comprising data generation, quality control, statistical analyses, interpretation of association results, and linking results to biology should be designed and executed to minimize the generation of false-positive and false-negative associations and misleading links to biological processes. This review aims to provide general guidelines for data analysis that address data quality control, association tests, adjustment for population stratification, and significance evaluation to improve the reliability of conclusions. We also provide guidance on post-GWAS strategy and the interpretation of results. These guidelines are tailored to dairy cattle, which are characterized by long-range linkage disequilibrium, large half-sib families, and routinely collected phenotypes, requiring different approaches than those applied in human GWAS. We discuss common limitations and challenges that have been overlooked in the analysis and interpretation of GWAS to identify candidate sequence variants in dairy cattle.
Collapse
Affiliation(s)
- G Sahana
- Aarhus University, Center for Quantitative Genetic and Genomics, 8830 Tjele, Denmark.
| | - Z Cai
- Aarhus University, Center for Quantitative Genetic and Genomics, 8830 Tjele, Denmark
| | - M P Sanchez
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350 Jouy-en-Josas, France
| | - A C Bouwman
- Wageningen University & Research, Animal Breeding and Genomics, 6700 AH Wageningen, the Netherlands
| | - D Boichard
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350 Jouy-en-Josas, France
| |
Collapse
|
2
|
Heidaritabar M, Bink MCAM, Dervishi E, Charagu P, Huisman A, Plastow GS. Genome-wide association studies for additive and dominance effects for body composition traits in commercial crossbred Piétrain pigs. J Anim Breed Genet 2023. [PMID: 36883263 DOI: 10.1111/jbg.12768] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 02/18/2023] [Indexed: 03/09/2023]
Abstract
Fat depth (FD) and muscle depth (MD) are economically important traits and used to estimate carcass lean content (LMP), which is one of the main breeding objectives in pig breeding programmes. We assessed the genetic architectures of body composition traits for additive and dominance effects in commercial crossbred Piétrain pigs using both 50 K array and sequence genotypes. We first performed a genome-wide association study (GWAS) using single-marker association analysis with a false discovery rate of 0.1. Then, we estimated the additive and dominance effects of the most significant variant in the quantitative trait loci (QTL) regions. It was investigated whether the use of whole-genome sequence (WGS) will improve the QTL detection (both additive and dominance) with a higher power compared with lower density SNP arrays. Our results showed that more QTL regions were detected by WGS compared with 50 K array (n = 54 vs. n = 17). Of the novel associated regions associated with FD and LMP and detected by WGS, the most pronounced peak was on SSC13, situated at ~116-118, 121-127 and 129-134 Mbp. Additionally, we found that only additive effects contributed to the genetic architecture of the analysed traits and no significant dominance effects were found for the tested SNPs at QTL regions, regardless of panel density. The associated SNPs are located in or near several relevant candidate genes. Of these genes, GABRR2, GALR1, RNGTT, CDH20 and MC4R have been previously reported as being associated with fat deposition traits. However, the genes on SSC1 (ZNF292, ORC3, CNR1, SRSF12, MDN1, TSHZ1, RELCH and RNF152) and SSC18 (TTC26 and KIAA1549) have not been reported previously to our best knowledge. Our current findings provide insights into the genomic regions influencing composition traits in Piétrain pigs.
Collapse
Affiliation(s)
- Marzieh Heidaritabar
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, Alberta, Canada
| | - Marco C A M Bink
- Hendrix Genetics Research, Technology & Services B.V., Boxmeer, the Netherlands
| | - Elda Dervishi
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, Alberta, Canada
| | - Patrick Charagu
- Hendrix Genetics, Swine Business Unit, Regina, Saskatchewan, Canada
| | - Abe Huisman
- Hendrix Genetics Research, Technology & Services B.V., Boxmeer, the Netherlands
| | - Graham S Plastow
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, Alberta, Canada
| |
Collapse
|
3
|
Häfliger IM, Spengeler M, Seefried FR, Drögemüller C. Four novel candidate causal variants for deficient homozygous haplotypes in Holstein cattle. Sci Rep 2022; 12:5435. [PMID: 35361830 PMCID: PMC8971413 DOI: 10.1038/s41598-022-09403-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Accepted: 03/07/2022] [Indexed: 12/23/2022] Open
Abstract
Mendelian variants can determine both insemination success and neonatal survival and thus influence fertility and rearing success of cattle. We present 24 deficient homozygous haplotype regions in the Holstein population of Switzerland and provide an overview of the previously identified haplotypes in the global Holstein breed. This study encompasses massive genotyping, whole-genome sequencing (WGS) and phenotype association analyses. We performed haplotype screenings on almost 53 thousand genotyped animals including 114 k SNP data with two different approaches. We revealed significant haplotype associations to several survival, birth and fertility traits. Within haplotype regions, we mined WGS data of hundreds of bovine genomes for candidate causal variants, which were subsequently evaluated by using a custom genotyping array in several thousand breeding animals. With this approach, we confirmed the known deleterious SMC2:p.Phe1135Ser missense variant associated with Holstein haplotype (HH) 3. For two previously reported deficient homozygous haplotypes that show negative associations to female fertility traits, we propose candidate causative loss-of-function variants: the HH13-related KIR2DS1:p.Gln159* nonsense variant and the HH21-related NOTCH3:p.Cys44del deletion. In addition, we propose the RIOX1:p.Ala133_Glu142del deletion as well as the PCDH15:p.Leu867Val missense variant to explain the unexpected low number of homozygous haplotype carriers for HH25 and HH35, respectively. In conclusion, we demonstrate that with mining massive SNP data in combination with WGS data, we can map several haplotype regions and unravel novel recessive protein-changing variants segregating at frequencies of 1 to 5%. Our findings both confirm previously identified loci and expand the spectrum of undesired alleles impairing reproduction success in Holstein cattle, the world's most important dairy breed.
Collapse
Affiliation(s)
- Irene M Häfliger
- Institute of Genetics, Vetsuisse Faculty, University of Bern, Bremgartenstrasse 109a, 3001, Bern, Switzerland.
| | | | | | - Cord Drögemüller
- Institute of Genetics, Vetsuisse Faculty, University of Bern, Bremgartenstrasse 109a, 3001, Bern, Switzerland
| |
Collapse
|
4
|
Including dominance effects in the prediction model through locus-specific weights on heterozygous genotypes can greatly improve genomic predictive abilities. Heredity (Edinb) 2022; 128:154-158. [PMID: 35132207 PMCID: PMC8897419 DOI: 10.1038/s41437-022-00504-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Revised: 01/18/2022] [Accepted: 01/19/2022] [Indexed: 11/29/2022] Open
Abstract
The dominance effect is considered to be a key factor affecting complex traits. However, previous studies have shown that the improvement of the model, including the dominance effect, is usually less than 1%. This study proposes a novel genomic prediction method called CADM, which combines additive and dominance genetic effects through locus-specific weights on heterozygous genotypes. To the best of our knowledge, this is the first study of weighting dominance effects for genomic prediction. This method was applied to the analysis of chicken (511 birds) and pig (3534 animals) datasets. A 5-fold cross-validation method was used to evaluate the genomic predictive ability. The CADM model was compared with typical models considering additive and dominance genetic effects (ADM) and the model considering only additive genetic effects (AM). Based on the chicken data, using the CADM model, the genomic predictive abilities were improved for all three traits (body weight at 12th week, eviscerating percentage, and breast muscle percentage), and the average improvement in prediction accuracy was 27.1% compared with the AM model, while the ADM model was not better than the AM model. Based on the pig data, the CADM model increased the genomic predictive ability for all the three pig traits (trait names are masked, here designated as T1, T2, and T3), with an average increase of 26.3%, and the ADM model did not improve, or even slightly decreased, compared with the AM model. The results indicate that dominant genetic variation is one of the important sources of phenotypic variation, and the novel prediction model significantly improves the accuracy of genomic prediction.
Collapse
|
5
|
Akbarzadeh M, Dehkordi SR, Roudbar MA, Sargolzaei M, Guity K, Sedaghati-Khayat B, Riahi P, Azizi F, Daneshpour MS. GWAS findings improved genomic prediction accuracy of lipid profile traits: Tehran Cardiometabolic Genetic Study. Sci Rep 2021; 11:5780. [PMID: 33707626 PMCID: PMC7952573 DOI: 10.1038/s41598-021-85203-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Accepted: 02/26/2021] [Indexed: 12/15/2022] Open
Abstract
In recent decades, ongoing GWAS findings discovered novel therapeutic modifications such as whole-genome risk prediction in particular. Here, we proposed a method based on integrating the traditional genomic best linear unbiased prediction (gBLUP) approach with GWAS information to boost genetic prediction accuracy and gene-based heritability estimation. This study was conducted in the framework of the Tehran Cardio-metabolic Genetic study (TCGS) containing 14,827 individuals and 649,932 SNP markers. Five SNP subsets were selected based on GWAS results: top 1%, 5%, 10%, 50% significant SNPs, and reported associated SNPs in previous studies. Furthermore, we randomly selected subsets as large as every five subsets. Prediction accuracy has been investigated on lipid profile traits with a tenfold and 10-repeat cross-validation algorithm by the gBLUP method. Our results revealed that genetic prediction based on selected subsets of SNPs obtained from the dataset outperformed the subsets from previously reported SNPs. Selected SNPs' subsets acquired a more precise prediction than whole SNPs and much higher than randomly selected SNPs. Also, common SNPs with the most captured prediction accuracy in the selected sets caught the highest gene-based heritability. However, it is better to be mindful of the fact that a small number of SNPs obtained from GWAS results could capture a highly notable proportion of variance and prediction accuracy.
Collapse
Affiliation(s)
- Mahdi Akbarzadeh
- Cellular and Molecular Research Center, Research Institute for Endocrine Sciences, Shahid Beheshti University of Medical Sciences, POBox: 19195-4763, Tehran, Iran
| | - Saeid Rasekhi Dehkordi
- Cellular and Molecular Research Center, Research Institute for Endocrine Sciences, Shahid Beheshti University of Medical Sciences, POBox: 19195-4763, Tehran, Iran
| | - Mahmoud Amiri Roudbar
- Department of Animal Science, Safiabad-Dezful Agricultural and Natural Resources Research and Education Center, Agricultural Research, Education & Extension Organization (AREEO), Dezful, Iran
| | - Mehdi Sargolzaei
- Department of Pathobiology, Ontario Veterinary College, University of Guelph, Guelph, Canada
- Select Sires Inc., Plain City, USA
| | - Kamran Guity
- Cellular and Molecular Research Center, Research Institute for Endocrine Sciences, Shahid Beheshti University of Medical Sciences, POBox: 19195-4763, Tehran, Iran
| | - Bahareh Sedaghati-Khayat
- Cellular and Molecular Research Center, Research Institute for Endocrine Sciences, Shahid Beheshti University of Medical Sciences, POBox: 19195-4763, Tehran, Iran
| | - Parisa Riahi
- Cellular and Molecular Research Center, Research Institute for Endocrine Sciences, Shahid Beheshti University of Medical Sciences, POBox: 19195-4763, Tehran, Iran
| | - Fereidoun Azizi
- Endocrine Research Center, Research Institute for Endocrine Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Maryam S Daneshpour
- Cellular and Molecular Research Center, Research Institute for Endocrine Sciences, Shahid Beheshti University of Medical Sciences, POBox: 19195-4763, Tehran, Iran.
| |
Collapse
|
6
|
Doekes HP, Bijma P, Veerkamp RF, de Jong G, Wientjes YCJ, Windig JJ. Inbreeding depression across the genome of Dutch Holstein Friesian dairy cattle. Genet Sel Evol 2020; 52:64. [PMID: 33115403 PMCID: PMC7594306 DOI: 10.1186/s12711-020-00583-1] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2020] [Accepted: 10/09/2020] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Inbreeding depression refers to the decrease in mean performance due to inbreeding. Inbreeding depression is caused by an increase in homozygosity and reduced expression of (on average) favourable dominance effects. Dominance effects and allele frequencies differ across loci, and consequently inbreeding depression is expected to differ along the genome. In this study, we investigated differences in inbreeding depression across the genome of Dutch Holstein Friesian cattle, by estimating dominance effects and effects of regions of homozygosity (ROH). METHODS Genotype (75 k) and phenotype data of 38,792 cows were used. For nine yield, fertility and udder health traits, GREML models were run to estimate genome-wide inbreeding depression and estimate additive, dominance and ROH variance components. For this purpose, we introduced a ROH-based relationship matrix. Additive, dominance and ROH effects per SNP were obtained through back-solving. In addition, a single SNP GWAS was performed to identify significant additive, dominance or ROH associations. RESULTS Genome-wide inbreeding depression was observed for all yield, fertility and udder health traits. For example, a 1% increase in genome-wide homozygosity was associated with a decrease in 305-d milk yield of approximately 99 kg. For yield traits only, including dominance and ROH effects in the GREML model resulted in a better fit (P < 0.05) than a model with only additive effects. After correcting for the effect of genome-wide homozygosity, dominance and ROH variance explained less than 1% of the phenotypic variance for all traits. Furthermore, dominance and ROH effects were distributed evenly along the genome. The most notable region with a favourable dominance effect for yield traits was on chromosome 5, but overall few regions with large favourable dominance effects and significant dominance associations were detected. No significant ROH-associations were found. CONCLUSIONS Inbreeding depression was distributed quite equally along the genome and was well captured by genome-wide homozygosity. These findings suggest that, based on 75 k SNP data, there is little benefit of accounting for region-specific inbreeding depression in selection schemes.
Collapse
Affiliation(s)
- Harmen P Doekes
- Animal Breeding and Genomics, Wageningen University and Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands. .,Centre for Genetic Resources the Netherlands, Wageningen University and Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands.
| | - Piter Bijma
- Animal Breeding and Genomics, Wageningen University and Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands
| | - Roel F Veerkamp
- Animal Breeding and Genomics, Wageningen University and Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands
| | - Gerben de Jong
- Cooperation CRV, Wassenaarweg 20, 6843 NW, Arnhem, The Netherlands
| | - Yvonne C J Wientjes
- Animal Breeding and Genomics, Wageningen University and Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands
| | - Jack J Windig
- Animal Breeding and Genomics, Wageningen University and Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands.,Centre for Genetic Resources the Netherlands, Wageningen University and Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands
| |
Collapse
|
7
|
Gutiérrez-Reinoso MA, Aponte PM, Cabezas J, Rodriguez-Alvarez L, Garcia-Herreros M. Genomic Evaluation of Primiparous High-Producing Dairy Cows: Inbreeding Effects on Genotypic and Phenotypic Production-Reproductive Traits. Animals (Basel) 2020; 10:ani10091704. [PMID: 32967074 PMCID: PMC7552765 DOI: 10.3390/ani10091704] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2020] [Revised: 09/15/2020] [Accepted: 09/16/2020] [Indexed: 12/13/2022] Open
Abstract
Simple Summary Improving the genomic prediction methodologies in high-producing dairy cattle is a key factor for the selection of suitable individuals to ensure better productivity. However, the most advanced prediction tools based on genotyping show ~75% reliability. Nowadays, the incorporation of new indices to genomic prediction methods, such as the Inbreeding Index (II), can significantly facilitate the selection of reliable production and reproductive traits for progeny selection. Thus, the objective of this study was to determine the impact of II (low: LI and high: HI), based on genomic analysis, and its effect on production and reproductive phenotypic traits in high-producing primiparous dairy cows. Individuals with II between ≥2.5 and ≤5.0 have shown up to a two-fold increase in negative correlations comparing LI versus HI genomic production and reproductive parameters, severely affecting important traits such as Milk Production at 305 d, Protein Production at 305 d, Fertility Index, and Daughter Pregnancy Rate. Therefore, high-producing dairy cows face an increased risk of negative II-derived effects in their selection programs, particularly at II ≥ 2.5. Abstract The main objective of this study was to analyze the effects of the inbreeding degree in high-producing primiparous dairy cows genotypically and phenotypically evaluated and its impacts on production and reproductive parameters. Eighty Holstein–Friesian primiparous cows (age: ~26 months; ~450 kg body weight) were previously genomically analyzed to determine the Inbreeding Index (II) and were divided into two groups: low inbreeding group (LI: <2.5; n = 40) and high inbreeding group (HI: ≥2.5 and ≤5.0; n = 40). Genomic determinations of production and reproductive parameters (14 in total), together with analyses of production (12) and reproductive (11) phenotypic parameters (23 in total) were carried out. Statistically significant differences were obtained between groups concerning the genomic parameters of Milk Production at 305 d and Protein Production at 305 d and the reproductive parameter Daughter Calving Ease, the first two being higher in cows of the HI group and the third lower in the LI group (p < 0.05). For the production phenotypic parameters, statistically significant differences were observed between both groups in the Total Fat, Total Protein, and Urea parameters, the first two being higher in the LI group (p < 0.05). Also, significant differences were observed in several reproductive phenotypic parameters, such as Number of Services per Conception, Calving to Conception Interval, Days Open Post Service, and Current Inter-Partum Period, all of which negatively influenced the HI group (p < 0.05). In addition, correlation analyses were performed between production and reproductive genomic parameters separately and in each consanguinity group. The results showed multiple positive and negative correlations between the production and reproductive parameters independently of the group analyzed, being these correlations more remarkable for the reproductive parameters in the LI group and the production parameters in the HI group (p < 0.05). In conclusion, the degree of inbreeding significantly influenced the results, affecting different genomic and phenotypic production and reproductive parameters in high-producing primiparous cows. The determination of the II in first-calf heifers is crucial to evaluate the negative effects associated with homozygosity avoiding an increase in inbreeding depression on production and reproductive traits.
Collapse
Affiliation(s)
- Miguel A. Gutiérrez-Reinoso
- Departamento de Ciencia Animal, Laboratorio de Biotecnología Animal, Facultad de Ciencias Veterinarias, Universidad de Concepción (UdeC), Chillán 3780000, Chile; (M.A.G.-R.); (J.C.)
- Facultad de Ciencias Agropecuarias y Recursos Naturales, Carrera de Medicina Veterinaria, Universidad Técnica de Cotopaxi (UTC), Latacunga 050150, Ecuador
| | - Pedro Manuel Aponte
- Colegio de Ciencias Biológicas y Ambientales (COCIBA), Universidad San Francisco de Quito (USFQ), Quito 170157, Ecuador;
- Instituto de Investigaciones en Biomedicina “One-health”, Universidad San Francisco de Quito (USFQ), Campus Cumbayá, Quito 170157, Ecuador
| | - Joel Cabezas
- Departamento de Ciencia Animal, Laboratorio de Biotecnología Animal, Facultad de Ciencias Veterinarias, Universidad de Concepción (UdeC), Chillán 3780000, Chile; (M.A.G.-R.); (J.C.)
| | - Lleretny Rodriguez-Alvarez
- Departamento de Ciencia Animal, Laboratorio de Biotecnología Animal, Facultad de Ciencias Veterinarias, Universidad de Concepción (UdeC), Chillán 3780000, Chile; (M.A.G.-R.); (J.C.)
- Correspondence: (L.R.-A.); (M.G.-H.); Tel.: +56-42-220-8835 (L.R.-A.); Fax: +351-24-3767 (ext. 330) (M.G.-H.)
| | - Manuel Garcia-Herreros
- Instituto Nacional de Investigação Agrária e Veterinária (INIAV), 2005-048 Santarém, Portugal
- Correspondence: (L.R.-A.); (M.G.-H.); Tel.: +56-42-220-8835 (L.R.-A.); Fax: +351-24-3767 (ext. 330) (M.G.-H.)
| |
Collapse
|