1
|
Wei C, Chang C, Zhang W, Ren D, Cai X, Zhou T, Shi S, Wu X, Si J, Yuan X, Li J, Zhang Z. Preselecting Variants from Large-Scale Genome-Wide Association Study Meta-Analyses Increases the Genomic Prediction Accuracy of Growth and Carcass Traits in Large White Pigs. Animals (Basel) 2023; 13:3746. [PMID: 38136785 PMCID: PMC10740834 DOI: 10.3390/ani13243746] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Revised: 11/30/2023] [Accepted: 11/30/2023] [Indexed: 12/24/2023] Open
Abstract
Preselected variants associated with the trait of interest from genome-wide association studies (GWASs) are available to improve genomic prediction in pigs. The objectives of this study were to use preselected variants from a large GWAS meta-analysis to assess the impact of single-nucleotide polymorphism (SNP) preselection strategies on genome prediction of growth and carcass traits in pigs. We genotyped 1018 Large White pigs using medium (50k) SNP arrays and then imputed SNPs to sequence level by utilizing a reference panel of 1602 whole-genome sequencing samples. We tested the effects of different proportions of selected top SNPs across different SNP preselection strategies on genomic prediction. Finally, we compared the prediction accuracies by employing genomic best linear unbiased prediction (GBLUP), genomic feature BLUP and three weighted GBLUP models. SNP preselection strategies showed an average improvement in accuracy ranging from 0.3 to 2% in comparison to the SNP chip data. The accuracy of genomic prediction exhibited a pattern of initial increase followed by decrease, or continuous decrease across various SNP preselection strategies, as the proportion of selected top SNPs increased. The highest level of prediction accuracy was observed when utilizing 1 or 5% of top SNPs. Compared with the GBLUP model, the utilization of estimated marker effects from a GWAS meta-analysis as SNP weights in the BLUP|GA model improved the accuracy of genomic prediction in different SNP preselection strategies. The new SNP preselection strategies gained from this study bring opportunities for genomic prediction in limited-size populations in pigs.
Collapse
Affiliation(s)
- Chen Wei
- National Engineering Research Centre for Swine Breeding Industry, Provincial Key Laboratory of Agricultural Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510640, China; (C.W.); (C.C.); (W.Z.); (D.R.); (X.C.); (T.Z.); (S.S.); (X.Y.); (J.L.)
| | - Chengjie Chang
- National Engineering Research Centre for Swine Breeding Industry, Provincial Key Laboratory of Agricultural Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510640, China; (C.W.); (C.C.); (W.Z.); (D.R.); (X.C.); (T.Z.); (S.S.); (X.Y.); (J.L.)
| | - Wenjing Zhang
- National Engineering Research Centre for Swine Breeding Industry, Provincial Key Laboratory of Agricultural Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510640, China; (C.W.); (C.C.); (W.Z.); (D.R.); (X.C.); (T.Z.); (S.S.); (X.Y.); (J.L.)
| | - Duanyang Ren
- National Engineering Research Centre for Swine Breeding Industry, Provincial Key Laboratory of Agricultural Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510640, China; (C.W.); (C.C.); (W.Z.); (D.R.); (X.C.); (T.Z.); (S.S.); (X.Y.); (J.L.)
| | - Xiaodian Cai
- National Engineering Research Centre for Swine Breeding Industry, Provincial Key Laboratory of Agricultural Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510640, China; (C.W.); (C.C.); (W.Z.); (D.R.); (X.C.); (T.Z.); (S.S.); (X.Y.); (J.L.)
| | - Tianru Zhou
- National Engineering Research Centre for Swine Breeding Industry, Provincial Key Laboratory of Agricultural Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510640, China; (C.W.); (C.C.); (W.Z.); (D.R.); (X.C.); (T.Z.); (S.S.); (X.Y.); (J.L.)
| | - Shaolei Shi
- National Engineering Research Centre for Swine Breeding Industry, Provincial Key Laboratory of Agricultural Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510640, China; (C.W.); (C.C.); (W.Z.); (D.R.); (X.C.); (T.Z.); (S.S.); (X.Y.); (J.L.)
| | - Xibo Wu
- Guangxi State Farms Yongxin Animal Husbandry Group Co., Ltd., Nanning 530022, China; (X.W.); (J.S.)
| | - Jinglei Si
- Guangxi State Farms Yongxin Animal Husbandry Group Co., Ltd., Nanning 530022, China; (X.W.); (J.S.)
| | - Xiaolong Yuan
- National Engineering Research Centre for Swine Breeding Industry, Provincial Key Laboratory of Agricultural Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510640, China; (C.W.); (C.C.); (W.Z.); (D.R.); (X.C.); (T.Z.); (S.S.); (X.Y.); (J.L.)
| | - Jiaqi Li
- National Engineering Research Centre for Swine Breeding Industry, Provincial Key Laboratory of Agricultural Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510640, China; (C.W.); (C.C.); (W.Z.); (D.R.); (X.C.); (T.Z.); (S.S.); (X.Y.); (J.L.)
| | - Zhe Zhang
- National Engineering Research Centre for Swine Breeding Industry, Provincial Key Laboratory of Agricultural Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510640, China; (C.W.); (C.C.); (W.Z.); (D.R.); (X.C.); (T.Z.); (S.S.); (X.Y.); (J.L.)
| |
Collapse
|
2
|
Taranto F, Esposito S, De Vita P. Genomics for Yield and Yield Components in Durum Wheat. PLANTS (BASEL, SWITZERLAND) 2023; 12:2571. [PMID: 37447132 DOI: 10.3390/plants12132571] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Revised: 06/29/2023] [Accepted: 07/04/2023] [Indexed: 07/15/2023]
Abstract
In recent years, many efforts have been conducted to dissect the genetic basis of yield and yield components in durum wheat thanks to linkage mapping and genome-wide association studies. In this review, starting from the analysis of the genetic bases that regulate the expression of yield for developing new durum wheat varieties, we have highlighted how, currently, the reductionist approach, i.e., dissecting the yield into its individual components, does not seem capable of ensuring significant yield increases due to diminishing resources, land loss, and ongoing climate change. However, despite the identification of genes and/or chromosomal regions, controlling the grain yield in durum wheat is still a challenge, mainly due to the polyploidy level of this species. In the review, we underline that the next-generation sequencing (NGS) technologies coupled with improved wheat genome assembly and high-throughput genotyping platforms, as well as genome editing technology, will revolutionize plant breeding by providing a great opportunity to capture genetic variation that can be used in breeding programs. To date, genomic selection provides a valuable tool for modeling optimal allelic combinations across the whole genome that maximize the phenotypic potential of an individual under a given environment.
Collapse
Affiliation(s)
- Francesca Taranto
- Institute of Biosciences and Bioresources (CNR-IBBR), 70126 Bari, Italy
| | - Salvatore Esposito
- Research Centre for Cereal and Industrial Crops (CREA-CI), CREA-Council for Agricultural Research and Economics, 71122 Foggia, Italy
| | - Pasquale De Vita
- Research Centre for Cereal and Industrial Crops (CREA-CI), CREA-Council for Agricultural Research and Economics, 71122 Foggia, Italy
| |
Collapse
|
3
|
Zhuang Z, Wu J, Qiu Y, Ruan D, Ding R, Xu C, Zhou S, Zhang Y, Liu Y, Ma F, Yang J, Sun Y, Zheng E, Yang M, Cai G, Yang J, Wu Z. Improving the accuracy of genomic prediction for meat quality traits using whole genome sequence data in pigs. J Anim Sci Biotechnol 2023; 14:67. [PMID: 37161604 PMCID: PMC10170792 DOI: 10.1186/s40104-023-00863-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Accepted: 03/05/2023] [Indexed: 05/11/2023] Open
Abstract
BACKGROUND Pork quality can directly affect customer purchase tendency and meat quality traits have become valuable in modern pork production. However, genetic improvement has been slow due to high phenotyping costs. In this study, whole genome sequence (WGS) data was used to evaluate the prediction accuracy of genomic best linear unbiased prediction (GBLUP) for meat quality in large-scale crossbred commercial pigs. RESULTS We produced WGS data (18,695,907 SNPs and 2,106,902 INDELs exceed quality control) from 1,469 sequenced Duroc × (Landrace × Yorkshire) pigs and developed a reference panel for meat quality including meat color score, marbling score, L* (lightness), a* (redness), and b* (yellowness) of genomic prediction. The prediction accuracy was defined as the Pearson correlation coefficient between adjusted phenotypes and genomic estimated breeding values in the validation population. Using different marker density panels derived from WGS data, accuracy differed substantially among meat quality traits, varied from 0.08 to 0.47. Results showed that MultiBLUP outperform GBLUP and yielded accuracy increases ranging from 17.39% to 75%. We optimized the marker density and found medium- and high-density marker panels are beneficial for the estimation of heritability for meat quality. Moreover, we conducted genotype imputation from 50K chip to WGS level in the same population and found average concordance rate to exceed 95% and r2 = 0.81. CONCLUSIONS Overall, estimation of heritability for meat quality traits can benefit from the use of WGS data. This study showed the superiority of using WGS data to genetically improve pork quality in genomic prediction.
Collapse
Affiliation(s)
- Zhanwei Zhuang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Jie Wu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Yibin Qiu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Donglin Ruan
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Rongrong Ding
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Cineng Xu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Shenping Zhou
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Yuling Zhang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Yiyi Liu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Fucai Ma
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Jifei Yang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Ying Sun
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Enqin Zheng
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Ming Yang
- College of Animal Science and Technology, Zhongkai University of Agriculture and Engineering, Guangzhou, 510225, China
| | - Gengyuan Cai
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Jie Yang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China.
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China.
| | - Zhenfang Wu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China.
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China.
- Yunfu Subcenter of Guangdong Laboratory for Lingnan Modern Agriculture, Yunfu, 527400, China.
| |
Collapse
|
4
|
Tahir MS, Porto-Neto LR, Reverter-Gomez T, Olasege BS, Sajid MR, Wockner KB, Tan AWL, Fortes MRS. Utility of multi-omics data to inform genomic prediction of heifer fertility traits. J Anim Sci 2022; 100:skac340. [PMID: 36239447 PMCID: PMC9733504 DOI: 10.1093/jas/skac340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2022] [Accepted: 10/12/2022] [Indexed: 12/15/2022] Open
Abstract
Biologically informed single nucleotide polymorphisms (SNPs) impact genomic prediction accuracy of the target traits. Our previous genomics, proteomics, and transcriptomics work identified candidate genes related to puberty and fertility in Brahman heifers. We aimed to test this biological information for capturing heritability and predicting heifer fertility traits in another breed i.e., Tropical Composite. The SNP from the identified genes including 10 kilobases (kb) region on either side were selected as biologically informed SNP set. The SNP from the rest of the Bos taurus genes including 10-kb region on either side were selected as biologically uninformed SNP set. Bovine high-density (HD) complete SNP set (628,323 SNP) was used as a control. Two populations-Tropical Composites (N = 1331) and Brahman (N = 2310)-had records for three traits: pregnancy after first mating season (PREG1, binary), first conception score (FCS, score 1 to 3), and rebreeding score (REB, score 1 to 3.5). Using the best linear unbiased prediction method, effectiveness of each SNP set to predict the traits was tested in two scenarios: a 5-fold cross-validation within Tropical Composites using biological information from Brahman studies, and application of prediction equations from one breed to the other. The accuracy of prediction was calculated as the correlation between genomic estimated breeding values and adjusted phenotypes. Results show that biologically informed SNP set estimated heritabilities not significantly better than the control HD complete SNP set in Tropical Composites; however, it captured all the observed genetic variance in PREG1 and FCS when modeled together with the biologically uninformed SNP set. In 5-fold cross-validation within Tropical Composites, the biologically informed SNP set performed marginally better (statistically insignificant) in terms of prediction accuracies (PREG1: 0.20, FCS: 0.13, and REB: 0.12) as compared to HD complete SNP set (PREG1: 0.17, FCS: 0.10, and REB: 0.11), and biologically uninformed SNP set (PREG1: 0.16, FCS: 0.10, and REB: 0.11). Across-breed use of prediction equations still remained a challenge: accuracies by all SNP sets dropped to around zero for all traits. The performance of biologically informed SNP was not significantly better than other sets in Tropical Composites. However, results indicate that biological information obtained from Brahman was successful to predict the fertility traits in Tropical Composite population.
Collapse
Affiliation(s)
- Muhammad S Tahir
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia Campus, Brisbane 4072, QLD, Australia
| | - Laercio R Porto-Neto
- Commonwealth Scientific and Industrial Research Organization, St. Lucia, Brisbane 4072, QLD, Australia
| | - Toni Reverter-Gomez
- Commonwealth Scientific and Industrial Research Organization, St. Lucia, Brisbane 4072, QLD, Australia
| | - Babatunde S Olasege
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia Campus, Brisbane 4072, QLD, Australia
| | - Mirza R Sajid
- Department of Statistics, University of Gujrat, 50700 Punjab, Pakistan
| | - Kimberley B Wockner
- Queensland Department of Agriculture and Fisheries, Brisbane 4072, QLD, Australia
| | - Andre W L Tan
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia Campus, Brisbane 4072, QLD, Australia
| | - Marina R S Fortes
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia Campus, Brisbane 4072, QLD, Australia
| |
Collapse
|
5
|
Ros-Freixedes R, Johnsson M, Whalen A, Chen CY, Valente BD, Herring WO, Gorjanc G, Hickey JM. Genomic prediction with whole-genome sequence data in intensely selected pig lines. GENETICS SELECTION EVOLUTION 2022; 54:65. [PMID: 36153511 PMCID: PMC9509613 DOI: 10.1186/s12711-022-00756-0] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Accepted: 09/05/2022] [Indexed: 12/03/2022]
Abstract
Background Early simulations indicated that whole-genome sequence data (WGS) could improve the accuracy of genomic predictions within and across breeds. However, empirical results have been ambiguous so far. Large datasets that capture most of the genomic diversity in a population must be assembled so that allele substitution effects are estimated with high accuracy. The objectives of this study were to use a large pig dataset from seven intensely selected lines to assess the benefits of using WGS for genomic prediction compared to using commercial marker arrays and to identify scenarios in which WGS provides the largest advantage. Methods We sequenced 6931 individuals from seven commercial pig lines with different numerical sizes. Genotypes of 32.8 million variants were imputed for 396,100 individuals (17,224 to 104,661 per line). We used BayesR to perform genomic prediction for eight complex traits. Genomic predictions were performed using either data from a standard marker array or variants preselected from WGS based on association tests. Results The accuracies of genomic predictions based on preselected WGS variants were not robust across traits and lines and the improvements in prediction accuracy that we achieved so far with WGS compared to standard marker arrays were generally small. The most favourable results for WGS were obtained when the largest training sets were available and standard marker arrays were augmented with preselected variants with statistically significant associations to the trait. With this method and training sets of around 80k individuals, the accuracy of within-line genomic predictions was on average improved by 0.025. With multi-line training sets, improvements of 0.04 compared to marker arrays could be expected. Conclusions Our results showed that WGS has limited potential to improve the accuracy of genomic predictions compared to marker arrays in intensely selected pig lines. Thus, although we expect that larger improvements in accuracy from the use of WGS are possible with a combination of larger training sets and optimised pipelines for generating and analysing such datasets, the use of WGS in the current implementations of genomic prediction should be carefully evaluated against the cost of large-scale WGS data on a case-by-case basis. Supplementary Information The online version contains supplementary material available at 10.1186/s12711-022-00756-0.
Collapse
|
6
|
Ding R, Zhuang Z, Qiu Y, Wang X, Wu J, Zhou S, Ruan D, Xu C, Hong L, Gu T, Zheng E, Cai G, Huang W, Wu Z, Yang J. A composite strategy of genome-wide association study and copy number variation analysis for carcass traits in a Duroc pig population. BMC Genomics 2022; 23:590. [PMID: 35964005 PMCID: PMC9375371 DOI: 10.1186/s12864-022-08804-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Accepted: 07/31/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Carcass traits are important in pig breeding programs for improving pork production. Understanding the genetic variants underlies complex phenotypes can help explain trait variation in pigs. In this study, we integrated a weighted single-step genome-wide association study (wssGWAS) and copy number variation (CNV) analyses to map genetic variations and genes associated with loin muscle area (LMA), loin muscle depth (LMD) and lean meat percentage (LMP) in Duroc pigs. RESULTS Firstly, we performed a genome-wide analysis for CNV detection using GeneSeek Porcine SNP50 Bead chip data of 3770 pigs. A total of 11,100 CNVs were detected, which were aggregated by overlapping 695 CNV regions (CNVRs). Next, we investigated CNVs of pigs from the same population by whole-genome resequencing. A genome-wide analysis of 21 pigs revealed 23,856 CNVRs that were further divided into three categories (851 gain, 22,279 loss, and 726 mixed), which covered 190.8 Mb (~ 8.42%) of the pig autosomal genome. Further, the identified CNVRs were used to determine an overall validation rate of 68.5% for the CNV detection accuracy of chip data. CNVR association analyses identified one CNVR associated with LMA, one with LMD and eight with LMP after applying stringent Bonferroni correction. The wssGWAS identified eight, six and five regions explaining more than 1% of the additive genetic variance for LMA, LMD and LMP, respectively. The CNVR analyses and wssGWAS identified five common regions, of which three regions were associated with LMA and two with LMP. Four genes (DOK7, ARAP1, ELMO2 and SLC13A3) were highlighted as promising candidates according to their function. CONCLUSIONS We determined an overall validation rate for the CNV detection accuracy of low-density chip data and constructed a genomic CNV map for Duroc pigs using resequencing, thereby proving a value genetic variation resource for pig genome research. Furthermore, our study utilized a composite genetic strategy for complex traits in pigs, which will contribute to the study for elucidating the genetic architecture that may be influenced and regulated by multiple forms of variations.
Collapse
Affiliation(s)
- Rongrong Ding
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangdong, 510642, P.R. China.,Guangdong Wens Breeding Swine Technology Co., Ltd, Guangdong, 527439, P.R. China.,Lingnan Guangdong Laboratory of Modern Agriculture, Guangzhou, 510642, China
| | - Zhanwei Zhuang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangdong, 510642, P.R. China.,Lingnan Guangdong Laboratory of Modern Agriculture, Guangzhou, 510642, China
| | - Yibin Qiu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangdong, 510642, P.R. China.,Lingnan Guangdong Laboratory of Modern Agriculture, Guangzhou, 510642, China
| | - Xingwang Wang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangdong, 510642, P.R. China
| | - Jie Wu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangdong, 510642, P.R. China.,Lingnan Guangdong Laboratory of Modern Agriculture, Guangzhou, 510642, China
| | - Shenping Zhou
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangdong, 510642, P.R. China.,Lingnan Guangdong Laboratory of Modern Agriculture, Guangzhou, 510642, China
| | - Donglin Ruan
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangdong, 510642, P.R. China.,Lingnan Guangdong Laboratory of Modern Agriculture, Guangzhou, 510642, China
| | - Cineng Xu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangdong, 510642, P.R. China.,Lingnan Guangdong Laboratory of Modern Agriculture, Guangzhou, 510642, China
| | - Linjun Hong
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangdong, 510642, P.R. China.,Lingnan Guangdong Laboratory of Modern Agriculture, Guangzhou, 510642, China
| | - Ting Gu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangdong, 510642, P.R. China.,Lingnan Guangdong Laboratory of Modern Agriculture, Guangzhou, 510642, China
| | - Enqin Zheng
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangdong, 510642, P.R. China.,Lingnan Guangdong Laboratory of Modern Agriculture, Guangzhou, 510642, China
| | - Gengyuan Cai
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangdong, 510642, P.R. China.,Lingnan Guangdong Laboratory of Modern Agriculture, Guangzhou, 510642, China
| | - Wen Huang
- Department of Animal Science, Michigan State University, East Lansing, MI, USA
| | - Zhenfang Wu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangdong, 510642, P.R. China. .,Lingnan Guangdong Laboratory of Modern Agriculture, Guangzhou, 510642, China.
| | - Jie Yang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangdong, 510642, P.R. China. .,Lingnan Guangdong Laboratory of Modern Agriculture, Guangzhou, 510642, China.
| |
Collapse
|
7
|
Gao Y, Li J, Cai G, Wang Y, Yang W, Li Y, Zhao X, Li R, Gao Y, Tuo W, Baldwin RL, Li CJ, Fang L, Liu GE. Single-cell transcriptomic and chromatin accessibility analyses of dairy cattle peripheral blood mononuclear cells and their responses to lipopolysaccharide. BMC Genomics 2022; 23:338. [PMID: 35501711 PMCID: PMC9063233 DOI: 10.1186/s12864-022-08562-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2021] [Accepted: 04/13/2022] [Indexed: 11/24/2022] Open
Abstract
Background Gram-negative bacteria are important pathogens in cattle, causing severe infectious diseases, including mastitis. Lipopolysaccharides (LPS) are components of the outer membrane of Gram-negative bacteria and crucial mediators of chronic inflammation in cattle. LPS modulations of bovine immune responses have been studied before. However, the single-cell transcriptomic and chromatin accessibility analyses of bovine peripheral blood mononuclear cells (PBMCs) and their responses to LPS stimulation were never reported. Results We performed single-cell RNA sequencing (scRNA-seq) and single-cell sequencing assay for transposase-accessible chromatin (scATAC-seq) in bovine PBMCs before and after LPS treatment and demonstrated that seven major cell types, which included CD4 T cells, CD8 T cells, and B cells, monocytes, natural killer cells, innate lymphoid cells, and dendritic cells. Bioinformatic analyses indicated that LPS could increase PBMC cell cycle progression, cellular differentiation, and chromatin accessibility. Gene analyses further showed significant changes in differential expression, transcription factor binding site, gene ontology, and regulatory interactions during the PBMC responses to LPS. Consistent with the findings of previous studies, LPS induced activation of monocytes and dendritic cells, likely through their upregulated TLR4 receptor. NF-κB was observed to be activated by LPS and an increased transcription of an array of pro-inflammatory cytokines, in agreement that NF-κB is an LPS-responsive regulator of innate immune responses. In addition, by integrating LPS-induced differentially expressed genes (DEGs) with large-scale GWAS of 45 complex traits in Holstein, we detected trait-relevant cell types. We found that selected DEGs were significantly associated with immune-relevant health, milk production, and body conformation traits. Conclusion This study provided the first scRNAseq and scATAC-seq data for cattle PBMCs and their responses to the LPS stimulation to the best of our knowledge. These results should also serve as valuable resources for the future study of the bovine immune system and open the door for discoveries about immune cell roles in complex traits like mastitis at single-cell resolution. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08562-0.
Collapse
Affiliation(s)
- Yahui Gao
- Institute of Animal Science and Veterinary Medicine, Shandong Academy of Agricultural Sciences, No.202, Gongyebei Road, Jinan, 250100, China.,Animal Genomics and Improvement Laboratory, BARC, USDA-ARS, Beltsville, MD, 20705, USA
| | - Jianbin Li
- Institute of Animal Science and Veterinary Medicine, Shandong Academy of Agricultural Sciences, No.202, Gongyebei Road, Jinan, 250100, China.
| | - Gaozhan Cai
- Institute of Animal Science and Veterinary Medicine, Shandong Academy of Agricultural Sciences, No.202, Gongyebei Road, Jinan, 250100, China.,Shandong Ox Livestock Breeding Co., Ltd, Jinan, 250100, China
| | - Yujiao Wang
- Institute of Animal Science and Veterinary Medicine, Shandong Academy of Agricultural Sciences, No.202, Gongyebei Road, Jinan, 250100, China
| | - Wenjing Yang
- College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Yanqin Li
- Institute of Animal Science and Veterinary Medicine, Shandong Academy of Agricultural Sciences, No.202, Gongyebei Road, Jinan, 250100, China
| | - Xiuxin Zhao
- Shandong Ox Livestock Breeding Co., Ltd, Jinan, 250100, China
| | - Rongling Li
- Institute of Animal Science and Veterinary Medicine, Shandong Academy of Agricultural Sciences, No.202, Gongyebei Road, Jinan, 250100, China
| | - Yundong Gao
- Institute of Animal Science and Veterinary Medicine, Shandong Academy of Agricultural Sciences, No.202, Gongyebei Road, Jinan, 250100, China
| | - Wenbin Tuo
- Animal Parasitic Diseases Laboratory, BARC, USDA-ARS, Beltsville, MD, 20705, USA
| | - Ransom L Baldwin
- Animal Genomics and Improvement Laboratory, BARC, USDA-ARS, Beltsville, MD, 20705, USA
| | - Cong-Jun Li
- Animal Genomics and Improvement Laboratory, BARC, USDA-ARS, Beltsville, MD, 20705, USA.
| | - Lingzhao Fang
- MRC Human Genetics Unit at the Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, EH4 2XU, UK.
| | - George E Liu
- Animal Genomics and Improvement Laboratory, BARC, USDA-ARS, Beltsville, MD, 20705, USA.
| |
Collapse
|
8
|
Shen F, Bianco L, Wu B, Tian Z, Wang Y, Wu T, Xu X, Han Z, Velasco R, Fontana P, Zhang X. A bulked segregant analysis tool for out-crossing species (BSATOS) and QTL-based genomics-assisted prediction of complex traits in apple. J Adv Res 2022; 42:149-162. [PMID: 36513410 PMCID: PMC9788957 DOI: 10.1016/j.jare.2022.03.013] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2022] [Revised: 03/06/2022] [Accepted: 03/22/2022] [Indexed: 12/27/2022] Open
Abstract
INTRODUCTION Genomic heterozygosity, self-incompatibility, and rich-in somatic mutations hinder the molecular breeding efficiency of outcrossing plants. OBJECTIVES We attempted to develop an efficient integrated strategy to identify quantitative trait loci (QTLs) and trait-associated genes, to develop gene markers, and to construct genomics-assisted prediction (GAP) modes. METHODS A novel protocol, bulked segregant analysis tool for out-crossing species (BSATOS), is presented here, which is characterized by taking full advantage of all segregation patterns (including AB × AB markers) and haplotype information. To verify the effectiveness of the protocol in dealing with the complex traits of outbreeding species, three apple cross populations with 9,654 individuals were adopted. RESULTS By using BSATOS, 90, 60, and 77 significant QTLs were identified successfully and candidate genes were predicted for apple fruit weight (FW), fruit ripening date (FRD), and fruit soluble solid content (SSC), respectively. The gene-based markers were developed and genotyped for 1,396 individuals in a training population, including 145 Malus accessions and 1,251 F1 plants of the three full-sib families. GAP models were trained using marker genotype effect estimates of the training population. The prediction accuracy was 0.7658, 0.6455, and 0.3758 for FW, FRD, and SSC, respectively. CONCLUSION The BSATOS and GAP models provided a convenient and efficient methodology for candidate gene mining and molecular breeding in out-crossing plant species. The BSATOS pipeline can be freely downloaded from: https://github.com/maypoleflyn/BSATOS.
Collapse
Affiliation(s)
- Fei Shen
- College of Horticulture, China Agricultural University, Beijing 100193, China,Research and Innovation Center, Edmund Mach Foundation, 38010 S. Michele all’Adige, Italy,Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China
| | - Luca Bianco
- Research and Innovation Center, Edmund Mach Foundation, 38010 S. Michele all’Adige, Italy
| | - Bei Wu
- College of Horticulture, China Agricultural University, Beijing 100193, China
| | - Zhendong Tian
- College of Horticulture, China Agricultural University, Beijing 100193, China
| | - Yi Wang
- College of Horticulture, China Agricultural University, Beijing 100193, China
| | - Ting Wu
- College of Horticulture, China Agricultural University, Beijing 100193, China
| | - Xuefeng Xu
- College of Horticulture, China Agricultural University, Beijing 100193, China
| | - Zhenhai Han
- College of Horticulture, China Agricultural University, Beijing 100193, China,Corresponding authors.
| | - Riccardo Velasco
- Research Centre for Viticulture and Enology, CREA, Conegliano, Italy
| | - Paolo Fontana
- Research and Innovation Center, Edmund Mach Foundation, 38010 S. Michele all’Adige, Italy,Corresponding authors.
| | - Xinzhong Zhang
- College of Horticulture, China Agricultural University, Beijing 100193, China,Corresponding authors.
| |
Collapse
|
9
|
Functional annotation of regulatory elements in cattle genome reveals the roles of extracellular interaction and dynamic change of chromatin states in rumen development during weaning. Genomics 2022; 114:110296. [PMID: 35143887 DOI: 10.1016/j.ygeno.2022.110296] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Revised: 12/20/2021] [Accepted: 02/01/2022] [Indexed: 12/24/2022]
Abstract
We profiled landscapes of bovine regulatory elements and explored dynamic changes of chromatin states in rumen development during weaning. The regulatory elements (15 chromatin states) and their coordinated activities in cattle were defined through genome-wide profiling of four histone modifications, CTCF-binding, DNA accessibility, DNA methylation, and transcriptome in rumen epithelial tissues. Each chromatin state presented specific enrichment for sequence ontology, methylation, trait-associated variants, transcription, gene expression-associated variants, selection signatures, and evolutionarily conserved elements. During weaning, weak enhancers and flanking active transcriptional start sites (TSS) were the most dynamic chromatin states and occurred in tandem with significant variations in gene expression and DNA methylation, significantly associated with stature, production, and reproduction economic traits. By comparing with in vitro cultured epithelial cells and in vivo rumen tissues, we showed the commonness and uniqueness of these results, especially the roles of cell interactions and mitochondrial activities in tissue development.
Collapse
|
10
|
Martini JWR, Gao N, Crossa J. Incorporating Omics Data in Genomic Prediction. Methods Mol Biol 2022; 2467:341-357. [PMID: 35451782 DOI: 10.1007/978-1-0716-2205-6_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
In this chapter, we discuss the motivation for integrating other types of omics data into genomic prediction methods. We give an overview of literature investigating the performance of omics-enhanced predictions, and highlight potential pitfalls when applying these methods in breeding. We emphasize that the statistical methods available for genomic data can be transferred to the general omics case. However, when using a framework of omic relationship matrices, the standardization of the variables may be more relevant than it is for a genomic relationship matrix based on single-nucleotide polymorphisms.
Collapse
Affiliation(s)
- Johannes W R Martini
- International Maize and Wheat Improvement Center (CIMMYT), Veracruz, CP, Mexico.
| | - Ning Gao
- School of Life Sciences, Sun Yat-Sen University, Guangzhou, China
| | - José Crossa
- International Maize and Wheat Improvement Center (CIMMYT), Veracruz, CP, Mexico
| |
Collapse
|
11
|
Romé H, Chu TT, Marois D, Huang C, Madsen P, Jensen J. Accounting for genetic architecture for body weight improves accuracy of predicting breeding values in a commercial line of broilers. J Anim Breed Genet 2021; 138:528-540. [PMID: 33774870 PMCID: PMC8451786 DOI: 10.1111/jbg.12546] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2020] [Revised: 01/29/2021] [Accepted: 02/28/2021] [Indexed: 12/21/2022]
Abstract
BLUP (best linear unbiased prediction) is the standard for predicting breeding values, where different assumptions can be made on variance-covariance structure, which may influence predictive ability. Herein, we compare accuracy of prediction of four derived-BLUP models: (a) a pedigree relationship matrix (PBLUP), (b) a genomic relationship matrix (GBLUP), (c) a weighted genomic relationship matrix (WGBLUP) and (d) a relationship matrix based on genomic features that consisted of only a subset of SNP selected on a priori information (GFBLUP). We phenotyped a commercial population of broilers for body weight (BW) in five successive weeks and genotyped them using a 50k SNP array. We compared predictive ability of univariate models using conservative cross-validation method, where each full-sib group was divided into two folds. Results from cross-validation showed, with WGBLUP model, a gain in accuracy from 2% to 7% compared with GBLUP model. Splitting the additive genetic matrix into two matrices, based on significance level of SNP (Gf : estimated with only set of SNP selected on significance level, Gr : estimated with the remaining SNP), led to a gain in accuracy from 1% to 70%, depending on the proportion of SNP used to define Gf . Thus, information from GWAS in models improves predictive ability of breeding values for BW in broilers. Increasing the power of detection of SNP effects, by acquiring more data or improving methods for GWAS, will help improve predictive ability.
Collapse
Affiliation(s)
- Hélène Romé
- Center for Quantitative Genetics and GenomicsAarhus UniversityTjeleDenmark
| | - Thinh T. Chu
- Center for Quantitative Genetics and GenomicsAarhus UniversityTjeleDenmark
- Faculty of Animal ScienceVietnam National University of AgricultureGia LamVietnam
| | | | | | - Per Madsen
- Center for Quantitative Genetics and GenomicsAarhus UniversityTjeleDenmark
| | - Just Jensen
- Center for Quantitative Genetics and GenomicsAarhus UniversityTjeleDenmark
| |
Collapse
|
12
|
Gebreyesus G, Lund MS, Sahana G, Su G. Reliabilities of Genomic Prediction for Young Stock Survival Traits Using 54K SNP Chip Augmented With Additional Single-Nucleotide Polymorphisms Selected From Imputed Whole-Genome Sequencing Data. Front Genet 2021; 12:667300. [PMID: 34349779 PMCID: PMC8326759 DOI: 10.3389/fgene.2021.667300] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2021] [Accepted: 06/23/2021] [Indexed: 11/16/2022] Open
Abstract
This study investigated effects of integrating single-nucleotide polymorphisms (SNPs) selected based on previous genome-wide association studies (GWASs), from imputed whole-genome sequencing (WGS) data, in the conventional 54K chip on genomic prediction reliability of young stock survival (YSS) traits in dairy cattle. The WGS SNPs included two groups of SNP sets that were selected based on GWAS in the Danish Holstein for YSS index (YSS_SNPs, n = 98) and SNPs chosen as peaks of quantitative trait loci for the traits of Nordic total merit index in Denmark–Finland–Sweden dairy cattle populations (DFS_SNPs, n = 1,541). Additionally, the study also investigated the possibility of improving genomic prediction reliability for survival traits by modeling the SNPs within recessive lethal haplotypes (LET_SNP, n = 130) detected from the 54K chip in the Nordic Holstein. De-regressed proofs (DRPs) were obtained from 6,558 Danish Holstein bulls genotyped with either 54K chip or customized LD chip that includes SNPs in the standard LD chip and some of the selected WGS SNPs. The chip data were subsequently imputed to 54K SNP together with the selected WGS SNPs. Genomic best linear unbiased prediction (GBLUP) models were implemented to predict breeding values through either pooling the 54K and selected WGS SNPs together as one genetic component (a one-component model) or considering 54K SNPs and selected WGS SNPs as two separate genetic components (a two-component model). Across all the traits, inclusion of each of the selected WGS SNP sets led to negligible improvements in prediction accuracies (0.17 percentage points on average) compared to prediction using only 54K. Similarly, marginal improvement in prediction reliability was obtained when all the selected WGS SNPs were included (0.22 percentage points). No further improvement in prediction reliability was observed when considering random regression on genotype code of recessive lethal alleles in the model including both groups of the WGS SNPs. Additionally, there was no difference in prediction reliability from integrating the selected WGS SNP sets through the two-component model compared to the one-component GBLUP.
Collapse
Affiliation(s)
- Grum Gebreyesus
- Center for Quantitative Genetics and Genomics, Aarhus University, Tjele, Denmark
| | - Mogens Sandø Lund
- Center for Quantitative Genetics and Genomics, Aarhus University, Tjele, Denmark
| | - Goutam Sahana
- Center for Quantitative Genetics and Genomics, Aarhus University, Tjele, Denmark
| | - Guosheng Su
- Center for Quantitative Genetics and Genomics, Aarhus University, Tjele, Denmark
| |
Collapse
|
13
|
Tahir MS, Porto-Neto LR, Gondro C, Shittu OB, Wockner K, Tan AWL, Smith HR, Gouveia GC, Kour J, Fortes MRS. Meta-Analysis of Heifer Traits Identified Reproductive Pathways in Bos indicus Cattle. Genes (Basel) 2021; 12:768. [PMID: 34069992 PMCID: PMC8157873 DOI: 10.3390/genes12050768] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Revised: 05/12/2021] [Accepted: 05/13/2021] [Indexed: 12/12/2022] Open
Abstract
Fertility traits measured early in life define the reproductive potential of heifers. Knowledge of genetics and biology can help devise genomic selection methods to improve heifer fertility. In this study, we used ~2400 Brahman cattle to perform GWAS and multi-trait meta-analysis to determine genomic regions associated with heifer fertility. Heifer traits measured were pregnancy at first mating opportunity (PREG1, a binary trait), first conception score (FCS, score 1 to 3) and rebreeding score (REB, score 1 to 3.5). The heritability estimates were 0.17 (0.03) for PREG1, 0.11 (0.05) for FCS and 0.28 (0.05) for REB. The three traits were highly genetically correlated (0.75-0.83) as expected. Meta-analysis was performed using SNP effects estimated for each of the three traits, adjusted for standard error. We identified 1359 significant SNPs (p-value < 9.9 × 10-6 at FDR < 0.0001) in the multi-trait meta-analysis. Genomic regions of 0.5 Mb around each significant SNP from the meta-analysis were annotated to create a list of 2560 positional candidate genes. The most significant SNP was in the vicinity of a genomic region on chromosome 8, encompassing the genes SLC44A1, FSD1L, FKTN, TAL2 and TMEM38B. The genomic region in humans that contains homologs of these genes is associated with age at puberty in girls. Top significant SNPs pointed to additional fertility-related genes, again within a 0.5 Mb region, including ESR2, ITPR1, GNG2, RGS9BP, ANKRD27, TDRD12, GRM1, MTHFD1, PTGDR and NTNG1. Functional pathway enrichment analysis resulted in many positional candidate genes relating to known fertility pathways, including GnRH signaling, estrogen signaling, progesterone mediated oocyte maturation, cAMP signaling, calcium signaling, glutamatergic signaling, focal adhesion, PI3K-AKT signaling and ovarian steroidogenesis pathway. The comparison of results from this study with previous transcriptomics and proteomics studies on puberty of the same cattle breed (Brahman) but in a different population identified 392 genes in common from which some genes-BRAF, GABRA2, GABR1B, GAD1, FSHR, CNGA3, PDE10A, SNAP25, ESR2, GRIA2, ORAI1, EGFR, CHRNA5, VDAC2, ACVR2B, ORAI3, CYP11A1, GRIN2A, ATP2B3, CAMK2A, PLA2G, CAMK2D and MAPK3-are also part of the above-mentioned pathways. The biological functions of the positional candidate genes and their annotation to known pathways allowed integrating the results into a bigger picture of molecular mechanisms related to puberty in the hypothalamus-pituitary-ovarian axis. A reasonable number of genes, common between previous puberty studies and this study on early reproductive traits, corroborates the proposed molecular mechanisms. This study identified the polymorphism associated with early reproductive traits, and candidate genes that provided a visualization of the proposed mechanisms, coordinating the hypothalamic, pituitary, and ovarian functions for reproductive performance in Brahman cattle.
Collapse
Affiliation(s)
- Muhammad S. Tahir
- School of Chemistry and Molecular Bioscience, The University of Queensland Australia, Brisbane, QLD 4072, Australia; (M.S.T.); (O.B.S.); (K.W.); (A.W.L.T.); (H.R.S.); (J.K.)
| | - Laercio R. Porto-Neto
- Commonwealth Scientific and Industrial Research Organization, Brisbane, QLD 4072, Australia;
| | - Cedric Gondro
- Department of Animal Science, Michigan State University, East Lansing, MI 48824, USA;
| | - Olasege B. Shittu
- School of Chemistry and Molecular Bioscience, The University of Queensland Australia, Brisbane, QLD 4072, Australia; (M.S.T.); (O.B.S.); (K.W.); (A.W.L.T.); (H.R.S.); (J.K.)
| | - Kimberley Wockner
- School of Chemistry and Molecular Bioscience, The University of Queensland Australia, Brisbane, QLD 4072, Australia; (M.S.T.); (O.B.S.); (K.W.); (A.W.L.T.); (H.R.S.); (J.K.)
| | - Andre W. L. Tan
- School of Chemistry and Molecular Bioscience, The University of Queensland Australia, Brisbane, QLD 4072, Australia; (M.S.T.); (O.B.S.); (K.W.); (A.W.L.T.); (H.R.S.); (J.K.)
| | - Hugo R. Smith
- School of Chemistry and Molecular Bioscience, The University of Queensland Australia, Brisbane, QLD 4072, Australia; (M.S.T.); (O.B.S.); (K.W.); (A.W.L.T.); (H.R.S.); (J.K.)
| | - Gabriela C. Gouveia
- Animal Science Department, Veterinary School, Federal University of Minas Gerais, Belo Horizonte 31270-901, Brazil;
| | - Jagish Kour
- School of Chemistry and Molecular Bioscience, The University of Queensland Australia, Brisbane, QLD 4072, Australia; (M.S.T.); (O.B.S.); (K.W.); (A.W.L.T.); (H.R.S.); (J.K.)
| | - Marina R. S. Fortes
- School of Chemistry and Molecular Bioscience, The University of Queensland Australia, Brisbane, QLD 4072, Australia; (M.S.T.); (O.B.S.); (K.W.); (A.W.L.T.); (H.R.S.); (J.K.)
| |
Collapse
|
14
|
Gao Y, Fang L, Baldwin RL, Connor EE, Cole JB, Van Tassell CP, Ma L, Li CJ, Liu GE. Single-cell transcriptomic analyses of dairy cattle ruminal epithelial cells during weaning. Genomics 2021; 113:2045-2055. [PMID: 33933592 DOI: 10.1016/j.ygeno.2021.04.039] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Revised: 03/20/2021] [Accepted: 04/27/2021] [Indexed: 11/30/2022]
Abstract
Using the 10× Genomics Chromium Controller, we obtained scRNA-seq data of 5064 and 1372 individual cells from two Holstein calf ruminal epithelial tissues before and after weaning, respectively. We detected six distinct cell clusters, designated their cell types, and reported their marker genes. We then examined these clusters' underlining cell types and relationships by performing cell cycle, pseudotime trajectory, regulatory network, weighted gene co-expression network and gene ontology analyses. By integrating these cell marker genes with Holstein GWAS signals, we found they were enriched for animal production and body conformation traits. Finally, we confirmed their cell identities by comparing them with human and mouse stomach epithelial cells. This study presents an initial effort to implement single-cell transcriptomic analysis in cattle, and demonstrates ruminal tissue epithelial cell types and their developments during weaning, opening the door for new discoveries about tissue/cell type roles in complex traits at single-cell resolution.
Collapse
Affiliation(s)
- Yahui Gao
- Animal Genomics and Improvement Laboratory, BARC, USDA-ARS, Beltsville, MD 20705, USA; Department of Animal and Avian Sciences, University of Maryland, College Park, MD 20742, USA.
| | - Lingzhao Fang
- MRC Human Genetics Unit at the Medical Research Council Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh EH4 2XU, United Kingdom.
| | - Ransom L Baldwin
- Animal Genomics and Improvement Laboratory, BARC, USDA-ARS, Beltsville, MD 20705, USA.
| | - Erin E Connor
- Department of Animal and Food Sciences, University of Delaware, Newark, DE 19716, USA.
| | - John B Cole
- Animal Genomics and Improvement Laboratory, BARC, USDA-ARS, Beltsville, MD 20705, USA.
| | - Curtis P Van Tassell
- Animal Genomics and Improvement Laboratory, BARC, USDA-ARS, Beltsville, MD 20705, USA.
| | - Li Ma
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD 20742, USA.
| | - Cong-Jun Li
- Animal Genomics and Improvement Laboratory, BARC, USDA-ARS, Beltsville, MD 20705, USA.
| | - George E Liu
- Animal Genomics and Improvement Laboratory, BARC, USDA-ARS, Beltsville, MD 20705, USA.
| |
Collapse
|
15
|
Farooq M, van Dijk ADJ, Nijveen H, Aarts MGM, Kruijer W, Nguyen TP, Mansoor S, de Ridder D. Prior Biological Knowledge Improves Genomic Prediction of Growth-Related Traits in Arabidopsis thaliana. Front Genet 2021; 11:609117. [PMID: 33552126 PMCID: PMC7855462 DOI: 10.3389/fgene.2020.609117] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Accepted: 12/21/2020] [Indexed: 01/11/2023] Open
Abstract
Prediction of growth-related complex traits is highly important for crop breeding. Photosynthesis efficiency and biomass are direct indicators of overall plant performance and therefore even minor improvements in these traits can result in significant breeding gains. Crop breeding for complex traits has been revolutionized by technological developments in genomics and phenomics. Capitalizing on the growing availability of genomics data, genome-wide marker-based prediction models allow for efficient selection of the best parents for the next generation without the need for phenotypic information. Until now such models mostly predict the phenotype directly from the genotype and fail to make use of relevant biological knowledge. It is an open question to what extent the use of such biological knowledge is beneficial for improving genomic prediction accuracy and reliability. In this study, we explored the use of publicly available biological information for genomic prediction of photosynthetic light use efficiency (Φ PSII ) and projected leaf area (PLA) in Arabidopsis thaliana. To explore the use of various types of knowledge, we mapped genomic polymorphisms to Gene Ontology (GO) terms and transcriptomics-based gene clusters, and applied these in a Genomic Feature Best Linear Unbiased Predictor (GFBLUP) model, which is an extension to the traditional Genomic BLUP (GBLUP) benchmark. Our results suggest that incorporation of prior biological knowledge can improve genomic prediction accuracy for both Φ PSII and PLA. The improvement achieved depends on the trait, type of knowledge and trait heritability. Moreover, transcriptomics offers complementary evidence to the Gene Ontology for improvement when used to define functional groups of genes. In conclusion, prior knowledge about trait-specific groups of genes can be directly translated into improved genomic prediction.
Collapse
Affiliation(s)
- Muhammad Farooq
- Bioinformatics Group, Wageningen University, Wageningen, Netherlands
- Molecular Virology and Gene Silencing Lab, Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering (NIBGE), Punjab, Pakistan
| | - Aalt D. J. van Dijk
- Bioinformatics Group, Wageningen University, Wageningen, Netherlands
- Biometris, Wageningen University, Wageningen, Netherlands
| | - Harm Nijveen
- Bioinformatics Group, Wageningen University, Wageningen, Netherlands
| | - Mark G. M. Aarts
- Laboratory of Genetics, Wageningen University, Wageningen, Netherlands
| | - Willem Kruijer
- Biometris, Wageningen University, Wageningen, Netherlands
| | - Thu-Phuong Nguyen
- Laboratory of Genetics, Wageningen University, Wageningen, Netherlands
| | - Shahid Mansoor
- Molecular Virology and Gene Silencing Lab, Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering (NIBGE), Punjab, Pakistan
| | - Dick de Ridder
- Bioinformatics Group, Wageningen University, Wageningen, Netherlands
| |
Collapse
|
16
|
Botelho ME, Lopes MS, Mathur PK, Knol EF, Guimarães SEF, Marques DBD, Lopes PS, Silva FF, Veroneze R. Applying an association weight matrix in weighted genomic prediction of boar taint compounds. J Anim Breed Genet 2020; 138:442-453. [PMID: 33285013 DOI: 10.1111/jbg.12528] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2020] [Revised: 10/13/2020] [Accepted: 11/14/2020] [Indexed: 12/14/2022]
Abstract
Biological information regarding markers and gene association may be used to attribute different weights for single nucleotide polymorphism (SNP) in genome-wide selection. Therefore, we aimed to evaluate the predictive ability and the bias of genomic prediction using models that allow SNP weighting in the genomic relationship matrix (G) building, with and without incorporating biological information to obtain the weights. Firstly, we performed a genome-wide association studies (GWAS) in data set containing single- (SL) or a multi-line (ML) pig population for androstenone, skatole and indole levels. Secondly, 1%, 2%, 5%, 10%, 30% and 50% of the markers explaining the highest proportions of the genetic variance for each trait were selected to build gene networks through the association weight matrix (AWM) approach. The number of edges in the network was computed and used to derive weights for G (AWM-WssGBLUP). The single-step GBLUP (ssGBLUP) and weighted ssGBLUP (WssGBLUP) were used as standard scenarios. All scenarios presented predictive abilities different from zero; however, the great overlap in their confidences interval suggests no differences among scenarios. Most of scenarios of based on AWM provide overestimations for skatole in both SL and ML populations. On the other hand, the skatole and indole prediction were no biased in the ssGBLUP (S1) in both SL and ML populations. Most of scenarios based on AWM provide no biased predictions for indole in both SL and ML populations. In summary, using biological information through AWM matrix and gene networks to derive weights for genomic prediction resulted in no increase in predictive ability for boar taint compounds. In addition, this approach increased the number of analyses steps. Thus, we can conclude that ssGBLUP is most appropriate for the analysis of boar taint compounds in comparison with the weighted strategies used in the present work.
Collapse
Affiliation(s)
- Margareth E Botelho
- Department of Animal Science, Universidade Federal de Viçosa, Viçosa, Brazil
| | - Marcos S Lopes
- Topigs Norsvin, Curitiba, Brazil.,Topigs Norsvin Research Center, Beuningen, the Netherlands
| | | | - Egbert F Knol
- Topigs Norsvin Research Center, Beuningen, the Netherlands
| | | | - Daniele B D Marques
- Department of Animal Science, Universidade Federal de Viçosa, Viçosa, Brazil
| | - Paulo S Lopes
- Department of Animal Science, Universidade Federal de Viçosa, Viçosa, Brazil
| | - Fabyano F Silva
- Department of Animal Science, Universidade Federal de Viçosa, Viçosa, Brazil
| | - Renata Veroneze
- Department of Animal Science, Universidade Federal de Viçosa, Viçosa, Brazil
| |
Collapse
|
17
|
Ye S, Li J, Zhang Z. Multi-omics-data-assisted genomic feature markers preselection improves the accuracy of genomic prediction. J Anim Sci Biotechnol 2020; 11:109. [PMID: 33292577 PMCID: PMC7708144 DOI: 10.1186/s40104-020-00515-5] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2020] [Accepted: 09/22/2020] [Indexed: 12/02/2022] Open
Abstract
Background Presently, multi-omics data (e.g., genomics, transcriptomics, proteomics, and metabolomics) are available to improve genomic predictors. Omics data not only offers new data layers for genomic prediction but also provides a bridge between organismal phenotypes and genome variation that cannot be readily captured at the genome sequence level. Therefore, using multi-omics data to select feature markers is a feasible strategy to improve the accuracy of genomic prediction. In this study, simultaneously using whole-genome sequencing (WGS) and gene expression level data, four strategies for single-nucleotide polymorphism (SNP) preselection were investigated for genomic predictions in the Drosophila Genetic Reference Panel. Results Using genomic best linear unbiased prediction (GBLUP) with complete WGS data, the prediction accuracies were 0.208 ± 0.020 (0.181 ± 0.022) for the startle response and 0.272 ± 0.017 (0.307 ± 0.015) for starvation resistance in the female (male) lines. Compared with GBLUP using complete WGS data, both GBLUP and the genomic feature BLUP (GFBLUP) did not improve the prediction accuracy using SNPs preselected from complete WGS data based on the results of genome-wide association studies (GWASs) or transcriptome-wide association studies (TWASs). Furthermore, by using SNPs preselected from the WGS data based on the results of the expression quantitative trait locus (eQTL) mapping of all genes, only the startle response had greater accuracy than GBLUP with the complete WGS data. The best accuracy values in the female and male lines were 0.243 ± 0.020 and 0.220 ± 0.022, respectively. Importantly, by using SNPs preselected based on the results of the eQTL mapping of significant genes from TWAS, both GBLUP and GFBLUP resulted in great accuracy and small bias of genomic prediction. Compared with the GBLUP using complete WGS data, the best accuracy values represented increases of 60.66% and 39.09% for the starvation resistance and 27.40% and 35.36% for startle response in the female and male lines, respectively. Conclusions Overall, multi-omics data can assist genomic feature preselection and improve the performance of genomic prediction. The new knowledge gained from this study will enrich the use of multi-omics in genomic prediction.
Collapse
Affiliation(s)
- Shaopan Ye
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong, China
| | - Jiaqi Li
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong, China
| | - Zhe Zhang
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong, China.
| |
Collapse
|
18
|
Genomic Prediction Informed by Biological Processes Expands Our Understanding of the Genetic Architecture Underlying Free Amino Acid Traits in Dry Arabidopsis Seeds. G3-GENES GENOMES GENETICS 2020; 10:4227-4239. [PMID: 32978264 PMCID: PMC7642941 DOI: 10.1534/g3.120.401240] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
Plant growth, development, and nutritional quality depends upon amino acid homeostasis, especially in seeds. However, our understanding of the underlying genetics influencing amino acid content and composition remains limited, with only a few candidate genes and quantitative trait loci identified to date. Improved knowledge of the genetics and biological processes that determine amino acid levels will enable researchers to use this information for plant breeding and biological discovery. Toward this goal, we used genomic prediction to identify biological processes that are associated with, and therefore potentially influence, free amino acid (FAA) composition in seeds of the model plant Arabidopsis thaliana. Markers were split into categories based on metabolic pathway annotations and fit using a genomic partitioning model to evaluate the influence of each pathway on heritability explained, model fit, and predictive ability. Selected pathways included processes known to influence FAA composition, albeit to an unknown degree, and spanned four categories: amino acid, core, specialized, and protein metabolism. Using this approach, we identified associations for pathways containing known variants for FAA traits, in addition to finding new trait-pathway associations. Markers related to amino acid metabolism, which are directly involved in FAA regulation, improved predictive ability for branched chain amino acids and histidine. The use of genomic partitioning also revealed patterns across biochemical families, in which serine-derived FAAs were associated with protein related annotations and aromatic FAAs were associated with specialized metabolic pathways. Taken together, these findings provide evidence that genomic partitioning is a viable strategy to uncover the relative contributions of biological processes to FAA traits in seeds, offering a promising framework to guide hypothesis testing and narrow the search space for candidate genes.
Collapse
|
19
|
Ren D, An L, Li B, Qiao L, Liu W. Efficient weighting methods for genomic best linear-unbiased prediction (BLUP) adapted to the genetic architectures of quantitative traits. Heredity (Edinb) 2020; 126:320-334. [PMID: 32980863 DOI: 10.1038/s41437-020-00372-y] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2020] [Revised: 09/12/2020] [Accepted: 09/13/2020] [Indexed: 11/09/2022] Open
Abstract
Genomic best linear-unbiased prediction (GBLUP) assumes equal variance for all marker effects, which is suitable for traits that conform to the infinitesimal model. For traits controlled by major genes, Bayesian methods with shrinkage priors or genome-wide association study (GWAS) methods can be used to identify causal variants effectively. The information from Bayesian/GWAS methods can be used to construct the weighted genomic relationship matrix (G). However, it remains unclear which methods perform best for traits varying in genetic architecture. Therefore, we developed several methods to optimize the performance of weighted GBLUP and compare them with other available methods using simulated and real data sets. First, two types of methods (marker effects with local shrinkage or normal prior) were used to obtain test statistics and estimates for each marker effect. Second, three weighted G matrices were constructed based on the marker information from the first step: (1) the genomic-feature-weighted G, (2) the estimated marker-variance-weighted G, and (3) the absolute value of the estimated marker-effect-weighted G. Following the above process, six different weighted GBLUP methods (local shrinkage/normal-prior GF/EV/AEWGBLUP) were proposed for genomic prediction. Analyses with both simulated and real data demonstrated that these options offer flexibility for optimizing the weighted GBLUP for traits with a broad spectrum of genetic architectures. The advantage of weighting methods over GBLUP in terms of accuracy was trait dependant, ranging from 14.8% to marginal for simulated traits and from 44% to marginal for real traits. Local-shrinkage prior EVWGBLUP is superior for traits mainly controlled by loci of a large effect. Normal-prior AEWGBLUP performs well for traits mainly controlled by loci of moderate effect. For traits controlled by some loci with large effects (explain 25-50% genetic variance) and a range of loci with small effects, GFWGBLUP has advantages. In conclusion, the optimal weighted GBLUP method for genomic selection should take both the genetic architecture and number of QTLs of traits into consideration carefully.
Collapse
Affiliation(s)
- Duanyang Ren
- College of Animal Science and Veterinary Medicine, Shanxi Agricultural University, Taigu, China
| | - Lixia An
- College of Information, Shanxi Agricultural University, Taigu, China
| | - Baojun Li
- College of Animal Science and Veterinary Medicine, Shanxi Agricultural University, Taigu, China
| | - Liying Qiao
- College of Animal Science and Veterinary Medicine, Shanxi Agricultural University, Taigu, China
| | - Wenzhong Liu
- College of Animal Science and Veterinary Medicine, Shanxi Agricultural University, Taigu, China.
| |
Collapse
|
20
|
Arriagada O, Marcotuli I, Gadaleta A, Schwember AR. Molecular Mapping and Genomics of Grain Yield in Durum Wheat: A Review. Int J Mol Sci 2020; 21:ijms21197021. [PMID: 32987666 PMCID: PMC7582296 DOI: 10.3390/ijms21197021] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2020] [Revised: 09/14/2020] [Accepted: 09/17/2020] [Indexed: 02/07/2023] Open
Abstract
Durum wheat is the most relevant cereal for the whole of Mediterranean agriculture, due to its intrinsic adaptation to dryland and semi-arid environments and to its strong historical cultivation tradition. It is not only relevant for the primary production sector, but also for the food industry chains associated with it. In Mediterranean environments, wheat is mostly grown under rainfed conditions and the crop is frequently exposed to environmental stresses, with high temperatures and water scarcity especially during the grain filling period. For these reasons, and due to recurrent disease epidemics, Mediterranean wheat productivity often remains under potential levels. Many studies, using both linkage analysis (LA) and a genome-wide association study (GWAS), have identified the genomic regions controlling the grain yield and the associated markers that can be used for marker-assisted selection (MAS) programs. Here, we have summarized all the current studies identifying quantitative trait loci (QTLs) and/or candidate genes involved in the main traits linked to grain yield: kernel weight, number of kernels per spike and number of spikes per unit area.
Collapse
Affiliation(s)
- Osvin Arriagada
- Departamento de Ciencias Vegetales, Facultad de Agronomía e Ingeniería Forestal, Pontificia Universidad Católica de Chile, 306-22 Santiago, Chile;
| | - Ilaria Marcotuli
- Department of Agricultural and Environmental Science, University of Bari Aldo Moro, 70121 Bari, Italy; (I.M.); (A.G.)
| | - Agata Gadaleta
- Department of Agricultural and Environmental Science, University of Bari Aldo Moro, 70121 Bari, Italy; (I.M.); (A.G.)
| | - Andrés R. Schwember
- Departamento de Ciencias Vegetales, Facultad de Agronomía e Ingeniería Forestal, Pontificia Universidad Católica de Chile, 306-22 Santiago, Chile;
- Correspondence: ; Tel.: +56-223544123
| |
Collapse
|
21
|
Liu S, Yu Y, Zhang S, Cole JB, Tenesa A, Wang T, McDaneld TG, Ma L, Liu GE, Fang L. Epigenomics and genotype-phenotype association analyses reveal conserved genetic architecture of complex traits in cattle and human. BMC Biol 2020; 18:80. [PMID: 32620158 PMCID: PMC7334855 DOI: 10.1186/s12915-020-00792-6] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2019] [Accepted: 05/12/2020] [Indexed: 02/01/2023] Open
Abstract
Background Lack of comprehensive functional annotations across a wide range of tissues and cell types severely hinders the biological interpretations of phenotypic variation, adaptive evolution, and domestication in livestock. Here we used a combination of comparative epigenomics, genome-wide association study (GWAS), and selection signature analysis, to shed light on potential adaptive evolution in cattle. Results We cross-mapped 8 histone marks of 1300 samples from human to cattle, covering 178 unique tissues/cell types. By uniformly analyzing 723 RNA-seq and 40 whole genome bisulfite sequencing (WGBS) datasets in cattle, we validated that cross-mapped histone marks captured tissue-specific expression and methylation, reflecting tissue-relevant biology. Through integrating cross-mapped tissue-specific histone marks with large-scale GWAS and selection signature results, we for the first time detected relevant tissues and cell types for 45 economically important traits and artificial selection in cattle. For instance, immune tissues are significantly associated with health and reproduction traits, multiple tissues for milk production and body conformation traits (reflecting their highly polygenic architecture), and thyroid for the different selection between beef and dairy cattle. Similarly, we detected relevant tissues for 58 complex traits and diseases in humans and observed that immune and fertility traits in humans significantly correlated with those in cattle in terms of relevant tissues, which facilitated the identification of causal genes for such traits. For instance, PIK3CG, a gene highly specifically expressed in mononuclear cells, was significantly associated with both age-at-menopause in human and daughter-still-birth in cattle. ICAM, a T cell-specific gene, was significantly associated with both allergic diseases in human and metritis in cattle. Conclusion Collectively, our results highlighted that comparative epigenomics in conjunction with GWAS and selection signature analyses could provide biological insights into the phenotypic variation and adaptive evolution. Cattle may serve as a model for human complex traits, by providing additional information beyond laboratory model organisms, particularly when more novel phenotypes become available in the near future.
Collapse
Affiliation(s)
- Shuli Liu
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, BARC-East, Beltsville, MD, 20705, USA.,College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Ying Yu
- College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Shengli Zhang
- College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - John B Cole
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, BARC-East, Beltsville, MD, 20705, USA
| | - Albert Tenesa
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, EH4 2XU, UK.,The Roslin Institute, University of Edinburgh, Edinburgh, EH25 9RG, UK
| | - Ting Wang
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, 63110, USA
| | - Tara G McDaneld
- US Meat Animal Research Center, Agricultural Research Service, USDA, Clay Center, NE, 68933, USA
| | - Li Ma
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD, 20742, USA.
| | - George E Liu
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, BARC-East, Beltsville, MD, 20705, USA.
| | - Lingzhao Fang
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, BARC-East, Beltsville, MD, 20705, USA. .,MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, EH4 2XU, UK. .,Department of Animal and Avian Sciences, University of Maryland, College Park, MD, 20742, USA.
| |
Collapse
|
22
|
Yin L, Zhang H, Zhou X, Yuan X, Zhao S, Li X, Liu X. KAML: improving genomic prediction accuracy of complex traits using machine learning determined parameters. Genome Biol 2020; 21:146. [PMID: 32552725 PMCID: PMC7386246 DOI: 10.1186/s13059-020-02052-w] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2020] [Accepted: 05/21/2020] [Indexed: 02/06/2023] Open
Abstract
Advances in high-throughput sequencing technologies have reduced the cost of genotyping dramatically and led to genomic prediction being widely used in animal and plant breeding, and increasingly in human genetics. Inspired by the efficient computing of linear mixed model and the accurate prediction of Bayesian methods, we propose a machine learning-based method incorporating cross-validation, multiple regression, grid search, and bisection algorithms named KAML that aims to combine the advantages of prediction accuracy with computing efficiency. KAML exhibits higher prediction accuracy than existing methods, and it is available at https://github.com/YinLiLin/KAML.
Collapse
Affiliation(s)
- Lilin Yin
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Ministry of Education & College of Animal Science and Technology, Huazhong Agricultural University, Wuhan, 430070, Hubei, People's Republic of China.,Key Laboratory of Swine Genetics and Breeding, Ministry of Agriculture, Huazhong Agricultural University, Wuhan, 430070, Hubei, People's Republic of China
| | - Haohao Zhang
- School of Computer Science and Technology, Wuhan University of Technology, Wuhan, 430070, China
| | - Xiang Zhou
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA.,Center for Statistical Genetics, University of Michigan, Ann Arbor, MI, USA
| | - Xiaohui Yuan
- School of Computer Science and Technology, Wuhan University of Technology, Wuhan, 430070, China
| | - Shuhong Zhao
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Ministry of Education & College of Animal Science and Technology, Huazhong Agricultural University, Wuhan, 430070, Hubei, People's Republic of China.,Key Laboratory of Swine Genetics and Breeding, Ministry of Agriculture, Huazhong Agricultural University, Wuhan, 430070, Hubei, People's Republic of China
| | - Xinyun Li
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Ministry of Education & College of Animal Science and Technology, Huazhong Agricultural University, Wuhan, 430070, Hubei, People's Republic of China. .,Key Laboratory of Swine Genetics and Breeding, Ministry of Agriculture, Huazhong Agricultural University, Wuhan, 430070, Hubei, People's Republic of China.
| | - Xiaolei Liu
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Ministry of Education & College of Animal Science and Technology, Huazhong Agricultural University, Wuhan, 430070, Hubei, People's Republic of China. .,Key Laboratory of Swine Genetics and Breeding, Ministry of Agriculture, Huazhong Agricultural University, Wuhan, 430070, Hubei, People's Republic of China.
| |
Collapse
|
23
|
Fang L, Cai W, Liu S, Canela-Xandri O, Gao Y, Jiang J, Rawlik K, Li B, Schroeder SG, Rosen BD, Li CJ, Sonstegard TS, Alexander LJ, Van Tassell CP, VanRaden PM, Cole JB, Yu Y, Zhang S, Tenesa A, Ma L, Liu GE. Comprehensive analyses of 723 transcriptomes enhance genetic and biological interpretations for complex traits in cattle. Genome Res 2020; 30:790-801. [PMID: 32424068 PMCID: PMC7263193 DOI: 10.1101/gr.250704.119] [Citation(s) in RCA: 77] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2019] [Accepted: 05/01/2020] [Indexed: 12/13/2022]
Abstract
By uniformly analyzing 723 RNA-seq data from 91 tissues and cell types, we built a comprehensive gene atlas and studied tissue specificity of genes in cattle. We demonstrated that tissue-specific genes significantly reflected the tissue-relevant biology, showing distinct promoter methylation and evolution patterns (e.g., brain-specific genes evolve slowest, whereas testis-specific genes evolve fastest). Through integrative analyses of those tissue-specific genes with large-scale genome-wide association studies, we detected relevant tissues/cell types and candidate genes for 45 economically important traits in cattle, including blood/immune system (e.g., CCDC88C) for male fertility, brain (e.g., TRIM46 and RAB6A) for milk production, and multiple growth-related tissues (e.g., FGF6 and CCND2) for body conformation. We validated these findings by using epigenomic data across major somatic tissues and sperm. Collectively, our findings provided novel insights into the genetic and biological mechanisms underlying complex traits in cattle, and our transcriptome atlas can serve as a primary source for biological interpretation, functional validation, studies of adaptive evolution, and genomic improvement in livestock.
Collapse
Affiliation(s)
- Lingzhao Fang
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, Maryland 20705, USA
- Department of Animal and Avian Sciences, University of Maryland, College Park, Maryland 20742, USA
- The Roslin Institute, Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian EH25 9RG, United Kingdom
- Medical Research Council Human Genetics Unit at the Medical Research Council Institute of Genetics and Molecular Medicine, The University of Edinburgh, Edinburgh EH4 2XU, United Kingdom
| | - Wentao Cai
- Department of Animal and Avian Sciences, University of Maryland, College Park, Maryland 20742, USA
- College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Shuli Liu
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, Maryland 20705, USA
- College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Oriol Canela-Xandri
- The Roslin Institute, Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian EH25 9RG, United Kingdom
- Medical Research Council Human Genetics Unit at the Medical Research Council Institute of Genetics and Molecular Medicine, The University of Edinburgh, Edinburgh EH4 2XU, United Kingdom
| | - Yahui Gao
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, Maryland 20705, USA
- Department of Animal and Avian Sciences, University of Maryland, College Park, Maryland 20742, USA
| | - Jicai Jiang
- Department of Animal and Avian Sciences, University of Maryland, College Park, Maryland 20742, USA
| | - Konrad Rawlik
- The Roslin Institute, Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian EH25 9RG, United Kingdom
| | - Bingjie Li
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, Maryland 20705, USA
| | - Steven G Schroeder
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, Maryland 20705, USA
| | - Benjamin D Rosen
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, Maryland 20705, USA
| | - Cong-Jun Li
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, Maryland 20705, USA
| | | | - Leeson J Alexander
- Fort Keogh Livestock and Range Research Laboratory, Agricultural Research Service, USDA, Miles City, Montana 59301, USA
| | - Curtis P Van Tassell
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, Maryland 20705, USA
| | - Paul M VanRaden
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, Maryland 20705, USA
| | - John B Cole
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, Maryland 20705, USA
| | - Ying Yu
- College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Shengli Zhang
- College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Albert Tenesa
- The Roslin Institute, Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian EH25 9RG, United Kingdom
- Medical Research Council Human Genetics Unit at the Medical Research Council Institute of Genetics and Molecular Medicine, The University of Edinburgh, Edinburgh EH4 2XU, United Kingdom
| | - Li Ma
- Department of Animal and Avian Sciences, University of Maryland, College Park, Maryland 20742, USA
| | - George E Liu
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, Maryland 20705, USA
| |
Collapse
|
24
|
Raymond B, Wientjes YCJ, Bouwman AC, Schrooten C, Veerkamp RF. A deterministic equation to predict the accuracy of multi-population genomic prediction with multiple genomic relationship matrices. Genet Sel Evol 2020; 52:21. [PMID: 32345213 PMCID: PMC7189707 DOI: 10.1186/s12711-020-00540-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2019] [Accepted: 04/14/2020] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND A multi-population genomic prediction (GP) model in which important pre-selected single nucleotide polymorphisms (SNPs) are differentially weighted (MPMG) has been shown to result in better prediction accuracy than a multi-population, single genomic relationship matrix ([Formula: see text]) GP model (MPSG) in which all SNPs are weighted equally. Our objective was to underpin theoretically the advantages and limits of the MPMG model over the MPSG model, by deriving and validating a deterministic prediction equation for its accuracy. METHODS Using selection index theory, we derived an equation to predict the accuracy of estimated total genomic values of selection candidates from population [Formula: see text] ([Formula: see text]), when individuals from two populations, [Formula: see text] and [Formula: see text], are combined in the training population and two [Formula: see text], made respectively from pre-selected and remaining SNPs, are fitted simultaneously in MPMG. We used simulations to validate the prediction equation in scenarios that differed in the level of genetic correlation between populations, heritability, and proportion of genetic variance explained by the pre-selected SNPs. Empirical accuracy of the MPMG model in each scenario was calculated and compared to the predicted accuracy from the equation. RESULTS In general, the derived prediction equation resulted in accurate predictions of [Formula: see text] for the scenarios evaluated. Using the prediction equation, we showed that an important advantage of the MPMG model over the MPSG model is its ability to benefit from the small number of independent chromosome segments ([Formula: see text]) due to the pre-selected SNPs, both within and across populations, whereas for the MPSG model, there is only a single value for [Formula: see text], calculated based on all SNPs, which is very large. However, this advantage is dependent on the pre-selected SNPs that explain some proportion of the total genetic variance for the trait. CONCLUSIONS We developed an equation that gives insight into why, and under which conditions the MPMG outperforms the MPSG model for GP. The equation can be used as a deterministic tool to assess the potential benefit of combining information from different populations, e.g., different breeds or lines for GP in livestock or plants, or different groups of people based on their ethnic background for prediction of disease risk scores.
Collapse
Affiliation(s)
- Biaty Raymond
- Animal Breeding and Genomics, Wageningen University and Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands. .,Biometris, Wageningen University and Research, 6700AA, Wageningen, The Netherlands.
| | - Yvonne C J Wientjes
- Animal Breeding and Genomics, Wageningen University and Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands
| | - Aniek C Bouwman
- Animal Breeding and Genomics, Wageningen University and Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands
| | | | - Roel F Veerkamp
- Animal Breeding and Genomics, Wageningen University and Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands
| |
Collapse
|
25
|
Rohde PD, Fourie Sørensen I, Sørensen P. qgg: an R package for large-scale quantitative genetic analyses. Bioinformatics 2019; 36:2614-2615. [DOI: 10.1093/bioinformatics/btz955] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2019] [Revised: 12/16/2019] [Accepted: 12/23/2019] [Indexed: 01/03/2023] Open
Abstract
Abstract
Summary
Here, we present the R package qgg, which provides an environment for large-scale genetic analyses of quantitative traits and diseases. The qgg package provides an infrastructure for efficient processing of large-scale genetic data and functions for estimating genetic parameters, and performing single and multiple marker association analyses and genomic-based predictions of phenotypes.
Availability and implementation
The qgg package is freely available. For the latest updates, user guides and example scripts, consult the main page http://psoerensen.github.io/qgg. The current release is available from CRAN (https://CRAN.R-project.org/package=qgg) for all major operating systems.
Supplementary information
Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Palle Duun Rohde
- Department of Molecular Biology and Genetics, Aarhus University, Tjele, Denmark
| | | | - Peter Sørensen
- Department of Molecular Biology and Genetics, Aarhus University, Tjele, Denmark
| |
Collapse
|
26
|
Song H, Ye S, Jiang Y, Zhang Z, Zhang Q, Ding X. Using imputation-based whole-genome sequencing data to improve the accuracy of genomic prediction for combined populations in pigs. Genet Sel Evol 2019; 51:58. [PMID: 31638889 PMCID: PMC6805481 DOI: 10.1186/s12711-019-0500-8] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2019] [Accepted: 10/07/2019] [Indexed: 11/17/2022] Open
Abstract
BACKGROUND For genomic selection in populations with a small reference population, combining populations of the same breed or populations of related breeds is an effective way to increase the size of the reference population. However, genomic predictions based on single nucleotide polymorphism (SNP)-chip genotype data using combined populations with different genetic backgrounds or from different breeds have not shown a clear advantage over using within-population or within-breed predictions. The increasing availability of whole-genome sequencing (WGS) data provides new opportunities for combined population genomic prediction. Our objective was to investigate the accuracy of genomic prediction using imputation-based WGS data from combined populations in pigs. Using 80K SNP panel genotypes, WGS genotypes, or genotypes on WGS variants that were pruned based on linkage disequilibrium (LD), three methods [genomic best linear unbiased prediction (GBLUP), single-step (ss)GBLUP, and genomic feature (GF)BLUP] were implemented with different prior information to identify the best method to improve the accuracy of genomic prediction for combined populations in pigs. RESULTS In total, 2089 and 2043 individuals with production and reproduction phenotypes, respectively, from three Yorkshire populations with different genetic backgrounds were genotyped with the PorcineSNP80 panel. Imputation accuracy from 80K to WGS variants reached 92%. The results showed that use of the WGS data compared to the 80K SNP panel did not increase the accuracy of genomic prediction in a single population, but using WGS data with LD pruning and GFBLUP with prior information did yield higher accuracy than the 80K SNP panel. For the 80K SNP panel genotypes, using the combined population resulted in a slight improvement, no change, or even a slight decrease in accuracy in comparison with the single population for GBLUP and ssGBLUP, while accuracy increased by 1 to 2.4% when using WGS data. Notably, the GFBLUP method did not perform well for both the combined population and the single populations. CONCLUSIONS The use of WGS data was beneficial for combined population genomic prediction. Simply increasing the number of SNPs to the WGS level did not increase accuracy for a single population, while using pruned WGS data based on LD and GFBLUP with prior information could yield higher accuracy than the 80K SNP panel.
Collapse
Affiliation(s)
- Hailiang Song
- Key Laboratory of Animal Genetics and Breeding of Ministry of Agriculture, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Shaopan Ye
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong China
| | - Yifan Jiang
- Key Laboratory of Animal Genetics and Breeding of Ministry of Agriculture, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Zhe Zhang
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong China
| | - Qin Zhang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, Shandong Agricultural University, Taian, China
| | - Xiangdong Ding
- Key Laboratory of Animal Genetics and Breeding of Ministry of Agriculture, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| |
Collapse
|
27
|
Rohde PD, Jensen IR, Sarup PM, Ørsted M, Demontis D, Sørensen P, Kristensen TN. Genetic Signatures of Drug Response Variability in Drosophila melanogaster. Genetics 2019; 213:633-650. [PMID: 31455722 PMCID: PMC6781897 DOI: 10.1534/genetics.119.302381] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2019] [Accepted: 08/26/2019] [Indexed: 12/27/2022] Open
Abstract
Knowledge of the genetic basis underlying variation in response to environmental exposures or treatments is important in many research areas. For example, knowing the set of causal genetic variants for drug responses could revolutionize personalized medicine. We used Drosophila melanogaster to investigate the genetic signature underlying behavioral variability in response to methylphenidate (MPH), a drug used in the treatment of attention-deficit/hyperactivity disorder. We exposed a wild-type D. melanogaster population to MPH and a control treatment, and observed an increase in locomotor activity in MPH-exposed individuals. Whole-genome transcriptomic analyses revealed that the behavioral response to MPH was associated with abundant gene expression alterations. To confirm these patterns in a different genetic background and to further advance knowledge on the genetic signature of drug response variability, we used a system of inbred lines, the Drosophila Genetic Reference Panel (DGRP). Based on the DGRP, we showed that the behavioral response to MPH was strongly genotype-dependent. Using an integrative genomic approach, we incorporated known gene interactions into the genomic analyses of the DGRP, and identified putative candidate genes for variability in drug response. We successfully validated 71% of the investigated candidate genes by gene expression knockdown. Furthermore, we showed that MPH has cross-generational behavioral and transcriptomic effects. Our findings establish a foundation for understanding the genetic mechanisms driving genotype-specific responses to medical treatment, and highlight the opportunities that integrative genomic approaches have in optimizing medical treatment of complex diseases.
Collapse
Affiliation(s)
- Palle Duun Rohde
- Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, 8830 Tjele, Denmark
- The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, 8000 Aarhus C, Denmark
- Center for Integrative Sequencing, Aarhus University, 8000, Denmark
| | - Iben Ravnborg Jensen
- Section for Biology and Environmental Science, Department of Chemistry and Bioscience, Aalborg University, 9220, Denmark
| | - Pernille Merete Sarup
- Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, 8830 Tjele, Denmark
| | - Michael Ørsted
- Section for Biology and Environmental Science, Department of Chemistry and Bioscience, Aalborg University, 9220, Denmark
| | - Ditte Demontis
- The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, 8000 Aarhus C, Denmark
- Center for Integrative Sequencing, Aarhus University, 8000, Denmark
- Department of Biomedicine, Aarhus University, 8000, Denmark
| | - Peter Sørensen
- Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, 8830 Tjele, Denmark
| | - Torsten Nygaard Kristensen
- Section for Biology and Environmental Science, Department of Chemistry and Bioscience, Aalborg University, 9220, Denmark
- Section for Genetics, Ecology and Evolution, Department of Bioscience, Aarhus University, 8000, Denmark
| |
Collapse
|
28
|
Drag MH, Kogelman LJA, Maribo H, Meinert L, Thomsen PD, Kadarmideen HN. Characterization of eQTLs associated with androstenone by RNA sequencing in porcine testis. Physiol Genomics 2019; 51:488-499. [PMID: 31373884 DOI: 10.1152/physiolgenomics.00125.2018] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Characterization of genetic variants affecting genome-wide gene expression levels (expression quantitative trait loci or eQTLs) in pig testes may improve our understanding of genetic architecture of boar taint (an animal welfare trait) and helps in genome-assisted or genomic selection programs. The aims of this study were to identify eQTLs associated with androstenone, to find candidate eQTLs for low androstenone, and to validate the top eQTL by reverse transcriptase quantitative PCR (RT-qPCR). Gene expression profiles were obtained by RNA sequencing in testis from Danish cross-bred pigs and genotype data by 80K single nucleotide polymorphism panel. A total of 262 eQTLs [false discovery rate (FDR) < 0.05] were identified by using two software packages: Matrix eQTL and Krux eQTL. Of these, 149 cis-acting eQTLs were significantly associated with androstenone concentrations and gene expression (FDR < 0.05). The eQTLs were associated with several genes of boar taint relevance including CYP1A2, CYB5D1, and SPHK2. One eQTL gene, AMPH, was differentially expressed (FDR < 0.05) and affected by chicory. Five candidate eQTLs associated with low androstenone concentrations were discovered, including the top eQTL associated with CYP1A2. RT-qPCR confirmed target gene expression to be significantly (P < 0.05) different based on eQTL genotypes. Furthermore, eQTLs were enriched as QTLs for 15 boar taint related traits from the PigQTLdb. This is the first study to report eQTLs in testes of commercial crossbred pigs used in pork production and to reveal genetic architecture of boar taint. Potential applications include development of a DNA test and in advanced genomic selection models for boar taint.
Collapse
Affiliation(s)
- Markus H Drag
- Department of Veterinary and Animal Sciences, Faculty of Health and Medical Sciences, University of Copenhagen, Frederiksberg, Denmark
- Department of Applied Mathematics and Computer Science, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Lisette J A Kogelman
- Department of Neurology, Danish Headache Center, Rigshospitalet Glostrup, Faculty of Health and Medical Sciences, University of Copenhagen, Glostrup, Denmark
| | - Hanne Maribo
- SEGES, Danish Pig Research Center, Copenhagen, Denmark
| | - Lene Meinert
- Danish Meat Research Institute (DMRI), Danish Technological Institute, Taastrup, Denmark
| | - Preben D Thomsen
- Department of Veterinary and Animal Sciences, Faculty of Health and Medical Sciences, University of Copenhagen, Frederiksberg, Denmark
| | - Haja N Kadarmideen
- Department of Veterinary and Animal Sciences, Faculty of Health and Medical Sciences, University of Copenhagen, Frederiksberg, Denmark
- Department of Applied Mathematics and Computer Science, Technical University of Denmark, Kgs. Lyngby, Denmark
| |
Collapse
|
29
|
Kristensen PS, Jensen J, Andersen JR, Guzmán C, Orabi J, Jahoor A. Genomic Prediction and Genome-Wide Association Studies of Flour Yield and Alveograph Quality Traits Using Advanced Winter Wheat Breeding Material. Genes (Basel) 2019; 10:E669. [PMID: 31480460 PMCID: PMC6770321 DOI: 10.3390/genes10090669] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2019] [Revised: 08/26/2019] [Accepted: 08/29/2019] [Indexed: 12/02/2022] Open
Abstract
Use of genetic markers and genomic prediction might improve genetic gain for quality traits in wheat breeding programs. Here, flour yield and Alveograph quality traits were inspected in 635 F6 winter wheat breeding lines from two breeding cycles. Genome-wide association studies revealed single nucleotide polymorphisms (SNPs) on chromosome 5D significantly associated with flour yield, Alveograph P (dough tenacity), and Alveograph W (dough strength). Additionally, SNPs on chromosome 1D were associated with Alveograph P and W, SNPs on chromosome 1B were associated with Alveograph P, and SNPs on chromosome 4A were associated with Alveograph L (dough extensibility). Predictive abilities based on genomic best linear unbiased prediction (GBLUP) models ranged from 0.50 for flour yield to 0.79 for Alveograph W based on a leave-one-out cross-validation strategy. Predictive abilities were negatively affected by smaller training set sizes, lower genetic relationship between lines in training and validation sets, and by genotype-environment (G×E) interactions. Bayesian Power Lasso models and genomic feature models resulted in similar or slightly improved predictions compared to GBLUP models. SNPs with the largest effects can be used for screening large numbers of lines in early generations in breeding programs to select lines that potentially have good quality traits. In later generations, genomic predictions might be used for a more accurate selection of high quality wheat lines.
Collapse
Affiliation(s)
| | - Just Jensen
- Department of Molecular Biology and Genetics, Aarhus University, 8830 Tjele, Denmark
| | | | - Carlos Guzmán
- Departamento de Genética, Escuela Técnica Superior de Ingeniería Agronómica y de Montes, Edificio Gregor Mendel, Campus de Rabanales, Universidad de Córdoba, CeiA3, 14071 Córdoba, Spain
| | | | - Ahmed Jahoor
- Nordic Seed A/S, 8300 Odder, Denmark
- Department of Plant Breeding, The Swedish University of Agricultural Sciences, 23053 Alnarp, Sweden
| |
Collapse
|
30
|
Fang L, Liu S, Liu M, Kang X, Lin S, Li B, Connor EE, Baldwin RL, Tenesa A, Ma L, Liu GE, Li CJ. Functional annotation of the cattle genome through systematic discovery and characterization of chromatin states and butyrate-induced variations. BMC Biol 2019; 17:68. [PMID: 31419979 PMCID: PMC6698049 DOI: 10.1186/s12915-019-0687-8] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2019] [Accepted: 08/05/2019] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND The functional annotation of genomes, including chromatin accessibility and modifications, is important for understanding and effectively utilizing the increased amount of genome sequences reported. However, while such annotation has been well explored in a diverse set of tissues and cell types in human and model organisms, relatively little data are available for livestock genomes, hindering our understanding of complex trait variation, domestication, and adaptive evolution. Here, we present the first complete global landscape of regulatory elements in cattle and explore the dynamics of chromatin states in rumen epithelial cells induced by the rumen developmental regulator-butyrate. RESULTS We established the first global map of regulatory elements (15 chromatin states) and defined their coordinated activities in cattle, through genome-wide profiling for six histone modifications, RNA polymerase II, CTCF-binding sites, DNA accessibility, DNA methylation, and transcriptome in rumen epithelial primary cells (REPC), rumen tissues, and Madin-Darby bovine kidney epithelial cells (MDBK). We demonstrated that each chromatin state exhibited specific enrichment for sequence ontology, transcription, methylation, trait-associated variants, gene expression-associated variants, selection signatures, and evolutionarily conserved elements, implying distinct biological functions. After butyrate treatments, we observed that the weak enhancers and flanking active transcriptional start sites (TSS) were the most dynamic chromatin states, occurred concomitantly with significant alterations in gene expression and DNA methylation, which was significantly associated with heifer conception rate and stature economic traits. CONCLUSION Our results demonstrate the crucial role of functional genome annotation for understanding genome regulation, complex trait variation, and adaptive evolution in livestock. Using butyrate to induce the dynamics of the epigenomic landscape, we were able to establish the correlation among nutritional elements, chromatin states, gene activities, and phenotypic outcomes.
Collapse
Affiliation(s)
- Lingzhao Fang
- Animal Genomics and Improvement Laboratory, BARC, Agricultural Research Service, USDA, Beltsville, MD 20705 USA
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD 20742 USA
| | - Shuli Liu
- Animal Genomics and Improvement Laboratory, BARC, Agricultural Research Service, USDA, Beltsville, MD 20705 USA
- College of Animal Science and Technology, China Agricultural University, Beijing, 100193 China
| | - Mei Liu
- Animal Genomics and Improvement Laboratory, BARC, Agricultural Research Service, USDA, Beltsville, MD 20705 USA
- College of Animal Science and Technology, Shaanxi Key Laboratory of Agricultural Molecular Biology, Northwest A&F University, Yangling, 712100 Shaanxi China
| | - Xiaolong Kang
- Animal Genomics and Improvement Laboratory, BARC, Agricultural Research Service, USDA, Beltsville, MD 20705 USA
- College of Agriculture, Ningxia University, Yinchuan, 750021 China
| | - Shudai Lin
- Animal Genomics and Improvement Laboratory, BARC, Agricultural Research Service, USDA, Beltsville, MD 20705 USA
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding and Key Lab of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, College of Animal Science of South China Agricultural University, Guangzhou, 510642 China
| | - Bingjie Li
- Animal Genomics and Improvement Laboratory, BARC, Agricultural Research Service, USDA, Beltsville, MD 20705 USA
| | - Erin E. Connor
- Animal Genomics and Improvement Laboratory, BARC, Agricultural Research Service, USDA, Beltsville, MD 20705 USA
| | - Ransom L. Baldwin
- Animal Genomics and Improvement Laboratory, BARC, Agricultural Research Service, USDA, Beltsville, MD 20705 USA
| | - Albert Tenesa
- The Roslin Institute, University of Edinburgh, Edinburgh, EH4 2XU UK
| | - Li Ma
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD 20742 USA
| | - George E. Liu
- Animal Genomics and Improvement Laboratory, BARC, Agricultural Research Service, USDA, Beltsville, MD 20705 USA
| | - Cong-jun Li
- Animal Genomics and Improvement Laboratory, BARC, Agricultural Research Service, USDA, Beltsville, MD 20705 USA
| |
Collapse
|
31
|
Hoff JL, Decker JE, Schnabel RD, Seabury CM, Neibergs HL, Taylor JF. QTL-mapping and genomic prediction for bovine respiratory disease in U.S. Holsteins using sequence imputation and feature selection. BMC Genomics 2019; 20:555. [PMID: 31277567 PMCID: PMC6612181 DOI: 10.1186/s12864-019-5941-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2018] [Accepted: 06/26/2019] [Indexed: 02/01/2023] Open
Abstract
BACKGROUND National genetic evaluations for disease resistance do not exist, precluding the genetic improvement of cattle for these traits. We imputed BovineHD genotypes to whole genome sequence for 2703 Holsteins that were cases or controls for Bovine Respiratory Disease and sampled from either California or New Mexico to construct and compare genomic prediction models. The sequence variation reference dataset comprised variants called for 1578 animals from Run 5 of the 1000 Bull Genomes Project, including 450 Holsteins and 29 animals sequenced from this study population. Genotypes for 9,282,726 variants with minor allele frequencies ≥5% were imputed and used to obtain genomic predictions in GEMMA using a Bayesian Sparse Linear Mixed Model. RESULTS Variation explained by markers increased from 13.6% using BovineHD data to 14.4% using imputed whole genome sequence data and the resolution of genomic regions detected as harbouring QTL substantially increased. Explained variation in the analysis of the combined California and New Mexico data was less than when data for each state were separately analysed and the estimated genetic correlation between risk of Bovine Respiratory Disease in California and New Mexico Holsteins was - 0.36. Consequently, genomic predictions trained using the data from one state did not accurately predict disease risk in the other state. To determine if a prediction model could be developed with utility in both states, we selected variants within genomic regions harbouring: 1) genes involved in the normal immune response to infection by pathogens responsible for Bovine Respiratory Disease detected by RNA-Seq analysis, and/or 2) QTL identified in the association analysis of the imputed sequence variants. The model based on QTL selected variants is biased but when trained in one state generated BRD risk predictions with positive accuracies in the other state. CONCLUSIONS We demonstrate the utility of sequence-based and biology-driven model development for genomic selection. Disease phenotypes cannot be routinely recorded in most livestock species and the observed phenotypes may vary in their genomic architecture due to variation in the pathogen composition across environments. Elucidation of trait biology and genetic architecture may guide the development of prediction models with utility across breeds and environments.
Collapse
Affiliation(s)
- Jesse L Hoff
- Division of Animal Sciences, University of Missouri, Columbia, MO, 65211, USA
| | - Jared E Decker
- Division of Animal Sciences, University of Missouri, Columbia, MO, 65211, USA.,Informatics Institute, University of Missouri, Columbia, MO, 65211, USA
| | - Robert D Schnabel
- Division of Animal Sciences, University of Missouri, Columbia, MO, 65211, USA.,Informatics Institute, University of Missouri, Columbia, MO, 65211, USA
| | - Christopher M Seabury
- Department of Veterinary Pathobiology, Texas A&M University, College Station, TX, 77843, USA
| | - Holly L Neibergs
- Department of Animal Sciences, Washington State University, Pullman, WA, 99163, USA
| | - Jeremy F Taylor
- Division of Animal Sciences, University of Missouri, Columbia, MO, 65211, USA.
| |
Collapse
|
32
|
Impact of the rumen microbiome on milk fatty acid composition of Holstein cattle. Genet Sel Evol 2019; 51:23. [PMID: 31142263 PMCID: PMC6542034 DOI: 10.1186/s12711-019-0464-8] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2018] [Accepted: 05/14/2019] [Indexed: 12/15/2022] Open
Abstract
Background Fatty acids (FA) in bovine milk derive through body mobilization, de novo synthesis or from the feed via the blood stream. To be able to digest feedstuff, the cow depends on its rumen microbiome. The relative abundance of the microbes has been shown to differ between cows. To date, there is little information on the impact of the microbiome on the formation of specific milk FA. Therefore, in this study, our aim was to investigate the impact of the rumen bacterial microbiome on milk FA composition. Furthermore, we evaluated the predictive value of the rumen microbiome and the host genetics on the composition of individual FA in milk. Results Our results show that the proportion of variance explained by the rumen bacteria composition (termed microbiability or \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$h_{B}^{2}$$\end{document}hB2) was generally smaller than that of the genetic component (heritability), and that rumen bacteria influenced most C15:0, C17:0, C18:2 n-6, C18:3 n-3 and CLA cis-9, trans-11 with estimated \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$h_{B}^{2}$$\end{document}hB2 ranging from 0.26 to 0.42. For C6:0, C8:0, C10:0, C12:0, C16:0, C16:1 cis-9 and C18:1 cis-9, the variance explained by the rumen bacteria component was close to 0. In general, both the rumen microbiome and the host genetics had little value for predicting FA phenotype. Compared to genetic information only, adding rumen bacteria information resulted in a significant improvement of the predictive value for C15:0 from 0.22 to 0.38 (P = 9.50e−07) and C18:3 n-3 from 0 to 0.29 (P = 8.81e−18). Conclusions The rumen microbiome has a pronounced influence on the content of odd chain FA and polyunsaturated C18 FA, and to a lesser extent, on the content of the short- and medium-chain FA in the milk of Holstein cattle. The accuracy of prediction of FA phenotypes in milk based on information from either the animal’s genotypes or rumen bacteria composition was very low. Electronic supplementary material The online version of this article (10.1186/s12711-019-0464-8) contains supplementary material, which is available to authorized users.
Collapse
|
33
|
Fang L, Zhou Y, Liu S, Jiang J, Bickhart DM, Null DJ, Li B, Schroeder SG, Rosen BD, Cole JB, Van Tassell CP, Ma L, Liu GE. Integrating Signals from Sperm Methylome Analysis and Genome-Wide Association Study for a Better Understanding of Male Fertility in Cattle. EPIGENOMES 2019; 3:epigenomes3020010. [PMID: 34968233 PMCID: PMC8594688 DOI: 10.3390/epigenomes3020010] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2019] [Revised: 05/03/2019] [Accepted: 05/11/2019] [Indexed: 01/18/2023] Open
Abstract
Decreased male fertility is a big concern in both human society and the livestock industry. Sperm DNA methylation is commonly believed to be associated with male fertility. However, due to the lack of accurate male fertility records (i.e., limited mating times), few studies have investigated the comprehensive impacts of sperm DNA methylation on male fertility in mammals. In this study, we generated 10 sperm DNA methylomes and performed a preliminary correlation analysis between signals from sperm DNA methylation and signals from large-scale (n = 27,214) genome-wide association studies (GWAS) of 35 complex traits (including 12 male fertility-related traits). We detected genomic regions, which experienced DNA methylation alterations in sperm and were associated with aging and extreme fertility phenotypes (e.g., sire-conception rate or SCR). In dynamic hypomethylated regions (HMRs) and partially methylated domains (PMDs), we found genes (e.g., HOX gene clusters and microRNAs) that were involved in the embryonic development. We demonstrated that genomic regions, which gained rather than lost methylations during aging, and in animals with low SCR were significantly and selectively enriched for GWAS signals of male fertility traits. Our study discovered 16 genes as the potential candidate markers for male fertility, including SAMD5 and PDE5A. Collectively, this initial effort supported a hypothesis that sperm DNA methylation may contribute to male fertility in cattle and revealed the usefulness of functional annotations in enhancing biological interpretation and genomic prediction for complex traits and diseases.
Collapse
Affiliation(s)
- Lingzhao Fang
- Animal Genomics and Improvement Laboratory, BARC, Agricultural Research Service, USDA, Beltsville, MD 20705, USA
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD 20742, USA
| | - Yang Zhou
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Education Ministry of China, Huazhong Agricultural University, Wuhan 430070, China
| | - Shuli Liu
- Animal Genomics and Improvement Laboratory, BARC, Agricultural Research Service, USDA, Beltsville, MD 20705, USA
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture & National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Jicai Jiang
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD 20742, USA
| | - Derek M. Bickhart
- Dairy Forage Research Center, Agricultural Research Service, USDA, Madison, WI 53718, USA
| | - Daniel J. Null
- Animal Genomics and Improvement Laboratory, BARC, Agricultural Research Service, USDA, Beltsville, MD 20705, USA
| | - Bingjie Li
- Animal Genomics and Improvement Laboratory, BARC, Agricultural Research Service, USDA, Beltsville, MD 20705, USA
| | - Steven G. Schroeder
- Animal Genomics and Improvement Laboratory, BARC, Agricultural Research Service, USDA, Beltsville, MD 20705, USA
| | - Benjamin D. Rosen
- Animal Genomics and Improvement Laboratory, BARC, Agricultural Research Service, USDA, Beltsville, MD 20705, USA
| | - John B. Cole
- Animal Genomics and Improvement Laboratory, BARC, Agricultural Research Service, USDA, Beltsville, MD 20705, USA
| | - Curtis P. Van Tassell
- Animal Genomics and Improvement Laboratory, BARC, Agricultural Research Service, USDA, Beltsville, MD 20705, USA
| | - Li Ma
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD 20742, USA
- Correspondence: (L.M.); (G.E.L.); Tel.: +1-301-405-1389 (L.M.); +1-301-504-9843 (G.E.L.); Fax: +1-301-504-8414 (G.E.L.)
| | - George E. Liu
- Animal Genomics and Improvement Laboratory, BARC, Agricultural Research Service, USDA, Beltsville, MD 20705, USA
- Correspondence: (L.M.); (G.E.L.); Tel.: +1-301-405-1389 (L.M.); +1-301-504-9843 (G.E.L.); Fax: +1-301-504-8414 (G.E.L.)
| |
Collapse
|
34
|
Gebreyesus G, Bovenhuis H, Lund MS, Poulsen NA, Sun D, Buitenhuis B. Reliability of genomic prediction for milk fatty acid composition by using a multi-population reference and incorporating GWAS results. Genet Sel Evol 2019; 51:16. [PMID: 31029078 PMCID: PMC6487064 DOI: 10.1186/s12711-019-0460-z] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2018] [Accepted: 04/10/2019] [Indexed: 01/01/2023] Open
Abstract
Background Large-scale phenotyping for detailed milk fatty acid (FA) composition is difficult due to expensive and time-consuming analytical techniques. Reliability of genomic prediction is often low for traits that are expensive/difficult to measure and for breeds with a small reference population size. An effective method to increase reference population size could be to combine datasets from different populations. Prediction models might also benefit from incorporation of information on the biological underpinnings of quantitative traits. Genome-wide association studies (GWAS) show that genomic regions on Bos taurus chromosomes (BTA) 14, 19 and 26 underlie substantial proportions of the genetic variation in milk FA traits. Genomic prediction models that incorporate such results could enable improved prediction accuracy in spite of limited reference population sizes. In this study, we combine gas chromatography quantified FA samples from the Chinese, Danish and Dutch Holstein populations and implement a genomic feature best linear unbiased prediction (GFBLUP) model that incorporates variants on BTA14, 19 and 26 as genomic features for which random genetic effects are estimated separately. Prediction reliabilities were compared to those estimated with traditional GBLUP models. Results Predictions using a multi-population reference and a traditional GBLUP model resulted in average gains in prediction reliability of 10% points in the Dutch, 8% points in the Danish and 1% point in the Chinese populations compared to predictions based on population-specific references. Compared to the traditional GBLUP, implementation of the GFBLUP model with a multi-population reference led to further increases in prediction reliability of up to 38% points in the Dutch, 23% points in the Danish and 13% points in the Chinese populations. Prediction reliabilities from the GFBLUP model were moderate to high across the FA traits analyzed. Conclusions Our study shows that it is possible to predict genetic merits for milk FA traits with reasonable accuracy by combining related populations of a breed and using models that incorporate GWAS results. Our findings indicate that international collaborations that facilitate access to multi-population datasets could be highly beneficial to the implementation of genomic selection for detailed milk composition traits. Electronic supplementary material The online version of this article (10.1186/s12711-019-0460-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Grum Gebreyesus
- Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University, Blichers Allé 20, P.O. Box 50, 8830, Tjele, Denmark. .,Animal Breeding and Genomics Centre, Wageningen University, P.O. Box 338, 6700 AH, Wageningen, The Netherlands.
| | - Henk Bovenhuis
- Animal Breeding and Genomics Centre, Wageningen University, P.O. Box 338, 6700 AH, Wageningen, The Netherlands
| | - Mogens S Lund
- Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University, Blichers Allé 20, P.O. Box 50, 8830, Tjele, Denmark
| | - Nina A Poulsen
- Department of Food Science, Aarhus University, Blichers Allé 20, P.O. Box 50, 8830, Tjele, Denmark
| | - Dongxiao Sun
- Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture of China, National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Bart Buitenhuis
- Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University, Blichers Allé 20, P.O. Box 50, 8830, Tjele, Denmark
| |
Collapse
|
35
|
Fang L, Zhou Y, Liu S, Jiang J, Bickhart DM, Null DJ, Li B, Schroeder SG, Rosen BD, Cole JB, Van Tassell CP, Ma L, Liu GE. Comparative analyses of sperm DNA methylomes among human, mouse and cattle provide insights into epigenomic evolution and complex traits. Epigenetics 2019; 14:260-276. [PMID: 30810461 PMCID: PMC6557555 DOI: 10.1080/15592294.2019.1582217] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Sperm DNA methylation is crucial for fertility and viability of offspring but epigenome evolution in mammals is largely understudied. By comparing sperm DNA methylomes and large-scale genome-wide association study (GWAS) signals between human and cattle, we aimed to examine the DNA methylome evolution and its associations with complex phenotypes in mammals. Our analysis revealed that genes with conserved non-methylated promoters (e.g., ANKS1A and WNT7A) among human and cattle were involved in common system and embryo development, and enriched for GWAS signals of body conformation traits in both species, while genes with conserved hypermethylated promoters (e.g., TCAP and CD80) were engaged in immune responses and highlighted by immune-related traits. On the other hand, genes with human-specific hypomethylated promoters (e.g., FOXP2 and HYDIN) were engaged in neuron system development and enriched for GWAS signals of brain-related traits, while genes with cattle-specific hypomethylated promoters (e.g., LDHB and DGAT2) mainly participated in lipid storage and metabolism. We validated our findings using sperm-retained nucleosome, preimplantation transcriptome, and adult tissue transcriptome data, as well as sequence evolutionary features, including motif binding sites, mutation rates, recombination rates and evolution signatures. In conclusion, our results demonstrate important roles of epigenome evolution in shaping the genetic architecture underlying complex phenotypes, hence enhance signal prioritization in GWAS and provide valuable information for human neurological disorders and livestock genetic improvement.
Collapse
Affiliation(s)
- Lingzhao Fang
- a Animal Genomics and Improvement Laboratory, BARC , Agricultural Research Service, USDA , Beltsville , MD , USA.,b Department of Animal and Avian Sciences , University of Maryland , College Park , MD , USA
| | - Yang Zhou
- c Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Education Ministry of China , Huazhong Agricultural University , Wuhan , Hubei , China
| | - Shuli Liu
- a Animal Genomics and Improvement Laboratory, BARC , Agricultural Research Service, USDA , Beltsville , MD , USA.,d Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture & National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology , China Agricultural University , Beijing , China
| | - Jicai Jiang
- b Department of Animal and Avian Sciences , University of Maryland , College Park , MD , USA
| | - Derek M Bickhart
- e Dairy Forage Research Center , Agricultural Research Service, USDA , Madison , WI , USA
| | - Daniel J Null
- a Animal Genomics and Improvement Laboratory, BARC , Agricultural Research Service, USDA , Beltsville , MD , USA
| | - Bingjie Li
- a Animal Genomics and Improvement Laboratory, BARC , Agricultural Research Service, USDA , Beltsville , MD , USA
| | - Steven G Schroeder
- a Animal Genomics and Improvement Laboratory, BARC , Agricultural Research Service, USDA , Beltsville , MD , USA
| | - Benjamin D Rosen
- a Animal Genomics and Improvement Laboratory, BARC , Agricultural Research Service, USDA , Beltsville , MD , USA
| | - John B Cole
- a Animal Genomics and Improvement Laboratory, BARC , Agricultural Research Service, USDA , Beltsville , MD , USA
| | - Curtis P Van Tassell
- a Animal Genomics and Improvement Laboratory, BARC , Agricultural Research Service, USDA , Beltsville , MD , USA
| | - Li Ma
- b Department of Animal and Avian Sciences , University of Maryland , College Park , MD , USA
| | - George E Liu
- a Animal Genomics and Improvement Laboratory, BARC , Agricultural Research Service, USDA , Beltsville , MD , USA
| |
Collapse
|
36
|
Fang L, Jiang J, Li B, Zhou Y, Freebern E, Vanraden PM, Cole JB, Liu GE, Ma L. Genetic and epigenetic architecture of paternal origin contribute to gestation length in cattle. Commun Biol 2019; 2:100. [PMID: 30886909 PMCID: PMC6418173 DOI: 10.1038/s42003-019-0341-6] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2018] [Accepted: 02/06/2019] [Indexed: 12/19/2022] Open
Abstract
The length of gestation can affect offspring health and performance. Both maternal and fetal effects contribute to gestation length; however, paternal contributions to gestation length remain elusive. Using genome-wide association study (GWAS) in 27,214 Holstein bulls with millions of gestation records, here we identify nine paternal genomic loci associated with cattle gestation length. We demonstrate that these GWAS signals are enriched in pathways relevant to embryonic development, and in differentially methylated regions between sperm samples with long and short gestation length. We reveal that gestation length shares genetic and epigenetic architecture in sperm with calving ability, body depth, and conception rate. While several candidate genes are detected in our fine-mapping analysis, we provide evidence indicating ZNF613 as a promising candidate for cattle gestation length. Collectively, our findings support that the paternal genome and epigenome can impact gestation length potentially through regulation of the embryonic development. Lingzhao Fang et al. studied the paternal genetic variants that affect gestational length in cattle. They found that paternal genes from pathways involved in embryonic development were associated with gestation length, and that these were often found in differentially methylated regions of the genome.
Collapse
Affiliation(s)
- Lingzhao Fang
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD, 20742, USA.,Animal Genomics and Improvement Laboratory, BARC, Agricultural Research Service, USDA, Beltsville, MD, 20705, USA
| | - Jicai Jiang
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD, 20742, USA
| | - Bingjie Li
- Animal Genomics and Improvement Laboratory, BARC, Agricultural Research Service, USDA, Beltsville, MD, 20705, USA
| | - Yang Zhou
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Education Ministry of China, Huazhong Agricultural University, 430070, Wuhan, Hubei, China
| | - Ellen Freebern
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD, 20742, USA
| | - Paul M Vanraden
- Animal Genomics and Improvement Laboratory, BARC, Agricultural Research Service, USDA, Beltsville, MD, 20705, USA
| | - John B Cole
- Animal Genomics and Improvement Laboratory, BARC, Agricultural Research Service, USDA, Beltsville, MD, 20705, USA
| | - George E Liu
- Animal Genomics and Improvement Laboratory, BARC, Agricultural Research Service, USDA, Beltsville, MD, 20705, USA
| | - Li Ma
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD, 20742, USA.
| |
Collapse
|
37
|
MicroRNA-guided prioritization of genome-wide association signals reveals the importance of microRNA-target gene networks for complex traits in cattle. Sci Rep 2018; 8:9345. [PMID: 29921979 PMCID: PMC6008395 DOI: 10.1038/s41598-018-27729-y] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2018] [Accepted: 06/06/2018] [Indexed: 02/07/2023] Open
Abstract
MicroRNAs (miRNA) are key modulators of gene expression and so act as putative fine-tuners of complex phenotypes. Here, we hypothesized that causal variants of complex traits are enriched in miRNAs and miRNA-target networks. First, we conducted a genome-wide association study (GWAS) for seven functional and milk production traits using imputed sequence variants (13~15 million) and >10,000 animals from three dairy cattle breeds, i.e., Holstein (HOL), Nordic red cattle (RDC) and Jersey (JER). Second, we analyzed for enrichments of association signals in miRNAs and their miRNA-target networks. Our results demonstrated that genomic regions harboring miRNA genes were significantly (P < 0.05) enriched with GWAS signals for milk production traits and mastitis, and that enrichments within miRNA-target gene networks were significantly higher than in random gene-sets for the majority of traits. Furthermore, most between-trait and across-breed correlations of enrichments with miRNA-target networks were significantly greater than with random gene-sets, suggesting pleiotropic effects of miRNAs. Intriguingly, genes that were differentially expressed in response to mammary gland infections were significantly enriched in the miRNA-target networks associated with mastitis. All these findings were consistent across three breeds. Collectively, our observations demonstrate the importance of miRNAs and their targets for the expression of complex traits.
Collapse
|
38
|
Rohde PD, Østergaard S, Kristensen TN, Sørensen P, Loeschcke V, Mackay TFC, Sarup P. Functional Validation of Candidate Genes Detected by Genomic Feature Models. G3 (BETHESDA, MD.) 2018; 8:1659-1668. [PMID: 29519937 PMCID: PMC5940157 DOI: 10.1534/g3.118.200082] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/29/2018] [Accepted: 03/07/2018] [Indexed: 12/11/2022]
Abstract
Understanding the genetic underpinnings of complex traits requires knowledge of the genetic variants that contribute to phenotypic variability. Reliable statistical approaches are needed to obtain such knowledge. In genome-wide association studies, variants are tested for association with trait variability to pinpoint loci that contribute to the quantitative trait. Because stringent genome-wide significance thresholds are applied to control the false positive rate, many true causal variants can remain undetected. To ameliorate this problem, many alternative approaches have been developed, such as genomic feature models (GFM). The GFM approach tests for association of set of genomic markers, and predicts genomic values from genomic data utilizing prior biological knowledge. We investigated to what degree the findings from GFM have biological relevance. We used the Drosophila Genetic Reference Panel to investigate locomotor activity, and applied genomic feature prediction models to identify gene ontology (GO) categories predictive of this phenotype. Next, we applied the covariance association test to partition the genomic variance of the predictive GO terms to the genes within these terms. We then functionally assessed whether the identified candidate genes affected locomotor activity by reducing gene expression using RNA interference. In five of the seven candidate genes tested, reduced gene expression altered the phenotype. The ranking of genes within the predictive GO term was highly correlated with the magnitude of the phenotypic consequence of gene knockdown. This study provides evidence for five new candidate genes for locomotor activity, and provides support for the reliability of the GFM approach.
Collapse
Affiliation(s)
- Palle Duun Rohde
- Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, 8830 Tjele, Denmark
- The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, 8000 Aarhus, Denmark
- Center for Integrative Sequencing, Aarhus University, 8000 Aarhus, Denmark
| | - Solveig Østergaard
- Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, 8830 Tjele, Denmark
| | - Torsten Nygaard Kristensen
- Section for Genetics, Ecology and Evolution, Department of Bioscience, Aarhus University, 8000 Aarhus, Denmark
- Section for Biology and Environmental Science, Department of Chemistry and Bioscience, Aalborg University, 9220 Aalborg, Denmark
| | - Peter Sørensen
- Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, 8830 Tjele, Denmark
| | - Volker Loeschcke
- Section for Genetics, Ecology and Evolution, Department of Bioscience, Aarhus University, 8000 Aarhus, Denmark
| | - Trudy F C Mackay
- Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina 27695
- Program in Genetics, North Carolina State University, Raleigh, North Carolina 27695
- W. M. Keck Center for Behavioral Biology, North Carolina State University, Raleigh, North Carolina 27695
| | - Pernille Sarup
- Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, 8830 Tjele, Denmark
| |
Collapse
|
39
|
Gebreyesus G, Lund MS, Buitenhuis B, Bovenhuis H, Poulsen NA, Janss LG. Modeling heterogeneous (co)variances from adjacent-SNP groups improves genomic prediction for milk protein composition traits. Genet Sel Evol 2017; 49:89. [PMID: 29207947 PMCID: PMC5718071 DOI: 10.1186/s12711-017-0364-8] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2016] [Accepted: 11/28/2017] [Indexed: 11/10/2022] Open
Abstract
Background Accurate genomic prediction requires a large reference population, which is problematic for traits that are expensive to measure. Traits related to milk protein composition are not routinely recorded due to costly procedures and are considered to be controlled by a few quantitative trait loci of large effect. The amount of variation explained may vary between regions leading to heterogeneous (co)variance patterns across the genome. Genomic prediction models that can efficiently take such heterogeneity of (co)variances into account can result in improved prediction reliability. In this study, we developed and implemented novel univariate and bivariate Bayesian prediction models, based on estimates of heterogeneous (co)variances for genome segments (BayesAS). Available data consisted of milk protein composition traits measured on cows and de-regressed proofs of total protein yield derived for bulls. Single-nucleotide polymorphisms (SNPs), from 50K SNP arrays, were grouped into non-overlapping genome segments. A segment was defined as one SNP, or a group of 50, 100, or 200 adjacent SNPs, or one chromosome, or the whole genome. Traditional univariate and bivariate genomic best linear unbiased prediction (GBLUP) models were also run for comparison. Reliabilities were calculated through a resampling strategy and using deterministic formula. Results BayesAS models improved prediction reliability for most of the traits compared to GBLUP models and this gain depended on segment size and genetic architecture of the traits. The gain in prediction reliability was especially marked for the protein composition traits β-CN, κ-CN and β-LG, for which prediction reliabilities were improved by 49 percentage points on average using the MT-BayesAS model with a 100-SNP segment size compared to the bivariate GBLUP. Prediction reliabilities were highest with the BayesAS model that uses a 100-SNP segment size. The bivariate versions of our BayesAS models resulted in extra gains of up to 6% in prediction reliability compared to the univariate versions. Conclusions Substantial improvement in prediction reliability was possible for most of the traits related to milk protein composition using our novel BayesAS models. Grouping adjacent SNPs into segments provided enhanced information to estimate parameters and allowing the segments to have different (co)variances helped disentangle heterogeneous (co)variances across the genome.
Collapse
Affiliation(s)
- Grum Gebreyesus
- Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University, Blichers Allé 20, P.O. Box 50, 8830, Tjele, Denmark. .,Animal Breeding and Genomics Centre, Wageningen University, PO Box 338, 6700 AH, Wageningen, The Netherlands.
| | - Mogens S Lund
- Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University, Blichers Allé 20, P.O. Box 50, 8830, Tjele, Denmark
| | - Bart Buitenhuis
- Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University, Blichers Allé 20, P.O. Box 50, 8830, Tjele, Denmark
| | - Henk Bovenhuis
- Animal Breeding and Genomics Centre, Wageningen University, PO Box 338, 6700 AH, Wageningen, The Netherlands
| | - Nina A Poulsen
- Department of Food Science, Aarhus University, Blichers Allé 20, P.O. Box 50, 8830, Tjele, Denmark
| | - Luc G Janss
- Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University, Blichers Allé 20, P.O. Box 50, 8830, Tjele, Denmark
| |
Collapse
|
40
|
Ørsted M, Rohde PD, Hoffmann AA, Sørensen P, Kristensen TN. Environmental variation partitioned into separate heritable components. Evolution 2017; 72:136-152. [DOI: 10.1111/evo.13391] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2017] [Revised: 10/30/2017] [Accepted: 10/31/2017] [Indexed: 12/16/2022]
Affiliation(s)
- Michael Ørsted
- Section of Biology and Environmental Science, Department of Chemistry and Bioscience; Aalborg University; Fredrik Bajers Vej 7H 9220 Aalborg E Denmark
- School of Biosciences, Bio21 Molecular Science and Biotechnology Institute; The University of Melbourne; Parkville Victoria 3052 Australia
| | - Palle Duun Rohde
- Center for Quantitative Genetics and Genomics; Department of Molecular Biology and Genetics; Aarhus University; Blichers Allé 20 8830 Tjele Denmark
- i PSYCH; The Lundbeck Foundation Initiative for Integrative Psychiatric Research; 8000 Aarhus C Denmark
- i SEQ, Center for Integrative Sequencing; Aarhus University; Bartholins Allé 6 8000 Aarhus C Denmark
| | - Ary Anthony Hoffmann
- Section of Biology and Environmental Science, Department of Chemistry and Bioscience; Aalborg University; Fredrik Bajers Vej 7H 9220 Aalborg E Denmark
- School of Biosciences, Bio21 Molecular Science and Biotechnology Institute; The University of Melbourne; Parkville Victoria 3052 Australia
| | - Peter Sørensen
- Center for Quantitative Genetics and Genomics; Department of Molecular Biology and Genetics; Aarhus University; Blichers Allé 20 8830 Tjele Denmark
| | - Torsten Nygaard Kristensen
- Section of Biology and Environmental Science, Department of Chemistry and Bioscience; Aalborg University; Fredrik Bajers Vej 7H 9220 Aalborg E Denmark
- Section of Genetics, Ecology and Evolution, Department of Bioscience; Aarhus University; 8000 Aarhus C Denmark
| |
Collapse
|
41
|
Mangin B, Bonnafous F, Blanchet N, Boniface MC, Bret-Mestries E, Carrère S, Cottret L, Legrand L, Marage G, Pegot-Espagnet P, Munos S, Pouilly N, Vear F, Vincourt P, Langlade NB. Genomic Prediction of Sunflower Hybrids Oil Content. FRONTIERS IN PLANT SCIENCE 2017; 8:1633. [PMID: 28983306 DOI: 10.3389/fpls.2017.01633d] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Received: 06/28/2017] [Accepted: 09/06/2017] [Indexed: 05/26/2023]
Abstract
Prediction of hybrid performance using incomplete factorial mating designs is widely used in breeding programs including different heterotic groups. Based on the general combining ability (GCA) of the parents, predictions are accurate only if the genetic variance resulting from the specific combining ability is small and both parents have phenotyped descendants. Genomic selection (GS) can predict performance using a model trained on both phenotyped and genotyped hybrids that do not necessarily include all hybrid parents. Therefore, GS could overcome the issue of unknown parent GCA. Here, we compared the accuracy of classical GCA-based and genomic predictions for oil content of sunflower seeds using several GS models. Our study involved 452 sunflower hybrids from an incomplete factorial design of 36 female and 36 male lines. Re-sequencing of parental lines allowed to identify 468,194 non-redundant SNPs and to infer the hybrid genotypes. Oil content was observed in a multi-environment trial (MET) over 3 years, leading to nine different environments. We compared GCA-based model to different GS models including female and male genomic kinships with the addition of the female-by-male interaction genomic kinship, the use of functional knowledge as SNPs in genes of oil metabolic pathways, and with epistasis modeling. When both parents have descendants in the training set, the predictive ability was high even for GCA-based prediction, with an average MET value of 0.782. GS performed slightly better (+0.2%). Neither the inclusion of the female-by-male interaction, nor functional knowledge of oil metabolism, nor epistasis modeling improved the GS accuracy. GS greatly improved predictive ability when one or both parents were untested in the training set, increasing GCA-based predictive ability by 10.4% from 0.575 to 0.635 in the MET. In this scenario, performing GS only considering SNPs in oil metabolic pathways did not improve whole genome GS prediction but increased GCA-based prediction ability by 6.4%. Our results show that GS is a major improvement to breeding efficiency compared to the classical GCA modeling when either one or both parents are not well-characterized. This finding could therefore accelerate breeding through reducing phenotyping efforts and more effectively targeting for the most promising crosses.
Collapse
Affiliation(s)
- Brigitte Mangin
- LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France
| | - Fanny Bonnafous
- LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France
| | - Nicolas Blanchet
- LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France
| | - Marie-Claude Boniface
- LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France
| | | | - Sébastien Carrère
- LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France
| | - Ludovic Cottret
- LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France
| | - Ludovic Legrand
- LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France
| | - Gwenola Marage
- LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France
| | - Prune Pegot-Espagnet
- LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France
| | - Stéphane Munos
- LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France
| | - Nicolas Pouilly
- LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France
| | - Felicity Vear
- GDEC, INRA, Université Clermont II Blaise PascalClermont-Ferrand, France
| | - Patrick Vincourt
- LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France
| | - Nicolas B Langlade
- LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France
| |
Collapse
|
42
|
Fang L, Sahana G, Ma P, Su G, Yu Y, Zhang S, Lund MS, Sørensen P. Use of biological priors enhances understanding of genetic architecture and genomic prediction of complex traits within and between dairy cattle breeds. BMC Genomics 2017; 18:604. [PMID: 28797230 PMCID: PMC5553760 DOI: 10.1186/s12864-017-4004-z] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2016] [Accepted: 08/02/2017] [Indexed: 02/08/2023] Open
Abstract
Background A better understanding of the genetic architecture underlying complex traits (e.g., the distribution of causal variants and their effects) may aid in the genomic prediction. Here, we hypothesized that the genomic variants of complex traits might be enriched in a subset of genomic regions defined by genes grouped on the basis of “Gene Ontology” (GO), and that incorporating this independent biological information into genomic prediction models might improve their predictive ability. Results Four complex traits (i.e., milk, fat and protein yields, and mastitis) together with imputed sequence variants in Holstein (HOL) and Jersey (JER) cattle were analysed. We first carried out a post-GWAS analysis in a HOL training population to assess the degree of enrichment of the association signals in the gene regions defined by each GO term. We then extended the genomic best linear unbiased prediction model (GBLUP) to a genomic feature BLUP (GFBLUP) model, including an additional genomic effect quantifying the joint effect of a group of variants located in a genomic feature. The GBLUP model using a single random effect assumes that all genomic variants contribute to the genomic relationship equally, whereas GFBLUP attributes different weights to the individual genomic relationships in the prediction equation based on the estimated genomic parameters. Our results demonstrate that the immune-relevant GO terms were more associated with mastitis than milk production, and several biologically meaningful GO terms improved the prediction accuracy with GFBLUP for the four traits, as compared with GBLUP. The improvement of the genomic prediction between breeds (the average increase across the four traits was 0.161) was more apparent than that it was within the HOL (the average increase across the four traits was 0.020). Conclusions Our genomic feature modelling approaches provide a framework to simultaneously explore the genetic architecture and genomic prediction of complex traits by taking advantage of independent biological knowledge. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-4004-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Lingzhao Fang
- Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark. .,Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture & National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China.
| | - Goutam Sahana
- Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Peipei Ma
- Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Guosheng Su
- Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Ying Yu
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture & National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Shengli Zhang
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture & National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Mogens Sandø Lund
- Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Peter Sørensen
- Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| |
Collapse
|
43
|
Rohde PD, Gaertner B, Ward K, Sørensen P, Mackay TFC. Genomic Analysis of Genotype-by-Social Environment Interaction for Drosophila melanogaster Aggressive Behavior. Genetics 2017; 206:1969-1984. [PMID: 28550016 PMCID: PMC5560801 DOI: 10.1534/genetics.117.200642] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2017] [Accepted: 05/22/2017] [Indexed: 02/06/2023] Open
Abstract
Human psychiatric disorders such as schizophrenia, bipolar disorder, and attention-deficit/hyperactivity disorder often include adverse behaviors including increased aggressiveness. Individuals with psychiatric disorders often exhibit social withdrawal, which can further increase the probability of conducting a violent act. Here, we used the inbred, sequenced lines of the Drosophila Genetic Reference Panel (DGRP) to investigate the genetic basis of variation in male aggressive behavior for flies reared in a socialized and socially isolated environment. We identified genetic variation for aggressive behavior, as well as significant genotype-by-social environmental interaction (GSEI); i.e., variation among DGRP genotypes in the degree to which social isolation affected aggression. We performed genome-wide association (GWA) analyses to identify genetic variants associated with aggression within each environment. We used genomic prediction to partition genetic variants into gene ontology (GO) terms and constituent genes, and identified GO terms and genes with high prediction accuracies in both social environments and for GSEI. The top predictive GO terms significantly increased the proportion of variance explained, compared to prediction models based on all segregating variants. We performed genomic prediction across environments, and identified genes in common between the social environments that turned out to be enriched for genome-wide associated variants. A large proportion of the associated genes have previously been associated with aggressive behavior in Drosophila and mice. Further, many of these genes have human orthologs that have been associated with neurological disorders, indicating partially shared genetic mechanisms underlying aggression in animal models and human psychiatric disorders.
Collapse
Affiliation(s)
- Palle Duun Rohde
- Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, 8830 Tjele, Denmark
- iPSYCH, The Lundbeck Foundation Initiative for Integrative Psychiatric Research, 8000 Aarhus, Denmark
- ISEQ, Center for Integrative Sequencing, Aarhus University, 8000 Aarhus, Denmark
| | - Bryn Gaertner
- Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina 27695
- Program in Genetics, North Carolina State University, Raleigh, North Carolina 27695
- W.M. Keck Center for Behavioral Biology, North Carolina State University, Raleigh, North Carolina 27695
| | - Kirsty Ward
- Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina 27695
- Program in Genetics, North Carolina State University, Raleigh, North Carolina 27695
- W.M. Keck Center for Behavioral Biology, North Carolina State University, Raleigh, North Carolina 27695
| | - Peter Sørensen
- Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, 8830 Tjele, Denmark
| | - Trudy F C Mackay
- Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina 27695
- Program in Genetics, North Carolina State University, Raleigh, North Carolina 27695
- W.M. Keck Center for Behavioral Biology, North Carolina State University, Raleigh, North Carolina 27695
| |
Collapse
|
44
|
Sørensen IF, Edwards SM, Rohde PD, Sørensen P. Multiple Trait Covariance Association Test Identifies Gene Ontology Categories Associated with Chill Coma Recovery Time in Drosophila melanogaster. Sci Rep 2017; 7:2413. [PMID: 28546557 PMCID: PMC5445101 DOI: 10.1038/s41598-017-02281-3] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2017] [Accepted: 04/10/2017] [Indexed: 12/29/2022] Open
Abstract
The genomic best linear unbiased prediction (GBLUP) model has proven to be useful for prediction of complex traits as well as estimation of population genetic parameters. Improved inference and prediction accuracy of GBLUP may be achieved by identifying genomic regions enriched for causal genetic variants. We aimed at searching for patterns in GBLUP-derived single-marker statistics, by including them in genetic marker set tests, that could reveal associations between a set of genetic markers (genomic feature) and a complex trait. GBLUP-derived set tests proved to be powerful for detecting genomic features, here defined by gene ontology (GO) terms, enriched for causal variants affecting a quantitative trait in a population with low degree of relatedness. Different set test approaches were compared using simulated data illustrating the impact of trait- and genomic feature-specific factors on detection power. We extended the most powerful single trait set test, covariance association test (CVAT), to a multiple trait setting. The multiple trait CVAT (MT-CVAT) identified functionally relevant GO categories associated with the quantitative trait, chill coma recovery time, in the unrelated, sequenced inbred lines of the Drosophila melanogaster Genetic Reference Panel.
Collapse
Affiliation(s)
- Izel Fourie Sørensen
- Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, 8830, Tjele, Denmark.
| | - Stefan M Edwards
- Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, 8830, Tjele, Denmark.,The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| | - Palle Duun Rohde
- Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, 8830, Tjele, Denmark.,Centre for Integrative Sequencing, iSEQ, Aarhus University, 8000, Aarhus, Denmark.,iPSYCH, The Lundbeck Foundation Initiative for Integrative Psychiatric Research, 8000, Aarhus, Denmark
| | - Peter Sørensen
- Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, 8830, Tjele, Denmark
| |
Collapse
|
45
|
Fang L, Sahana G, Ma P, Su G, Yu Y, Zhang S, Lund MS, Sørensen P. Exploring the genetic architecture and improving genomic prediction accuracy for mastitis and milk production traits in dairy cattle by mapping variants to hepatic transcriptomic regions responsive to intra-mammary infection. Genet Sel Evol 2017; 49:44. [PMID: 28499345 PMCID: PMC5427631 DOI: 10.1186/s12711-017-0319-0] [Citation(s) in RCA: 46] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2016] [Accepted: 05/03/2017] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND A better understanding of the genetic architecture of complex traits can contribute to improve genomic prediction. We hypothesized that genomic variants associated with mastitis and milk production traits in dairy cattle are enriched in hepatic transcriptomic regions that are responsive to intra-mammary infection (IMI). Genomic markers [e.g. single nucleotide polymorphisms (SNPs)] from those regions, if included, may improve the predictive ability of a genomic model. RESULTS We applied a genomic feature best linear unbiased prediction model (GFBLUP) to implement the above strategy by considering the hepatic transcriptomic regions responsive to IMI as genomic features. GFBLUP, an extension of GBLUP, includes a separate genomic effect of SNPs within a genomic feature, and allows differential weighting of the individual marker relationships in the prediction equation. Since GFBLUP is computationally intensive, we investigated whether a SNP set test could be a computationally fast way to preselect predictive genomic features. The SNP set test assesses the association between a genomic feature and a trait based on single-SNP genome-wide association studies. We applied these two approaches to mastitis and milk production traits (milk, fat and protein yield) in Holstein (HOL, n = 5056) and Jersey (JER, n = 1231) cattle. We observed that a majority of genomic features were enriched in genomic variants that were associated with mastitis and milk production traits. Compared to GBLUP, the accuracy of genomic prediction with GFBLUP was marginally improved (3.2 to 3.9%) in within-breed prediction. The highest increase (164.4%) in prediction accuracy was observed in across-breed prediction. The significance of genomic features based on the SNP set test were correlated with changes in prediction accuracy of GFBLUP (P < 0.05). CONCLUSIONS GFBLUP provides a framework for integrating multiple layers of biological knowledge to provide novel insights into the biological basis of complex traits, and to improve the accuracy of genomic prediction. The SNP set test might be used as a first-step to improve GFBLUP models. Approaches like GFBLUP and SNP set test will become increasingly useful, as the functional annotations of genomes keep accumulating for a range of species and traits.
Collapse
Affiliation(s)
- Lingzhao Fang
- Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark. .,Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture and National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China.
| | - Goutam Sahana
- Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Peipei Ma
- Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Guosheng Su
- Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Ying Yu
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture and National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Shengli Zhang
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture and National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Mogens Sandø Lund
- Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Peter Sørensen
- Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| |
Collapse
|
46
|
Fang L, Sahana G, Su G, Yu Y, Zhang S, Lund MS, Sørensen P. Integrating Sequence-based GWAS and RNA-Seq Provides Novel Insights into the Genetic Basis of Mastitis and Milk Production in Dairy Cattle. Sci Rep 2017; 7:45560. [PMID: 28358110 PMCID: PMC5372096 DOI: 10.1038/srep45560] [Citation(s) in RCA: 52] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2016] [Accepted: 02/28/2017] [Indexed: 02/06/2023] Open
Abstract
Connecting genome-wide association study (GWAS) to biological mechanisms underlying complex traits is a major challenge. Mastitis resistance and milk production are complex traits of economic importance in the dairy sector and are associated with intra-mammary infection (IMI). Here, we integrated IMI-relevant RNA-Seq data from Holstein cattle and sequence-based GWAS data from three dairy cattle breeds (i.e., Holstein, Nordic red cattle, and Jersey) to explore the genetic basis of mastitis resistance and milk production using post-GWAS analyses and a genomic feature linear mixed model. At 24 h post-IMI, genes responsive to IMI in the mammary gland were preferentially enriched for genetic variants associated with mastitis resistance rather than milk production. Response genes in the liver were mainly enriched for variants associated with mastitis resistance at an early time point (3 h) post-IMI, whereas responsive genes at later stages were enriched for associated variants with milk production. The up- and down-regulated genes were enriched for associated variants with mastitis resistance and milk production, respectively. The patterns were consistent across breeds, indicating that different breeds shared similarities in the genetic basis of these traits. Our approaches provide a framework for integrating multiple layers of data to understand the genetic architecture underlying complex traits.
Collapse
Affiliation(s)
- Lingzhao Fang
- Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, 8830 Tjele, Denmark.,Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture &National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, 100193, Beijing, China
| | - Goutam Sahana
- Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, 8830 Tjele, Denmark
| | - Guosheng Su
- Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, 8830 Tjele, Denmark
| | - Ying Yu
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture &National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, 100193, Beijing, China
| | - Shengli Zhang
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture &National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, 100193, Beijing, China
| | - Mogens Sandø Lund
- Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, 8830 Tjele, Denmark
| | - Peter Sørensen
- Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, 8830 Tjele, Denmark
| |
Collapse
|
47
|
|
48
|
Mangin B, Bonnafous F, Blanchet N, Boniface MC, Bret-Mestries E, Carrère S, Cottret L, Legrand L, Marage G, Pegot-Espagnet P, Munos S, Pouilly N, Vear F, Vincourt P, Langlade NB. Genomic Prediction of Sunflower Hybrids Oil Content. FRONTIERS IN PLANT SCIENCE 2017; 8:1633. [PMID: 28983306 PMCID: PMC5613134 DOI: 10.3389/fpls.2017.01633] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2017] [Accepted: 09/06/2017] [Indexed: 05/18/2023]
Abstract
Prediction of hybrid performance using incomplete factorial mating designs is widely used in breeding programs including different heterotic groups. Based on the general combining ability (GCA) of the parents, predictions are accurate only if the genetic variance resulting from the specific combining ability is small and both parents have phenotyped descendants. Genomic selection (GS) can predict performance using a model trained on both phenotyped and genotyped hybrids that do not necessarily include all hybrid parents. Therefore, GS could overcome the issue of unknown parent GCA. Here, we compared the accuracy of classical GCA-based and genomic predictions for oil content of sunflower seeds using several GS models. Our study involved 452 sunflower hybrids from an incomplete factorial design of 36 female and 36 male lines. Re-sequencing of parental lines allowed to identify 468,194 non-redundant SNPs and to infer the hybrid genotypes. Oil content was observed in a multi-environment trial (MET) over 3 years, leading to nine different environments. We compared GCA-based model to different GS models including female and male genomic kinships with the addition of the female-by-male interaction genomic kinship, the use of functional knowledge as SNPs in genes of oil metabolic pathways, and with epistasis modeling. When both parents have descendants in the training set, the predictive ability was high even for GCA-based prediction, with an average MET value of 0.782. GS performed slightly better (+0.2%). Neither the inclusion of the female-by-male interaction, nor functional knowledge of oil metabolism, nor epistasis modeling improved the GS accuracy. GS greatly improved predictive ability when one or both parents were untested in the training set, increasing GCA-based predictive ability by 10.4% from 0.575 to 0.635 in the MET. In this scenario, performing GS only considering SNPs in oil metabolic pathways did not improve whole genome GS prediction but increased GCA-based prediction ability by 6.4%. Our results show that GS is a major improvement to breeding efficiency compared to the classical GCA modeling when either one or both parents are not well-characterized. This finding could therefore accelerate breeding through reducing phenotyping efforts and more effectively targeting for the most promising crosses.
Collapse
Affiliation(s)
- Brigitte Mangin
- LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France
- *Correspondence: Brigitte Mangin
| | - Fanny Bonnafous
- LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France
| | - Nicolas Blanchet
- LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France
| | - Marie-Claude Boniface
- LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France
| | | | - Sébastien Carrère
- LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France
| | - Ludovic Cottret
- LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France
| | - Ludovic Legrand
- LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France
| | - Gwenola Marage
- LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France
| | - Prune Pegot-Espagnet
- LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France
| | - Stéphane Munos
- LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France
| | - Nicolas Pouilly
- LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France
| | - Felicity Vear
- GDEC, INRA, Université Clermont II Blaise PascalClermont-Ferrand, France
| | - Patrick Vincourt
- LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France
| | - Nicolas B. Langlade
- LIPM, Université de Toulouse, INRA, Centre National de la Recherche ScientifiqueCastanet-Tolosan, France
| |
Collapse
|
49
|
Fang L, Hou Y, An J, Li B, Song M, Wang X, Sørensen P, Dong Y, Liu C, Wang Y, Zhu H, Zhang S, Yu Y. Genome-Wide Transcriptional and Post-transcriptional Regulation of Innate Immune and Defense Responses of Bovine Mammary Gland to Staphylococcus aureus. Front Cell Infect Microbiol 2016; 6:193. [PMID: 28083515 PMCID: PMC5183581 DOI: 10.3389/fcimb.2016.00193] [Citation(s) in RCA: 43] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2016] [Accepted: 12/09/2016] [Indexed: 12/28/2022] Open
Abstract
Staphylococcus aureus (S. aureus) is problematic for lactating mammals and public health. Understanding of mechanisms by which the hosts respond to severe invasion of S. aureus remains elusive. In this study, the genome-wide expression of mRNAs and miRNAs in bovine mammary gland cells were interrogated at 24 h after intra-mammary infection (IMI) with high or low concentrations of S. aureus. Compared to the negative control quarters, 194 highly-confident responsive genes were identified in the quarters with high concentration (109 cfu/mL) of S. aureus, which were predominantly implicated in pathways and biological processes pertaining to innate immune system, such as cytokine-cytokine receptor interaction and inflammatory response. In contrast, only 21 highly-confident genes were significantly differentially expressed in face of low concentration (106 cfu/mL) of S. aureus, which slightly perturbed the cell signaling and invoked corresponding responses like vasoconstriction, indicating limited perturbations and immunological evading. Additionally, the significant up-regulations of bta-mir-223 and bta-mir-21-3p were observed in the quarters infected by high concentration of S. aureus. Network analysis suggested that the two miRNAs' pivotal roles in defending hosts against bacterial infection probably through inhibiting CXCL14 and KIT. The significant down-regulation of CXCL14 was also observed in bovine mammary epithelial cells at 24 h post-infection of S. aureus (108 cfu/mL) in vitro. Integrated analysis with QTL database further suggested 28 genes (e.g., CXCL14, KIT, and SLC4A11) as candidates of bovine mastitis. This study first systematically revealed transcriptional and post-transcriptional responses of bovine mammary gland cells to invading S. aureus in a dosage-dependent pattern, and highlighted a complicated responsive mechanism in a network of miRNA-gene-pathway interplay.
Collapse
Affiliation(s)
- Lingzhao Fang
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture & National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural UniversityBeijing, China; Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus UniversityTjele, Denmark
| | - Yali Hou
- Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences Beijing, China
| | - Jing An
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture & National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University Beijing, China
| | - Bingjie Li
- Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University Tjele, Denmark
| | - Minyan Song
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture & National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University Beijing, China
| | - Xiao Wang
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture & National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural UniversityBeijing, China; Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus UniversityTjele, Denmark
| | - Peter Sørensen
- Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University Tjele, Denmark
| | - Yichun Dong
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture & National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University Beijing, China
| | - Chao Liu
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture & National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University Beijing, China
| | - Yachun Wang
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture & National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University Beijing, China
| | - Huabin Zhu
- Department of Animal Biotechnology and Reproduction, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences Beijing, China
| | - Shengli Zhang
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture & National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University Beijing, China
| | - Ying Yu
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture & National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University Beijing, China
| |
Collapse
|
50
|
Covariance Association Test (CVAT) Identifies Genetic Markers Associated with Schizophrenia in Functionally Associated Biological Processes. Genetics 2016; 203:1901-13. [PMID: 27317683 DOI: 10.1534/genetics.116.189498] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2016] [Accepted: 06/09/2016] [Indexed: 12/12/2022] Open
Abstract
Schizophrenia is a psychiatric disorder with large personal and social costs, and understanding the genetic etiology is important. Such knowledge can be obtained by testing the association between a disease phenotype and individual genetic markers; however, such single-marker methods have limited power to detect genetic markers with small effects. Instead, aggregating genetic markers based on biological information might increase the power to identify sets of genetic markers of etiological significance. Several set test methods have been proposed: Here we propose a new set test derived from genomic best linear unbiased prediction (GBLUP), the covariance association test (CVAT). We compared the performance of CVAT to other commonly used set tests. The comparison was conducted using a simulated study population having the same genetic parameters as for schizophrenia. We found that CVAT was among the top performers. When extending CVAT to utilize a mixture of SNP effects, we found an increase in power to detect the causal sets. Applying the methods to a Danish schizophrenia case-control data set, we found genomic evidence for association of schizophrenia with vitamin A metabolism and immunological responses, which previously have been implicated with schizophrenia based on experimental and observational studies.
Collapse
|