1
|
Li Q, Bian J, Qian Y, Kossinna P, Gau C, Gordon PMK, Zhou X, Guo X, Yan J, Wu J, Long Q. An expression-directed linear mixed model discovering low-effect genetic variants. Genetics 2024; 226:iyae018. [PMID: 38314848 DOI: 10.1093/genetics/iyae018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 11/29/2023] [Accepted: 01/05/2024] [Indexed: 02/07/2024] Open
Abstract
Detecting genetic variants with low-effect sizes using a moderate sample size is difficult, hindering downstream efforts to learn pathology and estimating heritability. In this work, by utilizing informative weights learned from training genetically predicted gene expression models, we formed an alternative approach to estimate the polygenic term in a linear mixed model. Our linear mixed model estimates the genetic background by incorporating their relevance to gene expression. Our protocol, expression-directed linear mixed model, enables the discovery of subtle signals of low-effect variants using moderate sample size. By applying expression-directed linear mixed model to cohorts of around 5,000 individuals with either binary (WTCCC) or quantitative (NFBC1966) traits, we demonstrated its power gain at the low-effect end of the genetic etiology spectrum. In aggregate, the additional low-effect variants detected by expression-directed linear mixed model substantially improved estimation of missing heritability. Expression-directed linear mixed model moves precision medicine forward by accurately detecting the contribution of low-effect genetic variants to human diseases.
Collapse
Affiliation(s)
- Qing Li
- Department of Biochemistry & Molecular Biology, University of Calgary, Calgary T2N 1N4, Canada
| | - Jiayi Bian
- Department of Mathematics and Statistics, University of Calgary, Calgary T2N 1N4, Canada
| | - Yanzhao Qian
- Department of Mathematics and Statistics, University of Calgary, Calgary T2N 1N4, Canada
| | - Pathum Kossinna
- Department of Biochemistry & Molecular Biology, University of Calgary, Calgary T2N 1N4, Canada
| | - Cooper Gau
- Department of Mathematics and Statistics, University of Calgary, Calgary T2N 1N4, Canada
| | - Paul M K Gordon
- Alberta Children's Hospital Research Institute, University of Calgary, Calgary T2N 1N4, Canada
| | - Xiang Zhou
- School of Public Health, University of Michigan, Ann Arbor 48109, USA
| | - Xingyi Guo
- Department of Medicine & Biomedical Informatics, Vanderbilt University Medical Center, Nashville 37203, USA
| | - Jun Yan
- Physiology and Pharmacology, University of Calgary, Calgary T2N 1N4, Canada
- Hotchkiss Brain Institute, University of Calgary, Calgary T2N 1N4, Canada
| | - Jingjing Wu
- Department of Mathematics and Statistics, University of Calgary, Calgary T2N 1N4, Canada
| | - Quan Long
- Department of Biochemistry & Molecular Biology, University of Calgary, Calgary T2N 1N4, Canada
- Department of Mathematics and Statistics, University of Calgary, Calgary T2N 1N4, Canada
- Alberta Children's Hospital Research Institute, University of Calgary, Calgary T2N 1N4, Canada
- Hotchkiss Brain Institute, University of Calgary, Calgary T2N 1N4, Canada
- Department of Medical Genetics, University of Calgary, Calgary T2N 1N4, Canada
| |
Collapse
|
2
|
Feldmann MJ, Pincot DDA, Cole GS, Knapp SJ. Genetic gains underpinning a little-known strawberry Green Revolution. Nat Commun 2024; 15:2468. [PMID: 38504104 PMCID: PMC10951273 DOI: 10.1038/s41467-024-46421-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Accepted: 02/26/2024] [Indexed: 03/21/2024] Open
Abstract
The annual production of strawberry has increased by one million tonnes in the US and 8.4 million tonnes worldwide since 1960. Here we show that the US expansion was driven by genetic gains from Green Revolution breeding and production advances that increased yields by 2,755%. Using a California population with a century-long breeding history and phenotypes of hybrids observed in coastal California environments, we estimate that breeding has increased fruit yields by 2,974-6,636%, counts by 1,454-3,940%, weights by 228-504%, and firmness by 239-769%. Using genomic prediction approaches, we pinpoint the origin of the Green Revolution to the early 1950s and uncover significant increases in additive genetic variation caused by transgressive segregation and phenotypic diversification. Lastly, we show that the most consequential Green Revolution breeding breakthrough was the introduction of photoperiod-insensitive, PERPETUAL FLOWERING hybrids in the 1970s that doubled yields and drove the dramatic expansion of strawberry production in California.
Collapse
Affiliation(s)
- Mitchell J Feldmann
- Department of Plant Sciences, University of California Davis, One Shields Avenue, Davis, CA, 95616, USA
| | - Dominique D A Pincot
- Department of Plant Sciences, University of California Davis, One Shields Avenue, Davis, CA, 95616, USA
| | - Glenn S Cole
- Department of Plant Sciences, University of California Davis, One Shields Avenue, Davis, CA, 95616, USA
| | - Steven J Knapp
- Department of Plant Sciences, University of California Davis, One Shields Avenue, Davis, CA, 95616, USA.
| |
Collapse
|
3
|
Bilton TP, Sharma SK, Schofield MR, Black MA, Jacobs JME, Bryan GJ, Dodds KG. Construction of relatedness matrices in autopolyploid populations using low-depth high-throughput sequencing data. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024; 137:64. [PMID: 38430392 PMCID: PMC10908621 DOI: 10.1007/s00122-024-04568-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Accepted: 01/30/2024] [Indexed: 03/03/2024]
Abstract
KEY MESSAGE An improved estimator of genomic relatedness using low-depth high-throughput sequencing data for autopolyploids is developed. Its outputs strongly correlate with SNP array-based estimates and are available in the package GUSrelate. High-throughput sequencing (HTS) methods have reduced sequencing costs and resources compared to array-based tools, facilitating the investigation of many non-model polyploid species. One important quantity that can be computed from HTS data is the genetic relatedness between all individuals in a population. However, HTS data are often messy, with multiple sources of errors (i.e. sequencing errors or missing parental alleles) which, if not accounted for, can lead to bias in genomic relatedness estimates. We derive a new estimator for constructing a genomic relationship matrix (GRM) from HTS data for autopolyploid species that accounts for errors associated with low sequencing depths, implemented in the R package GUSrelate. Simulations revealed that GUSrelate performed similarly to existing GRM methods at high depth but reduced bias in self-relatedness estimates when the sequencing depth was low. Using a panel consisting of 351 tetraploid potato genotypes, we found that GUSrelate produced GRMs from genotyping-by-sequencing (GBS) data that were highly correlated with a GRM computed from SNP array data, and less biased than existing methods when benchmarking against the array-based GRM estimates. GUSrelate provides researchers with a tool to reliably construct GRMs from low-depth HTS data.
Collapse
Affiliation(s)
- Timothy P Bilton
- AgResearch, Invermay Agricultural Centre, Mosgiel, New Zealand.
- Department of Mathematics and Statistics, University of Otago, Dunedin, New Zealand.
| | - Sanjeev Kumar Sharma
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, UK
| | - Matthew R Schofield
- Department of Mathematics and Statistics, University of Otago, Dunedin, New Zealand
| | - Michael A Black
- Department of Biochemistry, University of Otago, Dunedin, New Zealand
| | | | - Glenn J Bryan
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, UK
| | - Ken G Dodds
- AgResearch, Invermay Agricultural Centre, Mosgiel, New Zealand
| |
Collapse
|
4
|
Lange A, Medugorac I, Ali A, Kessler B, Kurome M, Zakhartchenko V, Hammer SE, Hauser A, Denner J, Dobenecker B, Wess G, Tan PLJ, Garkavenko O, Reichart B, Wolf E, Kemter E. Genetic diversity, growth and heart function of Auckland Island pigs, a potential source for organ xenotransplantation. Xenotransplantation 2024; 31:e12858. [PMID: 38646921 DOI: 10.1111/xen.12858] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2024] [Revised: 02/23/2024] [Accepted: 03/28/2024] [Indexed: 04/23/2024]
Abstract
One of the prerequisites for successful organ xenotransplantation is a reasonable size match between the porcine organ and the recipient's organ to be replaced. Therefore, the selection of a suitable genetic background of source pigs is important. In this study, we investigated body and organ growth, cardiac function, and genetic diversity of a colony of Auckland Island pigs established at the Center for Innovative Medical Models (CiMM), LMU Munich. Male and female Auckland Island pig kidney cells (selected to be free of porcine endogenous retrovirus C) were imported from New Zealand, and founder animals were established by somatic cell nuclear transfer (SCNT). Morphologically, Auckland Island pigs have smaller body stature compared to many domestic pig breeds, rendering their organ dimensions well-suited for human transplantation. Furthermore, echocardiography assessments of Auckland Island pig hearts indicated normal structure and functioning across various age groups throughout the study. Single nucleotide polymorphism (SNP) analysis revealed higher runs of homozygosity (ROH) in Auckland Island pigs compared to other domestic pig breeds and demonstrated that the entire locus coding the swine leukocyte antigens (SLAs) was homozygous. Based on these findings, Auckland Island pigs represent a promising genetic background for organ xenotransplantation.
Collapse
Affiliation(s)
- Andreas Lange
- Chair for Molecular Animal Breeding and Biotechnology, Gene Center and Department of Veterinary Sciences, LMU Munich, Munich, Germany
- Center for Innovative Medical Models (CiMM), LMU Munich, Munich, Germany
| | - Ivica Medugorac
- Population Genomics Group, Department of Veterinary Sciences, LMU Munich, Munich, Germany
| | - Asghar Ali
- Chair for Molecular Animal Breeding and Biotechnology, Gene Center and Department of Veterinary Sciences, LMU Munich, Munich, Germany
- Center for Innovative Medical Models (CiMM), LMU Munich, Munich, Germany
| | - Barbara Kessler
- Chair for Molecular Animal Breeding and Biotechnology, Gene Center and Department of Veterinary Sciences, LMU Munich, Munich, Germany
- Center for Innovative Medical Models (CiMM), LMU Munich, Munich, Germany
| | - Mayuko Kurome
- Chair for Molecular Animal Breeding and Biotechnology, Gene Center and Department of Veterinary Sciences, LMU Munich, Munich, Germany
- Center for Innovative Medical Models (CiMM), LMU Munich, Munich, Germany
| | - Valeri Zakhartchenko
- Chair for Molecular Animal Breeding and Biotechnology, Gene Center and Department of Veterinary Sciences, LMU Munich, Munich, Germany
- Center for Innovative Medical Models (CiMM), LMU Munich, Munich, Germany
| | - Sabine E Hammer
- Institute of Immunology, Department of Pathobiology, University of Veterinary Medicine, Vienna, Austria
| | - Andreas Hauser
- Chair for Molecular Animal Breeding and Biotechnology, Gene Center and Department of Veterinary Sciences, LMU Munich, Munich, Germany
| | - Joachim Denner
- Institute of Virology, Free University of Berlin, Berlin, Germany
| | - Britta Dobenecker
- Chair for Animal Nutrition, Department of Veterinary Sciences, LMU Munich, Munich, Germany
| | - Gerhard Wess
- Clinic of Small Animal Medicine, Center for Clinical Veterinary Medicine, LMU Munich, Munich, Germany
| | | | | | - Bruno Reichart
- Walter-Brendel-Center for Experimental Medicine, LMU Munich, Munich, Germany
| | - Eckhard Wolf
- Chair for Molecular Animal Breeding and Biotechnology, Gene Center and Department of Veterinary Sciences, LMU Munich, Munich, Germany
- Center for Innovative Medical Models (CiMM), LMU Munich, Munich, Germany
- Laboratory for Functional Genome Analysis (LAFUGA), Gene Center, LMU Munich, Munich, Germany
| | - Elisabeth Kemter
- Chair for Molecular Animal Breeding and Biotechnology, Gene Center and Department of Veterinary Sciences, LMU Munich, Munich, Germany
- Center for Innovative Medical Models (CiMM), LMU Munich, Munich, Germany
| |
Collapse
|
5
|
Fraimout A, Guillaume F, Li Z, Sillanpää MJ, Rastas P, Merilä J. Dissecting the genetic architecture of quantitative traits using genome-wide identity-by-descent sharing. Mol Ecol 2024; 33:e17299. [PMID: 38380534 DOI: 10.1111/mec.17299] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Revised: 01/08/2024] [Accepted: 01/22/2024] [Indexed: 02/22/2024]
Abstract
Additive and dominance genetic variances underlying the expression of quantitative traits are important quantities for predicting short-term responses to selection, but they are notoriously challenging to estimate in most non-model wild populations. Specifically, large-sized or panmictic populations may be characterized by low variance in genetic relatedness among individuals which, in turn, can prevent accurate estimation of quantitative genetic parameters. We used estimates of genome-wide identity-by-descent (IBD) sharing from autosomal SNP loci to estimate quantitative genetic parameters for ecologically important traits in nine-spined sticklebacks (Pungitius pungitius) from a large, outbred population. Using empirical and simulated datasets, with varying sample sizes and pedigree complexity, we assessed the performance of different crossing schemes in estimating additive genetic variance and heritability for all traits. We found that low variance in relatedness characteristic of wild outbred populations with high migration rate can impair the estimation of quantitative genetic parameters and bias heritability estimates downwards. On the other hand, the use of a half-sib/full-sib design allowed precise estimation of genetic variance components and revealed significant additive variance and heritability for all measured traits, with negligible dominance contributions. Genome-partitioning and QTL mapping analyses revealed that most traits had a polygenic basis and were controlled by genes at multiple chromosomes. Furthermore, different QTL contributed to variation in the same traits in different populations suggesting heterogeneous underpinnings of parallel evolution at the phenotypic level. Our results provide important guidelines for future studies aimed at estimating adaptive potential in the wild, particularly for those conducted in outbred large-sized populations.
Collapse
Affiliation(s)
- Antoine Fraimout
- Organismal and Evolutionary Biology Research Programme, Faculty of Biological and Environmental Sciences, FI-00014 University of Helsinki, Helsinki, Finland
| | - Frédéric Guillaume
- Organismal and Evolutionary Biology Research Programme, Faculty of Biological and Environmental Sciences, FI-00014 University of Helsinki, Helsinki, Finland
| | - Zitong Li
- Organismal and Evolutionary Biology Research Programme, Faculty of Biological and Environmental Sciences, FI-00014 University of Helsinki, Helsinki, Finland
| | - Mikko J Sillanpää
- Research Unit of Mathematical Sciences, FI-90014 University of Oulu, Oulu, Finland
| | - Pasi Rastas
- Organismal and Evolutionary Biology Research Programme, Faculty of Biological and Environmental Sciences, FI-00014 University of Helsinki, Helsinki, Finland
- Institute of Biotechnology, FI-00014 University of Helsinki, Helsinki, Finland
| | - Juha Merilä
- Organismal and Evolutionary Biology Research Programme, Faculty of Biological and Environmental Sciences, FI-00014 University of Helsinki, Helsinki, Finland
- Area of Ecology and Biodiversity, School of Biological Sciences, The University of Hong Kong, Hong Kong SAR, China
| |
Collapse
|
6
|
El-Kassaby YA, Cappa EP, Chen C, Ratcliffe B, Porth IM. Efficient genomics-based 'end-to-end' selective tree breeding framework. Heredity (Edinb) 2024; 132:98-105. [PMID: 38172577 PMCID: PMC10844606 DOI: 10.1038/s41437-023-00667-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Revised: 12/07/2023] [Accepted: 12/21/2023] [Indexed: 01/05/2024] Open
Abstract
Since their initiation in the 1950s, worldwide selective tree breeding programs followed the recurrent selection scheme of repeated cycles of selection, breeding (mating), and testing phases and essentially remained unchanged to accelerate this process or address environmental contingencies and concerns. Here, we introduce an "end-to-end" selective tree breeding framework that: (1) leverages strategically preselected GWAS-based sequence data capturing trait architecture information, (2) generates unprecedented resolution of genealogical relationships among tested individuals, and (3) leads to the elimination of the breeding phase through the utilization of readily available wind-pollinated (OP) families. Individuals' breeding values generated from multi-trait multi-site analysis were also used in an optimum contribution selection protocol to effectively manage genetic gain/co-ancestry trade-offs and traits' correlated response to selection. The proof-of-concept study involved a 40-year-old spruce OP testing population growing on three sites in British Columbia, Canada, clearly demonstrating our method's superiority in capturing most of the available genetic gains in a substantially reduced timeline relative to the traditional approach. The proposed framework is expected to increase the efficiency of existing selective breeding programs, accelerate the start of new programs for ecologically and environmentally important tree species, and address climate-change caused biotic and abiotic stress concerns more effectively.
Collapse
Affiliation(s)
- Yousry A El-Kassaby
- Faculty of Forestry, The University of British Columbia, Vancouver, BC, Canada.
| | - Eduardo P Cappa
- Instituto Nacional de Tecnología Agropecuaria (INTA), Instituto de Recursos Biológicos, Centro de Investigación en Recursos Naturales, Buenos Aires, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Charles Chen
- Department of Biochemistry and Molecular Biology, Oklahoma State University, Oklahoma, OK, USA
| | - Blaise Ratcliffe
- Faculty of Forestry, The University of British Columbia, Vancouver, BC, Canada
| | - Ilga M Porth
- Department of Wood and Forest Sciences, Université Laval, Quebec, QC, Canada
| |
Collapse
|
7
|
Freudiger A, Jovanovic VM, Huang Y, Snyder-Mackler N, Conrad DF, Miller B, Montague MJ, Westphal H, Stadler PF, Bley S, Horvath JE, Brent LJN, Platt ML, Ruiz-Lambides A, Tung J, Nowick K, Ringbauer H, Widdig A. Taking identity-by-descent analysis into the wild: Estimating realized relatedness in free-ranging macaques. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.09.574911. [PMID: 38260273 PMCID: PMC10802400 DOI: 10.1101/2024.01.09.574911] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2024]
Abstract
Biological relatedness is a key consideration in studies of behavior, population structure, and trait evolution. Except for parent-offspring dyads, pedigrees capture relatedness imperfectly. The number and length of DNA segments that are identical-by-descent (IBD) yield the most precise estimates of relatedness. Here, we leverage novel methods for estimating locus-specific IBD from low coverage whole genome resequencing data to demonstrate the feasibility and value of resolving fine-scaled gradients of relatedness in free-living animals. Using primarily 4-6× coverage data from a rhesus macaque (Macaca mulatta) population with available long-term pedigree data, we show that we can call the number and length of IBD segments across the genome with high accuracy even at 0.5× coverage. The resulting estimates demonstrate substantial variation in genetic relatedness within kin classes, leading to overlapping distributions between kin classes. They identify cryptic genetic relatives that are not represented in the pedigree and reveal elevated recombination rates in females relative to males, which allows us to discriminate maternal and paternal kin using genotype data alone. Our findings represent a breakthrough in the ability to understand the predictors and consequences of genetic relatedness in natural populations, contributing to our understanding of a fundamental component of population structure in the wild.
Collapse
Affiliation(s)
- Annika Freudiger
- Behavioral Ecology Research Group, Faculty of Life Sciences, Institute of Biology, Leipzig University, Leipzig, Germany
- Department of Primate Behavior and Evolution, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Vladimir M Jovanovic
- Human Biology and Primate Evolution, Institut für Zoologie, Freie Universität Berlin, Berlin, Germany
- Bioinformatics Solution Center, Freie Universität Berlin, Berlin, Germany
| | - Yilei Huang
- Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
- Bioinformatics Group, Institute of Computer Science, and Interdisciplinary Center for Bioinformatics, Leipzig University, Leipzig, Germany
| | - Noah Snyder-Mackler
- Center for Evolution & Medicine, School of Life Sciences, Arizona State University, Tempe, USA
| | - Donald F Conrad
- Division of Genetics, Oregon National Primate Research Center, Portland, Oregon, USA
| | - Brian Miller
- Division of Genetics, Oregon National Primate Research Center, Portland, Oregon, USA
| | - Michael J Montague
- Department of Neuroscience, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Hendrikje Westphal
- Behavioral Ecology Research Group, Faculty of Life Sciences, Institute of Biology, Leipzig University, Leipzig, Germany
- Department of Primate Behavior and Evolution, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
- Bioinformatics Group, Institute of Computer Science, and Interdisciplinary Center for Bioinformatics, Leipzig University, Leipzig, Germany
| | - Peter F Stadler
- Bioinformatics Group, Institute of Computer Science, and Interdisciplinary Center for Bioinformatics, Leipzig University, Leipzig, Germany
- Max Planck Institute for Mathematics in the Sciences, Leipzig, Germany
- Institute for Theoretical Chemistry, University of Vienna, Austria
- Facultad de Ciencias, Universidad Nacional de Colombia, Bogotá, Colombia
- Santa Fe Institute, Santa Fe, NM, USA
| | - Stefanie Bley
- Behavioral Ecology Research Group, Faculty of Life Sciences, Institute of Biology, Leipzig University, Leipzig, Germany
- Department of Primate Behavior and Evolution, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Julie E Horvath
- Department of Biological and Biomedical Sciences, North Carolina Central University, North Carolina, Durham, USA
- Research and Collections Section, North Carolina Museum of Natural Sciences, North Carolina, Raleigh, USA
- Department of Biological Sciences, North Carolina State University, North Carolina, Raleigh, USA
- Department of Evolutionary Anthropology, Duke University, North Carolina, Durham, USA
- Renaissance Computing Institute, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Lauren J N Brent
- Centre for Research in Animal Behaviour, University of Exeter, Exeter, UK
| | - Michael L Platt
- Department of Neuroscience, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Marketing Department, the Wharton School of Business, University of Pennsylvania, Philadelphia, PA, USA
- Department of Psychology, School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA, USA
| | - Angelina Ruiz-Lambides
- Cayo Santiago Field Station, Caribbean Primate Research Center, University of Puerto Rico, Punta Santiago, Puerto Rico
| | - Jenny Tung
- Department of Primate Behavior and Evolution, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
- Department of Evolutionary Anthropology, Duke University, North Carolina, Durham, USA
- Department of Biology, Duke University, Durham, North Carolina, USA
- Duke University Population Research Institute, Durham, North Carolina, USA
| | - Katja Nowick
- Human Biology and Primate Evolution, Institut für Zoologie, Freie Universität Berlin, Berlin, Germany
- Bioinformatics Solution Center, Freie Universität Berlin, Berlin, Germany
| | - Harald Ringbauer
- Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Anja Widdig
- Behavioral Ecology Research Group, Faculty of Life Sciences, Institute of Biology, Leipzig University, Leipzig, Germany
- Department of Primate Behavior and Evolution, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
- German Centre for Integrative Biodiversity Research (iDiv), Halle-Jena-Leipzig, Germany
| |
Collapse
|
8
|
Zhang QX, Liu T, Guo X, Zhen J, Yang MY, Khederzadeh S, Zhou F, Han X, Zheng Q, Jia P, Ding X, He M, Zou X, Liao JK, Zhang H, He J, Zhu X, Lu D, Chen H, Zeng C, Liu F, Zheng HF, Liu S, Xu HM, Chen GB. Searching across-cohort relatives in 54,092 GWAS samples via encrypted genotype regression. PLoS Genet 2024; 20:e1011037. [PMID: 38206971 PMCID: PMC10783776 DOI: 10.1371/journal.pgen.1011037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Accepted: 12/13/2023] [Indexed: 01/13/2024] Open
Abstract
Explicitly sharing individual level data in genomics studies has many merits comparing to sharing summary statistics, including more strict QCs, common statistical analyses, relative identification and improved statistical power in GWAS, but it is hampered by privacy or ethical constraints. In this study, we developed encG-reg, a regression approach that can detect relatives of various degrees based on encrypted genomic data, which is immune of ethical constraints. The encryption properties of encG-reg are based on the random matrix theory by masking the original genotypic matrix without sacrificing precision of individual-level genotype data. We established a connection between the dimension of a random matrix, which masked genotype matrices, and the required precision of a study for encrypted genotype data. encG-reg has false positive and false negative rates equivalent to sharing original individual level data, and is computationally efficient when searching relatives. We split the UK Biobank into their respective centers, and then encrypted the genotype data. We observed that the relatives estimated using encG-reg was equivalently accurate with the estimation by KING, which is a widely used software but requires original genotype data. In a more complex application, we launched a finely devised multi-center collaboration across 5 research institutes in China, covering 9 cohorts of 54,092 GWAS samples. encG-reg again identified true relatives existing across the cohorts with even different ethnic backgrounds and genotypic qualities. Our study clearly demonstrates that encrypted genomic data can be used for data sharing without loss of information or data sharing barrier.
Collapse
Affiliation(s)
- Qi-Xin Zhang
- Institute of Bioinformatics, Zhejiang University, Hangzhou, Zhejiang, China
- Center for Reproductive Medicine, Department of Genetic and Genomic Medicine, and Clinical Research Institute, Zhejiang Provincial People’s Hospital, People’s Hospital of Hangzhou Medical College, Hangzhou, Zhejiang, China
| | - Tianzi Liu
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences, Shanghai, China
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
| | - Xinxin Guo
- School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, Guangdong, China
| | - Jianxin Zhen
- Central Laboratory, Shenzhen Baoan Women’s and Children’s Hospital, Shenzhen, Guangdong, China
| | - Meng-yuan Yang
- Diseases & Population (DaP) Geninfo Lab, School of Life Sciences, Westlake University, Hangzhou, Zhejiang, China
| | - Saber Khederzadeh
- Diseases & Population (DaP) Geninfo Lab, School of Life Sciences, Westlake University, Hangzhou, Zhejiang, China
| | - Fang Zhou
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | - Xiaotong Han
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangdong Provincial Clinical Research Center for Ocular Diseases, Guangzhou, Guangdong, China
| | - Qiwen Zheng
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
| | - Peilin Jia
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
| | - Xiaohu Ding
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangdong Provincial Clinical Research Center for Ocular Diseases, Guangzhou, Guangdong, China
| | - Mingguang He
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangdong Provincial Clinical Research Center for Ocular Diseases, Guangzhou, Guangdong, China
- Centre for Eye Research Australia, Royal Victorian Eye and Ear Hospital, Melbourne, Victoria, Australia
- Ophthalmology, Department of Surgery, University of Melbourne, Melbourne, Victoria, Australia
| | - Xin Zou
- State Key Laboratory of CAD & GC, Zhejiang University, Hangzhou, Zhejiang, China
| | - Jia-Kai Liao
- School of Mathematics and Statistics and Research Institute of Mathematical Sciences (RIMS), Jiangsu Provincial Key Laboratory of Educational Big Data Science and Engineering, Jiangsu Normal University, Xuzhou, Jiangsu, China
- Ningbo Institute of Life and Health Industry, University of Chinese Academy of Sciences, Ningbo, Zhejiang, China
| | - Hongxin Zhang
- State Key Laboratory of CAD & GC, Zhejiang University, Hangzhou, Zhejiang, China
| | - Ji He
- Department of Neurology, Peking University Third Hospital, Beijing, China
| | - Xiaofeng Zhu
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, Ohio, United States of America
| | - Daru Lu
- State Key Laboratory of Genetic Engineering and MOE Engineering Research Center of Gene Technology, School of Life Sciences and Zhongshan Hospital, Fudan University, Shanghai, China
- NHC Key Laboratory of Birth Defects and Reproductive Health, Chongqing Population and Family Planning Science and Technology Research Institute, Chongqing, China
| | - Hongyan Chen
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | - Changqing Zeng
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
- Henan Academy of Sciences, Zhengzhou, Henan, China
| | - Fan Liu
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
- Department of Forensic Sciences, College of Criminal Justice, Naif Arab University of Security Sciences, Riyadh, Kingdom of Saudi Arabia
| | - Hou-Feng Zheng
- Diseases & Population (DaP) Geninfo Lab, School of Life Sciences, Westlake University, Hangzhou, Zhejiang, China
| | - Siyang Liu
- School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, Guangdong, China
| | - Hai-Ming Xu
- Institute of Bioinformatics, Zhejiang University, Hangzhou, Zhejiang, China
| | - Guo-Bo Chen
- Center for Reproductive Medicine, Department of Genetic and Genomic Medicine, and Clinical Research Institute, Zhejiang Provincial People’s Hospital, People’s Hospital of Hangzhou Medical College, Hangzhou, Zhejiang, China
- Key Laboratory of Endocrine Gland Diseases of Zhejiang Province, Hangzhou, Zhejiang, China
| |
Collapse
|
9
|
Arias KD, Gutiérrez JP, Fernández I, Álvarez I, Goyache F. Approaching autozygosity in a small pedigree of Gochu Asturcelta pigs. Genet Sel Evol 2023; 55:74. [PMID: 37880572 PMCID: PMC10601182 DOI: 10.1186/s12711-023-00846-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Accepted: 10/03/2023] [Indexed: 10/27/2023] Open
Abstract
BACKGROUND In spite of the availability of single nucleotide polymorphism (SNP) array data, differentiation between observed homozygosity and that caused by mating between relatives (autozygosity) introduces major difficulties. Homozygosity estimators show large variation due to different causes, namely, Mendelian sampling, population structure, and differences among chromosomes. Therefore, the ascertainment of how inbreeding is reflected in the genome is still an issue. The aim of this research was to study the usefulness of genomic information for the assessment of genetic diversity in the highly endangered Gochu Asturcelta pig breed. Pedigree depth varied from 0 (founders) to 4 equivalent discrete generations (t). Four homozygosity parameters (runs of homozygosity, FROH; heterozygosity-rich regions, FHRR; Li and Horvitz's, FLH; and Yang and colleague's FYAN) were computed for each individual, adjusted for the variability in the base population (BP; six individuals) and further jackknifed over autosomes. Individual increases in homozygosity (depending on t) and increases in pairwise homozygosity (i.e., increase in the parents' mean) were computed for each individual in the pedigree, and effective population size (Ne) was computed for five subpopulations (cohorts). Genealogical parameters (individual inbreeding, individual increase in inbreeding, and Ne) were used for comparisons. RESULTS The mean F was 0.120 ± 0.074 and the mean BP-adjusted homozygosity ranged from 0.099 ± 0.081 (FLH) to 0.152 ± 0.075 (FYAN). After jackknifing, the mean values were slightly lower. The increase in pairwise homozygosity tended to be twofold higher than the corresponding individual increase in homozygosity values. When compared with genealogical estimates, estimates of Ne obtained using FYAN tended to have low root-mean-squared errors. However, Ne estimates based on increases in pairwise homozygosity using both FROH and FHRR estimates of genomic inbreeding had lower root-mean-squared errors. CONCLUSIONS Parameters characterizing homozygosity may not accurately depict losses of variability in small populations in which breeding policy prohibits matings between close relatives. After BP adjustment, the performance of FROH and FHRR was highly consistent. Assuming that an increase in homozygosity depends only on pedigree depth can lead to underestimating it in populations with shallow pedigrees. An increase in pairwise homozygosity computed from either FROH or FHRR is a promising approach for characterizing autozygosity.
Collapse
Affiliation(s)
- Katherine D Arias
- Área de Genética y Reproducción Animal, SERIDA-Deva, Camino de Rioseco 1225, 33394, Gijón, Spain
| | - Juan Pablo Gutiérrez
- Departamento de Producción Animal, Universidad Complutense de Madrid, Avda. Puerta de Hierro S/N, 28040, Madrid, Spain
| | - Iván Fernández
- Área de Genética y Reproducción Animal, SERIDA-Deva, Camino de Rioseco 1225, 33394, Gijón, Spain
| | - Isabel Álvarez
- Área de Genética y Reproducción Animal, SERIDA-Deva, Camino de Rioseco 1225, 33394, Gijón, Spain
| | - Félix Goyache
- Área de Genética y Reproducción Animal, SERIDA-Deva, Camino de Rioseco 1225, 33394, Gijón, Spain.
| |
Collapse
|
10
|
Kim EH, Kang HC, Myung CH, Kim JY, Sun DW, Lee DH, Lee SH, Lim HT. Comparison on genomic prediction using pedigree BLUP and single step GBLUP through the Hanwoo full-sib family. Anim Biosci 2023; 36:1327-1335. [PMID: 37170517 PMCID: PMC10472147 DOI: 10.5713/ab.22.0327] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2022] [Revised: 01/03/2023] [Accepted: 03/16/2023] [Indexed: 05/13/2023] Open
Abstract
OBJECTIVE When evaluating individuals with the same parent and no phenotype by pedigree best linear unbiased prediction (BLUP), it is difficult to explain carcass grade difference and select individuals because they have the same value in pedigree BLUP (PBLUP). However, single step GBLUP (ssGBLUP), which can estimate the breeding value suitable for the individual by adding genotype, is more accurate than the existing method. METHODS The breeding value and accuracy were estimated with pedigree BLUP and ssGBLUP using pedigree and genotype of 408 Hanwoo cattle from 16 families with the same parent among siblings produced by fertilized egg transplantation. A total of 14,225 Hanwoo cattle with pedigree, genotype and phenotype were used as the reference population. PBLUP obtained estimated breeding value (EBV) using the pedigree of the test and reference populations, and ssGBLUP obtained genomic EBV (GEBV) after constructing and H-matrix by integrating the pedigree and genotype of the test and reference populations. RESULTS For all traits, the accuracy of GEBV using ssGBLUP is 0.18 to 0.20 higher than the accuracy of EBV obtained with PBLUP. Comparison of EBV and GEBV of individuals without phenotype, since the value of EBV is estimated based on expected values of alleles passed down from common ancestors. It does not take Mendelian sampling into consideration, so the EBV of all individuals within the same family is estimated to be the same value. However, GEBV makes estimating true kinship coefficient based on different genotypes of individuals possible, so GEBV that corresponds to each individual is estimated rather than a uniform GEBV for each individual. CONCLUSION Since Hanwoo cows bred through embryo transfer have a high possibility of having the same parent, if ssGBLUP after adding genotype is used, estimating true kinship coefficient corresponding to each individual becomes possible, allowing for more accurate estimation of breeding value.
Collapse
Affiliation(s)
- Eun-Ho Kim
- Department of Animal Science, Gyeongsang National University, Jinju 52828,
Korea
| | - Ho-Chan Kang
- Department of Animal Science and Biotechnology, Gyeongsang National University, Jinju 52828,
Korea
| | - Cheol-Hyun Myung
- Department of Animal Science, Gyeongsang National University, Jinju 52828,
Korea
| | - Ji-Yeong Kim
- Department of Animal Science, Gyeongsang National University, Jinju 52828,
Korea
| | - Du-Won Sun
- Gyeongnam Animal Science and Technology, Gyeongsang National University, Jinju 52828,
Korea
| | - Doo-Ho Lee
- Department of Animal Science and Biotechnology, Chungnam National University, Daejeon 34134,
Korea
| | - Seung-Hwan Lee
- Department of Animal Science and Biotechnology, Chungnam National University, Daejeon 34134,
Korea
| | - Hyun-Tae Lim
- Department of Animal Science, Gyeongsang National University, Jinju 52828,
Korea
- Department of Animal Science and Biotechnology, Gyeongsang National University, Jinju 52828,
Korea
| |
Collapse
|
11
|
Labroo MR, Endelman JB, Gemenet DC, Werner CR, Gaynor RC, Covarrubias-Pazaran GE. Clonal diploid and autopolyploid breeding strategies to harness heterosis: insights from stochastic simulation. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2023; 136:147. [PMID: 37291402 DOI: 10.1007/s00122-023-04377-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Accepted: 05/05/2023] [Indexed: 06/10/2023]
Abstract
KEY MESSAGE Reciprocal recurrent selection sometimes increases genetic gain per unit cost in clonal diploids with heterosis due to dominance, but it typically does not benefit autopolyploids. Breeding can change the dominance as well as additive genetic value of populations, thus utilizing heterosis. A common hybrid breeding strategy is reciprocal recurrent selection (RRS), in which parents of hybrids are typically recycled within pools based on general combining ability. However, the relative performances of RRS and other breeding strategies have not been thoroughly compared. RRS can have relatively increased costs and longer cycle lengths, but these are sometimes outweighed by its ability to harness heterosis due to dominance. Here, we used stochastic simulation to compare genetic gain per unit cost of RRS, terminal crossing, recurrent selection on breeding value, and recurrent selection on cross performance considering different amounts of population heterosis due to dominance, relative cycle lengths, time horizons, estimation methods, selection intensities, and ploidy levels. In diploids with phenotypic selection at high intensity, whether RRS was the optimal breeding strategy depended on the initial population heterosis. However, in diploids with rapid-cycling genomic selection at high intensity, RRS was the optimal breeding strategy after 50 years over almost all amounts of initial population heterosis under the study assumptions. Diploid RRS required more population heterosis to outperform other strategies as its relative cycle length increased and as selection intensity and time horizon decreased. The optimal strategy depended on selection intensity, a proxy for inbreeding rate. Use of diploid fully inbred parents vs. outbred parents with RRS typically did not affect genetic gain. In autopolyploids, RRS typically did not outperform one-pool strategies regardless of the initial population heterosis.
Collapse
Affiliation(s)
- Marlee R Labroo
- Excellence in Breeding Platform, Consultative Group of International Agricultural Research, Texcoco, Mexico
- International Maize and Wheat Improvement Center (CIMMYT), Texcoco, Mexico
| | - Jeffrey B Endelman
- Department of Horticulture, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Dorcus C Gemenet
- Excellence in Breeding Platform, Consultative Group of International Agricultural Research, Texcoco, Mexico
- International Maize and Wheat Improvement Center (CIMMYT), Texcoco, Mexico
| | - Christian R Werner
- Excellence in Breeding Platform, Consultative Group of International Agricultural Research, Texcoco, Mexico
- International Maize and Wheat Improvement Center (CIMMYT), Texcoco, Mexico
| | | | - Giovanny E Covarrubias-Pazaran
- Excellence in Breeding Platform, Consultative Group of International Agricultural Research, Texcoco, Mexico.
- International Maize and Wheat Improvement Center (CIMMYT), Texcoco, Mexico.
| |
Collapse
|
12
|
Oliveira TP, Obšteter J, Pocrnic I, Heslot N, Gorjanc G. A method for partitioning trends in genetic mean and variance to understand breeding practices. Genet Sel Evol 2023; 55:36. [PMID: 37268883 DOI: 10.1186/s12711-023-00804-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 04/17/2023] [Indexed: 06/04/2023] Open
Abstract
BACKGROUND In breeding programmes, the observed genetic change is a sum of the contributions of different selection paths represented by groups of individuals. Quantifying these sources of genetic change is essential for identifying the key breeding actions and optimizing breeding programmes. However, it is difficult to disentangle the contribution of individual paths due to the inherent complexity of breeding programmes. Here we extend the previously developed method for partitioning genetic mean by paths of selection to work both with the mean and variance of breeding values. METHODS First, we extended the partitioning method to quantify the contribution of different paths to genetic variance assuming that the breeding values are known. Second, we combined the partitioning method with the Markov Chain Monte Carlo approach to draw samples from the posterior distribution of breeding values and use these samples for computing the point and interval estimates of partitions for the genetic mean and variance. We implemented the method in the R package AlphaPart. We demonstrated the method with a simulated cattle breeding programme. RESULTS We show how to quantify the contribution of different groups of individuals to genetic mean and variance and that the contributions of different selection paths to genetic variance are not necessarily independent. Finally, we observed that the partitioning method under the pedigree-based model has some limitations, which suggests the need for a genomic extension. CONCLUSIONS We presented a partitioning method to quantify sources of change in genetic mean and variance in breeding programmes. The method can help breeders and researchers understand the dynamics in genetic mean and variance in a breeding programme. The developed method for partitioning genetic mean and variance is a powerful method for understanding how different selection paths interact within a breeding programme and how they can be optimised.
Collapse
Affiliation(s)
- Thiago P Oliveira
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Edinburgh, UK.
| | - Jana Obšteter
- Agricultural Institute of Slovenia, Ljubljana, Slovenia
| | - Ivan Pocrnic
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Edinburgh, UK
| | | | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Edinburgh, UK
| |
Collapse
|
13
|
Abstract
Prior to the development of genome-wide arrays and whole genome sequencing technologies, heritability estimation mainly relied on the study of related individuals. Over the past decade, various approaches have been developed to estimate SNP-based narrow-sense heritability (h SNP 2 ${\rm{h}}_{{\rm{SNP}}}^2$ ) in unrelated individuals. These latter approaches use either individual-level genetic variations or summary results from genome-wide association studies (GWAS). Recently, several studies compared these approaches using extensive simulations and empirical datasets. However, sparse information on hands-on training necessitates revisiting these approaches from the perspective of a stepwise guide for practical applications. Here, we provide an overview of the commonly used SNP-heritability estimation approaches utilizing genome-wide array, imputed or whole genome data from unrelated individuals, or summary results. We not only discuss these approaches based on their statistical concepts, utility, advantages, and limitations, but also provide step-by-step protocols to apply these approaches. For illustration purposes, we estimateh SNP 2 ${\rm{h}}_{{\rm{SNP}}}^2$ of height and BMI utilizing individual-level data from The Northern Finland Birth Cohort (NFBC) and summary results from the Genetic Investigation of ANthropometric Traits (GIANT;) consortium. We present this review as a template for the researchers who estimate and use heritability in their studies and as a reference for geneticists who develop or extend heritability estimation approaches. © 2023 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: GREML (GCTA) Alternate Protocol 1: Stratified GREML Basic Protocol 2: LDAK Alternate Protocol 2: Stratified LDAK Basic Protocol 3: Threshold GREML Basic Protocol 4: LD score (LDSC) regression Basic Protocol 5: SumHer.
Collapse
Affiliation(s)
- Amit K. Srivastava
- Division of Human Genetics, Cincinnati Children’s Hospital Medical Center, USA; The Center for Prevention of Preterm Birth, Perinatal Institute, Cincinnati Children’s Hospital Medical Center, USA; March of Dimes Prematurity Research Center Ohio Collaborative, USA; Department of Pediatrics, University of Cincinnati College of Medicine, USA
| | - Scott M. Williams
- Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, USA; Department of Genetics and Genome Sciences, School of Medicine, Case Western Reserve University, USA; Institute of Computational Biology, Case Western Reserve University, USA
| | - Ge Zhang
- Division of Human Genetics, Cincinnati Children’s Hospital Medical Center, USA; The Center for Prevention of Preterm Birth, Perinatal Institute, Cincinnati Children’s Hospital Medical Center, USA; March of Dimes Prematurity Research Center Ohio Collaborative, USA; Department of Pediatrics, University of Cincinnati College of Medicine, USA
| |
Collapse
|
14
|
Liu S, Yao T, Chen D, Xiao S, Chen L, Zhang Z. Genomic prediction in pigs using data from a commercial crossbred population: insights from the Duroc x (Landrace x Yorkshire) three-way crossbreeding system. Genet Sel Evol 2023; 55:21. [PMID: 36977978 PMCID: PMC10053053 DOI: 10.1186/s12711-023-00794-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Accepted: 03/20/2023] [Indexed: 03/30/2023] Open
Abstract
BACKGROUND Genomic selection is widely applied for genetic improvement in livestock crossbreeding systems to select excellent nucleus purebred (PB) animals and to improve the performance of commercial crossbred (CB) animals. Most current predictions are based solely on PB performance. Our objective was to explore the potential application of genomic selection of PB animals using genotypes of CB animals with extreme phenotypes in a three-way crossbreeding system as the reference population. Using real genotyped PB as ancestors, we simulated the production of 100,000 pigs for a Duroc x (Landrace x Yorkshire) DLY crossbreeding system. The predictive performance of breeding values of PB animals for CB performance using genotypes and phenotypes of (1) PB animals, (2) DLY animals with extreme phenotypes, and (3) random DLY animals for traits of different heritabilities ([Formula: see text] = 0.1, 0.3, and 0.5) was compared across different reference population sizes (500 to 6500) and prediction models (genomic best linear unbiased prediction (GBLUP) and Bayesian sparse linear mixed model (BSLMM)). RESULTS Using a reference population consisting of CB animals with extreme phenotypes showed a definite predictive advantage for medium- and low-heritability traits and, in combination with the BSLMM model, significantly improved selection response for CB performance. For high-heritability traits, the predictive performance of a reference population of extreme CB phenotypes was comparable to that of PB phenotypes when the effect of the genetic correlation between PB and CB performance ([Formula: see text]) on the accuracy obtained with a PB reference population was considered, and the former could exceed the latter if the reference size was large enough. For the selection of the first and terminal sires in a three-way crossbreeding system, prediction using extreme CB phenotypes outperformed the use of PB phenotypes, while the optimal design of the reference group for the first dam depended on the percentage of individuals from the corresponding breed that the PB reference data comprised and on the heritability of the target trait. CONCLUSIONS A commercial crossbred population is promising for the design of the reference population for genomic prediction, and selective genotyping of CB animals with extreme phenotypes has the potential for maximizing genetic improvement for CB performance in the pig industry.
Collapse
Affiliation(s)
- Siyi Liu
- National Key Laboratory for Swine Genetics, Breeding and Production Technology, Jiangxi Agricultural University, Nanchang, 330045, China
| | - Tianxiong Yao
- National Key Laboratory for Swine Genetics, Breeding and Production Technology, Jiangxi Agricultural University, Nanchang, 330045, China
| | - Dong Chen
- National Key Laboratory for Swine Genetics, Breeding and Production Technology, Jiangxi Agricultural University, Nanchang, 330045, China
| | - Shijun Xiao
- National Key Laboratory for Swine Genetics, Breeding and Production Technology, Jiangxi Agricultural University, Nanchang, 330045, China
| | - Liqing Chen
- National Key Laboratory for Swine Genetics, Breeding and Production Technology, Jiangxi Agricultural University, Nanchang, 330045, China
| | - Zhiyan Zhang
- National Key Laboratory for Swine Genetics, Breeding and Production Technology, Jiangxi Agricultural University, Nanchang, 330045, China.
| |
Collapse
|
15
|
Pedigree reconstruction and population structure using SNP markers in Gir cattle. J Appl Genet 2023; 64:329-340. [PMID: 36645582 DOI: 10.1007/s13353-023-00747-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Revised: 11/24/2022] [Accepted: 01/03/2023] [Indexed: 01/17/2023]
Abstract
Our objective was to establish a SNPs panel for pedigree reconstruction using microarrays of different densities and evaluate the genomic relationship coefficient of the inferred pedigree, in addition to analyzing the population structure based on genomic analyses in Gir cattle. For parentage analysis and genomic relationship, 16,205 genotyped Gir animals (14,458 females and 1747 males) and 1810 common markers to the four SNP microarrays were used. For population structure analyses, including linkage disequilibrium, effective population size, and runs of homozygosity (ROH), genotypes from 21,656 animals were imputed. Likelihood ratio (LR) approach was used to reconstruct the pedigree, deepening the pedigree and showing it is well established in terms of recent information. Coefficients for each relationship category of the inferred pedigree were adequate. Linkage disequilibrium showed rapid decay. We detected a decrease in the effective population size over the last 50 generations, with the average generation interval around 9.08 years. Higher ROH-based inbreeding coefficient in a class of short ROH segments, with moderate to high values, was also detected, suggesting bottlenecks in the Gir genome. Breeding strategies to minimize inbreeding and avoid massive use of few proven sires with high genetic value are suggested to maintain genetic variability in future generations. In addition, we recommend reducing the generation interval to maximize genetic progress and increase effective population size.
Collapse
|
16
|
Jibrila I, Ten Napel J, Vandenplas J, Bergsma R, Veerkamp RF, Calus MPL. Impact of genomic preselection on subsequent ssGBLUP evaluation of preselected animals for scarcely recorded feed intake in pigs. J Anim Breed Genet 2023; 140:253-263. [PMID: 36637041 DOI: 10.1111/jbg.12754] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Accepted: 12/23/2022] [Indexed: 01/14/2023]
Abstract
We have previously shown that single-step genomic best linear unbiased prediction (ssGBLUP) estimates breeding values of genomically preselected animals without preselection bias for widely recorded traits, that is traits recorded for the majority of animals in the breeding population. This study investigated the impact of genomic preselection (GPS) on accuracy and bias in ssGBLUP evaluation of genomically preselected animals for a scarcely recorded trait, that is a trait recorded for only a small proportion of the animals, which generally has a lower prediction accuracy than widely recorded traits, mainly due to having a much smaller number of phenotypes available. We used data from a commercial pig breeding program, considering feed intake as a scarcely recorded target trait, being available for ~30% of the animals with phenotypes for any trait, and average daily gain, backfat thickness and loin depth as widely recorded predictor traits, being available for >95% of the animals with phenotypes for any trait. The data contained the routine GPS implemented by commercial animal breeding programs, and we retrospectively implemented two scenarios with additional layers of GPS by discarding pedigree, genotypes and phenotypes of animals without progeny. The ssGBLUP evaluation following GPS used records only from the target trait, only from the predictor traits, or both. Accuracy for feed intake did not differ statistically across GPS scenarios, although it tended to decrease with more intense GPS. The accuracy had average values of 0.37, 0.44, and 0.45 across all GPS scenarios when, respectively, records from only the target trait, only the predictor traits, or both were used in the ssGBLUP evaluation. Considerable deflation of the genomic breeding values for feed intake was observed in the most stringent GPS scenario, due to the variance components being underestimated as a result of the limited amount of strongly preselected data. As long as (co)variance components were unbiased, no or only marginal bias was observed. These results for accuracy and bias were observed whether records of the scarcely recorded target trait, of the predictor traits, or both were used in the ssGBLUP evaluation. Our results show that for the scarcely recorded feed intake in pigs, ssGBLUP is able to estimate breeding values of preselected animals without preselection bias, similarly as previously observed for widely recorded traits.
Collapse
Affiliation(s)
- Ibrahim Jibrila
- Wageningen University and Research Animal Breeding and Genomics, Wageningen, the Netherlands
| | - Jan Ten Napel
- Wageningen University and Research Animal Breeding and Genomics, Wageningen, the Netherlands
| | - Jeremie Vandenplas
- Wageningen University and Research Animal Breeding and Genomics, Wageningen, the Netherlands
| | - Rob Bergsma
- Topigs Norsvin Research Center B.V., Beuningen, the Netherlands
| | - Roel F Veerkamp
- Wageningen University and Research Animal Breeding and Genomics, Wageningen, the Netherlands
| | - Mario P L Calus
- Wageningen University and Research Animal Breeding and Genomics, Wageningen, the Netherlands
| |
Collapse
|
17
|
Jiang W, Zhang X, Li S, Song S, Zhao H. An unbiased kinship estimation method for genetic data analysis. BMC Bioinformatics 2022; 23:525. [PMID: 36474154 PMCID: PMC9727941 DOI: 10.1186/s12859-022-05082-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Accepted: 11/25/2022] [Indexed: 12/13/2022] Open
Abstract
Accurate estimate of relatedness is important for genetic data analyses, such as heritability estimation and association mapping based on data collected from genome-wide association studies. Inaccurate relatedness estimates may lead to biased heritability estimations and spurious associations. Individual-level genotype data are often used to estimate kinship coefficient between individuals. The commonly used sample correlation-based genomic relationship matrix (scGRM) method estimates kinship coefficient by calculating the average sample correlation coefficient among all single nucleotide polymorphisms (SNPs), where the observed allele frequencies are used to calculate both the expectations and variances of genotypes. Although this method is widely used, a substantial proportion of estimated kinship coefficients are negative, which are difficult to interpret. In this paper, through mathematical derivation, we show that there indeed exists bias in the estimated kinship coefficient using the scGRM method when the observed allele frequencies are regarded as true frequencies. This leads to negative bias for the average estimate of kinship among all individuals, which explains the estimated negative kinship coefficients. Based on this observation, we propose an unbiased estimation method, UKin, which can reduce kinship estimation bias. We justify our improved method with rigorous mathematical proof. We have conducted simulations as well as two real data analyses to compare UKin with scGRM and three other kinship estimating methods: rGRM, tsGRM, and KING. Our results demonstrate that both bias and root mean square error in kinship coefficient estimation could be reduced by using UKin. We further investigated the performance of UKin, KING, and three GRM-based methods in calculating the SNP-based heritability, and show that UKin can improve estimation accuracy for heritability regardless of the scale of SNP panel.
Collapse
Affiliation(s)
- Wei Jiang
- Department of Biostatistics, School of Public Health, Yale University, New Haven, USA
| | - Xiangyu Zhang
- Department of Biostatistics, School of Public Health, Yale University, New Haven, USA
| | - Siting Li
- Department of Biomedical Data Science, Geisel School of Medicine, Dartmouth College, Hanover, USA
| | - Shuang Song
- Center for Statistical Science, Tsinghua University, Beijing, China
- Department of Industrial Engineering, Tsinghua University, Beijing, China
| | - Hongyu Zhao
- Department of Biostatistics, School of Public Health, Yale University, New Haven, USA.
| |
Collapse
|
18
|
Lagler DK, Hannemann E, Eck K, Klawatsch J, Seichter D, Russ I, Mendel C, Lühken G, Krebs S, Blum H, Upadhyay M, Medugorac I. Fine-mapping and identification of candidate causal genes for tail length in the Merinolandschaf breed. Commun Biol 2022; 5:918. [PMID: 36068271 PMCID: PMC9448734 DOI: 10.1038/s42003-022-03854-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Accepted: 08/16/2022] [Indexed: 12/14/2022] Open
Abstract
Docking the tails of lambs in long-tailed sheep breeds is a common practice worldwide. But this practice is associated with pain. Breeding for a shorter tail could offer an alternative. Therefore, this study aimed to analyze the natural tail length variation in the Merinolandschaf and to identify causal alleles for the short tail phenotype segregating within long-tailed breeds. We used SNP-based association analysis and haplotype-based mapping in 362 genotyped (Illumina OvineSNP50) and phenotyped Merinolandschaf lambs. Genome-wide significant regions were capture sequenced in 48 lambs and comparatively analyzed in various long and short-tailed sheep breeds and wild sheep subspecies. Here we show a SNP located in the first exon of HOXB13 and a SINE element located in the promotor of HOXB13 as promising candidates. These results enable more precise breeding towards shorter tails, improve animal welfare by amplification of ancestral alleles and contribute to a better understanding of differential embryonic development. Using SNP-association analysis and genetic mapping, a SNP and an insertion in and close to HOXB13 associated with short tail length is identified in Merino sheep, which may be a target for safely selecting shorter tails and improving sheep welfare.
Collapse
Affiliation(s)
- Dominik Karl Lagler
- Population Genomics Group, Department of Veterinary Sciences, LMU Munich, Lena-Christ-Str. 48, 82152, Martinsried, Germany.,Tierzuchtforschung e.V. München, Senator-Gerauer-Str. 23, 85586, Poing, Germany
| | - Elisabeth Hannemann
- Population Genomics Group, Department of Veterinary Sciences, LMU Munich, Lena-Christ-Str. 48, 82152, Martinsried, Germany
| | - Kim Eck
- Population Genomics Group, Department of Veterinary Sciences, LMU Munich, Lena-Christ-Str. 48, 82152, Martinsried, Germany.,Tierzuchtforschung e.V. München, Senator-Gerauer-Str. 23, 85586, Poing, Germany
| | - Jürgen Klawatsch
- Population Genomics Group, Department of Veterinary Sciences, LMU Munich, Lena-Christ-Str. 48, 82152, Martinsried, Germany.,Tierzuchtforschung e.V. München, Senator-Gerauer-Str. 23, 85586, Poing, Germany
| | - Doris Seichter
- Tierzuchtforschung e.V. München, Senator-Gerauer-Str. 23, 85586, Poing, Germany
| | - Ingolf Russ
- Tierzuchtforschung e.V. München, Senator-Gerauer-Str. 23, 85586, Poing, Germany
| | - Christian Mendel
- Institute for Animal Breeding, Bavarian State Research Center for Agriculture, Prof.-Dürrwaechter-Platz 1, 85586, Poing, Germany
| | - Gesine Lühken
- Institute of Animal Breeding and Genetics, JLU Gießen, Ludwigstr. 21, 35390, Gießen, Germany
| | - Stefan Krebs
- Laboratory for Functional Genome Analysis, Gene Center, Ludwig-Maximilians-University Munich, 80539, Munich, Germany
| | - Helmut Blum
- Laboratory for Functional Genome Analysis, Gene Center, Ludwig-Maximilians-University Munich, 80539, Munich, Germany
| | - Maulik Upadhyay
- Population Genomics Group, Department of Veterinary Sciences, LMU Munich, Lena-Christ-Str. 48, 82152, Martinsried, Germany
| | - Ivica Medugorac
- Population Genomics Group, Department of Veterinary Sciences, LMU Munich, Lena-Christ-Str. 48, 82152, Martinsried, Germany.
| |
Collapse
|
19
|
Jibrila I, Vandenplas J, Ten Napel J, Bergsma R, Veerkamp RF, Calus MPL. Impact of genomic preselection on subsequent genetic evaluations with ssGBLUP using real data from pigs. Genet Sel Evol 2022; 54:48. [PMID: 35764921 PMCID: PMC9238012 DOI: 10.1186/s12711-022-00727-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Accepted: 05/02/2022] [Indexed: 11/29/2022] Open
Abstract
Background Empirically assessing the impact of preselection on genetic evaluation of preselected animals requires comparing scenarios that take different approaches into account, including scenarios without preselection. However, preselection is almost always performed in animal breeding programs, so it is difficult to have a dataset without preselection. Hence, most studies on preselection have used simulated datasets, and have concluded that genomic estimated breeding values (GEBV) from subsequent single-step genomic best linear unbiased prediction (ssGBLUP) evaluations are unbiased. The aim of this study was to investigate the impact of genomic preselection (GPS) on accuracy and bias in subsequent ssGBLUP evaluations, using data from a commercial pig breeding program. Methods We used data on average daily gain during performance testing, average daily gain throughout life, backfat thickness, and loin depth from one sire line and one dam line of pigs. As these traits have different weights in the breeding goals of the two lines, we analyzed the lines separately. For each line, we implemented a reference GPS scenario that kept all available data, against which the next two scenarios were compared. We then implemented two other scenarios with additional layers of GPS by removing all animals without progeny either (i) only in the validation generation, or (ii) in all generations. We conducted subsequent ssGBLUP evaluations for each GPS scenario, using all the data remaining after implementing the GPS scenario. Accuracy and bias were computed by comparing GEBV against progeny yield deviations of validation animals. Results Results for all traits and in both lines showed a marginal loss in accuracy due to the additional layers of GPS. Average accuracies across all GPS scenarios in the two lines were 0.39, 0.47, 0.56, and 0.60, for average daily gain during performance testing and throughout life, backfat thickness, and loin depth, respectively. Biases were largely absent, and when present, did not differ greatly between the GPS scenarios. Conclusions We conclude that the impact of preselection on accuracy and bias in subsequent ssGBLUP evaluations of selection candidates in pigs is generally minimal. We expect this conclusion to apply for other animal breeding programs as well, since preselection of any type or intensity generally has the same effect in animal breeding programs. Supplementary Information The online version contains supplementary material available at 10.1186/s12711-022-00727-5.
Collapse
Affiliation(s)
- Ibrahim Jibrila
- Animal Breeding and Genomics Group, Wageningen University & Research, PO Box 338, 6700 AH, Wageningen, The Netherlands.
| | - Jeremie Vandenplas
- Animal Breeding and Genomics Group, Wageningen University & Research, PO Box 338, 6700 AH, Wageningen, The Netherlands
| | - Jan Ten Napel
- Animal Breeding and Genomics Group, Wageningen University & Research, PO Box 338, 6700 AH, Wageningen, The Netherlands
| | - Rob Bergsma
- Topigs Norsvin Research Center B.V., Schoenaker 6, 6641 SZ, Beuningen, The Netherlands
| | - Roel F Veerkamp
- Animal Breeding and Genomics Group, Wageningen University & Research, PO Box 338, 6700 AH, Wageningen, The Netherlands
| | - Mario P L Calus
- Animal Breeding and Genomics Group, Wageningen University & Research, PO Box 338, 6700 AH, Wageningen, The Netherlands
| |
Collapse
|
20
|
Belay TK, Eikje LS, Gjuvsland AB, Nordbø Ø, Tribout T, Meuwissen T. Correcting for base-population differences and unknown parent groups in single-step genomic predictions of Norwegian Red Cattle. J Anim Sci 2022; 100:6618053. [PMID: 35752161 PMCID: PMC9467032 DOI: 10.1093/jas/skac227] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Accepted: 06/23/2022] [Indexed: 11/15/2022] Open
Abstract
Bias and inflation in genomic evaluation with the single-step methods have been reported in several studies. Incompatibility between the base-populations of the pedigree-based and the genomic relationship matrix (G) could be a reason for these biases. Inappropriate ways of accounting for missing parents could be another reason for biases in genetic evaluations with or without genomic information. To handle these problems, we fitted and evaluated a fixed covariate (J) that contains ones for genotyped animals and zeros for unrelated non-genotyped animals, or pedigree-based regression coefficients for related non-genotyped animals. We also evaluated alternative ways of fitting the J covariate together with genetic groups on biases and stability of breeding value estimates, and of including it into G as a random effect. In a whole vs. partial data set comparison, four scenarios were investigated for the partial data: genotypes missing, phenotypes missing, both genotypes and phenotypes missing, and pedigree missing. Fitting J either as fixed or random reduced level-bias and inflation and increased stability of genomic predictions as compared to the basic model where neither J nor genetic groups were fitted. In most models, genomic predictions were largely biased for scenarios with missing genotype and phenotype information. The biases were reduced for models which combined group and J effects. Models with these corrected group covariates performed better than the recently published model where genetic groups were encapsulated and fitted as random via the Quaas and Pollak transformation. In our Norwegian Red cattle data, a model which combined group and J regression coefficients was preferred because it showed least bias and highest stability of genomic predictions across the scenarios.
Collapse
Affiliation(s)
- Tesfaye K Belay
- Department of animal and aquacultural Sciences, Norwegian University of Life Sciences, NMBU, Norway
| | | | | | | | - Thierry Tribout
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, JouyenJosas, France
| | - Theo Meuwissen
- Department of animal and aquacultural Sciences, Norwegian University of Life Sciences, NMBU, Norway
| |
Collapse
|
21
|
Chasing genetic correlation breakers to stimulate population resilience to climate change. Sci Rep 2022; 12:8238. [PMID: 35581288 PMCID: PMC9114142 DOI: 10.1038/s41598-022-12320-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Accepted: 05/09/2022] [Indexed: 11/29/2022] Open
Abstract
Global climate change introduces new combinations of environmental conditions, which is expected to increase stress on plants. This could affect many traits in multiple ways that are as yet unknown but will likely require the modification of existing genetic relationships among functional traits potentially involved in local adaptation. Theoretical evolutionary studies have determined that it is an advantage to have an excess of recombination events under heterogeneous environmental conditions. Our study, conducted on a population of radiata pine (Pinus radiata D. Don), was able to identify individuals that show high genetic recombination at genomic regions, which potentially include pleiotropic or collocating QTLs responsible for the studied traits, reaching a prediction accuracy of 0.80 in random cross-validation and 0.72 when whole family was removed from the training population and predicted. To identify these highly recombined individuals, a training population was constructed from correlation breakers, created through tandem selection of parents in the previous generation and their consequent mating. Although the correlation breakers showed lower observed heterogeneity possibly due to direct selection in both studied traits, the genomic regions with statistically significant differences in the linkage disequilibrium pattern showed higher level of heretozygosity, which has the effect of decomposing unfavourable genetic correlation. We propose undertaking selection of correlation breakers under current environmental conditions and using genomic predictions to increase the frequency of these ’recombined’ individuals in future plantations, ensuring the resilience of planted forests to changing climates. The increased frequency of such individuals will decrease the strength of the population-level genetic correlations among traits, increasing the opportunity for new trait combinations to be developed in the future.
Collapse
|
22
|
Fan C, Mancuso N, Chiang CW. A genealogical estimate of genetic relationships. Am J Hum Genet 2022; 109:812-824. [PMID: 35417677 DOI: 10.1016/j.ajhg.2022.03.016] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2021] [Accepted: 03/25/2022] [Indexed: 12/23/2022] Open
Abstract
The application of genetic relationships among individuals, characterized by a genetic relationship matrix (GRM), has far-reaching effects in human genetics. However, the current standard to calculate the GRM treats linked markers as independent and does not explicitly model the underlying genealogical history of the study sample. Here, we propose a coalescent-informed framework, namely the expected GRM (eGRM), to infer the expected relatedness between pairs of individuals given an ancestral recombination graph (ARG) of the sample. Through extensive simulations, we show that the eGRM is an unbiased estimate of latent pairwise genome-wide relatedness and is robust when computed with ARG inferred from incomplete genetic data. As a result, the eGRM better captures the structure of a population than the canonical GRM, even when using the same genetic information. More importantly, our framework allows a principled approach to estimate the eGRM at different time depths of the ARG, thereby revealing the time-varying nature of population structure in a sample. When applied to SNP array genotypes from a population sample from Northern and Eastern Finland, we find that clustering analysis with the eGRM reveals population structure driven by subpopulations that would not be apparent via the canonical GRM and that temporally the population model is consistent with recent divergence and expansion. Taken together, our proposed eGRM provides a robust tree-centric estimate of relatedness with wide application to genetic studies.
Collapse
|
23
|
Zhang L, Sun L. Linear Mixed-Effect Models Through the Lens of Hardy–Weinberg Disequilibrium. Front Genet 2022; 13:856872. [PMID: 35495131 PMCID: PMC9039721 DOI: 10.3389/fgene.2022.856872] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Accepted: 03/09/2022] [Indexed: 11/13/2022] Open
Abstract
For genetic association studies with related individuals, the linear mixed-effect model is the most commonly used method. In this report, we show that contrary to the popular belief, this standard method can be sensitive to departure from Hardy–Weinberg equilibrium (i.e., Hardy–Weinberg disequilibrium) at the causal SNPs in two ways. First, when the trait heritability is treated as a nuisance parameter, although the association test has correct type I error control, the resulting heritability estimate can be biased, often upward, in the presence of Hardy–Weinberg disequilibrium. Second, if the true heritability is used in the linear mixed-effect model, then the corresponding association test can be biased in the presence of Hardy–Weinberg disequilibrium. We provide some analytical insights along with supporting empirical results from simulation and application studies.
Collapse
Affiliation(s)
- Lin Zhang
- Department of Statistical Sciences, University of Toronto, Toronto, ON, Canada
| | - Lei Sun
- Department of Statistical Sciences, University of Toronto, Toronto, ON, Canada
- Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
- *Correspondence: Lei Sun ,
| |
Collapse
|
24
|
Bermann M, Cesarani A, Misztal I, Lourenco D. Past, present, and future developments in single-step genomic models. ITALIAN JOURNAL OF ANIMAL SCIENCE 2022. [DOI: 10.1080/1828051x.2022.2053366] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Affiliation(s)
- Matias Bermann
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, USA
| | - Alberto Cesarani
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, USA
- Dipartimento di Agraria, Università degli Studi di Sassari, Sassari, Italy
| | - Ignacy Misztal
- Dipartimento di Agraria, Università degli Studi di Sassari, Sassari, Italy
| | - Daniela Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, USA
| |
Collapse
|
25
|
Tan X, Liu R, Li W, Zheng M, Zhu D, Liu D, Feng F, Li Q, Liu L, Wen J, Zhao G. Assessment the effect of genomic selection and detection of selective signature in broilers. Poult Sci 2022; 101:101856. [PMID: 35413593 PMCID: PMC9018145 DOI: 10.1016/j.psj.2022.101856] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Revised: 02/01/2022] [Accepted: 02/28/2022] [Indexed: 12/02/2022] Open
Abstract
Due to high selection advances and shortened generation interval, genomic selection (GS) is now an effective animal breeding scheme. In broilers, many studies have compared the accuracy of different GS prediction methods, but few reports have demonstrated phenotypic or genetic changes using GS. In this study, the paternal chicken line B underwent continuous selection for 3 generations. The chicken 55 k SNP chip was used to estimate the genetic parameters and detect genomic response regions by selective sweep analysis. The heritability for body weight (BW), meat production, and abdominal fat traits were ranged from 0.12 to 0.38. A high genetic correlation was found between BW and meat production traits, while a low genetic correlation (<0.1) was found between meat production and abdominal fat traits. Selection resulted in an increase of about 516 g in BW and 140 g in breast muscle weight. Percentage of breast muscle and whole thigh were increased 0.8 to 1.5%. No change was observed in abdominal fat percentage. The genomic estimated breeding value advances was positive for BW and meat production (except whole thigh percentage), while negative for abdominal fat percentage. By selective sweep analysis, 39 common chromosomal regions and 102 protein coding genes were found to be influenced, including MYH1A, MYH1B, and MYH1D of the MYH gene family. Tight junction pathway as well as myosin complex related terms were enriched. This study demonstrates the effective use of GS for improvements in BW and meat production in chicken line B. Further, genomic regions, responsive to intensive genetic selection, were identified to contain genes of the MYH family.
Collapse
|
26
|
Abstract
Traditional tree improvement is cumbersome and costly. Our main objective was to assess the extent to which genomic data can currently accelerate and improve decision making in this field. We used diameter at breast height (DBH) and wood density (WD) data for 4430 tree genotypes and single-nucleotide polymorphism (SNP) data for 2446 tree genotypes. Pedigree reconstruction was performed using a combination of maximum likelihood parentage assignment and matching based on identity-by-state (IBS) similarity. In addition, we used best linear unbiased prediction (BLUP) methods to predict phenotypes using SNP markers (GBLUP), recorded pedigree information (ABLUP), and single-step “blended” BLUP (HBLUP) combining SNP and pedigree information. We substantially improved the accuracy of pedigree records, resolving the inconsistent parental information of 506 tree genotypes. This led to substantially increased predictive ability (i.e., by up to 87%) in HBLUP analyses compared to a baseline from ABLUP. Genomic prediction was possible across populations and within previously untested families with moderately large training populations (N = 800–1200 tree genotypes) and using as few as 2000–5000 SNP markers. HBLUP was generally more effective than traditional ABLUP approaches, particularly after dealing appropriately with pedigree uncertainties. Our study provides evidence that genome-wide marker data can significantly enhance tree improvement. The operational implementation of genomic selection has started in radiata pine breeding in New Zealand, but further reductions in DNA extraction and genotyping costs may be required to realise the full potential of this approach.
Collapse
|
27
|
Aase K, Jensen H, Muff S. Genomic estimation of quantitative genetic parameters in wild admixed populations. Methods Ecol Evol 2022. [DOI: 10.1111/2041-210x.13810] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Affiliation(s)
- Kenneth Aase
- Centre for Biodiversity Dynamics, Department of Biology Norwegian University of Science and Technology Trondheim Norway
| | - Henrik Jensen
- Centre for Biodiversity Dynamics, Department of Biology Norwegian University of Science and Technology Trondheim Norway
| | - Stefanie Muff
- Centre for Biodiversity Dynamics, Department of Biology Norwegian University of Science and Technology Trondheim Norway
- Department of Mathematical Sciences, Norwegian University of Science and Technology Trondheim Norway
| |
Collapse
|
28
|
Zhang M, Luo H, Xu L, Shi Y, Zhou J, Wang D, Zhang X, Huang X, Wang Y. Genomic Selection for Milk Production Traits in Xinjiang Brown Cattle. Animals (Basel) 2022; 12:ani12020136. [PMID: 35049759 PMCID: PMC8772551 DOI: 10.3390/ani12020136] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Revised: 12/22/2021] [Accepted: 12/30/2021] [Indexed: 11/16/2022] Open
Abstract
One-step genomic selection is a method for improving the reliability of the breeding value estimation. This study aimed to compare the reliability of pedigree-based best linear unbiased prediction (PBLUP) and single-step genomic best linear unbiased prediction (ssGBLUP), single-trait and multitrait models, and the restricted maximum likelihood (REML) and Bayesian methods. Data were collected from the production performance records of 2207 Xinjiang Brown cattle in Xinjiang from 1983 to 2018. A cross test was designed to calculate the genetic parameters and reliability of the breeding value of 305 daily milk yield (305 dMY), milk fat yield (MFY), milk protein yield (MPY), and somatic cell score (SCS) of Xinjiang Brown cattle. The heritability of 305 dMY, MFY, MPY, and SCS estimated using the REML and Bayesian multitrait models was approximately 0.39 (0.02), 0.40 (0.03), 0.49 (0.02), and 0.07 (0.02), respectively. The heritability and estimated breeding value (EBV) and the reliability of milk production traits of these cattle calculated based on PBLUP and ssGBLUP using the multitrait model REML and Bayesian methods were higher than those of the single-trait model REML method; the ssGBLUP method was significantly better than the PBLUP method. The reliability of the estimated breeding value can be improved from 0.9% to 3.6%, and the reliability of the genomic estimated breeding value (GEBV) for the genotyped population can reach 83%. Therefore, the genetic evaluation of the multitrait model is better than that of the single-trait model. Thus, genomic selection can be applied to small population varieties such as Xinjiang Brown cattle, in improving the reliability of the genomic estimated breeding value.
Collapse
Affiliation(s)
- Menghua Zhang
- College of Animal Science, Xinjiang Agricultural University, Urumqi 830052, China; (M.Z.); (L.X.); (D.W.); (X.Z.)
| | - Hanpeng Luo
- Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture of China, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing 100193, China;
| | - Lei Xu
- College of Animal Science, Xinjiang Agricultural University, Urumqi 830052, China; (M.Z.); (L.X.); (D.W.); (X.Z.)
| | - Yuangang Shi
- School of Agriculture, Ningxia University, Yinchuan 750021, China; (Y.S.); (J.Z.)
| | - Jinghang Zhou
- School of Agriculture, Ningxia University, Yinchuan 750021, China; (Y.S.); (J.Z.)
| | - Dan Wang
- College of Animal Science, Xinjiang Agricultural University, Urumqi 830052, China; (M.Z.); (L.X.); (D.W.); (X.Z.)
| | - Xiaoxue Zhang
- College of Animal Science, Xinjiang Agricultural University, Urumqi 830052, China; (M.Z.); (L.X.); (D.W.); (X.Z.)
| | - Xixia Huang
- College of Animal Science, Xinjiang Agricultural University, Urumqi 830052, China; (M.Z.); (L.X.); (D.W.); (X.Z.)
- Correspondence: (X.H.); (Y.W.); Tel.: +86-1399-999-6861 (X.H.); +86-1580-159-5851 (Y.W.)
| | - Yachun Wang
- Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture of China, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing 100193, China;
- Correspondence: (X.H.); (Y.W.); Tel.: +86-1399-999-6861 (X.H.); +86-1580-159-5851 (Y.W.)
| |
Collapse
|
29
|
Ismael A, Xue J, Meason DF, Klápště J, Gallart M, Li Y, Bellè P, Gomez-Gallego M, Bradford KT, Telfer E, Dungey H. Genetic Variation in Drought-Tolerance Traits and Their Relationships to Growth in Pinus radiata D. Don Under Water Stress. FRONTIERS IN PLANT SCIENCE 2022; 12:766803. [PMID: 35058945 PMCID: PMC8764257 DOI: 10.3389/fpls.2021.766803] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Accepted: 11/29/2021] [Indexed: 05/08/2023]
Abstract
The selection of drought-tolerant genotypes is globally recognized as an effective strategy to maintain the growth and survival of commercial tree species exposed to future drought periods. New genomic selection tools that reduce the time of progeny trials are required to substitute traditional tree breeding programs. We investigated the genetic variation of water stress tolerance in New Zealand-grown Pinus radiata D. Don using 622 commercially-used genotypes from 63 families. We used quantitative pedigree-based (Genomic Best Linear Unbiased Prediction or ABLUP) and genomic-based (Genomic Best Linear Unbiased Prediction or GBLUP) approaches to examine the heritability estimates associated with water stress tolerance in P. radiata. Tree seedling growth traits, foliar carbon isotope composition (δ13C), and dark-adapted chlorophyll fluorescence (Y) were monitored before, during and after 10 months of water stress. Height growth showed a constant and moderate heritability level, while the heritability estimate for diameter growth and δ13C decreased with water stress. In contrast, chlorophyll fluorescence exhibited low heritability after 5 and 10 months of water stress. The GBLUP approach provided less breeding value accuracy than ABLUP, however, the relative selection efficiency of GBLUP was greater compared with ABLUP selection techniques. Although there was no significant relationship directly between δ13C and Y, the genetic correlations were significant and stronger for GBLUP. The positive genetic correlations between δ13C and tree biomass traits under water stress indicated that intraspecific variation in δ13C was likely driven by differences in the genotype's photosynthetic capacity. The results show that foliar δ13C can predict P. radiata genotype tolerance to water stress using ABLUP and GBLUP approaches and that such approaches can provide a faster screening and selection of drought-tolerant genotypes for forestry breeding programs.
Collapse
Affiliation(s)
- Ahmed Ismael
- Scion (New Zealand Forest Research Institute Ltd.), Rotorua, New Zealand
- Research and Development, Livestock Improvement Corporation, Hamilton, New Zealand
| | - Jianming Xue
- Scion (New Zealand Forest Research Institute Ltd.), Christchurch, New Zealand
| | | | - Jaroslav Klápště
- Scion (New Zealand Forest Research Institute Ltd.), Rotorua, New Zealand
| | - Marta Gallart
- Centre for Planetary Health and Food Security, Griffith University, Nathan, QLD, Australia
| | - Yongjun Li
- Scion (New Zealand Forest Research Institute Ltd.), Rotorua, New Zealand
- Agriculture Victoria, AgriBio Center, Bundoora, VIC, Australia
| | - Pierre Bellè
- Scion (New Zealand Forest Research Institute Ltd.), Rotorua, New Zealand
| | - Mireia Gomez-Gallego
- Scion (New Zealand Forest Research Institute Ltd.), Rotorua, New Zealand
- INRAE, IAM, Université de Lorraine, Nancy, France
| | | | - Emily Telfer
- Scion (New Zealand Forest Research Institute Ltd.), Rotorua, New Zealand
| | - Heidi Dungey
- Scion (New Zealand Forest Research Institute Ltd.), Rotorua, New Zealand
| |
Collapse
|
30
|
Kim EH, Kang HC, Sun DW, Myung CH, Kim JY, Lee DH, Lee SH, Lim HT. Estimation of breeding value and accuracy using pedigree and genotype of Hanwoo cows (Korean cattle). J Anim Breed Genet 2021; 139:281-291. [PMID: 34902178 DOI: 10.1111/jbg.12661] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Revised: 11/03/2021] [Accepted: 11/22/2021] [Indexed: 11/29/2022]
Abstract
The genetic improvement of Hanwoo is dependent on the estimated breeding value (EBV) of pedigree-based Korean proven bull's number, and the genetic evaluation for cows is difficult due to insufficient pedigree and test records. Genomic selection involves utilizing the individual's genotype to estimate the breeding value (BV) and is determined to be an appropriate evaluation method for cows who lack test information. This study used pedigree and genotype to estimate and analyse BV and accuracy of Hanwoo cows in the Gyeongnam area using pedigree best linear unbiased prediction (PBLUP) and genomic best linear unbiased prediction (GBLUP). The test group acquired pedigree and genotype of 919 Hanwoo cows in the Gyeongnam area. The traits used for analysis were carcass weight (CWT), eye muscle areas (EMA), backfat thickness (BFT) and marbling score (MS). PBLUP used Reference group 1 containing the pedigree and phenotype of 919 Hanwoo cows and 545,483 heads to construct the numeric relationship matrix and estimated the EBV and accuracy. GBLUP used Reference group 2 containing the genotype and phenotype of 919 Hanwoo cows and 17,226 heads to construct the genomic relationship matrix and estimated the genomic EBV (GEBV) and accuracy. In the order of CWT, EMA, BFT and MS, the accuracy of PBLUP was 0.488, 0.480, 0.482 and 0.486 while the accuracy of GBLUP was higher with 0.779, 0.758, 0.766 and 0.791. And for 104 cows without relationship coefficient on pedigree to the reference group, the accuracy as PBLUP was estimated to be 0, but for GBLUP, it was possible to estimate the accuracy for all individuals. If GBLUP is applied to cows raised in general farms, the genetic evaluation can be performed even on animals without pedigree and high-accuracy estimation, enabling selection of excellent cows. Accordingly, by securing the genetic diversity of cows, it is expected to increase the profitability of farms by decreasing the inbreeding rate and increasing efficiency of elite calf production.
Collapse
Affiliation(s)
- Eun-Ho Kim
- Department of Animal Science, Gyeongsang National University, Jinju, Korea
| | - Ho-Chan Kang
- Department of Animal Science and Biotechnology, Gyeongsang National University, Jinju, Korea
| | - Du-Won Sun
- Institute of Agriculture and Life Science, Gyeongsang National University, Jinju, Korea
| | - Cheol-Hyun Myung
- Department of Animal Science, Gyeongsang National University, Jinju, Korea
| | - Ji-Yeong Kim
- Department of Animal Science, Gyeongsang National University, Jinju, Korea
| | - Doo-Ho Lee
- Department of Animal Science and Biotechnology, Chungnam National University, Daejeon, Korea
| | - Seung-Hwan Lee
- Department of Animal Science and Biotechnology, Chungnam National University, Daejeon, Korea
| | - Hyun-Tae Lim
- Department of Animal Science, Gyeongsang National University, Jinju, Korea.,Department of Animal Science and Biotechnology, Gyeongsang National University, Jinju, Korea.,Institute of Agriculture and Life Science, Gyeongsang National University, Jinju, Korea
| |
Collapse
|
31
|
Legarra A, Garcia-Baccino CA, Wientjes YCJ, Vitezica ZG. The correlation of substitution effects across populations and generations in the presence of nonadditive functional gene action. Genetics 2021; 219:iyab138. [PMID: 34718531 PMCID: PMC8664574 DOI: 10.1093/genetics/iyab138] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Accepted: 08/19/2021] [Indexed: 11/14/2022] Open
Abstract
Allele substitution effects at quantitative trait loci (QTL) are part of the basis of quantitative genetics theory and applications such as association analysis and genomic prediction. In the presence of nonadditive functional gene action, substitution effects are not constant across populations. We develop an original approach to model the difference in substitution effects across populations as a first order Taylor series expansion from a "focal" population. This expansion involves the difference in allele frequencies and second-order statistical effects (additive by additive and dominance). The change in allele frequencies is a function of relationships (or genetic distances) across populations. As a result, it is possible to estimate the correlation of substitution effects across two populations using three elements: magnitudes of additive, dominance, and additive by additive variances; relationships (Nei's minimum distances or Fst indexes); and assumed heterozygosities. Similarly, the theory applies as well to distinct generations in a population, in which case the distance across generations is a function of increase of inbreeding. Simulation results confirmed our derivations. Slight biases were observed, depending on the nonadditive mechanism and the reference allele. Our derivations are useful to understand and forecast the possibility of prediction across populations and the similarity of GWAS effects.
Collapse
Affiliation(s)
- Andres Legarra
- INRAE/INP, UMR 1388 GenPhySE, Castanet-Tolosan 31326, France
| | - Carolina A. Garcia-Baccino
- INRAE/INP, UMR 1388 GenPhySE, Castanet-Tolosan 31326, France
- Departamento de Producción Animal, Facultad de Agronomía, Universidad de Buenos Aires, Buenos Aires C1417DSQ, Argentina
- SAS NUCLEUS, Le Rheu 35650, France
| | - Yvonne C. J. Wientjes
- Wageningen University & Research, Animal Breeding and Genomics, Wageningen 6700 AH, the Netherlands
| | | |
Collapse
|
32
|
Kardos M, Armstrong EE, Fitzpatrick SW, Hauser S, Hedrick PW, Miller JM, Tallmon DA, Funk WC. The crucial role of genome-wide genetic variation in conservation. Proc Natl Acad Sci U S A 2021; 118:e2104642118. [PMID: 34772759 PMCID: PMC8640931 DOI: 10.1073/pnas.2104642118] [Citation(s) in RCA: 124] [Impact Index Per Article: 41.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/02/2021] [Indexed: 12/30/2022] Open
Abstract
The unprecedented rate of extinction calls for efficient use of genetics to help conserve biodiversity. Several recent genomic and simulation-based studies have argued that the field of conservation biology has placed too much focus on conserving genome-wide genetic variation, and that the field should instead focus on managing the subset of functional genetic variation that is thought to affect fitness. Here, we critically evaluate the feasibility and likely benefits of this approach in conservation. We find that population genetics theory and empirical results show that conserving genome-wide genetic variation is generally the best approach to prevent inbreeding depression and loss of adaptive potential from driving populations toward extinction. Focusing conservation efforts on presumably functional genetic variation will only be feasible occasionally, often misleading, and counterproductive when prioritized over genome-wide genetic variation. Given the increasing rate of habitat loss and other environmental changes, failure to recognize the detrimental effects of lost genome-wide genetic variation on long-term population viability will only worsen the biodiversity crisis.
Collapse
Affiliation(s)
- Marty Kardos
- Northwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, Seattle, WA 98112;
| | | | - Sarah W Fitzpatrick
- W.K. Kellogg Biological Station, Michigan State University, Hickory Corners, MI 49060
- Department of Integrative Biology, Michigan State University, East Lansing, MI 48824
- Ecology, Evolution, and Behavior Program, Michigan State University, East Lansing, MI 48824
| | - Samantha Hauser
- Department of Biological Sciences, University of Wisconsin-Milwaukee, Milwaukee, WI 53211
| | - Philip W Hedrick
- School of Life Sciences, Arizona State University, Tempe, AZ 85287
| | - Joshua M Miller
- San Diego Zoo Wildlife Alliance, Escondido, CA 92027
- Polar Bears International, Bozeman, MT 59772
- Department of Biological Sciences, MacEwan University, Edmonton, AB T5J 4S2, Canada
| | - David A Tallmon
- Biology and Marine Biology Program, University of Alaska Southeast, Juneau, AK 99801
| | - W Chris Funk
- Department of Biology, Graduate Degree Program in Ecology, Colorado State University, Fort Collins, CO 80523
| |
Collapse
|
33
|
Stirm M, Fonteyne LM, Shashikadze B, Lindner M, Chirivi M, Lange A, Kaufhold C, Mayer C, Medugorac I, Kessler B, Kurome M, Zakhartchenko V, Hinrichs A, Kemter E, Krause S, Wanke R, Arnold GJ, Wess G, Nagashima H, de Angelis MH, Flenkenthaler F, Kobelke LA, Bearzi C, Rizzi R, Bähr A, Reese S, Matiasek K, Walter MC, Kupatt C, Ziegler S, Bartenstein P, Fröhlich T, Klymiuk N, Blutke A, Wolf E. A scalable, clinically severe pig model for Duchenne muscular dystrophy. Dis Model Mech 2021; 14:273744. [PMID: 34796900 PMCID: PMC8688409 DOI: 10.1242/dmm.049285] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2021] [Accepted: 11/11/2021] [Indexed: 11/20/2022] Open
Abstract
Large animal models for Duchenne muscular dystrophy (DMD) are crucial for evaluation of diagnostic procedures and treatment strategies. Pigs cloned from male cells lacking DMD exon 52 (DMDΔ52) resemble molecular, clinical and pathological hallmarks of DMD, but die before sexual maturity and cannot be propagated by breeding. Therefore, we generated female DMD+/- carriers. A single founder animal had 11 litters with 29 DMDY/-, 34 DMD+/- as well as 36 male and 29 female wild-type offspring. Breeding with F1 and F2 DMD+/- carriers resulted in additional 114 DMDY/- piglets. With intensive neonatal management, the majority survived for 3-4 months, providing statistically relevant cohorts for experimental studies. Pathological investigations and proteome studies of skeletal muscles and myocardium confirmed the resemblance of human disease mechanisms. Importantly, DMDY/- pigs reveal progressive myocardial fibrosis and increased expression of connexin-43, associated with significantly reduced left ventricular ejection fraction already at age 3 months. Furthermore, behavioral tests provided evidence for impaired cognitive ability. Our breeding cohort of DMDΔ52 pigs and standardized tissue repositories provide important resources for studying DMD disease mechanisms and for testing novel treatment strategies.
Collapse
Affiliation(s)
- Michael Stirm
- Chair for Molecular Animal Breeding and Biotechnology, Gene Center and Department of Veterinary Sciences, LMU Munich, Munich, Germany.,Center for Innovative Medical Models (CiMM), LMU Munich, Munich, Germany
| | - Lina Marie Fonteyne
- Chair for Molecular Animal Breeding and Biotechnology, Gene Center and Department of Veterinary Sciences, LMU Munich, Munich, Germany.,Center for Innovative Medical Models (CiMM), LMU Munich, Munich, Germany
| | - Bachuki Shashikadze
- Laboratory for Functional Genome Analysis (LAFUGA), Gene Center, LMU Munich, Munich, Germany
| | - Magdalena Lindner
- Department of Nuclear Medicine, University Hospital, LMU Munich, Munich, Germany
| | - Maila Chirivi
- Fondazione Istituto Nazionale di Genetica Molecolare, Milan, Italy.,Department of Medical Surgical Sciences and Biotechnologies, Sapienza University of Rome, Latina, Italy
| | - Andreas Lange
- Chair for Molecular Animal Breeding and Biotechnology, Gene Center and Department of Veterinary Sciences, LMU Munich, Munich, Germany.,Center for Innovative Medical Models (CiMM), LMU Munich, Munich, Germany
| | - Clara Kaufhold
- Institute of Veterinary Pathology, Center for Clinical Veterinary Medicine, LMU Munich, Munich, Germany
| | - Christian Mayer
- Institute of Veterinary Pathology, Center for Clinical Veterinary Medicine, LMU Munich, Munich, Germany
| | - Ivica Medugorac
- Population Genomics Group, Department of Veterinary Sciences, LMU Munich, Munich, Germany
| | - Barbara Kessler
- Chair for Molecular Animal Breeding and Biotechnology, Gene Center and Department of Veterinary Sciences, LMU Munich, Munich, Germany.,Center for Innovative Medical Models (CiMM), LMU Munich, Munich, Germany
| | - Mayuko Kurome
- Chair for Molecular Animal Breeding and Biotechnology, Gene Center and Department of Veterinary Sciences, LMU Munich, Munich, Germany.,Center for Innovative Medical Models (CiMM), LMU Munich, Munich, Germany
| | - Valeri Zakhartchenko
- Chair for Molecular Animal Breeding and Biotechnology, Gene Center and Department of Veterinary Sciences, LMU Munich, Munich, Germany.,Center for Innovative Medical Models (CiMM), LMU Munich, Munich, Germany
| | - Arne Hinrichs
- Chair for Molecular Animal Breeding and Biotechnology, Gene Center and Department of Veterinary Sciences, LMU Munich, Munich, Germany.,Center for Innovative Medical Models (CiMM), LMU Munich, Munich, Germany
| | - Elisabeth Kemter
- Chair for Molecular Animal Breeding and Biotechnology, Gene Center and Department of Veterinary Sciences, LMU Munich, Munich, Germany.,Center for Innovative Medical Models (CiMM), LMU Munich, Munich, Germany
| | - Sabine Krause
- Friedrich Baur Institute, Department of Neurology, LMU Munich, Munich, Germany
| | - Rüdiger Wanke
- Institute of Veterinary Pathology, Center for Clinical Veterinary Medicine, LMU Munich, Munich, Germany
| | - Georg J Arnold
- Laboratory for Functional Genome Analysis (LAFUGA), Gene Center, LMU Munich, Munich, Germany
| | - Gerhard Wess
- Clinic of Small Animal Medicine, Center for Clinical Veterinary Medicine, LMU Munich, Munich, Germany
| | - Hiroshi Nagashima
- Meiji University International Institute for Bio-Resource Research, Kawasaki, Japan
| | | | - Florian Flenkenthaler
- Laboratory for Functional Genome Analysis (LAFUGA), Gene Center, LMU Munich, Munich, Germany
| | - Levin Arne Kobelke
- Laboratory for Functional Genome Analysis (LAFUGA), Gene Center, LMU Munich, Munich, Germany
| | - Claudia Bearzi
- Fondazione Istituto Nazionale di Genetica Molecolare, Milan, Italy.,Institute of Genetic and Biomedical Research, UOS of Milan, National Research Council (IRGB-CNR), Milan, Italy
| | - Roberto Rizzi
- Fondazione Istituto Nazionale di Genetica Molecolare, Milan, Italy.,Institute for Biomedical Technologies, National Research Council (ITB-CNR), Segrate, Milan, Italy
| | - Andrea Bähr
- Klinik und Poliklinik für Innere Medizin I, Klinikum rechts der Isar, Technical University Munich and German Center for Cardiovascular Research (DZHK), Munich Heart Alliance, Munich, Germany
| | - Sven Reese
- Chair for Anatomy, Histology and Embryology, Department of Veterinary Sciences, LMU Munich, Munich, Germany
| | - Kaspar Matiasek
- Institute of Veterinary Pathology, Center for Clinical Veterinary Medicine, LMU Munich, Munich, Germany
| | - Maggie C Walter
- Friedrich Baur Institute, Department of Neurology, LMU Munich, Munich, Germany
| | - Christian Kupatt
- Klinik und Poliklinik für Innere Medizin I, Klinikum rechts der Isar, Technical University Munich and German Center for Cardiovascular Research (DZHK), Munich Heart Alliance, Munich, Germany
| | - Sibylle Ziegler
- Department of Nuclear Medicine, University Hospital, LMU Munich, Munich, Germany
| | - Peter Bartenstein
- Department of Nuclear Medicine, University Hospital, LMU Munich, Munich, Germany
| | - Thomas Fröhlich
- Laboratory for Functional Genome Analysis (LAFUGA), Gene Center, LMU Munich, Munich, Germany
| | - Nikolai Klymiuk
- Chair for Molecular Animal Breeding and Biotechnology, Gene Center and Department of Veterinary Sciences, LMU Munich, Munich, Germany.,Center for Innovative Medical Models (CiMM), LMU Munich, Munich, Germany
| | - Andreas Blutke
- Institute of Experimental Genetics, Helmholtz Zentrum Munich, Neuherberg, Germany
| | - Eckhard Wolf
- Chair for Molecular Animal Breeding and Biotechnology, Gene Center and Department of Veterinary Sciences, LMU Munich, Munich, Germany.,Center for Innovative Medical Models (CiMM), LMU Munich, Munich, Germany.,Laboratory for Functional Genome Analysis (LAFUGA), Gene Center, LMU Munich, Munich, Germany
| |
Collapse
|
34
|
Pogorevc N, Simčič M, Khayatzadeh N, Sölkner J, Berger B, Bojkovski D, Zorc M, Dovč P, Medugorac I, Horvat S. Post-genotyping optimization of dataset formation could affect genetic diversity parameters: an example of analyses with alpine goat breeds. BMC Genomics 2021; 22:546. [PMID: 34273960 PMCID: PMC8285797 DOI: 10.1186/s12864-021-07802-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2020] [Accepted: 06/13/2021] [Indexed: 12/05/2022] Open
Abstract
Background Local breeds retained unique genetic variability important for adaptive potential especially in light of challenges related to climate change. Our first objective was to perform, for the first time, a genome-wide diversity characterization using Illumina GoatSNP50 BeadChip of autochthonous Drežnica goat breed from Slovenia, and five and one local breeds from neighboring Austria and Italy, respectively. For optimal conservation and breeding programs of endangered local breeds, it is important to detect past admixture events and strive for preservation of purebred representatives of each breed with low or without admixture. In the second objective, we hence investigated the effect of inclusion or exclusion of outliers from datasets on genetic diversity and population structure parameters. Results Distinct genetic origin of the Drežnica goat was demonstrated as having closest nodes to Austrian and Italian breeds. A phylogenetic study of these breeds with other goat breeds having SNP data available in the DRYAD repository positioned them in the alpine, European and global context. Swiss breeds clustered with cosmopolitan alpine breeds and were closer to French and Spanish breeds. On the other hand, the Drežnica goat, Austrian and Italian breeds were closer to Turkish breeds. Datasets where outliers were excluded affected estimates of genetic diversity parameters within the breed and increased the pairwise genetic distances between most of the breeds. Alpine breeds, including Drežnica, Austrian and Italian goats analyzed here, still exhibit relatively high levels of genetic variability, homogeneous genetic structure and strong geographical partitioning. Conclusions Genetic diversity analyses revealed that the Slovenian Drežnica goat has a distinct genetic identity and is closely related to the neighboring Austrian and Italian alpine breeds. These results expand our knowledge on phylogeny of goat breeds from easternmost part of the European Alps. The here employed outlier test and datasets optimization approaches provided an objective and statistically powerful tool for removal of admixed outliers. Importance of this test in selecting the representatives of each breed is warranted to obtain more objective diversity parameters and phylogenetic analysis. Such parameters are often the basis of breeding and management programs and are therefore important for preserving genetic variability and uniqueness of local rare breeds. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-07802-z.
Collapse
Affiliation(s)
- Neža Pogorevc
- Department of Animal science, Biotechnical Faculty, University of Ljubljana, Jamnikarjeva 101, SI-1000, Ljubljana, Slovenia
| | - Mojca Simčič
- Department of Animal science, Biotechnical Faculty, University of Ljubljana, Jamnikarjeva 101, SI-1000, Ljubljana, Slovenia
| | - Negar Khayatzadeh
- Division of Livestock Science, Department of Sustainable Agricultural Systems, University of Natural Resources and Life Sciences Vienna, Gregor Mendel Str. 33, A-1180, Vienna, Austria
| | - Johann Sölkner
- Division of Livestock Science, Department of Sustainable Agricultural Systems, University of Natural Resources and Life Sciences Vienna, Gregor Mendel Str. 33, A-1180, Vienna, Austria
| | - Beate Berger
- Department Animal Genetic Resources, AREC Raumberg-Gumpenstein, Institute of Organic Farming and Biodiversity of Farm Animals, 4601 Thalheim b., Wels, Austria
| | - Danijela Bojkovski
- Department of Animal science, Biotechnical Faculty, University of Ljubljana, Jamnikarjeva 101, SI-1000, Ljubljana, Slovenia
| | - Minja Zorc
- Department of Animal science, Biotechnical Faculty, University of Ljubljana, Jamnikarjeva 101, SI-1000, Ljubljana, Slovenia
| | - Peter Dovč
- Department of Animal science, Biotechnical Faculty, University of Ljubljana, Jamnikarjeva 101, SI-1000, Ljubljana, Slovenia
| | - Ivica Medugorac
- Population Genomics Group, Department of Veterinary Sciences, Faculty of Veterinary Medicine, Ludwig-Maximilians-University Munich, Lena-Christ-Straβe 48, 8215, Martinsried/Planegg, Germany
| | - Simon Horvat
- Department of Animal science, Biotechnical Faculty, University of Ljubljana, Jamnikarjeva 101, SI-1000, Ljubljana, Slovenia.
| |
Collapse
|
35
|
Rogers AR, Dunne JC, Romay C, Bohn M, Buckler ES, Ciampitti IA, Edwards J, Ertl D, Flint-Garcia S, Gore MA, Graham C, Hirsch CN, Hood E, Hooker DC, Knoll J, Lee EC, Lorenz A, Lynch JP, McKay J, Moose SP, Murray SC, Nelson R, Rocheford T, Schnable JC, Schnable PS, Sekhon R, Singh M, Smith M, Springer N, Thelen K, Thomison P, Thompson A, Tuinstra M, Wallace J, Wisser RJ, Xu W, Gilmour AR, Kaeppler SM, De Leon N, Holland JB. The importance of dominance and genotype-by-environment interactions on grain yield variation in a large-scale public cooperative maize experiment. G3-GENES GENOMES GENETICS 2021; 11:6062399. [PMID: 33585867 DOI: 10.1093/g3journal/jkaa050] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2020] [Accepted: 11/07/2020] [Indexed: 11/12/2022]
Abstract
High-dimensional and high-throughput genomic, field performance, and environmental data are becoming increasingly available to crop breeding programs, and their integration can facilitate genomic prediction within and across environments and provide insights into the genetic architecture of complex traits and the nature of genotype-by-environment interactions. To partition trait variation into additive and dominance (main effect) genetic and corresponding genetic-by-environment variances, and to identify specific environmental factors that influence genotype-by-environment interactions, we curated and analyzed genotypic and phenotypic data on 1918 maize (Zea mays L.) hybrids and environmental data from 65 testing environments. For grain yield, dominance variance was similar in magnitude to additive variance, and genetic-by-environment variances were more important than genetic main effect variances. Models involving both additive and dominance relationships best fit the data and modeling unique genetic covariances among all environments provided the best characterization of the genotype-by-environment interaction patterns. Similarity of relative hybrid performance among environments was modeled as a function of underlying weather variables, permitting identification of weather covariates driving correlations of genetic effects across environments. The resulting models can be used for genomic prediction of mean hybrid performance across populations of environments tested or for environment-specific predictions. These results can also guide efforts to incorporate high-throughput environmental data into genomic prediction models and predict values in new environments characterized with the same environmental characteristics.
Collapse
Affiliation(s)
- Anna R Rogers
- Program in Genetics, North Carolina State University, Raleigh, NC 27695, USA
| | - Jeffrey C Dunne
- Department of Crop and Soil Sciences, North Carolina State University, Raleigh, NC 27695, USA
| | - Cinta Romay
- Institute for Genomic Diversity, Cornell University, Ithaca, NY 14853, USA
| | - Martin Bohn
- Department of Crop Sciences, University of Illinois at Urban-Champaign, Urbana, IL 61801, USA
| | - Edward S Buckler
- Institute for Genomic Diversity, Cornell University, Ithaca, NY 14853, USA.,USDA-ARS Plant, Soil, and Nutrition Research Unit, Cornell University, Ithaca, NY 14853, USA
| | | | - Jode Edwards
- Department of Agronomy, Iowa State University, Ames, IA 50011, USA.,USDA-ARS Corn Insects and Crop Genetics Research Unit, Iowa State University, Ames, IA 50011, USA
| | - David Ertl
- Iowa Corn Promotion Board, Johnston, IA 50131, USA
| | - Sherry Flint-Garcia
- USDA-ARS Plant Genetics Research Unit, University of Missouri, Columbia, MO 65211, USA
| | - Michael A Gore
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA
| | - Christopher Graham
- Plant Science Department, West River Agricultural Center, South Dakota State University, Rapid City, SD 57769, USA
| | - Candice N Hirsch
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN 55108, USA
| | - Elizabeth Hood
- College of Agriculture, Arkansas State University, Jonesboro, AR 72467, USA
| | - David C Hooker
- Department of Plant Agriculture, Ridgetown Campus, University of Guelph, Ridgetown, ON N0P 2C0, Canada
| | - Joseph Knoll
- USDA-ARS Crop Genetics and Breeding Research Unit, Tifton, GA 31793, USA
| | - Elizabeth C Lee
- Department of Plant Agriculture, University of Guelph, Guelph N1G 2W1, Canada
| | - Aaron Lorenz
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN 55108, USA
| | - Jonathan P Lynch
- Department of Plant Science, Penn State University, University Park, PA 16802, USA
| | - John McKay
- Department of Bioagricultural Sciences and Pest Management, Colorado State University, Fort Collins, CO 80523, USA
| | - Stephen P Moose
- Department of Crop Sciences, University of Illinois at Urban-Champaign, Urbana, IL 61801, USA
| | - Seth C Murray
- Department of Soil and Crop Sciences, Texas A&M University, College Station, TX 77843, USA
| | - Rebecca Nelson
- Plant Pathology and Plant-Microbe Biology Section, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA
| | - Torbert Rocheford
- Department of Agronomy, Purdue University, West Lafayette, IN 47907, USA
| | - James C Schnable
- Department of Agronomy and Horticulture, University of Nebraska, Lincoln, NE 68583, USA
| | - Patrick S Schnable
- Department of Agronomy, Iowa State University, Ames, IA 50011, USA.,Plant Sciences Institute, Iowa State University, Ames, IA 50011, USA
| | - Rajandeep Sekhon
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC 29634, USA
| | - Maninder Singh
- Department of Plant, Soil and Microbial Sciences, Michigan State University, East Lansing, MI 48824, USA
| | - Margaret Smith
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA
| | - Nathan Springer
- Department of Agronomy and Horticulture, University of Nebraska, Lincoln, NE 68583, USA
| | - Kurt Thelen
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul, MN 55108, USA
| | - Peter Thomison
- Department of Horticulture and Crop Science, The Ohio State University, Columbus, OH 43210, USA
| | - Addie Thompson
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul, MN 55108, USA
| | - Mitch Tuinstra
- Department of Agronomy, Purdue University, West Lafayette, IN 47907, USA
| | - Jason Wallace
- Department of Crop and Soil Sciences, University of Georgia, Athens GA 30602, USA
| | - Randall J Wisser
- Department of Plant and Soil Sciences, University of Delaware, Newark, DE 19716, USA
| | - Wenwei Xu
- Texas A& M AgriLife Research, Texas A& M University, Lubbock, TX 79403, USA
| | | | - Shawn M Kaeppler
- Department of Agronomy, University of Wisconsin, Madison, WI 53706, USA
| | - Natalia De Leon
- Department of Agronomy, University of Wisconsin, Madison, WI 53706, USA
| | - James B Holland
- Program in Genetics, North Carolina State University, Raleigh, NC 27695, USA.,Department of Crop and Soil Sciences, North Carolina State University, Raleigh, NC 27695, USA.,USDA-ARS Plant Science Research Unit, North Carolina State University, Raleigh, NC 27695-7620, USA
| |
Collapse
|
36
|
Pincot DDA, Ledda M, Feldmann MJ, Hardigan MA, Poorten TJ, Runcie DE, Heffelfinger C, Dellaporta SL, Cole GS, Knapp SJ. Social network analysis of the genealogy of strawberry: retracing the wild roots of heirloom and modern cultivars. G3-GENES GENOMES GENETICS 2021; 11:6117203. [PMID: 33772307 PMCID: PMC8022721 DOI: 10.1093/g3journal/jkab015] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Accepted: 12/12/2020] [Indexed: 01/22/2023]
Abstract
The widely recounted story of the origin of cultivated strawberry (Fragaria × ananassa) oversimplifies the complex interspecific hybrid ancestry of the highly admixed populations from which heirloom and modern cultivars have emerged. To develop deeper insights into the three-century-long domestication history of strawberry, we reconstructed the genealogy as deeply as possible—pedigree records were assembled for 8,851 individuals, including 2,656 cultivars developed since 1775. The parents of individuals with unverified or missing pedigree records were accurately identified by applying an exclusion analysis to array-genotyped single-nucleotide polymorphisms. We identified 187 wild octoploid and 1,171 F. × ananassa founders in the genealogy, from the earliest hybrids to modern cultivars. The pedigree networks for cultivated strawberry are exceedingly complex labyrinths of ancestral interconnections formed by diverse hybrid ancestry, directional selection, migration, admixture, bottlenecks, overlapping generations, and recurrent hybridization with common ancestors that have unequally contributed allelic diversity to heirloom and modern cultivars. Fifteen to 333 ancestors were predicted to have transmitted 90% of the alleles found in country-, region-, and continent-specific populations. Using parent–offspring edges in the global pedigree network, we found that selection cycle lengths over the past 200 years of breeding have been extraordinarily long (16.0-16.9 years/generation), but decreased to a present-day range of 6.0-10.0 years/generation. Our analyses uncovered conspicuous differences in the ancestry and structure of North American and European populations, and shed light on forces that have shaped phenotypic diversity in F. × ananassa.
Collapse
Affiliation(s)
- Dominique D A Pincot
- Department of Plant Sciences, University of California, Davis, Davis, CA 95616, USA
| | - Mirko Ledda
- Department of Plant Sciences, University of California, Davis, Davis, CA 95616, USA
| | - Mitchell J Feldmann
- Department of Plant Sciences, University of California, Davis, Davis, CA 95616, USA
| | - Michael A Hardigan
- Department of Plant Sciences, University of California, Davis, Davis, CA 95616, USA
| | - Thomas J Poorten
- Department of Plant Sciences, University of California, Davis, Davis, CA 95616, USA
| | - Daniel E Runcie
- Department of Plant Sciences, University of California, Davis, Davis, CA 95616, USA
| | - Christopher Heffelfinger
- Department of Molecular, Cellular, and Developmental Biology, Yale University, New Haven, CT 06520, USA
| | - Stephen L Dellaporta
- Department of Molecular, Cellular, and Developmental Biology, Yale University, New Haven, CT 06520, USA
| | - Glenn S Cole
- Department of Plant Sciences, University of California, Davis, Davis, CA 95616, USA
| | - Steven J Knapp
- Department of Plant Sciences, University of California, Davis, Davis, CA 95616, USA
| |
Collapse
|
37
|
Luan P, Huo T, Ma B, Song D, Zhang X, Hu G. Genomic inbreeding and population structure of northern pike ( Esox lucius) in Xinjiang, China. Ecol Evol 2021; 11:5657-5668. [PMID: 34026037 PMCID: PMC8131772 DOI: 10.1002/ece3.7469] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Revised: 02/28/2021] [Accepted: 03/03/2021] [Indexed: 11/21/2022] Open
Abstract
Northern pike (Esox lucius) was widely distributed in the high latitudes of the northern hemisphere. In China, northern pike was originally distributed only in the upper reaches of the Irtysh River in Xinjiang and has appeared in many water bodies outside the Irtysh River Basin in Northern Xinjiang. A total of four populations were collected from north to south in Xinjiang, including Irtysh River (RIR), Ulungu Lake (LUL), a small lake nearby Ulungu River (LJD), and Bosten Lake (LBO). We estimated population genomic parameters, performed gene flow analysis, and estimated the effective population size of each population. The proportion of individuals with high inbreeding coefficient (F ≥ 0.0625) accounted for 36.4% (44/121) of all sequenced individuals, approximately 4.5% (1/22) in LUL, 25.9% (7/27) in LBO, 42.9% (18/42) in RIR, and 60% (18/30) in LJD. RIR had the highest mean of genomic relatedness (coancestry coefficient = 0.025 ± 0.040, IBD = 0.036 ± 0.078). Gene flow results showed that the population spreading was from RIR into two branches, one was LBO, and the other continued to split into LUL and LJD, and migration signal from LBO to LUL was detected. Our results suggested that the extinction risk of northern pike was very low in Xinjiang of China, and the controlled capture fishery of northern pike could be developed reasonably.
Collapse
Affiliation(s)
- Peixian Luan
- Heilongjiang River Fisheries Research InstituteChinese Academy of Fishery SciencesHarbinChina
- Key Laboratory of Freshwater Aquatic Biotechnology and BreedingMinistry of Agriculture and Rural AffairsHeilongjiang River Fisheries Research Institute, Chinese Academy of Fishery SciencesHarbinChina
| | - Tangbin Huo
- Heilongjiang River Fisheries Research InstituteChinese Academy of Fishery SciencesHarbinChina
- Key Laboratory of Freshwater Aquatic Biotechnology and BreedingMinistry of Agriculture and Rural AffairsHeilongjiang River Fisheries Research Institute, Chinese Academy of Fishery SciencesHarbinChina
| | - Bo Ma
- Heilongjiang River Fisheries Research InstituteChinese Academy of Fishery SciencesHarbinChina
- Key Laboratory of Freshwater Aquatic Biotechnology and BreedingMinistry of Agriculture and Rural AffairsHeilongjiang River Fisheries Research Institute, Chinese Academy of Fishery SciencesHarbinChina
| | - Dan Song
- Heilongjiang River Fisheries Research InstituteChinese Academy of Fishery SciencesHarbinChina
- Key Laboratory of Freshwater Aquatic Biotechnology and BreedingMinistry of Agriculture and Rural AffairsHeilongjiang River Fisheries Research Institute, Chinese Academy of Fishery SciencesHarbinChina
| | - Xiaofeng Zhang
- Heilongjiang River Fisheries Research InstituteChinese Academy of Fishery SciencesHarbinChina
- Key Laboratory of Freshwater Aquatic Biotechnology and BreedingMinistry of Agriculture and Rural AffairsHeilongjiang River Fisheries Research Institute, Chinese Academy of Fishery SciencesHarbinChina
| | - Guo Hu
- Heilongjiang River Fisheries Research InstituteChinese Academy of Fishery SciencesHarbinChina
- Key Laboratory of Freshwater Aquatic Biotechnology and BreedingMinistry of Agriculture and Rural AffairsHeilongjiang River Fisheries Research Institute, Chinese Academy of Fishery SciencesHarbinChina
| |
Collapse
|
38
|
Seppälä O, Çetin C, Cereghetti T, Feulner PGD, Adema CM. Examining adaptive evolution of immune activity: opportunities provided by gastropods in the age of 'omics'. Philos Trans R Soc Lond B Biol Sci 2021; 376:20200158. [PMID: 33813886 DOI: 10.1098/rstb.2020.0158] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Parasites threaten all free-living organisms, including molluscs. Understanding the evolution of immune defence traits in natural host populations is crucial for predicting their long-term performance under continuous infection risk. Adaptive trait evolution requires that traits are subject to selection (i.e. contribute to organismal fitness) and that they are heritable. Despite broad interest in the evolutionary ecology of immune activity in animals, the understanding of selection on and evolutionary potential of immune defence traits is far from comprehensive. For instance, empirical observations are only rarely in line with theoretical predictions of immune activity being subject to stabilizing selection. This discrepancy may be because ecoimmunological studies can typically cover only a fraction of the complexity of an animal immune system. Similarly, molecular immunology/immunogenetics studies provide a mechanistic understanding of immunity, but neglect variation that arises from natural genetic differences among individuals and from environmental conditions. Here, we review the current literature on natural selection on and evolutionary potential of immune traits in animals, signal how merging ecological immunology and genomics will strengthen evolutionary ecological research on immunity, and indicate research opportunities for molluscan gastropods for which well-established ecological understanding and/or 'immune-omics' resources are already available. This article is part of the Theo Murphy meeting issue 'Molluscan genomics: broad insights and future directions for a neglected phylum'.
Collapse
Affiliation(s)
- Otto Seppälä
- Research Department for Limnology, University of Innsbruck, Mondsee, Austria
| | - Cansu Çetin
- Department of Aquatic Ecology, Swiss Federal Institute of Aquatic Science and Technology, Dübendorf, Switzerland.,Institute of Integrative Biology, ETH Zürich, Zürich, Switzerland
| | - Teo Cereghetti
- Department of Aquatic Ecology, Swiss Federal Institute of Aquatic Science and Technology, Dübendorf, Switzerland.,Institute of Integrative Biology, ETH Zürich, Zürich, Switzerland
| | - Philine G D Feulner
- Department of Fish Ecology and Evolution, Centre of Ecology, Evolution and Biogeochemistry, Swiss Federal Institute of Aquatic Science and Technology, Kastanienbaum, Switzerland.,Division of Aquatic Ecology and Evolution, Institute of Ecology and Evolution, University of Bern, Bern, Switzerland
| | - Coen M Adema
- Department of Biology, Center for Evolutionary and Theoretical Immunology, University of New Mexico, Albuquerque, NM, USA
| |
Collapse
|
39
|
Shi R, Brito LF, Liu A, Luo H, Chen Z, Liu L, Guo G, Mulder H, Ducro B, van der Linden A, Wang Y. Genotype-by-environment interaction in Holstein heifer fertility traits using single-step genomic reaction norm models. BMC Genomics 2021; 22:193. [PMID: 33731012 PMCID: PMC7968333 DOI: 10.1186/s12864-021-07496-3] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2020] [Accepted: 02/26/2021] [Indexed: 01/07/2023] Open
Abstract
Background The effect of heat stress on livestock production is a worldwide issue. Animal performance is influenced by exposure to harsh environmental conditions potentially causing genotype-by-environment interactions (G × E), especially in highproducing animals. In this context, the main objectives of this study were to (1) detect the time periods in which heifer fertility traits are more sensitive to the exposure to high environmental temperature and/or humidity, (2) investigate G × E due to heat stress in heifer fertility traits, and, (3) identify genomic regions associated with heifer fertility and heat tolerance in Holstein cattle. Results Phenotypic records for three heifer fertility traits (i.e., age at first calving, interval from first to last service, and conception rate at the first service) were collected, from 2005 to 2018, for 56,998 Holstein heifers raised in 15 herds in the Beijing area (China). By integrating environmental data, including hourly air temperature and relative humidity, the critical periods in which the heifers are more sensitive to heat stress were located in more than 30 days before the first service for age at first calving and interval from first to last service, or 10 days before and less than 60 days after the first service for conception rate. Using reaction norm models, significant G × E was detected for all three traits regarding both environmental gradients, proportion of days exceeding heat threshold, and minimum temperature-humidity index. Through single-step genome-wide association studies, PLAG1, AMHR2, SP1, KRT8, KRT18, MLH1, and EOMES were suggested as candidate genes for heifer fertility. The genes HCRTR1, AGRP, PC, and GUCY1B1 are strong candidates for association with heat tolerance. Conclusions The critical periods in which the reproductive performance of heifers is more sensitive to heat stress are trait-dependent. Thus, detailed analysis should be conducted to determine this particular period for other fertility traits. The considerable magnitude of G × E and sire re-ranking indicates the necessity to consider G × E in dairy cattle breeding schemes. This will enable selection of more heat-tolerant animals with high reproductive efficiency under harsh climatic conditions. Lastly, the candidate genes identified to be linked with response to heat stress provide a better understanding of the underlying biological mechanisms of heat tolerance in dairy cattle. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-07496-3.
Collapse
Affiliation(s)
- Rui Shi
- Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China.,Animal Breeding and Genomics Group, Wageningen University & Research, P.O. Box 338, Wageningen, AH, 6700, the Netherlands.,Animal Production System Group, Wageningen University & Research, P.O. Box 338, Wageningen, AH, 6700, the Netherlands
| | - Luiz Fernando Brito
- Department of Animal Sciences, Purdue University, West Lafayette, Indiana, 47907, USA
| | - Aoxing Liu
- Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China.,Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Hanpeng Luo
- Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Ziwei Chen
- Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Lin Liu
- Beijing Dairy Cattle Center, Beijing, 100192, China
| | - Gang Guo
- Beijing Sunlon Livestock Development Co. Ltd, Beijing, 100176, China.
| | - Herman Mulder
- Animal Breeding and Genomics Group, Wageningen University & Research, P.O. Box 338, Wageningen, AH, 6700, the Netherlands.
| | - Bart Ducro
- Animal Breeding and Genomics Group, Wageningen University & Research, P.O. Box 338, Wageningen, AH, 6700, the Netherlands
| | - Aart van der Linden
- Animal Production System Group, Wageningen University & Research, P.O. Box 338, Wageningen, AH, 6700, the Netherlands.,Cooperation CRV, Arnhem, AL, 6800, the Netherlands
| | - Yachun Wang
- Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China.
| |
Collapse
|
40
|
Sapin E, Keller MC. Novel Approach for Parallelizing Pairwise Comparison Problems as Applied to Detecting Segments Identical By Decent in Whole-Genome Data. Bioinformatics 2021; 37:2121-2125. [PMID: 33705528 PMCID: PMC8352502 DOI: 10.1093/bioinformatics/btab084] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Revised: 11/09/2020] [Accepted: 03/09/2021] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Pairwise comparison problems arise in many areas of science. In genomics, datasets are already large and getting larger, and so operations that require pairwise comparisons-either on pairs of SNPs or pairs of individuals-are extremely computationally challenging. We propose a generic algorithm for addressing pairwise comparison problems that breaks a large problem (of order n2 comparisons) into multiple smaller ones (each of order n comparisons), allowing for massive parallelization. RESULTS We demonstrated that this approach is very efficient for calling identical by descent (IBD) segments between all pairs of individuals in the UK Biobank dataset, with a 250-fold savings in time and 750-fold savings in memory over the standard approach to detecting such segments across the full dataset. This efficiency should extend to other methods of IBD calling and, more generally, to other pairwise comparison tasks in genomics or other areas of science.
Collapse
Affiliation(s)
- Emmanuel Sapin
- Institute for Behavioral Genetics, University of Colorado Boulder, Boulder, CO, USA
| | - Matthew C Keller
- Institute for Behavioral Genetics, University of Colorado Boulder, Boulder, CO, USA
| |
Collapse
|
41
|
Kemper KE, Yengo L, Zheng Z, Abdellaoui A, Keller MC, Goddard ME, Wray NR, Yang J, Visscher PM. Phenotypic covariance across the entire spectrum of relatedness for 86 billion pairs of individuals. Nat Commun 2021; 12:1050. [PMID: 33594080 PMCID: PMC7886899 DOI: 10.1038/s41467-021-21283-4] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Accepted: 12/10/2020] [Indexed: 01/18/2023] Open
Abstract
Attributing the similarity between individuals to genetic and non-genetic factors is central to genetic analyses. In this paper we use the genomic relationship ([Formula: see text]) among 417,060 individuals to investigate the phenotypic covariance between pairs of individuals for 32 traits across the spectrum of relatedness, from unrelated pairs through to identical twins. We find linear relationships between phenotypic covariance and [Formula: see text] that agree with the SNP-based heritability ([Formula: see text]) in unrelated pairs ([Formula: see text]), and with pedigree-estimated heritability in close relatives ([Formula: see text]). The covariance increases faster than [Formula: see text] in distant relatives ([Formula: see text]), and we attribute this to imperfect linkage disequilibrium between causal variants and the common variants used to construct [Formula: see text]. We also examine the effect of assortative mating on heritability estimates from different experimental designs. We find that full-sib identity-by-descent regression estimates for height (0.66 s.e. 0.07) are consistent with estimates from close relatives (0.82 s.e. 0.04) after accounting for the effect of assortative mating.
Collapse
Affiliation(s)
- Kathryn E Kemper
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, Australia.
| | - Loic Yengo
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, Australia
| | - Zhili Zheng
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, Australia
| | - Abdel Abdellaoui
- Department of Psychiatry, Amsterdam UMC, University of Amsterdam, Amsterdam, the Netherlands
| | - Matthew C Keller
- Department of Psychology & Neuroscience, University of Colorado at Boulder, Boulder, CO, USA.,Institute for Behavioral Genetics, University of Colorado at Boulder, Boulder, CO, USA
| | - Michael E Goddard
- Faculty of Veterinary and Agricultural Science, University of Melbourne, Parkville, VIC, Australia.,Biosciences Research Division, Department of Economic Development, Jobs, Transport and Resources, Bundoora, VIC, Australia
| | - Naomi R Wray
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, Australia.,Queensland Brain Institute, University of Queensland, Brisbane, QLD, Australia
| | - Jian Yang
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, Australia
| | - Peter M Visscher
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, Australia. .,Queensland Brain Institute, University of Queensland, Brisbane, QLD, Australia.
| |
Collapse
|
42
|
Li W, Zheng M, Zhao G, Wang J, Liu J, Wang S, Feng F, Liu D, Zhu D, Li Q, Guo L, Guo Y, Liu R, Wen J. Identification of QTL regions and candidate genes for growth and feed efficiency in broilers. Genet Sel Evol 2021; 53:13. [PMID: 33549052 PMCID: PMC7866652 DOI: 10.1186/s12711-021-00608-3] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2019] [Accepted: 01/26/2021] [Indexed: 11/23/2022] Open
Abstract
BACKGROUND Feed accounts for about 70% of the total cost of poultry meat production. Residual feed intake (RFI) has become the preferred measure of feed efficiency because it is phenotypically independent of growth rate and body weight. In this study, our aim was to estimate genetic parameters and identify quantitative trait loci (QTL) for feed efficiency in 3314 purebred broilers using a genome-wide association study. Broilers were genotyped using a custom 55 K single nucleotide polymorphism (SNP) array. RESULTS Estimates of genomic heritability for seven growth and feed efficiency traits, including body weight at 28 days of age (BW28), BW42, average daily feed intake (ADFI), RFI, and RFI adjusted for weight of abdominal fat (RFIa), ranged from 0.12 to 0.26. Eleven genome-wide significant SNPs and 15 suggestively significant SNPs were detected, of which 19 clustered around two genomic regions. A region on chromosome 16 (2.34-2.66 Mb) was associated with both BW28 and BW42, and the most significant SNP in this region, AX_101003762, accounted for 7.6% of the genetic variance of BW28. The other region, on chromosome 1 (91.27-92.43 Mb) was associated with RFI and ADFI, and contains the NSUN3 and EPHA6 as candidate genes. The most significant SNP in this region, AX_172588157, accounted for 4.4% of the genetic variance of RFI. In addition, a genomic region containing the gene AGK on chromosome 1 was found to be associated with RFIa. The NSUN3 and AGK genes were found to be differentially expressed in breast muscle, thigh muscle, and abdominal fat between male broilers with high and low RFI. CONCLUSIONS We identified QTL regions for BW28 and BW42 (spanning 0.32 Mb) and RFI (spanning 1.16 Mb). The NSUN3, EPHA6, and AGK were identified as the most likely candidate genes for these QTL. These genes are involved in mitochondrial function and behavioral regulation. These results contribute to the identification of candidate genes and variants for growth and feed efficiency in poultry.
Collapse
Affiliation(s)
- Wei Li
- State Key Laboratory of Animal Nutrition; Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193 China
- College of Animal Science and Technology, China Agricultural University, Beijing, 100193 China
| | - Maiqing Zheng
- State Key Laboratory of Animal Nutrition; Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193 China
| | - Guiping Zhao
- State Key Laboratory of Animal Nutrition; Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193 China
| | - Jie Wang
- State Key Laboratory of Animal Nutrition; Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193 China
| | - Jie Liu
- State Key Laboratory of Animal Nutrition; Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193 China
| | - Shunli Wang
- State Key Laboratory of Animal Nutrition; Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193 China
| | - Furong Feng
- Foshan Gaoming Xinguang Agricultural and Animal Industrials Corporation, Foshan, 528515 China
| | - Dawei Liu
- Foshan Gaoming Xinguang Agricultural and Animal Industrials Corporation, Foshan, 528515 China
| | - Dan Zhu
- Foshan Gaoming Xinguang Agricultural and Animal Industrials Corporation, Foshan, 528515 China
| | - Qinghe Li
- State Key Laboratory of Animal Nutrition; Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193 China
| | - Liping Guo
- State Key Laboratory of Animal Nutrition; Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193 China
- College of Animal Science and Technology, China Agricultural University, Beijing, 100193 China
| | - Yuming Guo
- College of Animal Science and Technology, China Agricultural University, Beijing, 100193 China
| | - Ranran Liu
- State Key Laboratory of Animal Nutrition; Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193 China
| | - Jie Wen
- State Key Laboratory of Animal Nutrition; Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193 China
| |
Collapse
|
43
|
Jibrila I, Vandenplas J, Ten Napel J, Veerkamp RF, Calus MPL. Avoiding preselection bias in subsequent single-step genomic BLUP evaluations of genomically preselected animals. J Anim Breed Genet 2020; 138:432-441. [PMID: 33372707 PMCID: PMC8246977 DOI: 10.1111/jbg.12533] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Revised: 11/21/2020] [Accepted: 12/09/2020] [Indexed: 11/30/2022]
Abstract
In animal breeding, parents of the next generation are usually selected in multiple stages, and the initial stages of this selection are called preselection. Preselection reduces the information available for subsequent evaluation of preselected animals and this sometimes leads to bias. The objective of this study was to establish the minimum information required to subsequently evaluate genomically preselected animals without bias arising from preselection, with single-step genomic best linear unbiased prediction (ssGBLUP). We simulated a nucleus of a breeding program in which a recent population of 15 generations was produced. In each generation, parents of the next generation were selected in a single-stage selection based on pedigree BLUP. However, in generation 15, 10% of male and 15% of female offspring were preselected on their genomic estimated breeding values (GEBV). These GEBV were estimated using ssGBLUP, including the pedigree of all animals in generations 0-15, genotypes of all animals in generations 13-15 and phenotypes of all animals in generations 11-14. In subsequent ssGBLUP evaluation of these preselected animals, genotypes and phenotypes from various groups of animals were excluded one after another. We found that GEBV of the preselected animals were only estimated without preselection bias when genotypes and phenotypes of all animals in generations 13 and 14 and of the preselected animals were included in the subsequent evaluation. We also found that genotypes of the animals discarded at preselection only helped in reducing preselection bias in GEBV of their preselected sibs when genotypes of their parents were absent or excluded from the subsequent evaluation. We concluded that to prevent preselection bias in subsequent ssGBLUP evaluation of genomically preselected animals, information representative of the reference data used in the evaluation at preselection and genotypes and phenotypes of the preselected animals are needed in the subsequent evaluation.
Collapse
Affiliation(s)
- Ibrahim Jibrila
- Wageningen University & Research Animal Breeding and Genomics, Wageningen, The Netherlands
| | - Jeremie Vandenplas
- Wageningen University & Research Animal Breeding and Genomics, Wageningen, The Netherlands
| | - Jan Ten Napel
- Wageningen University & Research Animal Breeding and Genomics, Wageningen, The Netherlands
| | - Roel F Veerkamp
- Wageningen University & Research Animal Breeding and Genomics, Wageningen, The Netherlands
| | - Mario P L Calus
- Wageningen University & Research Animal Breeding and Genomics, Wageningen, The Netherlands
| |
Collapse
|
44
|
Peters SO, Kızılkaya K, Ibeagha-Awemu EM, Sinecen M, Zhao X. Comparative accuracies of genetic values predicted for economically important milk traits, genome-wide association, and linkage disequilibrium patterns of Canadian Holstein cows. J Dairy Sci 2020; 104:1900-1916. [PMID: 33358789 DOI: 10.3168/jds.2020-18489] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Accepted: 08/10/2020] [Indexed: 11/19/2022]
Abstract
Genomic selection methodologies and genome-wide association studies use powerful statistical procedures that correlate large amounts of high-density SNP genotypes and phenotypic data. Actual 305-d milk (MY), fat (FY), and protein (PY) yield data on 695 cows and 76,355 genotyping-by-sequencing-generated SNP marker genotypes from Canadian Holstein dairy cows were used to characterize linkage disequilibrium (LD) structure of Canadian Holstein cows. Also, the comparison of pedigree-based BLUP, genomic BLUP (GBLUP), and Bayesian (BayesB) statistical methods in the genomic selection methodologies and the comparison of Bayesian ridge regression and BayesB statistical methods in the genome-wide association studies were carried out for MY, FY, and PY. Results from LD analysis revealed that as marker distance decreases, LD increases through chromosomes. However, unexpected high peaks in LD were observed between marker pairs with larger marker distances on all chromosomes. The GBLUP and BayesB models resulted in similar heritability estimates through 10-fold cross-validation for MY and PY; however, the GBLUP model resulted in higher heritability estimates than BayesB model for FY. The predictive ability of GBLUP model was significantly lower than that of BayesB for MY, FY, and PY. Association analyses indicated that 28 high-effect markers and markers on Bos taurus autosome 14 located within 6 genes (DOP1B, TONSL, CPSF1, ADCK5, PARP10, and GRINA) associated significantly with FY.
Collapse
Affiliation(s)
- Sunday O Peters
- Department of Animal Science, Berry College, Mount Berry, GA 30149; Department of Animal and Dairy Science, University of Georgia, Athens 30602.
| | - Kadir Kızılkaya
- Department of Animal Science, Faculty of Agriculture, Aydin Adnan Menderes University, Aydin, 09100, Turkey
| | - Eveline M Ibeagha-Awemu
- Agriculture and Agri-Food Canada, Sherbrooke Research and Development Centre, 2000 Rue College, Sherbrooke, QC, J1M 0C8 Canada
| | - Mahmut Sinecen
- Department of Computer Engineering, Faculty of Engineering, Aydin Adnan Menderes University, Aydin, 09100, Turkey
| | - Xin Zhao
- Department of Animal Science, McGill University, 21,111 Lakeshore Road, Ste-Anne-De-Bellevue, QC, H9S 3V9 Canada
| |
Collapse
|
45
|
Bermann M, Lourenco D, Misztal I. Technical note: Automatic scaling in single-step genomic BLUP. J Dairy Sci 2020; 104:2027-2031. [PMID: 33309381 DOI: 10.3168/jds.2020-18969] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Accepted: 09/16/2020] [Indexed: 11/19/2022]
Abstract
Single-step genomic BLUP (ssGBLUP) requires compatibility between genomic and pedigree relationships for unbiased and accurate predictions. Scaling the genomic relationship matrix (G) to have the same averages as the pedigree relationship matrix (i.e., scaling by averages) is one way to ensure compatibility. This requires computing both relationship matrices, calculating averages, and changing G, whereas only the inverses of those matrices are needed in the mixed model equations. Therefore, the compatibility process can add extra computing burden. In the single-step Bayesian regression, the scaling is done by including a mean (μg) as a fixed effect in the model. The parameter μg can be interpreted as the average of the breeding values of the genotyped animals. In this study, such scaling, called automatic, was implemented in ssGBLUP via Quaas-Pollak transformation of the inverse of the relationship matrix used in ssGBLUP (H), which combines the inverses of the pedigree and genomic relationship matrices. Comparisons involved a simulated data set, and the genomic relationship matrix was computed using different allele frequencies either from the current population (i.e., realized allele frequencies), equal among all the loci, or from the base population. For all of the scenarios, we computed bias [defined as the average difference between true breeding values (TBV) and genomic estimated breeding values (GEBV)], accuracy (defined as the correlation between TBV and GEBV), and dispersion (defined as the regression coefficient of GEBV on TBV). With no scaling, the bias expressed in terms of genetic standard deviations was 0.86, 0.64, and 0.58 with realized, equal, and base population allele frequencies, respectively. With scaling by averages, which is currently used in ssGBLUP, bias was 0.07, 0.08, and 0.03, respectively. With automatic scaling, bias was 0.18 regardless of allele frequencies. Accuracies were similar among scaling methods, but about 0.1 lower in the scenario without scaling. The GEBV were more inflated without any scaling, whereas the automatic scaling performed similarly to the scaling by averages. The average dispersion for those methods was 0.94. When μg was treated as random, with the variance equal to differences between pedigree and genomic relationships, the bias was the same as with the scaling by averages. The automatic scaling is biased, especially when μg is treated as a fixed effect. The bias may be small in real data with fewer generations, when traits are undergoing weak selection, or when the number of genotyped animals is large.
Collapse
Affiliation(s)
- M Bermann
- Department of Animal and Dairy Science, University of Georgia, Athens, 30602.
| | - D Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, 30602
| | - I Misztal
- Department of Animal and Dairy Science, University of Georgia, Athens, 30602
| |
Collapse
|
46
|
Alves AAC, da Costa RM, Bresolin T, Fernandes Júnior GA, Espigolan R, Ribeiro AMF, Carvalheiro R, de Albuquerque LG. Genome-wide prediction for complex traits under the presence of dominance effects in simulated populations using GBLUP and machine learning methods. J Anim Sci 2020; 98:5849339. [PMID: 32474602 DOI: 10.1093/jas/skaa179] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Accepted: 05/22/2020] [Indexed: 01/05/2023] Open
Abstract
The aim of this study was to compare the predictive performance of the Genomic Best Linear Unbiased Predictor (GBLUP) and machine learning methods (Random Forest, RF; Support Vector Machine, SVM; Artificial Neural Network, ANN) in simulated populations presenting different levels of dominance effects. Simulated genome comprised 50k SNP and 300 QTL, both biallelic and randomly distributed across 29 autosomes. A total of six traits were simulated considering different values for the narrow and broad-sense heritability. In the purely additive scenario with low heritability (h2 = 0.10), the predictive ability obtained using GBLUP was slightly higher than the other methods whereas ANN provided the highest accuracies for scenarios with moderate heritability (h2 = 0.30). The accuracies of dominance deviations predictions varied from 0.180 to 0.350 in GBLUP extended for dominance effects (GBLUP-D), from 0.06 to 0.185 in RF and they were null using the ANN and SVM methods. Although RF has presented higher accuracies for total genetic effect predictions, the mean-squared error values in such a model were worse than those observed for GBLUP-D in scenarios with large additive and dominance variances. When applied to prescreen important regions, the RF approach detected QTL with high additive and/or dominance effects. Among machine learning methods, only the RF was capable to cover implicitly dominance effects without increasing the number of covariates in the model, resulting in higher accuracies for the total genetic and phenotypic values as the dominance ratio increases. Nevertheless, whether the interest is to infer directly on dominance effects, GBLUP-D could be a more suitable method.
Collapse
Affiliation(s)
- Anderson Antonio Carvalho Alves
- Department of Animal Science, Faculty of Agricultural and Veterinary Sciences, Sao Paulo State University (UNESP), Jaboticabal, SP, Brazil
| | - Rebeka Magalhães da Costa
- Department of Animal Science, Faculty of Agricultural and Veterinary Sciences, Sao Paulo State University (UNESP), Jaboticabal, SP, Brazil
| | - Tiago Bresolin
- Department of Animal Sciences, University of Wisconsin, Madison, WI
| | - Gerardo Alves Fernandes Júnior
- Department of Animal Science, Faculty of Agricultural and Veterinary Sciences, Sao Paulo State University (UNESP), Jaboticabal, SP, Brazil
| | - Rafael Espigolan
- Department of Animal Science, Faculty of Animal Science and Food Engineering, University of Sao Paulo, Pirassununga, SP, Brazil
| | | | - Roberto Carvalheiro
- Department of Animal Science, Faculty of Agricultural and Veterinary Sciences, Sao Paulo State University (UNESP), Jaboticabal, SP, Brazil.,National Council of Technological and Scientific Development (CNPq), Brasilia, Brazil
| | - Lucia Galvão de Albuquerque
- Department of Animal Science, Faculty of Agricultural and Veterinary Sciences, Sao Paulo State University (UNESP), Jaboticabal, SP, Brazil.,National Council of Technological and Scientific Development (CNPq), Brasilia, Brazil
| |
Collapse
|
47
|
Klápště J, Dungey HS, Telfer EJ, Suontama M, Graham NJ, Li Y, McKinley R. Marker Selection in Multivariate Genomic Prediction Improves Accuracy of Low Heritability Traits. Front Genet 2020; 11:499094. [PMID: 33193595 PMCID: PMC7662070 DOI: 10.3389/fgene.2020.499094] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2019] [Accepted: 09/18/2020] [Indexed: 11/13/2022] Open
Abstract
Multivariate analysis using mixed models allows for the exploration of genetic correlations between traits. Additionally, the transition to a genomic based approach is simplified by substituting classic pedigrees with a marker-based relationship matrix. It also enables the investigation of correlated responses to selection, trait integration and modularity in different kinds of populations. This study investigated a strategy for the construction of a marker-based relationship matrix that prioritized markers using Partial Least Squares. The efficiency of this strategy was found to depend on the correlation structure between investigated traits. In terms of accuracy, we found no benefit of this strategy compared with the all-marker-based multivariate model for the primary trait of diameter at breast height (DBH) in a radiata pine (Pinus radiata) population, possibly due to the presence of strong and well-estimated correlation with other highly heritable traits. Conversely, we did see benefit in a shining gum (Eucalyptus nitens) population, where the primary trait had low or only moderate genetic correlation with other low/moderately heritable traits. Marker selection in multivariate analysis can therefore be an efficient strategy to improve prediction accuracy for low heritability traits due to improved precision in poorly estimated low/moderate genetic correlations. Additionally, our study identified the genetic diversity as a factor contributing to the efficiency of marker selection in multivariate approaches due to higher precision of genetic correlation estimates.
Collapse
Affiliation(s)
- Jaroslav Klápště
- Scion (New Zealand Forest Research Institute Ltd.), Rotorua, New Zealand
| | - Heidi S Dungey
- Scion (New Zealand Forest Research Institute Ltd.), Rotorua, New Zealand
| | - Emily J Telfer
- Scion (New Zealand Forest Research Institute Ltd.), Rotorua, New Zealand
| | - Mari Suontama
- Scion (New Zealand Forest Research Institute Ltd.), Rotorua, New Zealand.,Skogforsk, Umeå, Sweden
| | - Natalie J Graham
- Scion (New Zealand Forest Research Institute Ltd.), Rotorua, New Zealand
| | - Yongjun Li
- Scion (New Zealand Forest Research Institute Ltd.), Rotorua, New Zealand.,Agriculture Victoria, AgriBio Center, Bundoora, VIC, Australia
| | - Russell McKinley
- Scion (New Zealand Forest Research Institute Ltd.), Rotorua, New Zealand
| |
Collapse
|
48
|
A de novo frameshift mutation in ZEB2 causes polledness, abnormal skull shape, small body stature and subfertility in Fleckvieh cattle. Sci Rep 2020; 10:17032. [PMID: 33046754 PMCID: PMC7550345 DOI: 10.1038/s41598-020-73807-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2020] [Accepted: 09/15/2020] [Indexed: 01/17/2023] Open
Abstract
Polledness in cattle is an autosomal dominant trait. Previous studies have revealed allelic heterogeneity at the polled locus and four different variants were identified, all in intergenic regions. In this study, we report a case of polled bull (FV-Polled1) born to horned parents, indicating a de novo origin of this polled condition. Using 50K genotyping and whole genome sequencing data, we identified on chromosome 2 an 11-bp deletion (AC_000159.1:g.52364063_52364073del; Del11) in the second exon of ZEB2 gene as the causal mutation for this de novo polled condition. We predicted that the deletion would shorten the protein product of ZEB2 by almost 91%. Moreover, we showed that all animals carrying Del11 mutation displayed symptoms similar to Mowat-Wilson syndrome (MWS) in humans, which is also associated with genetic variations in ZEB2. The symptoms in cattle include delayed maturity, small body stature and abnormal shape of skull. This is the first report of a de novo dominant mutation affecting only ZEB2 and associated with a genetic absence of horns. Therefore our results demonstrate undoubtedly that ZEB2 plays an important role in the process of horn ontogenesis as well as in the regulation of overall development and growth of animals.
Collapse
|
49
|
Genomic Studies Reveal Substantial Dominant Effects and Improved Genomic Predictions in an Open-Pollinated Breeding Population of Eucalyptus pellita. G3-GENES GENOMES GENETICS 2020; 10:3751-3763. [PMID: 32788286 PMCID: PMC7534421 DOI: 10.1534/g3.120.401601] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Most of the genomic studies in plants and animals have used additive models for studying genetic parameters and prediction accuracies. In this study, we used genomic models with additive and nonadditive effects to analyze the genetic architecture of growth and wood traits in an open-pollinated (OP) population of Eucalyptus pellita. We used two progeny trials consisting of 5742 trees from 244 OP families to estimate genetic parameters and to test genomic prediction accuracies of three growth traits (diameter at breast height - DBH, total height - Ht and tree volume - Vol) and kraft pulp yield (KPY). From 5742 trees, 468 trees from 28 families were genotyped with 2023 pre-selected markers from candidate genes. We used the pedigree-based additive best linear unbiased prediction (ABLUP) model and two marker-based models (single-step genomic BLUP – ssGBLUP and genomic BLUP – GBLUP) to estimate the genetic parameters and compare the prediction accuracies. Analyses with the two genomic models revealed large dominant effects influencing the growth traits but not KPY. Theoretical breeding value accuracies were higher with the dominance effect in ssGBLUP model for the three growth traits. Accuracies of cross-validation with random folding in the genotyped trees have ranged from 0.60 to 0.82 in different models. Accuracies of ABLUP were lower than the genomic models. Accuracies ranging from 0.50 to 0.76 were observed for within family cross-validation predictions with low relationships between training and validation populations indicating part of the functional variation is captured by the markers through short-range linkage disequilibrium (LD). Within-family phenotype predictive abilities and prediction accuracies of genetic values with dominance effects are higher than the additive models for growth traits indicating the importance of dominance effects in predicting phenotypes and genetic values. This study demonstrates the importance of genomic approaches in OP families to study nonadditive effects. To capture the LD between markers and the quantitative trait loci (QTL) it may be important to use informative markers from candidate genes.
Collapse
|
50
|
Meuwissen THE, Sonesson AK, Gebregiwergis G, Woolliams JA. Management of Genetic Diversity in the Era of Genomics. Front Genet 2020; 11:880. [PMID: 32903415 PMCID: PMC7438563 DOI: 10.3389/fgene.2020.00880] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2019] [Accepted: 07/17/2020] [Indexed: 12/27/2022] Open
Abstract
Management of genetic diversity aims to (i) maintain heterozygosity, which ameliorates inbreeding depression and loss of genetic variation at loci that may become of importance in the future; and (ii) avoid genetic drift, which prevents deleterious recessives (e.g., rare disease alleles) from drifting to high frequency, and prevents random drift of (functional) traits. In the genomics era, genomics data allow for many alternative measures of inbreeding and genomic relationships. Genomic relationships/inbreeding can be classified into (i) homozygosity/heterozygosity based (e.g., molecular kinship matrix); (ii) genetic drift-based, i.e., changes of allele frequencies; or (iii) IBD-based, i.e., SNPs are used in linkage analyses to identify IBD segments. Here, alternative measures of inbreeding/relationship were used to manage genetic diversity in genomic optimal contribution (GOC) selection schemes. Contrary to classic inbreeding theory, it was found that drift and homozygosity-based inbreeding could differ substantially in GOC schemes unless diversity management was based upon IBD. When using a homozygosity-based measure of relationship, the inbreeding management resulted in allele frequency changes toward 0.5 giving a low rate of increase in homozygosity for the panel used for management, but not for unmanaged neutral loci, at the expense of a high genetic drift. When genomic relationship matrices were based on drift, following VanRaden and as in GCTA, drift was low at the expense of a high rate of increase in homozygosity. The use of IBD-based relationship matrices for inbreeding management limited both drift and the homozygosity-based rate of inbreeding to their target values. Genetic improvement per percent of inbreeding was highest when GOC used IBD-based relationships irrespective of the inbreeding measure used. Genomic relationships based on runs of homozygosity resulted in very high initial improvement per percent of inbreeding, but also in substantial discrepancies between drift and homozygosity-based rates of inbreeding, and resulted in a drift that exceeded its target value. The discrepancy between drift and homozygosity-based rates of inbreeding was caused by a covariance between initial allele frequency and the subsequent change in frequency, which becomes stronger when using data from whole genome sequence.
Collapse
Affiliation(s)
- Theo H E Meuwissen
- Department of Animal and Aquacultural Sciences, Norwegian University of Life Sciences, Ås, Norway
| | | | | | - John A Woolliams
- The Roslin Institute and R(D)SVS, The University of Edinburgh, Edinburgh, United Kingdom
| |
Collapse
|