1
|
Xia L, Shi M, Li H, Zhang W, Cheng Y, Xia XQ. PMSeeker: A Scheme Based on the Greedy Algorithm and the Exhaustive Algorithm to Screen Low-Redundancy Marker Sets for Large-Scale Parentage Assignment with Full Parental Genotyping. BIOLOGY 2024; 13:100. [PMID: 38392318 PMCID: PMC10886308 DOI: 10.3390/biology13020100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Revised: 01/30/2024] [Accepted: 01/31/2024] [Indexed: 02/24/2024]
Abstract
Parentage assignment is a genetic test that utilizes genetic characteristics, such as molecular markers, to identify the parental relationships within populations, which, in commercial fish farming, are almost always large and where full information on potential parents is known. To accurately find the true parents, the genotypes of all loci in the parentage marker set (PMS) are required for each individual being tested. With the same accuracy, a PMS containing a smaller number of markers will undoubtedly save experimental costs. Thus, this study established a scheme to screen low-redundancy PMSs using the exhaustive algorithm and greedy algorithm. When screening PMSs, the greedy algorithm selects markers based on the parental dispersity index (PDI), a uniquely defined metric that outperforms the probability of exclusion (PE). With the conjunctive use of the two algorithms, non-redundant PMSs were found for more than 99.7% of solvable cases in three groups of random sample experiments in this study. Then, a low-redundancy PMS can be composed using two or more of these non-redundant PMSs. This scheme effectively reduces the number of markers in PMSs, thus conserving human and experimental resources and laying the groundwork for the widespread implementation of parentage assignment technology in economic species breeding.
Collapse
Affiliation(s)
- Lei Xia
- State Key Laboratory of Freshwater Ecology and Biotechnology, Hubei Hongshan Laboratory, Key Laboratory of Aquaculture Disease Control, Ministry of Agriculture and Rural Affairs, The Innovation Academy of Seed Design, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
- College of Advanced Agricultural Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Mijuan Shi
- State Key Laboratory of Freshwater Ecology and Biotechnology, Hubei Hongshan Laboratory, Key Laboratory of Aquaculture Disease Control, Ministry of Agriculture and Rural Affairs, The Innovation Academy of Seed Design, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
- College of Advanced Agricultural Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Heng Li
- State Key Laboratory of Freshwater Ecology and Biotechnology, Hubei Hongshan Laboratory, Key Laboratory of Aquaculture Disease Control, Ministry of Agriculture and Rural Affairs, The Innovation Academy of Seed Design, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
- College of Advanced Agricultural Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Wanting Zhang
- State Key Laboratory of Freshwater Ecology and Biotechnology, Hubei Hongshan Laboratory, Key Laboratory of Aquaculture Disease Control, Ministry of Agriculture and Rural Affairs, The Innovation Academy of Seed Design, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
| | - Yingyin Cheng
- State Key Laboratory of Freshwater Ecology and Biotechnology, Hubei Hongshan Laboratory, Key Laboratory of Aquaculture Disease Control, Ministry of Agriculture and Rural Affairs, The Innovation Academy of Seed Design, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
| | - Xiao-Qin Xia
- State Key Laboratory of Freshwater Ecology and Biotechnology, Hubei Hongshan Laboratory, Key Laboratory of Aquaculture Disease Control, Ministry of Agriculture and Rural Affairs, The Innovation Academy of Seed Design, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
- College of Advanced Agricultural Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
2
|
Analysis of Family Structure and Paternity Test of Tan Sheep in Yanchi Area, China. Animals (Basel) 2022; 12:ani12223099. [PMID: 36428327 PMCID: PMC9686711 DOI: 10.3390/ani12223099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2022] [Revised: 11/02/2022] [Accepted: 11/04/2022] [Indexed: 11/12/2022] Open
Abstract
Tan sheep is a special breed of locally protected sheep in China, one of the best quality meat sheep in the world. Due to the unclear pedigree of the rams on the Ningxia Tan sheep breeding farm, we investigated 74 rams in the field and explored a new method for family division. Genomic DNA was extracted from the blood of breeding rams. Using Plink software, GCTA tools and R language, we analyzed the genetic structure, kinship, and inbreeding coefficient of the breeding sheep, which revealed the genetic relationship between the individuals. The results showed that there was no obvious clustering phenomenon in the PCA, and the genetic background of the samples was similar. The G matrix and IBS distance matrix indicated that most individuals were far away from each other. Paternity testing identified 24 pairs of unknown parent-child pairs, and all the Tan sheep could be divided into 12 families, which provided a reference for sheep breeding. The average inbreeding coefficient based on the ROH of this population was 0.049, so there was a low degree of inbreeding and the rams in the field were able to maintain high genetic diversity. Overall, we explored a more accurate method through paternity and kinship analysis; it provides a scientific basis for pedigree construction, which has an important application value for Tan sheep breeding.
Collapse
|
3
|
Miller-Crews I, Matz MV, Hofmann HA. A 2b-RAD parentage analysis pipeline for complex and mixed DNA samples. Forensic Sci Int Genet 2021; 55:102590. [PMID: 34509741 DOI: 10.1016/j.fsigen.2021.102590] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2021] [Revised: 08/13/2021] [Accepted: 08/30/2021] [Indexed: 11/24/2022]
Abstract
Next-generation sequencing technology has revolutionized genotyping in many fields of study, yet parentage analysis often still relies on microsatellite markers that are costly to generate and are currently available only for a limited number of species. 2b-RAD sequencing (2b-RAD) is a DNA sequencing technique developed for ecological population genomics that utilizes type IIB restriction enzymes to generate consistent, uniform fragments across samples. This technology is inexpensive, effective with low DNA inputs, and robust to DNA degradation. Here, we developed a probabilistic genotyping-by-sequencing genetic testing pipeline for parentage analysis by using 2b-RAD for inferring familial relationships from mixed DNA samples and populations. Our approach to partial paternity assignment utilizes a novel weighted outlier paternity index (WOPI) adapted for next-generation sequencing data and an identity-by-state (IBS) matrix-based clustering method for pedigree reconstruction. The combination of these two parentage assignment methods overcomes two major obstacles faced by other genetic testing methods: 1) It allows detection of parentage when closely related or inbred individuals are in the alleged parent population (e.g., in laboratory strains); and 2) it resolves mixed DNA samples. We successfully demonstrate this novel approach by correctly inferring paternity for samples pooled from multiple offspring (i.e., entire clutches) in a highly inbred population of an East African cichlid fish. The unique advantages of 2b-RAD in combination with our bioinformatics pipeline enable straightforward and cost-effective parentage analysis in any species regardless of genomic resources available.
Collapse
Affiliation(s)
- Isaac Miller-Crews
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX 78712, USA
| | - Mikhail V Matz
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX 78712, USA
| | - Hans A Hofmann
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX 78712, USA; Institute for Cellular and Molecular Biology, The University of Texas at Austin, Austin, TX 78712, USA; Institue for Neuroscience, The University of Texas at Austin, Austin, TX 78712, USA.
| |
Collapse
|
4
|
Gebrehiwot NZ, Strucken EM, Marshall K, Aliloo H, Gibson JP. SNP panels for the estimation of dairy breed proportion and parentage assignment in African crossbred dairy cattle. Genet Sel Evol 2021; 53:21. [PMID: 33653262 PMCID: PMC7923343 DOI: 10.1186/s12711-021-00615-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Accepted: 02/17/2021] [Indexed: 01/10/2023] Open
Abstract
BACKGROUND Understanding the relationship between genetic admixture and phenotypic performance is crucial for the optimization of crossbreeding programs. The use of small sets of informative ancestry markers can be a cost-effective option for the estimation of breed composition and for parentage assignment in situations where pedigree recording is difficult. The objectives of this study were to develop small single nucleotide polymorphism (SNP) panels that can accurately estimate the total dairy proportion and assign parentage in both West and East African crossbred dairy cows. METHODS Medium- and high-density SNP genotype data (Illumina BovineSNP50 and BovineHD Beadchip) for 4231 animals sampled from African crossbreds, African Bos taurus, European Bos taurus, Bos indicus, and African indigenous populations were used. For estimating breed composition, the absolute differences in allele frequency were calculated between pure ancestral breeds to identify SNPs with the highest discriminating power, and different combinations of SNPs weighted by ancestral origin were tested against estimates based on all available SNPs. For parentage assignment, informative SNPs were selected based on the highest minor allele frequency (MAF) in African crossbred populations assuming two Scenarios: (1) parents were selected among all the animals with known genotypes, and (2) parents were selected only among the animals known to be a parent of at least one progeny. RESULTS For the medium-density genotype data, SNPs selected for the largest differences in allele frequency between West African indigenous and European Bos taurus breeds performed best for most African crossbred populations and achieved a prediction accuracy (r2) for breed composition of 0.926 to 0.961 with 200 SNPs. For the high-density dataset, a panel with 70% of the SNPs selected on their largest difference in allele frequency between African and European Bos taurus performed best or very near best across all crossbred populations with r2 ranging from 0.978 to 0.984 with 200 SNPs. In all African crossbred populations, unambiguous parentage assignment was possible with ≥ 300 SNPs for the majority of the panels for Scenario 1 and ≥ 200 SNPs for Scenario 2. CONCLUSIONS The identified low-cost SNP assays could overcome incomplete or inaccurate pedigree records in African smallholder systems and allow effective breeding decisions to produce progeny of desired breed composition.
Collapse
Affiliation(s)
- Netsanet Z. Gebrehiwot
- Centre for Genetic Analysis and Applications, School of Environmental and Rural Science, University of New England, Armidale, NSW 2351 Australia
| | - Eva M. Strucken
- Centre for Genetic Analysis and Applications, School of Environmental and Rural Science, University of New England, Armidale, NSW 2351 Australia
| | - Karen Marshall
- International Livestock Research Institute and Centre for Tropical Livestock Genetics and Health, Nairobi, Kenya
| | - Hassan Aliloo
- Centre for Genetic Analysis and Applications, School of Environmental and Rural Science, University of New England, Armidale, NSW 2351 Australia
| | - John P. Gibson
- Centre for Genetic Analysis and Applications, School of Environmental and Rural Science, University of New England, Armidale, NSW 2351 Australia
| |
Collapse
|
5
|
Hu LR, Li D, Chu Q, Wang YC, Zhou L, Yu Y, Zhang Y, Zhang SL, Usman T, Xie ZQ, Hou SY, Liu L, Shi WH. Selection and implementation of single nucleotide polymorphism markers for parentage analysis in crossbred cattle population. Animal 2020; 15:100066. [PMID: 33516033 DOI: 10.1016/j.animal.2020.100066] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2020] [Revised: 08/30/2020] [Accepted: 09/07/2020] [Indexed: 12/01/2022] Open
Abstract
Crossbreeding is an essential way of improving herd performance. However, frequent parentage record errors appear, which results in the lower accuracy of genetic parameter estimation and genetic evaluation. This study aims to build a single nucleotide polymorphism (SNP) panel with sufficient power for parentage testing in the crossbred population of Simmental and Holstein cattle. The direct sequencing technique in PCR products of pooling DNA along with matrix-assisted laser desorption/ionization time-of-flight MS method for genotyping the individuals was applied. A panel comprising 50 highly informative SNPs for parentage analysis was developed in the crossbred population. The average minor allele frequency for SNPs was 0.43, and the cumulative probability of exclusion for single-parent and both-parent inference met 0.99797 and 0.999999, respectively. The maker-set for parentage verification was then used in a group of 81 trios with aid of the likelihood-based parentage-assignment program of Cervus software. Reconfirmation with on-farm records showed that this 50-SNP system could provide sufficient and reliable information for parentage testing with the parental errors for mother-offspring and sire-offspring being 8.6 and 18.5%, respectively. In conclusion, a set of low-cost and efficient SNPs for the paternity testing in the Simmental and Holstein crossbred population are provided.
Collapse
Affiliation(s)
- L R Hu
- Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA, National Engineering Laboratory for Animal Breeding, Beijing Engineering Technology Research Center of Raw Milk Quality and Safety Control, College of Animal Science and Technology, China Agricultural University, No. 4 Yuanmingyuan West Road, Haidian District, Beijing 100193, PR China
| | - D Li
- Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA, National Engineering Laboratory for Animal Breeding, Beijing Engineering Technology Research Center of Raw Milk Quality and Safety Control, College of Animal Science and Technology, China Agricultural University, No. 4 Yuanmingyuan West Road, Haidian District, Beijing 100193, PR China; Beijing Xiangzhong Biotechnology Co. LTD, No. 1 Nongda South Road, Haidian District, Beijing 100080, PR China
| | - Q Chu
- Institute of Animal Husbandry and Veterinary Medicine, Beijing Academy of Agriculture and Forestry Sciences, No. 33 Zhanghua Road, Haidian District, Beijing 100097, PR China
| | - Y C Wang
- Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA, National Engineering Laboratory for Animal Breeding, Beijing Engineering Technology Research Center of Raw Milk Quality and Safety Control, College of Animal Science and Technology, China Agricultural University, No. 4 Yuanmingyuan West Road, Haidian District, Beijing 100193, PR China.
| | - L Zhou
- Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA, National Engineering Laboratory for Animal Breeding, Beijing Engineering Technology Research Center of Raw Milk Quality and Safety Control, College of Animal Science and Technology, China Agricultural University, No. 4 Yuanmingyuan West Road, Haidian District, Beijing 100193, PR China
| | - Y Yu
- Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA, National Engineering Laboratory for Animal Breeding, Beijing Engineering Technology Research Center of Raw Milk Quality and Safety Control, College of Animal Science and Technology, China Agricultural University, No. 4 Yuanmingyuan West Road, Haidian District, Beijing 100193, PR China
| | - Y Zhang
- Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA, National Engineering Laboratory for Animal Breeding, Beijing Engineering Technology Research Center of Raw Milk Quality and Safety Control, College of Animal Science and Technology, China Agricultural University, No. 4 Yuanmingyuan West Road, Haidian District, Beijing 100193, PR China
| | - S L Zhang
- Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA, National Engineering Laboratory for Animal Breeding, Beijing Engineering Technology Research Center of Raw Milk Quality and Safety Control, College of Animal Science and Technology, China Agricultural University, No. 4 Yuanmingyuan West Road, Haidian District, Beijing 100193, PR China
| | - T Usman
- Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA, National Engineering Laboratory for Animal Breeding, Beijing Engineering Technology Research Center of Raw Milk Quality and Safety Control, College of Animal Science and Technology, China Agricultural University, No. 4 Yuanmingyuan West Road, Haidian District, Beijing 100193, PR China; College of Veterinary Sciences and Animal Husbandry, Abdul Wali Khan University, Turu Road, Near Sheikh Maltoon Town, Mardan 23200, Pakistan
| | - Z Q Xie
- Anshan Hengli Dairy Farm, Shanchengzi Village, Anshan, Liaoning 114200, PR China
| | - S Y Hou
- Anshan Hengli Dairy Farm, Shanchengzi Village, Anshan, Liaoning 114200, PR China
| | - L Liu
- Beijing Dairy Cattle Center, Qinghe Road, Haidian District, Beijing 100192, PR China
| | - W H Shi
- Beijing Dairy Cattle Center, Qinghe Road, Haidian District, Beijing 100192, PR China
| |
Collapse
|
6
|
Sanarana YP, Maiwashe A, Berry DP, Banga C, van Marle-Köster E. Evaluation of the International Society for Animal Genetics bovine single nucleotide polymorphism parentage panel in South African Bonsmara and Drakensberger cattle. Trop Anim Health Prod 2020; 53:32. [PMID: 33230675 DOI: 10.1007/s11250-020-02481-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2020] [Accepted: 11/11/2020] [Indexed: 10/22/2022]
Abstract
A panel of 200 single nucleotide polymorphisms (SNPs) have been recommended by the International Society for Animal Genetics (ISAG) for use in parentage verification of cattle. While the SNPs included on the ISAG panel are segregating in European Bos taurus and Bos indicus breeds, their applicability in South African (SA) Sanga cattle has never been evaluated. This study, therefore, assessed the usefulness of the ISAG panel in SA Bonsmara (BON) and Drakensberger (DRB) cattle. Genotypes of 185 ISAG SNPs from 64 BON and 97 DRB sire-offspring pairs were available, all of which were validated with 119,375 SNPs. Of the 185 ISAG SNPs, 14 and 18 in the BON and DRB, respectively (9 in common to both breeds), were either monomorphic, exhibited at least one discordance between validated sire-offspring pairs, or had poor call rate or clustering issue. The mean minor allele frequency of the 185 ISAG SNPs was 0.331 in the BON and 0.359 in the DRB. The combined probability of parentage exclusion (PE) was the same (99.46%) for both breeds, while the probability of identity varied from 1.61 × 10-48 (BON) to 1.11 × 10-54 (DRB). Fifteen (23.4%) and 32 (33%) of the already validated sire-offspring pairs for the BON and DRB, respectively, were determined by the ISAG panel to be false-negatives based on a threshold of having at least two discordant SNPs. In comparison to sire discovery using the 119,375 SNPs, sire discovery using only the ISAG panel identified correctly 44 (out of 64 identified using the 119,375 SNPs) unique sire-offspring BON pairs and 62 (out of 97 identified using the 119,375 SNPs) unique sire-offspring DRB when all sires were masked. Five BON and three DRB offspring had > 1 sire nominated. This study demonstrated that the use of the ISAG panel may result in incorrect exclusions and multiple candidate sires for a given animal. Selection of more informative SNPs is, therefore, necessary in the pursuit of a low-cost and effective SNP panel for indigenous cattle breeds in SA.
Collapse
Affiliation(s)
- Yandisiwe P Sanarana
- Department of Animal and Wildlife Science, University of Pretoria, Hatfield, Pretoria, 0002, South Africa. .,Agricultural Research Council-Animal Production, Irene, Pretoria, 0062, South Africa.
| | - Azwihangwisi Maiwashe
- Agricultural Research Council-Animal Production, Irene, Pretoria, 0062, South Africa
| | - Donagh P Berry
- Department of Animal and Wildlife Science, University of Pretoria, Hatfield, Pretoria, 0002, South Africa.,Teagasc, Animal & Grassland Research and Innovation Center, Moorepark, Fermoy, Co. Cork, Ireland
| | - Cuthbert Banga
- Agricultural Research Council-Animal Production, Irene, Pretoria, 0062, South Africa
| | - Este van Marle-Köster
- Department of Animal and Wildlife Science, University of Pretoria, Hatfield, Pretoria, 0002, South Africa
| |
Collapse
|
7
|
Whalen A, Gorjanc G, Hickey JM. Parentage assignment with genotyping-by-sequencing data. J Anim Breed Genet 2019; 136:102-112. [PMID: 30548685 PMCID: PMC6392119 DOI: 10.1111/jbg.12370] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2018] [Revised: 11/09/2018] [Accepted: 11/10/2018] [Indexed: 01/04/2023]
Abstract
In this paper, we evaluate using genotype-by-sequencing (GBS) data to perform parentage assignment in lieu of traditional array data. The use of GBS data raises two issues: First, for low-coverage (e.g., <2×) GBS data, it may not be possible to call the genotype at many loci, a critical first step for detecting opposing homozygous markers. Second, the amount of sequencing coverage may vary across individuals, making it challenging to directly compare the likelihood scores between putative parents. To address these issues, we extend the probabilistic framework of Huisman (Molecular Ecology Resources, 2017, 17, 1009) and evaluate putative parents by comparing their (potentially noisy) genotypes to a series of proposal distributions. These distributions describe the expected genotype probabilities for the relatives of an individual. We assign putative parents as a parent if they are classified as a parent (as opposed to e.g., an unrelated individual), and if the assignment score passes a threshold. We evaluated this method on simulated data and found that (a) high-coverage (>2×) GBS data performs similarly to array data and requires only a small number of markers to correctly assign parents and (b) low-coverage GBS data (as low as 0.1×) can also be used, provided that it is obtained across a large number of markers. When analysing the low-coverage GBS data, we also found a high number of false positives if the true parent is not contained within the list of candidate parents, but that this false positive rate can be greatly reduced by hand tuning the assignment threshold. We provide this parentage assignment method as a standalone program called AlphaAssign.
Collapse
Affiliation(s)
- Andrew Whalen
- The Roslin Institute and Royal (Dick) School of Veterinary StudiesThe University of EdinburghMidlothianUK
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary StudiesThe University of EdinburghMidlothianUK
| | - John M. Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary StudiesThe University of EdinburghMidlothianUK
| |
Collapse
|
8
|
Flanagan SP, Jones AG. The future of parentage analysis: From microsatellites to SNPs and beyond. Mol Ecol 2019; 28:544-567. [PMID: 30575167 DOI: 10.1111/mec.14988] [Citation(s) in RCA: 76] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2018] [Revised: 11/30/2018] [Accepted: 12/03/2018] [Indexed: 12/14/2022]
Abstract
Parentage analysis is a cornerstone of molecular ecology that has delivered fundamental insights into behaviour, ecology and evolution. Microsatellite markers have long been the king of parentage, their hypervariable nature conferring sufficient power to correctly assign offspring to parents. However, microsatellite markers have seen a sharp decline in use with the rise of next-generation sequencing technologies, especially in the study of population genetics and local adaptation. The time is ripe to review the current state of parentage analysis and see how it stands to be affected by the emergence of next-generation sequencing approaches. We find that single nucleotide polymorphisms (SNPs), the typical next-generation sequencing marker, remain underutilized in parentage analysis but are gaining momentum, with 58 SNP-based parentage analyses published thus far. Many of these papers, particularly the earlier ones, compare the power of SNPs and microsatellites in a parentage context. In virtually every case, SNPs are at least as powerful as microsatellite markers. As few as 100-500 SNPs are sufficient to resolve parentage completely in most situations. We also provide an overview of the analytical programs that are commonly used and compatible with SNP data. As the next-generation parentage enterprise grows, a reliance on likelihood and Bayesian approaches, as opposed to strict exclusion, will become increasingly important. We discuss some of the caveats surrounding the use of next-generation sequencing data for parentage analysis and conclude that the future is bright for this important realm of molecular ecology.
Collapse
Affiliation(s)
- Sarah P Flanagan
- School of Biological Sciences, University of Canterbury, Christchurch, New Zealand
| | - Adam G Jones
- Department of Biological Sciences, University of Idaho, Moscow, Idaho
| |
Collapse
|
9
|
Oleński K, Kamiński S, Tokarska M, Hering DM. Subset of SNPs for parental identification in European bison Lowland-Białowieża line (Bison bonasus bonasus). CONSERV GENET RESOUR 2018. [DOI: 10.1007/s12686-017-0768-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
10
|
Purfield DC, McClure M, Berry DP. Justification for setting the individual animal genotype call rate threshold at eighty-five percent. J Anim Sci 2017; 94:4558-4569. [PMID: 27898963 DOI: 10.2527/jas.2016-0802] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Data quality of SNP arrays impacts the accuracy and precision of downstream data analyses. One such quality control measure often imposed is a threshold on individual animal call rate. Different call rate thresholds have been applied across studies; little is known, however, about the impact of these thresholds on the quality of the genotype data. The objective of the present study was to investigate the effect of different call rate thresholds on the integrity of the genotypes but also to quantify the contribution of different factors to the variability in animal call rate. Data included 142,342 samples genotyped on a custom Illumina genotype panel from 141,591 dairy and beef cattle; the number of Illumina SNP on the panel was 14,371. The mean animal call rate across all samples was 99.09%; 487 animals had both a low call rate (<99%) and a subsequent high call rate (≥99%) after resampling and regenotyping. Several factors were associated ( < 0.001) with individual call rate including animal sex, the sampling herd, the date of genotyping, the genotyping plate, and the plate well. The genotype and allele concordance between the genotypes of the 487 low- and high-call rate individuals improved at a diminishing rate as mean animal call rate increased. Mean genotype and allele concordance rates of 0.987 and 0.997, respectively, existed when animal call rate was between 85 and 90%, increasing to 0.998 and 0.999, respectively, when animal call rate was between 95 and <99%. The mean within-animal allele concordance rate of rare variants (i.e., minor allele frequency < 0.05) between low and high genotype call rate animals increased when animal call rate improved; an allele concordance rate of 1.00 was achieved when animal call rate was between 85 and <99%. The accuracy of imputation of the nonobserved genotypes in the low-call rate animals improved as animal call rate increased; the mean genotype concordance rate of the imputed nonobserved SNP was 0.41 when animal call rate was <40% but increased to 0.95 when animal call rate was between 95 and <99%. Parentage validation, determined by the count of opposing homozygotes in a parent-progeny pair, was unreliable when animal call rate was <85%. Therefore, to ensure the provision of high-quality genotypes while also considering the cost and inconvenience of resampling and regenotyping, we suggest a minimum animal call rate threshold of 85%.
Collapse
|
11
|
Talenti A, Nicolazzi EL, Chessa S, Frattini S, Moretti R, Coizet B, Nicoloso L, Colli L, Pagnacco G, Stella A, Ajmone-Marsan P, Ptak G, Crepaldi P. A method for single nucleotide polymorphism selection for parentage assessment in goats. J Dairy Sci 2016; 99:3646-3653. [PMID: 26971153 DOI: 10.3168/jds.2015-10077] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2015] [Accepted: 01/25/2016] [Indexed: 11/19/2022]
Abstract
Accurate pedigrees are essential to optimize genetic improvement and conservation of animal genetic resources. In goats, the use of mating groups and kidding management procedures hamper the identification of parentage. Small panels of single nucleotide polymorphisms (SNP) have been proposed in other species to substitute microsatellites for parentage assessment. Using data from the current GoatSNP50 chip, we developed a new 3-step procedure to identify a low-density SNP panel for highly accurate parentage assessment. Methodologies for SNP selection used in other species are less suitable in the goat because of uncertainties in the genome assembly. The procedure developed in this study is based on parent-offspring identification and on estimation of Mendelian errors, followed by canonical discriminant analysis identification and stepwise regression reduction. Starting from a reference sample of 109 Alpine goats with known pedigree relationships, we first identified a panel of 200 SNP that was further reduced to 2 final panels of 130 and 114 SNP with random coincidental match inclusion of 1.51×10(-57) and 2.94×10(-34), respectively. In our reference data set, all panels correctly identified all parent-offspring combinations, revealing a 40% pedigree error rate in the information provided by breeders. All reference trios were confirmed by official tests based on microsatellites. Panels were also tested on Saanen and Teramana breeds. Although the testing on a larger set of breeds in the reference population is still needed to validate these results, our findings suggest that our procedure could identify SNP panels for accurate parentage assessment in goats or in other species with unreliable marker positioning.
Collapse
Affiliation(s)
- A Talenti
- Dipartimento di Scienze Veterinarie e Sanità Pubblica, Università di Milano, via Celoria 10, Milano, 20133, Italy.
| | - E L Nicolazzi
- Fondazione Parco Tecnologico Padano (PTP), Via Einstein, Cascina Codazza, Lodi, 26900, Italy
| | - S Chessa
- Istituto di Biologia e Biotecnologia Agraria (IBBA-CNR), Consiglio Nazionale delle Ricerche, Via Einstein, Cascina Codazza, Lodi, 26900, Italy
| | - S Frattini
- Dipartimento di Scienze Veterinarie e Sanità Pubblica, Università di Milano, via Celoria 10, Milano, 20133, Italy
| | - R Moretti
- Istituto di Biologia e Biotecnologia Agraria (IBBA-CNR), Consiglio Nazionale delle Ricerche, Via Einstein, Cascina Codazza, Lodi, 26900, Italy
| | - B Coizet
- Dipartimento di Scienze Veterinarie e Sanità Pubblica, Università di Milano, via Celoria 10, Milano, 20133, Italy
| | - L Nicoloso
- Dipartimento di Scienze Veterinarie e Sanità Pubblica, Università di Milano, via Celoria 10, Milano, 20133, Italy
| | - L Colli
- Istituto di Zootecnica, Università Cattolica del Sacro Cuore Via Emilia Parmense, Piacenza, 29122, Italy
| | - G Pagnacco
- Dipartimento di Scienze Veterinarie e Sanità Pubblica, Università di Milano, via Celoria 10, Milano, 20133, Italy
| | - A Stella
- Fondazione Parco Tecnologico Padano (PTP), Via Einstein, Cascina Codazza, Lodi, 26900, Italy; Istituto di Biologia e Biotecnologia Agraria (IBBA-CNR), Consiglio Nazionale delle Ricerche, Via Einstein, Cascina Codazza, Lodi, 26900, Italy
| | - P Ajmone-Marsan
- Istituto di Zootecnica, Università Cattolica del Sacro Cuore Via Emilia Parmense, Piacenza, 29122, Italy
| | - G Ptak
- Dipartimento di Scienze Biomediche Comparate, Università di Teramo, Piazza Aldo Moro 45, Teramo, 64100, Italy; Institute of Animal Production, ul. Sarego 2, Krakow 31-047, Poland
| | - P Crepaldi
- Dipartimento di Scienze Veterinarie e Sanità Pubblica, Università di Milano, via Celoria 10, Milano, 20133, Italy
| |
Collapse
|
12
|
Pozzi A, Previtali C, Cenadelli S, Gandini L, Galli A, Bongioni G. Genetic traceability of cattle using an OpenArray genotyping platform. Anim Genet 2015; 47:133-4. [PMID: 26427605 DOI: 10.1111/age.12359] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/07/2015] [Indexed: 11/28/2022]
Affiliation(s)
- Anna Pozzi
- Istituto Sperimentale Italiano Lazzaro Spallanzani, Loc. La Quercia, 26027, Rivolta d'Adda, CR, Italy
| | - Cristina Previtali
- Istituto Sperimentale Italiano Lazzaro Spallanzani, Loc. La Quercia, 26027, Rivolta d'Adda, CR, Italy
| | - Silvia Cenadelli
- Istituto Sperimentale Italiano Lazzaro Spallanzani, Loc. La Quercia, 26027, Rivolta d'Adda, CR, Italy
| | - Luca Gandini
- Istituto Sperimentale Italiano Lazzaro Spallanzani, Loc. La Quercia, 26027, Rivolta d'Adda, CR, Italy
| | - Andrea Galli
- Istituto Sperimentale Italiano Lazzaro Spallanzani, Loc. La Quercia, 26027, Rivolta d'Adda, CR, Italy.,Consiglio per la Ricerca e la Sperimentazione in Agricoltura, Centro di Ricerca per le Produzioni Foraggere e Lattiero Casearie (CRA-FLC), Viale Piacenza 29, 26900, Lodi, Italy
| | - Graziella Bongioni
- Istituto Sperimentale Italiano Lazzaro Spallanzani, Loc. La Quercia, 26027, Rivolta d'Adda, CR, Italy
| |
Collapse
|
13
|
Strucken EM, Lee SH, Lee HK, Song KD, Gibson JP, Gondro C. How many markers are enough? Factors influencing parentage testing in different livestock populations. J Anim Breed Genet 2015; 133:13-23. [PMID: 26234440 DOI: 10.1111/jbg.12179] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2015] [Accepted: 05/12/2015] [Indexed: 01/13/2023]
Abstract
Reliability of parentage test panels is usually based on its power to exclude wrong parentage assignments based on allele frequencies. We evaluated the rates of false exclusions and inclusions in parentage assignments, and how these results are affected by allele frequencies, panel sizes and the number of allowed mismatches. We also evaluated the reliability of parentage testing by comparing populations with distinct genetic backgrounds using pure and composite families of cattle and sheep. Allowing for 1% genotype mismatches in true parent-offspring relations provided the best compromise between false-positive and false-negative assignments. Pure breeds needed at least 200-210 single-nucleotide polymorphism (SNP) markers to correctly assign relations, but between 700 and 890 markers to avoid assigning incorrect relationships. Composite breeds needed between 220 (sheep) and 500 (cattle) markers for correct assignment; 680 (cattle) to 4400 (sheep) SNPs were needed to eliminate false-positive assignments. Allowing 0% genotype mismatches decreased false-positive but increased false-negative assignments, whilst a higher threshold of 2% showed the opposite effects. Panels with high minor allele frequencies (0.35-0.45) provided the best chance for correct parentage resolutions requiring fewer markers. Further, we propose that a dynamic threshold would allow adapting to population specific error rates. A comparison to the performance of the official International Society for Animal Genetics SNP panel for cattle and a recently published SNP panel for sheep showed that randomly selected markers performed only slightly worse for the applied parentage test based on opposing homozygotes. This suggests that even with carefully selected panels, only marginal assignment improvements are obtainable for a particular number of SNPs. The main point for improvement is the number of markers used. We recommend using at least 200 SNP markers for parentage testing if the aim is to reduce false-negative results. To fully exclude false positives at least 700 markers are required.
Collapse
Affiliation(s)
- E M Strucken
- The Centre for Genetic Analysis and Applications, School of Environmental and Rural Science, University of New England, Armidale, NSW, Australia
| | - S H Lee
- Hanwoo Experiment Station, National Institute of Animal Science, RDA, Pyeonchang, Korea
| | - H K Lee
- Department of Animal Biotechnology, Chonbuk National University, Jeonju, Republic of Korea
| | - K D Song
- Department of Animal Biotechnology, Chonbuk National University, Jeonju, Republic of Korea
| | - J P Gibson
- The Centre for Genetic Analysis and Applications, School of Environmental and Rural Science, University of New England, Armidale, NSW, Australia
| | - C Gondro
- The Centre for Genetic Analysis and Applications, School of Environmental and Rural Science, University of New England, Armidale, NSW, Australia
| |
Collapse
|
14
|
Garritsen C, Van Marle-Köster E, Snyman M, Visser C. The impact of DNA parentage verification on breeding value estimation and sire ranking in South African Angora goats. Small Rumin Res 2015. [DOI: 10.1016/j.smallrumres.2015.01.008] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
15
|
Mastrangelo S, Di Gerlando R, Tolone M, Tortorici L, Sardina MT, Portolano B. Genome wide linkage disequilibrium and genetic structure in Sicilian dairy sheep breeds. BMC Genet 2014; 15:108. [PMID: 25928374 PMCID: PMC4197223 DOI: 10.1186/s12863-014-0108-5] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2014] [Accepted: 09/29/2014] [Indexed: 11/19/2022] Open
Abstract
Background The recent availability of sheep genome-wide SNP panels allows providing background information concerning genome structure in domestic animals. The aim of this work was to investigate the patterns of linkage disequilibrium (LD), the genetic diversity and population structure in Valle del Belice, Comisana, and Pinzirita dairy sheep breeds using the Illumina Ovine SNP50K Genotyping array. Results Average r2 between adjacent SNPs across all chromosomes was 0.155 ± 0.204 for Valle del Belice, 0.156 ± 0.208 for Comisana, and 0.128 ± 0.188 for Pinzirita breeds, and some variations in LD value across chromosomes were observed, in particular for Valle del Belice and Comisana breeds. Average values of r2 estimated for all pairwise combinations of SNPs pooled over all autosomes were 0.058 ± 0.023 for Valle del Belice, 0.056 ± 0.021 for Comisana, and 0.037 ± 0.017 for Pinzirita breeds. The LD declined as a function of distance and average r2 was lower than the values observed in other sheep breeds. Consistency of results among the several used approaches (Principal component analysis, Bayesian clustering, FST, Neighbor networks) showed that while Valle del Belice and Pinzirita breeds formed a unique cluster, Comisana breed showed the presence of substructure. In Valle del Belice breed, the high level of genetic differentiation within breed, the heterogeneous cluster in Admixture analysis, but at the same time the highest inbreeding coefficient, suggested that the breed had a wide genetic base with inbred individuals belonging to the same flock. The Sicilian breeds were characterized by low genetic differentiation and high level of admixture. Pinzirita breed displayed the highest genetic diversity (He, Ne) whereas the lowest value was found in Valle del Belice breed. Conclusions This study has reported for the first time estimates of LD and genetic diversity from a genome-wide perspective in Sicilian dairy sheep breeds. Our results indicate that breeds formed non-overlapping clusters and are clearly separated populations and that Comisana sheep breed does not constitute a homogenous population. The information generated from this study has important implications for the design and applications of association studies as well as for development of conservation and/or selection breeding programs. Electronic supplementary material The online version of this article (doi:10.1186/s12863-014-0108-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Salvatore Mastrangelo
- Dipartimento Scienze Agrarie e Forestali, Università degli Studi di Palermo, Viale delle Scienze, 90128, Palermo, Italy.
| | - Rosalia Di Gerlando
- Dipartimento Scienze Agrarie e Forestali, Università degli Studi di Palermo, Viale delle Scienze, 90128, Palermo, Italy.
| | - Marco Tolone
- Dipartimento Scienze Agrarie e Forestali, Università degli Studi di Palermo, Viale delle Scienze, 90128, Palermo, Italy.
| | - Lina Tortorici
- Dipartimento Scienze Agrarie e Forestali, Università degli Studi di Palermo, Viale delle Scienze, 90128, Palermo, Italy.
| | - Maria Teresa Sardina
- Dipartimento Scienze Agrarie e Forestali, Università degli Studi di Palermo, Viale delle Scienze, 90128, Palermo, Italy.
| | - Baldassare Portolano
- Dipartimento Scienze Agrarie e Forestali, Università degli Studi di Palermo, Viale delle Scienze, 90128, Palermo, Italy.
| | | |
Collapse
|
16
|
Silva MV, dos Santos DJ, Boison SA, Utsunomiya AT, Carmo AS, Sonstegard TS, Cole JB, Van Tassell CP. The development of genomics applied to dairy breeding. Livest Sci 2014. [DOI: 10.1016/j.livsci.2014.05.017] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
|
17
|
Fernández ME, Rogberg-Muñoz A, Lirón JP, Goszczynski DE, Ripoli MV, Carino MH, Peral-García P, Giovambattista G. Effectiveness of single-nucleotide polymorphisms to investigate cattle rustling. J Forensic Sci 2014; 59:1607-13. [PMID: 25039316 DOI: 10.1111/1556-4029.12562] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2013] [Revised: 10/05/2013] [Accepted: 10/26/2013] [Indexed: 11/26/2022]
Abstract
Short tandem repeats (STR)s have been the eligible markers for forensic animal genetics, despite single-nucleotide polymorphisms (SNP)s became acceptable. The technology, the type, and amount of markers could limit the investigation in degraded forensic samples. The performance of a 32-SNP panel genotyped through OpenArrays(TM) (real-time PCR based) was evaluated to resolve cattle-specific forensic cases. DNA from different biological sources was used, including samples from an alleged instance of cattle rustling. SNPs and STRs performance and repeatability were compared. SNP call rate was variable among sample type (average = 80.18%), while forensic samples showed the lowest value (70.94%). The repeatability obtained (98.7%) supports the used technology. SNPs had better call rates than STRs in 12 of 20 casework samples, while forensic index values were similar for both panels. In conclusion, the 32-SNPs used are as informative as the standard bovine STR battery and hence are suitable to resolve cattle rustling investigations.
Collapse
Affiliation(s)
- María E Fernández
- Instituto de Genética Veterinaria (IGEVET), CCT La Plata - CONICET - Fac Cs Veterinarias, UNLP, 60 Y 118 S/N, 1900, La Plata, Argentina
| | | | | | | | | | | | | | | |
Collapse
|
18
|
Heaton MP, Leymaster KA, Kalbfleisch TS, Kijas JW, Clarke SM, McEwan J, Maddox JF, Basnayake V, Petrik DT, Simpson B, Smith TPL, Chitko-McKown CG. SNPs for parentage testing and traceability in globally diverse breeds of sheep. PLoS One 2014; 9:e94851. [PMID: 24740156 PMCID: PMC3989260 DOI: 10.1371/journal.pone.0094851] [Citation(s) in RCA: 70] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2013] [Accepted: 03/19/2014] [Indexed: 01/02/2023] Open
Abstract
DNA-based parentage determination accelerates genetic improvement in sheep by increasing pedigree accuracy. Single nucleotide polymorphism (SNP) markers can be used for determining parentage and to provide unique molecular identifiers for tracing sheep products to their source. However, the utility of a particular "parentage SNP" varies by breed depending on its minor allele frequency (MAF) and its sequence context. Our aims were to identify parentage SNPs with exceptional qualities for use in globally diverse breeds and to develop a subset for use in North American sheep. Starting with genotypes from 2,915 sheep and 74 breed groups provided by the International Sheep Genomics Consortium (ISGC), we analyzed 47,693 autosomal SNPs by multiple criteria and selected 163 with desirable properties for parentage testing. On average, each of the 163 SNPs was highly informative (MAF≥0.3) in 48±5 breed groups. Nearby polymorphisms that could otherwise confound genetic testing were identified by whole genome and Sanger sequencing of 166 sheep from 54 breed groups. A genetic test with 109 of the 163 parentage SNPs was developed for matrix-assisted laser desorption/ionization-time-of-flight mass spectrometry. The scoring rates and accuracies for these 109 SNPs were greater than 99% in a panel of North American sheep. In a blinded set of 96 families (sire, dam, and non-identical twin lambs), each parent of every lamb was identified without using the other parent's genotype. In 74 ISGC breed groups, the median estimates for probability of a coincidental match between two animals (PI), and the fraction of potential adults excluded from parentage (PE) were 1.1×10(-39) and 0.999987, respectively, for the 109 SNPs combined. The availability of a well-characterized set of 163 parentage SNPs facilitates the development of high-throughput genetic technologies for implementing accurate and economical parentage testing and traceability in many of the world's sheep breeds.
Collapse
Affiliation(s)
- Michael P. Heaton
- U.S. Meat Animal Research Center (USMARC), Clay Center, Nebraska, United States of America
- * E-mail:
| | - Kreg A. Leymaster
- U.S. Meat Animal Research Center (USMARC), Clay Center, Nebraska, United States of America
| | - Theodore S. Kalbfleisch
- Department of Biochemistry and Molecular Biology, School of Medicine, University of Louisville, Louisville, Kentucky, United States of America
| | - James W. Kijas
- Division of Animal, Food and Health Sciences, CSIRO, Brisbane, Australia
| | | | - John McEwan
- AgResearch, Invermay Agricultural Center, Mosgiel, New Zealand
| | | | | | - Dustin T. Petrik
- GeneSeek, a Neogen company, Lincoln, Nebraska, United States of America
| | - Barry Simpson
- GeneSeek, a Neogen company, Lincoln, Nebraska, United States of America
| | - Timothy P. L. Smith
- U.S. Meat Animal Research Center (USMARC), Clay Center, Nebraska, United States of America
| | - Carol G. Chitko-McKown
- U.S. Meat Animal Research Center (USMARC), Clay Center, Nebraska, United States of America
| | | |
Collapse
|
19
|
Clarke SM, Henry HM, Dodds KG, Jowett TWD, Manley TR, Anderson RM, McEwan JC. A high throughput single nucleotide polymorphism multiplex assay for parentage assignment in New Zealand sheep. PLoS One 2014; 9:e93392. [PMID: 24740141 PMCID: PMC3989167 DOI: 10.1371/journal.pone.0093392] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2013] [Accepted: 03/04/2014] [Indexed: 11/19/2022] Open
Abstract
Accurate pedigree information is critical to animal breeding systems to ensure the highest rate of genetic gain and management of inbreeding. The abundance of available genomic data, together with development of high throughput genotyping platforms, means that single nucleotide polymorphisms (SNPs) are now the DNA marker of choice for genomic selection studies. Furthermore the superior qualities of SNPs compared to microsatellite markers allows for standardization between laboratories; a property that is crucial for developing an international set of markers for traceability studies. The objective of this study was to develop a high throughput SNP assay for use in the New Zealand sheep industry that gives accurate pedigree assignment and will allow a reduction in breeder input over lambing. This required two phases of development- firstly, a method of extracting quality DNA from ear-punch tissue performed in a high throughput cost efficient manner and secondly a SNP assay that has the ability to assign paternity to progeny resulting from mob mating. A likelihood based approach to infer paternity was used where sires with the highest LOD score (log of the ratio of the likelihood given parentage to likelihood given non-parentage) are assigned. An 84 “parentage SNP panel” was developed that assigned, on average, 99% of progeny to a sire in a problem where there were 3,000 progeny from 120 mob mated sires that included numerous half sib sires. In only 6% of those cases was there another sire with at least a 0.02 probability of paternity. Furthermore dam information (either recorded, or by genotyping possible dams) was absent, highlighting the SNP test’s suitability for paternity testing. Utilization of this parentage SNP assay will allow implementation of progeny testing into large commercial farms where the improved accuracy of sire assignment and genetic evaluations will increase genetic gain in the sheep industry.
Collapse
Affiliation(s)
- Shannon M. Clarke
- Animal Genomics, AgResearch, Invermay Agricultural Centre, Mosgiel, New Zealand
- * E-mail:
| | - Hannah M. Henry
- Animal Genomics, AgResearch, Invermay Agricultural Centre, Mosgiel, New Zealand
| | - Ken G. Dodds
- Animal Genomics, AgResearch, Invermay Agricultural Centre, Mosgiel, New Zealand
| | | | - Tim R. Manley
- Animal Genomics, AgResearch, Invermay Agricultural Centre, Mosgiel, New Zealand
| | - Rayna M. Anderson
- Animal Genomics, AgResearch, Invermay Agricultural Centre, Mosgiel, New Zealand
| | - John C. McEwan
- Animal Genomics, AgResearch, Invermay Agricultural Centre, Mosgiel, New Zealand
| |
Collapse
|
20
|
Lopes MS, Silva FF, Harlizius B, Duijvesteijn N, Lopes PS, Guimarães SE, Knol EF. Improved estimation of inbreeding and kinship in pigs using optimized SNP panels. BMC Genet 2013; 14:92. [PMID: 24063757 PMCID: PMC3849284 DOI: 10.1186/1471-2156-14-92] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2012] [Accepted: 09/19/2013] [Indexed: 01/14/2023] Open
Abstract
Background Traditional breeding programs consider an average pairwise kinship between sibs. Based on pedigree information, the relationship matrix is used for genetic evaluations disregarding variation due to Mendelian sampling. Therefore, inbreeding and kinship coefficients are either over or underestimated resulting in reduction of accuracy of genetic evaluations and genetic progress. Single nucleotide polymorphism (SNPs) can be used to estimate pairwise kinship and individual inbreeding more accurately. The aim of this study was to optimize the selection of markers and determine the required number of SNPs for estimation of kinship and inbreeding. Results A total of 1,565 animals from three commercial pig populations were analyzed for 28,740 SNPs from the PorcineSNP60 Beadchip. Mean genomic inbreeding was higher than pedigree-based estimates in lines 2 and 3, but lower in line 1. As expected, a larger variation of genomic kinship estimates was observed for half and full sibs than for pedigree-based kinship reflecting Mendelian sampling. Genomic kinship between father-offspring pairs was lower (0.23) than the estimate based on pedigree (0.26). Bootstrap analyses using six reduced SNP panels (n = 500, 1000, 1500, 2000, 2500 and 3000) showed that 2,000 SNPs were able to reproduce the results very close to those obtained using the full set of unlinked markers (n = 7,984-10,235) with high correlations (inbreeding r > 0.82 and kinship r > 0.96) and low variation between different sets with the same number of SNPs. Conclusions Variation of kinship between sibs due to Mendelian sampling is better captured using genomic information than the pedigree-based method. Therefore, the reduced sets of SNPs could generate more accurate kinship coefficients between sibs than the pedigree-based method. Variation of genomic kinship of father-offspring pairs is recommended as a parameter to determine accuracy of the method rather than correlation with pedigree-based estimates. Inbreeding and kinship coefficients can be estimated with high accuracy using ≥2,000 unlinked SNPs within all three commercial pig lines evaluated. However, a larger number of SNPs might be necessary in other populations or across lines.
Collapse
Affiliation(s)
- Marcos S Lopes
- TOPIGS Research Center IPG B,V,, P,O, Box 43, 6640 AA, Beuningen, the Netherlands.
| | | | | | | | | | | | | |
Collapse
|
21
|
Gudex B, Walker M, Fisher P, Spelman R. Validation of a single-nucleotide polymorphism panel for parentage testing of farmed red deer. Anim Genet 2013; 45:142-3. [PMID: 24033517 DOI: 10.1111/age.12085] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/26/2013] [Indexed: 11/30/2022]
Abstract
Red deer behaviour prevents the accurate physical matching of dams to fawns and, therefore, breeders have to rely on DNA-based parentage testing. A panel consisting of 100 single nucleotide polymorphism markers, with an average minor allele frequency of 0.25, was able to resolve 92% of fawns to both parents. In comparison, an existing 12-marker microsatellite panel was able to resolve 68% of fawns to both parents. When excluding dam DNA information, the single nucleotide polymorphism panel matched 81% of the fawns to their sires and the microsatellite panel 71%.
Collapse
Affiliation(s)
- B Gudex
- Livestock Improvement Corporation Ltd, Hamilton, 3240, New Zealand
| | | | | | | |
Collapse
|
22
|
Fernández ME, Goszczynski DE, Lirón JP, Villegas-Castagnasso EE, Carino MH, Ripoli MV, Rogberg-Muñoz A, Posik DM, Peral-García P, Giovambattista G. Comparison of the effectiveness of microsatellites and SNP panels for genetic identification, traceability and assessment of parentage in an inbred Angus herd. Genet Mol Biol 2013; 36:185-91. [PMID: 23885200 PMCID: PMC3715284 DOI: 10.1590/s1415-47572013000200008] [Citation(s) in RCA: 90] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2012] [Accepted: 09/28/2012] [Indexed: 11/22/2022] Open
Abstract
During the last decade, microsatellites (short tandem repeats or STRs) have been successfully used for animal genetic identification, traceability and paternity, although in recent year single nucleotide polymorphisms (SNPs) have been increasingly used for this purpose. An efficient SNP identification system requires a marker set with enough power to identify individuals and their parents. Genetic diagnostics generally include the analysis of related animals. In this work, the degree of information provided by SNPs for a consanguineous herd of cattle was compared with that provided by STRs. Thirty-six closely related Angus cattle were genotyped for 18 STRs and 116 SNPs. Cumulative SNPs exclusion power values (Q) for paternity and sample matching probability (MP) yielded values greater than 0.9998 and 4.32E(-42), respectively. Generally 2-3 SNPs per STR were needed to obtain an equivalent Q value. The MP showed that 24 SNPs were equivalent to the ISAG (International Society for Animal Genetics) minimal recommended set of 12 STRs (MP ∼ 10(-11)). These results provide valuable genetic data that support the consensus SNP panel for bovine genetic identification developed by the Parentage Recording Working Group of ICAR (International Committee for Animal Recording).
Collapse
Affiliation(s)
- María E Fernández
- Facultad de Ciencias Veterinarias, Instituto de Genética Veterinaria, Consejo Nacional de Investigaciones Científicas y Técnicas, Universidad Nacional de La Plata, La Plata, Argentina
| | | | | | | | | | | | | | | | | | | |
Collapse
|
23
|
VanRaden PM, Cooper TA, Wiggans GR, O'Connell JR, Bacheller LR. Confirmation and discovery of maternal grandsires and great-grandsires in dairy cattle. J Dairy Sci 2013; 96:1874-9. [PMID: 23332849 DOI: 10.3168/jds.2012-6176] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2012] [Accepted: 11/22/2012] [Indexed: 11/19/2022]
Abstract
Selection, mating, and improvement of dairy animals have required accurate pedigrees. Genomic tools allow paternal ancestors to be easily confirmed or discovered because most sires are genotyped for many markers, but maternal ancestors are more difficult to discover because most female ancestors are not genotyped. Three methods to discover maternal grandsires (MGS) were developed and compared. Conflicts were counted one single nucleotide polymorphism (SNP) at a time between genotypes of the animal and potential MGS (duo method) or also using the sire's genotype (trio method). Alternatively, haplotypes of a potential MGS were matched to the animal's maternal haplotype, obtained by using linkage across loci (HAP method). The duo and trio methods can be performed as soon as a genotype is received because no imputation is required. The HAP method improved accuracy because genotypes with 2,683 (3 K) SNP were imputed to the 45,187 (50K) SNP used for genomic evaluation. The HAP method was tested using modified pedigrees with 5% of true MGS replaced by a random genotyped bull from the same birth year and 5% of MGS set to missing for 4,134 Holsteins, 552 Jerseys, and 142 Brown Swiss that had confirmed, genotyped sires. Those same animals were used to test the duo and trio methods, except that some animals had multiple genotypes and imputed dams were excluded. Accuracy measured how often the correct MGS was selected from among 12,152 genotyped Holstein, 2,265 Jersey, and 1,605 Brown Swiss potential MGS. Accuracies were 61, 60, and 65%, respectively, with the duo method; 95, 91, and 94% with the trio method; and 97, 95, and 97% with the HAP method. Accuracy of the duo method was poor (only 52% for animals genotyped with 3 K and 65% with 50K) because additional information from the paternal genotype is not used. Accuracy of the trio method was 97% with 50K but only 78% with 3K because the missing SNP were not imputed. Accuracy of the HAP method was 94% with 3 K genotypes, 98% with 50K, and 92% with nongenotyped, imputed dams. When the HAP method was extended to great-grandsires, the accuracy of maternal great-grandsire discovery was 92% for 652 Holsteins, 95% for 33 Jerseys, and 85% for 20 Brown Swiss. Accuracy was even higher using simulated genotypes. Because most dairy bulls over several generations have been genotyped, percentages of haplotypes shared with candidate males can accurately confirm, correct, or discover the sires, MGS, and even more distant ancestors of most animals.
Collapse
Affiliation(s)
- P M VanRaden
- Animal Improvement Programs Laboratory, Agricultural Research Service, USDA, Beltsville, MD 20705-2350, USA.
| | | | | | | | | |
Collapse
|
24
|
Sevane N, Crespo I, Cañón J, Dunner S. A Primer-Extension Assay for simultaneous use in cattle Genotype Assisted Selection, parentage and traceability analysis. Livest Sci 2011. [DOI: 10.1016/j.livsci.2010.10.011] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
25
|
Hayes BJ. Technical note: Efficient parentage assignment and pedigree reconstruction with dense single nucleotide polymorphism data. J Dairy Sci 2011; 94:2114-7. [PMID: 21427003 DOI: 10.3168/jds.2010-3896] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2010] [Accepted: 01/06/2011] [Indexed: 11/19/2022]
Affiliation(s)
- B J Hayes
- Biosciences Research Division, Department of Primary Industries Victoria, Melbourne, Victoria, Australia 3083.
| |
Collapse
|
26
|
Harlizius B, Lopes MS, Duijvesteijn N, van de Goor LHP, van Haeringen WA, Panneman H, Guimarães SEF, Merks JWM, Knol EF. A single nucleotide polymorphism set for paternal identification to reduce the costs of trait recording in commercial pig breeding. J Anim Sci 2011; 89:1661-8. [PMID: 21239666 DOI: 10.2527/jas.2010-3347] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
In animal breeding, recording of correct pedigrees is essential to achieve genetic progress. Markers on DNA are useful to verify the on-farm pedigree records (parental verification) but can also be used to assign parents retrospectively (parental identification). This approach could reduce the costs of recording for traits with low incidence, such as those related to diseases or mortality. In this study, SNP were used to assign the true sires of 368 purebred animals from a Duroc-based sire line and 140 crossbred offspring from a commercial pig population. Some of the sires were closely related. There were 3 full sibs and 17 half sibs among the true fathers and 4 full sibs and 35 half sibs among all putative fathers. To define the number of SNP necessary, 5 SNP panels (40, 60, 80, 100, and 120 SNP) were assembled from the Illumina PorcineSNP60 Beadchip (Illumina, San Diego, CA) based on minor allele frequency (>0.3), high genotyping call rate (≥90%), and equal spacing across the genome. For paternal identification considering only the 66 true sires in the data set, 60 SNP resulted in 100% correct assignment of the sire. By including additional putative sires (n = 304), 80 SNP were sufficient for 100% correct assignment of the sire. The following criteria were derived to identify the correct sire for the current data set: the logarithm of odds (LOD) score for assigning the correct sire was ≥5, the number of mismatches was ≤1, and the difference in the LOD score between the first and the second most likely sire was >5. If the correct sire was not present among all putative sires, the mean LOD for the most likely sire was close to zero or negative when using 100 SNP. More SNP would be needed for paternal identification if the number of putative sires increased and the degree of relatedness was greater than in the data set used here. The threshold for the number of mismatches can be adjusted according to the practical situation to account for the trade-off between false negatives and false positives. The latter can be avoided efficiently, ensuring that the correct father is being sampled. Nevertheless, a restriction on the number of putative sires is advisable to reduce the risk of assigning close relatives.
Collapse
Affiliation(s)
- B Harlizius
- Institute for Pig Genetics BV, PO Box 43, 6640 AA Beuningen, The Netherlands.
| | | | | | | | | | | | | | | | | |
Collapse
|