1
|
Biscarini F, Nazzicari N, Bink M, Arús P, Aranzana MJ, Verde I, Micali S, Pascal T, Quilot-Turion B, Lambert P, da Silva Linge C, Pacheco I, Bassi D, Stella A, Rossini L. Genome-enabled predictions for fruit weight and quality from repeated records in European peach progenies. BMC Genomics 2017; 18:432. [PMID: 28583089 PMCID: PMC5460546 DOI: 10.1186/s12864-017-3781-8] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2017] [Accepted: 05/10/2017] [Indexed: 11/16/2022] Open
Abstract
Background Highly polygenic traits such as fruit weight, sugar content and acidity strongly influence the agroeconomic value of peach varieties. Genomic Selection (GS) can accelerate peach yield and quality gain if predictions show higher levels of accuracy compared to phenotypic selection. The available IPSC 9K SNP array V1 allows standardized and highly reliable genotyping, preparing the ground for GS in peach. Results A repeatability model (multiple records per individual plant) for genome-enabled predictions in eleven European peach populations is presented. The analysis included 1147 individuals derived from both commercial and non-commercial peach or peach-related accessions. Considered traits were average fruit weight (FW), sugar content (SC) and titratable acidity (TA). Plants were genotyped with the 9K IPSC array, grown in three countries (France, Italy, Spain) and phenotyped for 3–5 years. An analysis of imputation accuracy of missing genotypic data was conducted using the software Beagle, showing that two of the eleven populations were highly sensitive to increasing levels of missing data. The regression model produced, for each trait and each population, estimates of heritability (FW:0.35, SC:0.48, TA:0.53, on average) and repeatability (FW:0.56, SC:0.63, TA:0.62, on average). Predictive ability was estimated in a five-fold cross validation scheme within population as the correlation of true and predicted phenotypes. Results differed by populations and traits, but predictive abilities were in general high (FW:0.60, SC:0.72, TA:0.65, on average). Conclusions This study assessed the feasibility of Genomic Selection in peach for highly polygenic traits linked to yield and fruit quality. The accuracy of imputing missing genotypes was as high as 96%, and the genomic predictive ability was on average 0.65, but could be as high as 0.84 for fruit weight or 0.83 for titratable acidity. The estimated repeatability may prove very useful in the management of the typical long cycles involved in peach productions. All together, these results are very promising for the application of genomic selection to peach breeding programmes. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3781-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Filippo Biscarini
- PTP Science Park, Via Einstein - Loc. Cascina Codazza, Lodi, Italy.,IBBA-CNR, Via Edoardo Bassini, 15, Milan, 20133, Italy
| | - Nelson Nazzicari
- PTP Science Park, Via Einstein - Loc. Cascina Codazza, Lodi, Italy.,Council for Agricultural Research and Economics (CREA) Research Centre for Fodder Crops and Dairy Productions, Lodi, Italy
| | - Marco Bink
- Wageningen UR Biometris, Wageningen, The Netherlands.,Present Address: Hendrix Genetics Research, Technology & Services B.V., P.O. Box 114, Boxmeer NL, 5830AC, The Netherlands
| | - Pere Arús
- IRTA, Centre de Recerca en Agrigenòmica CSIC-IRTA-UAB-UB, Campus UAB, Bellaterra (Cerdanyola del Vallés), Barcelona, Spain
| | - Maria José Aranzana
- IRTA, Centre de Recerca en Agrigenòmica CSIC-IRTA-UAB-UB, Campus UAB, Bellaterra (Cerdanyola del Vallés), Barcelona, Spain
| | - Ignazio Verde
- Consiglio per la ricerca in agricoltura e l'analisi dell'economia agraria (CREA) - Centro di Ricerca per la Frutticoltura (CREA-FRU), Via di Fioranello 52, Roma, Italy
| | - Sabrina Micali
- Consiglio per la ricerca in agricoltura e l'analisi dell'economia agraria (CREA) - Centro di Ricerca per la Frutticoltura (CREA-FRU), Via di Fioranello 52, Roma, Italy
| | | | | | - Patrick Lambert
- Università degli Studi di Milano - DiSAA, Via Celoria 2, Milano, Italy
| | | | - Igor Pacheco
- Università degli Studi di Milano - DiSAA, Via Celoria 2, Milano, Italy.,Institute of Nutrition and Food Technology - INTA, Universidad de Chile, Av El Líbano 5524, Santiago, Chile
| | - Daniele Bassi
- Università degli Studi di Milano - DiSAA, Via Celoria 2, Milano, Italy
| | - Alessandra Stella
- PTP Science Park, Via Einstein - Loc. Cascina Codazza, Lodi, Italy.,IBBA-CNR, Via Edoardo Bassini, 15, Milan, 20133, Italy
| | - Laura Rossini
- PTP Science Park, Via Einstein - Loc. Cascina Codazza, Lodi, Italy. .,Università degli Studi di Milano - DiSAA, Via Celoria 2, Milano, Italy.
| |
Collapse
|
2
|
Biffani S, Dimauro C, Macciotta N, Rossoni A, Stella A, Biscarini F. Predicting haplotype carriers from SNP genotypes in Bos taurus through linear discriminant analysis. Genet Sel Evol 2015; 47:4. [PMID: 25651874 PMCID: PMC4318450 DOI: 10.1186/s12711-015-0094-8] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2014] [Accepted: 01/16/2015] [Indexed: 11/24/2022] Open
Abstract
Background SNP (single nucleotide polymorphisms) genotype data are increasingly available in cattle populations and, among other things, can be used to predict carriers of specific haplotypes. It is therefore convenient to have a practical statistical method for the accurate classification of individuals into carriers and non-carriers. In this paper, we present a procedure combining variable selection (i.e. the selection of predictive SNPs) and linear discriminant analysis for the identification of carriers of a haplotype on BTA19 (Bos taurus autosome 19) known to be associated with reduced cow fertility. A population of 3645 Brown Swiss cows and bulls genotyped with the 54K SNP-chip was available for the analysis. Results The overall error rate for the prediction of haplotype carriers was on average very low (∼≤1%). The error rate was found to depend on the number of SNPs in the model and their density around the region of the haplotype on BTA19. The minimum set of SNPs to still achieve accurate predictions was 5, with a total test error rate of 1.59. Conclusions The paper describes a procedure to accurately identify haplotype carriers from SNP genotypes in cattle populations. Very few misclassifications were observed, which indicates that this is a very reliable approach for potential applications in cattle breeding.
Collapse
Affiliation(s)
| | | | | | | | | | - Filippo Biscarini
- Department of Bioinformatics, PTP, Via Einstein - Loc, Cascina Codazza, Lodi 26900, Italy.
| |
Collapse
|
4
|
Zhan B, Fadista J, Thomsen B, Hedegaard J, Panitz F, Bendixen C. Global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping. BMC Genomics 2011; 12:557. [PMID: 22082336 PMCID: PMC3248099 DOI: 10.1186/1471-2164-12-557] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2011] [Accepted: 11/14/2011] [Indexed: 11/10/2022] Open
Abstract
Background Integration of genomic variation with phenotypic information is an effective approach for uncovering genotype-phenotype associations. This requires an accurate identification of the different types of variation in individual genomes. Results We report the integration of the whole genome sequence of a single Holstein Friesian bull with data from single nucleotide polymorphism (SNP) and comparative genomic hybridization (CGH) array technologies to determine a comprehensive spectrum of genomic variation. The performance of resequencing SNP detection was assessed by combining SNPs that were identified to be either in identity by descent (IBD) or in copy number variation (CNV) with results from SNP array genotyping. Coding insertions and deletions (indels) were found to be enriched for size in multiples of 3 and were located near the N- and C-termini of proteins. For larger indels, a combination of split-read and read-pair approaches proved to be complementary in finding different signatures. CNVs were identified on the basis of the depth of sequenced reads, and by using SNP and CGH arrays. Conclusions Our results provide high resolution mapping of diverse classes of genomic variation in an individual bovine genome and demonstrate that structural variation surpasses sequence variation as the main component of genomic variability. Better accuracy of SNP detection was achieved with little loss of sensitivity when algorithms that implemented mapping quality were used. IBD regions were found to be instrumental for calculating resequencing SNP accuracy, while SNP detection within CNVs tended to be less reliable. CNV discovery was affected dramatically by platform resolution and coverage biases. The combined data for this study showed that at a moderate level of sequencing coverage, an ensemble of platforms and tools can be applied together to maximize the accurate detection of sequence and structural variants.
Collapse
Affiliation(s)
- Bujie Zhan
- Group of Molecular Genetics and Systems Biology, Department of Molecular Biology and Genetics, Faculty of Science and Technology, Aarhus University, Blichers Allé 20, DK-8830 Tjele, Denmark
| | | | | | | | | | | |
Collapse
|