1
|
Integration of Infinium and Axiom SNP array data in the outcrossing species Malus × domestica and causes for seemingly incompatible calls. BMC Genomics 2021; 22:246. [PMID: 33827434 PMCID: PMC8028180 DOI: 10.1186/s12864-021-07565-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2020] [Accepted: 03/30/2021] [Indexed: 11/23/2022] Open
Abstract
Background Single nucleotide polymorphism (SNP) array technology has been increasingly used to generate large quantities of SNP data for use in genetic studies. As new arrays are developed to take advantage of new technology and of improved probe design using new genome sequence and panel data, a need to integrate data from different arrays and array platforms has arisen. This study was undertaken in view of our need for an integrated high-quality dataset of Illumina Infinium® 20 K and Affymetrix Axiom® 480 K SNP array data in apple (Malus × domestica). In this study, we qualify and quantify the compatibility of SNP calling, defined as SNP calls that are both accurate and concordant, across both arrays by two approaches. First, the concordance of SNP calls was evaluated using a set of 417 duplicate individuals genotyped on both arrays starting from a set of 10,295 robust SNPs on the Infinium array. Next, the accuracy of the SNP calls was evaluated on additional germplasm (n = 3141) from both arrays using Mendelian inconsistent and consistent errors across thousands of pedigree links. While performing this work, we took the opportunity to evaluate reasons for probe failure and observed discordant SNP calls. Results Concordance among the duplicate individuals was on average of 97.1% across 10,295 SNPs. Of these SNPs, 35% had discordant call(s) that were further curated, leading to a final set of 8412 (81.7%) SNPs that were deemed compatible. Compatibility was highly influenced by the presence of alternate probe binding locations and secondary polymorphisms. The impact of the latter was highly influenced by their number and proximity to the 3′ end of the probe. Conclusions The Infinium and Axiom SNP array data were mostly compatible. However, data integration required intense data filtering and curation. This work resulted in a workflow and information that may be of use in other data integration efforts. Such an in-depth analysis of array concordance and accuracy as ours has not been previously described in the literature and will be useful in future work on SNP array data integration and interpretation, and in probe/platform development. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-07565-7.
Collapse
|
2
|
Genetic characterization of an almond germplasm collection and volatilome profiling of raw and roasted kernels. HORTICULTURE RESEARCH 2021; 8:27. [PMID: 33518710 PMCID: PMC7848010 DOI: 10.1038/s41438-021-00465-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/04/2020] [Revised: 12/11/2020] [Accepted: 12/17/2020] [Indexed: 05/16/2023]
Abstract
Almond is appreciated for its nutraceutical value and for the aromatic profile of the kernels. In this work, an almond collection composed of 96 Sicilian accessions complemented with 10 widely cultivated cultivars was phenotyped for the production of volatile organic compounds using a proton-transfer time-of-flight mass spectrometer and genotyped using the Illumina Infinium®18 K Peach SNP array. The profiling of the aroma was carried out on fresh and roasted kernels enabling the detection of 150 mass peaks. Sixty eight, for the most related with sulfur compounds, furan containing compounds, and aldehydes formed by Strecker degradation, significantly increased during roasting, while the concentration of fifty-four mass peaks, for the most belonging to alcohols and terpenes, significantly decreased. Four hundred and seventy-one robust SNPs were selected and employed for population genetic studies. Structure analysis detected three subpopulations with the Sicilian accessions characterized by a different genetic stratification compared to those collected in Apulia (South Italy) and the International cultivars. The linkage-disequilibrium (LD) decay across the genome was equal to r2 = 0.083. Furthermore, a high level of collinearity (r2 = 0.96) between almond and peach was registered confirming the high synteny between the two genomes. A preliminary application of a genome-wide association analysis allowed the detection of significant marker-trait associations for 31 fresh and 33 roasted almond mass peaks respectively. An accurate genetic and phenotypic characterization of novel germplasm can represent a valuable tool for the set-up of marker-assisted selection of novel cultivars with an enhanced aromatic profile.
Collapse
|
3
|
High-quality chromosome-scale assembly of the walnut (Juglans regia L.) reference genome. Gigascience 2020; 9:giaa050. [PMID: 32432329 PMCID: PMC7238675 DOI: 10.1093/gigascience/giaa050] [Citation(s) in RCA: 59] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2019] [Revised: 03/13/2020] [Accepted: 04/20/2020] [Indexed: 12/29/2022] Open
Abstract
BACKGROUND The release of the first reference genome of walnut (Juglans regia L.) enabled many achievements in the characterization of walnut genetic and functional variation. However, it is highly fragmented, preventing the integration of genetic, transcriptomic, and proteomic information to fully elucidate walnut biological processes. FINDINGS Here, we report the new chromosome-scale assembly of the walnut reference genome (Chandler v2.0) obtained by combining Oxford Nanopore long-read sequencing with chromosome conformation capture (Hi-C) technology. Relative to the previous reference genome, the new assembly features an 84.4-fold increase in N50 size, with the 16 chromosomal pseudomolecules assembled and representing 95% of its total length. Using full-length transcripts from single-molecule real-time sequencing, we predicted 37,554 gene models, with a mean gene length higher than the previous gene annotations. Most of the new protein-coding genes (90%) present both start and stop codons, which represents a significant improvement compared with Chandler v1.0 (only 48%). We then tested the potential impact of the new chromosome-level genome on different areas of walnut research. By studying the proteome changes occurring during male flower development, we observed that the virtual proteome obtained from Chandler v2.0 presents fewer artifacts than the previous reference genome, enabling the identification of a new potential pollen allergen in walnut. Also, the new chromosome-scale genome facilitates in-depth studies of intraspecies genetic diversity by revealing previously undetected autozygous regions in Chandler, likely resulting from inbreeding, and 195 genomic regions highly differentiated between Western and Eastern walnut cultivars. CONCLUSION Overall, Chandler v2.0 will serve as a valuable resource to better understand and explore walnut biology.
Collapse
|
4
|
High-quality chromosome-scale assembly of the walnut (Juglans regia L.) reference genome. Gigascience 2020. [PMID: 32432329 DOI: 10.1101/80979] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/17/2023] Open
Abstract
BACKGROUND The release of the first reference genome of walnut (Juglans regia L.) enabled many achievements in the characterization of walnut genetic and functional variation. However, it is highly fragmented, preventing the integration of genetic, transcriptomic, and proteomic information to fully elucidate walnut biological processes. FINDINGS Here, we report the new chromosome-scale assembly of the walnut reference genome (Chandler v2.0) obtained by combining Oxford Nanopore long-read sequencing with chromosome conformation capture (Hi-C) technology. Relative to the previous reference genome, the new assembly features an 84.4-fold increase in N50 size, with the 16 chromosomal pseudomolecules assembled and representing 95% of its total length. Using full-length transcripts from single-molecule real-time sequencing, we predicted 37,554 gene models, with a mean gene length higher than the previous gene annotations. Most of the new protein-coding genes (90%) present both start and stop codons, which represents a significant improvement compared with Chandler v1.0 (only 48%). We then tested the potential impact of the new chromosome-level genome on different areas of walnut research. By studying the proteome changes occurring during male flower development, we observed that the virtual proteome obtained from Chandler v2.0 presents fewer artifacts than the previous reference genome, enabling the identification of a new potential pollen allergen in walnut. Also, the new chromosome-scale genome facilitates in-depth studies of intraspecies genetic diversity by revealing previously undetected autozygous regions in Chandler, likely resulting from inbreeding, and 195 genomic regions highly differentiated between Western and Eastern walnut cultivars. CONCLUSION Overall, Chandler v2.0 will serve as a valuable resource to better understand and explore walnut biology.
Collapse
|
5
|
Using whole-genome SNP data to reconstruct a large multi-generation pedigree in apple germplasm. BMC PLANT BIOLOGY 2020; 20:2. [PMID: 31898487 PMCID: PMC6941274 DOI: 10.1186/s12870-019-2171-6] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/08/2019] [Accepted: 11/27/2019] [Indexed: 05/02/2023]
Abstract
BACKGROUND Apple (Malus x domestica Borkh.) is one of the most important fruit tree crops of temperate areas, with great economic and cultural value. Apple cultivars can be maintained for centuries in plant collections through grafting, and some are thought to date as far back as Roman times. Molecular markers provide a means to reconstruct pedigrees and thus shed light on the recent history of migration and trade of biological materials. The objective of the present study was to identify relationships within a set of over 1400 mostly old apple cultivars using whole-genome SNP data (~ 253 K SNPs) in order to reconstruct pedigrees. RESULTS Using simple exclusion tests, based on counting the number of Mendelian errors, more than one thousand parent-offspring relations and 295 complete parent-offspring families were identified. Additionally, a grandparent couple was identified for the missing parental side of 26 parent-offspring pairings. Among the 407 parent-offspring relations without a second identified parent, 327 could be oriented because one of the individuals was an offspring in a complete family or by using historical data on parentage or date of recording. Parents of emblematic cultivars such as 'Ribston Pippin', 'White Transparent' and 'Braeburn' were identified. The overall pedigree combining all the identified relationships encompassed seven generations and revealed a major impact of two Renaissance cultivars of French and English origin, namely 'Reinette Franche' and 'Margil', and one North-Eastern Europe cultivar from the 1700s, 'Alexander'. On the contrary, several older cultivars, from the Middle Ages or the Roman times, had no, or only single, identifiable offspring in the set of studied accessions. Frequent crosses between cultivars originating from different European regions were identified, especially from the nineteenth century onwards. CONCLUSIONS The availability of over 1400 apple genotypes, previously filtered for genetic uniqueness and providing a broad representation of European germplasm, has been instrumental for the success of this large pedigree reconstruction. It enlightens the history of empirical selection and recent breeding of apple cultivars in Europe and provides insights to speed-up future breeding and selection.
Collapse
|
6
|
Mapping of fire blight resistance in Malus ×robusta 5 flowers following artificial inoculation. BMC PLANT BIOLOGY 2019; 19:532. [PMID: 31791233 PMCID: PMC6889339 DOI: 10.1186/s12870-019-2154-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/07/2019] [Accepted: 11/21/2019] [Indexed: 05/20/2023]
Abstract
BACKGROUND Although the most common path of infection for fire blight, a severe bacterial disease on apple, is via host plant flowers, quantitative trait loci (QTLs) for fire blight resistance to date have exclusively been mapped following shoot inoculation. It is not known whether the same mechanism underlies flower and shoot resistance. RESULTS We report the detection of a fire blight resistance QTL following independent artificial inoculation of flowers and shoots on two F1 segregating populations derived from crossing resistant Malus ×robusta 5 (Mr5) with susceptible 'Idared' and 'Royal Gala' in experimental orchards in Germany and New Zealand, respectively. QTL mapping of phenotypic datasets from artificial flower inoculation of the 'Idared' × Mr5 population with Erwinia amylovora over several years, and of the 'Royal Gala' × Mr5 population in a single year, revealed a single major QTL controlling floral fire blight resistance on linkage group 3 (LG3) of Mr5. This QTL corresponds to the QTL on LG3 reported previously for the 'Idared' × Mr5 and an 'M9' × Mr5 population following shoot inoculation in the glasshouse. Interval mapping of phenotypic data from shoot inoculations of subsets from both flower resistance populations re-confirmed that the resistance QTL is in the same position on LG3 of Mr5 as that for flower inoculation. These results provide strong evidence that fire blight resistance in Mr5 is controlled by a major QTL on LG3, independently of the mode of infection, rootstock and environment. CONCLUSIONS This study demonstrates for the first time that resistance to fire blight caused by Erwinia amylovora is independent of the mode of inoculation at least in Malus ×robusta 5.
Collapse
|
7
|
Pseudo-chromosome-length genome assembly of a double haploid "Bartlett" pear (Pyrus communis L.). Gigascience 2019; 8:giz138. [PMID: 31816089 PMCID: PMC6901071 DOI: 10.1093/gigascience/giz138] [Citation(s) in RCA: 51] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Revised: 10/18/2019] [Accepted: 10/30/2019] [Indexed: 11/14/2022] Open
Abstract
BACKGROUND We report an improved assembly and scaffolding of the European pear (Pyrus communis L.) genome (referred to as BartlettDHv2.0), obtained using a combination of Pacific Biosciences RSII long-read sequencing, Bionano optical mapping, chromatin interaction capture (Hi-C), and genetic mapping. The sample selected for sequencing is a double haploid derived from the same "Bartlett" reference pear that was previously sequenced. Sequencing of di-haploid plants makes assembly more tractable in highly heterozygous species such as P. communis. FINDINGS A total of 496.9 Mb corresponding to 97% of the estimated genome size were assembled into 494 scaffolds. Hi-C data and a high-density genetic map allowed us to anchor and orient 87% of the sequence on the 17 pear chromosomes. Approximately 50% (247 Mb) of the genome consists of repetitive sequences. Gene annotation confirmed the presence of 37,445 protein-coding genes, which is 13% fewer than previously predicted. CONCLUSIONS We showed that the use of a doubled-haploid plant is an effective solution to the problems presented by high levels of heterozygosity and duplication for the generation of high-quality genome assemblies. We present a high-quality chromosome-scale assembly of the European pear Pyrus communis and demostrate its high degree of synteny with the genomes of Malus x Domestica and Pyrus x bretschneideri.
Collapse
|
8
|
A new genomic tool for walnut (Juglans regia L.): development and validation of the high-density Axiom™ J. regia 700K SNP genotyping array. PLANT BIOTECHNOLOGY JOURNAL 2019; 17:1027-1036. [PMID: 30515952 PMCID: PMC6523593 DOI: 10.1111/pbi.13034] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/30/2018] [Revised: 10/22/2018] [Accepted: 10/28/2018] [Indexed: 05/02/2023]
Abstract
Over the last 20 years, global production of Persian walnut (Juglans regia L.) has grown enormously, likely reflecting increased consumption due to its numerous benefits to human health. However, advances in genome-wide association (GWA) studies and genomic selection (GS) for agronomically important traits in walnut remain limited due to the lack of powerful genomic tools. Here, we present the development and validation of a high-density 700K single nucleotide polymorphism (SNP) array in Persian walnut. Over 609K high-quality SNPs have been thoroughly selected from a set of 9.6 m genome-wide variants, previously identified from the high-depth re-sequencing of 27 founders of the Walnut Improvement Program (WIP) of University of California, Davis. To validate the effectiveness of the array, we genotyped a collection of 1284 walnut trees, including 1167 progeny of 48 WIP families and 26 walnut cultivars. More than half of the SNPs (55.7%) fell in the highest quality class of 'Poly High Resolution' (PHR) polymorphisms, which were used to assess the WIP pedigree integrity. We identified 151 new parent-offspring relationships, all confirmed with the Mendelian inheritance test. In addition, we explored the genetic variability among cultivars of different origin, revealing how the varieties from Europe and California were differentiated from Asian accessions. Both the reconstruction of the WIP pedigree and population structure analysis confirmed the effectiveness of the Applied Biosystems™ Axiom™ J. regia 700K SNP array, which initiates a novel genomic and advanced phase in walnut genetics and breeding.
Collapse
|
9
|
Development of a highly efficient Axiom™ 70 K SNP array for Pyrus and evaluation for high-density mapping and germplasm characterization. BMC Genomics 2019; 20:331. [PMID: 31046664 PMCID: PMC6498479 DOI: 10.1186/s12864-019-5712-3] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2018] [Accepted: 04/17/2019] [Indexed: 12/20/2022] Open
Abstract
Background Both a source of diversity and the development of genomic tools, such as reference genomes and molecular markers, are equally important to enable faster progress in plant breeding. Pear (Pyrus spp.) lags far behind other fruit and nut crops in terms of employment of available genetic resources for new cultivar development. To address this gap, we designed a high-density, high-efficiency and robust single nucleotide polymorphism (SNP) array for pear, with the main objectives of conducting genetic diversity and genome-wide association studies. Results By applying a two-step design process, which consisted of the construction of a first ‘draft’ array for the screening of a small subset of samples, we were able to identify the most robust and informative SNPs to include in the Applied Biosystems™ Axiom™ Pear 70 K Genotyping Array, currently the densest SNP array for pear. Preliminary evaluation of this 70 K array in 1416 diverse pear accessions from the USDA National Clonal Germplasm Repository (NCGR) in Corvallis, OR identified 66,616 SNPs (93% of all the tiled SNPs) as high quality and polymorphic (PolyHighResolution). We further used the Axiom Pear 70 K Genotyping Array to construct high-density linkage maps in a bi-parental population, and to make a direct comparison with available genotyping-by-sequencing (GBS) data, which suggested that the SNP array is a more robust method of screening for SNPs than restriction enzyme reduced representation sequence-based genotyping. Conclusions The Axiom Pear 70 K Genotyping Array, with its high efficiency in a widely diverse panel of Pyrus species and cultivars, represents a valuable resource for a multitude of molecular studies in pear. The characterization of the USDA-NCGR collection with this array will provide important information for pear geneticists and breeders, as well as for the optimization of conservation strategies for Pyrus. Electronic supplementary material The online version of this article (10.1186/s12864-019-5712-3) contains supplementary material, which is available to authorized users.
Collapse
|
10
|
Apple whole genome sequences: recent advances and new prospects. HORTICULTURE RESEARCH 2019; 6:59. [PMID: 30962944 PMCID: PMC6450873 DOI: 10.1038/s41438-019-0141-7] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2019] [Revised: 03/15/2019] [Accepted: 03/15/2019] [Indexed: 05/19/2023]
Abstract
In 2010, a major scientific milestone was achieved for tree fruit crops: publication of the first draft whole genome sequence (WGS) for apple (Malus domestica). This WGS, v1.0, was valuable as the initial reference for sequence information, fine mapping, gene discovery, variant discovery, and tool development. A new, high quality apple WGS, GDDH13 v1.1, was released in 2017 and now serves as the reference genome for apple. Over the past decade, these apple WGSs have had an enormous impact on our understanding of apple biological functioning, trait physiology and inheritance, leading to practical applications for improving this highly valued crop. Causal gene identities for phenotypes of fundamental and practical interest can today be discovered much more rapidly. Genome-wide polymorphisms at high genetic resolution are screened efficiently over hundreds to thousands of individuals with new insights into genetic relationships and pedigrees. High-density genetic maps are constructed efficiently and quantitative trait loci for valuable traits are readily associated with positional candidate genes and/or converted into diagnostic tests for breeders. We understand the species, geographical, and genomic origins of domesticated apple more precisely, as well as its relationship to wild relatives. The WGS has turbo-charged application of these classical research steps to crop improvement and drives innovative methods to achieve more durable, environmentally sound, productive, and consumer-desirable apple production. This review includes examples of basic and practical breakthroughs and challenges in using the apple WGSs. Recommendations for "what's next" focus on necessary upgrades to the genome sequence data pool, as well as for use of the data, to reach new frontiers in genomics-based scientific understanding of apple.
Collapse
|
11
|
An integrated approach for increasing breeding efficiency in apple and peach in Europe. HORTICULTURE RESEARCH 2018; 5:11. [PMID: 29507735 PMCID: PMC5830435 DOI: 10.1038/s41438-018-0016-3] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/05/2017] [Accepted: 12/23/2017] [Indexed: 05/02/2023]
Abstract
Despite the availability of whole genome sequences of apple and peach, there has been a considerable gap between genomics and breeding. To bridge the gap, the European Union funded the FruitBreedomics project (March 2011 to August 2015) involving 28 research institutes and private companies. Three complementary approaches were pursued: (i) tool and software development, (ii) deciphering genetic control of main horticultural traits taking into account allelic diversity and (iii) developing plant materials, tools and methodologies for breeders. Decisive breakthroughs were made including the making available of ready-to-go DNA diagnostic tests for Marker Assisted Breeding, development of new, dense SNP arrays in apple and peach, new phenotypic methods for some complex traits, software for gene/QTL discovery on breeding germplasm via Pedigree Based Analysis (PBA). This resulted in the discovery of highly predictive molecular markers for traits of horticultural interest via PBA and via Genome Wide Association Studies (GWAS) on several European genebank collections. FruitBreedomics also developed pre-breeding plant materials in which multiple sources of resistance were pyramided and software that can support breeders in their selection activities. Through FruitBreedomics, significant progresses were made in the field of apple and peach breeding, genetics, genomics and bioinformatics of which advantage will be made by breeders, germplasm curators and scientists. A major part of the data collected during the project has been stored in the FruitBreedomics database and has been made available to the public. This review covers the scientific discoveries made in this major endeavour, and perspective in the apple and peach breeding and genomics in Europe and beyond.
Collapse
|
12
|
Integrated QTL detection for key breeding traits in multiple peach progenies. BMC Genomics 2017; 18:404. [PMID: 28583082 PMCID: PMC5460339 DOI: 10.1186/s12864-017-3783-6] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2017] [Accepted: 05/10/2017] [Indexed: 01/23/2023] Open
Abstract
Background Peach (Prunus persica (L.) Batsch) is a major temperate fruit crop with an intense breeding activity. Breeding is facilitated by knowledge of the inheritance of the key traits that are often of a quantitative nature. QTLs have traditionally been studied using the phenotype of a single progeny (usually a full-sib progeny) and the correlation with a set of markers covering its genome. This approach has allowed the identification of various genes and QTLs but is limited by the small numbers of individuals used and by the narrow transect of the variability analyzed. In this article we propose the use of a multi-progeny mapping strategy that used pedigree information and Bayesian approaches that supports a more precise and complete survey of the available genetic variability. Results Seven key agronomic characters (data from 1 to 3 years) were analyzed in 18 progenies from crosses between occidental commercial genotypes and various exotic lines including accessions of other Prunus species. A total of 1467 plants from these progenies were genotyped with a 9 k SNP array. Forty-seven QTLs were identified, 22 coinciding with major genes and QTLs that have been consistently found in the same populations when studied individually and 25 were new. A substantial part of the QTLs observed (47%) would not have been detected in crosses between only commercial materials, showing the high value of exotic lines as a source of novel alleles for the commercial gene pool. Our strategy also provided estimations on the narrow sense heritability of each character, and the estimation of the QTL genotypes of each parent for the different QTLs and their breeding value. Conclusions The integrated strategy used provides a broader and more accurate picture of the variability available for peach breeding with the identification of many new QTLs, information on the sources of the alleles of interest and the breeding values of the potential donors of such valuable alleles. These results are first-hand information for breeders and a step forward towards the implementation of DNA-informed strategies to facilitate selection of new cultivars with improved productivity and quality. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3783-6) contains supplementary material, which is available to authorized users.
Collapse
|
13
|
The Peach v2.0 release: high-resolution linkage mapping and deep resequencing improve chromosome-scale assembly and contiguity. BMC Genomics 2017. [PMID: 28284188 DOI: 10.1186/s12864-017-3606] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2023] Open
Abstract
BACKGROUND The availability of the peach genome sequence has fostered relevant research in peach and related Prunus species enabling the identification of genes underlying important horticultural traits as well as the development of advanced tools for genetic and genomic analyses. The first release of the peach genome (Peach v1.0) represented a high-quality WGS (Whole Genome Shotgun) chromosome-scale assembly with high contiguity (contig L50 214.2 kb), large portions of mapped sequences (96%) and high base accuracy (99.96%). The aim of this work was to improve the quality of the first assembly by increasing the portion of mapped and oriented sequences, correcting misassemblies and improving the contiguity and base accuracy using high-throughput linkage mapping and deep resequencing approaches. RESULTS Four linkage maps with 3,576 molecular markers were used to improve the portion of mapped and oriented sequences (from 96.0% and 85.6% of Peach v1.0 to 99.2% and 98.2% of v2.0, respectively) and enabled a more detailed identification of discernible misassemblies (10.4 Mb in total). The deep resequencing approach fixed 859 homozygous SNPs (Single Nucleotide Polymorphisms) and 1347 homozygous indels. Moreover, the assembled NGS contigs enabled the closing of 212 gaps with an improvement in the contig L50 of 19.2%. CONCLUSIONS The improved high quality peach genome assembly (Peach v2.0) represents a valuable tool for the analysis of the genetic diversity, domestication, and as a vehicle for genetic improvement of peach and related Prunus species. Moreover, the important phylogenetic position of peach and the absence of recent whole genome duplication (WGD) events make peach a pivotal species for comparative genomics studies aiming at elucidating plant speciation and diversification processes.
Collapse
|
14
|
The Peach v2.0 release: high-resolution linkage mapping and deep resequencing improve chromosome-scale assembly and contiguity. BMC Genomics 2017; 18:225. [PMID: 28284188 PMCID: PMC5346207 DOI: 10.1186/s12864-017-3606-9] [Citation(s) in RCA: 196] [Impact Index Per Article: 28.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2016] [Accepted: 03/03/2017] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND The availability of the peach genome sequence has fostered relevant research in peach and related Prunus species enabling the identification of genes underlying important horticultural traits as well as the development of advanced tools for genetic and genomic analyses. The first release of the peach genome (Peach v1.0) represented a high-quality WGS (Whole Genome Shotgun) chromosome-scale assembly with high contiguity (contig L50 214.2 kb), large portions of mapped sequences (96%) and high base accuracy (99.96%). The aim of this work was to improve the quality of the first assembly by increasing the portion of mapped and oriented sequences, correcting misassemblies and improving the contiguity and base accuracy using high-throughput linkage mapping and deep resequencing approaches. RESULTS Four linkage maps with 3,576 molecular markers were used to improve the portion of mapped and oriented sequences (from 96.0% and 85.6% of Peach v1.0 to 99.2% and 98.2% of v2.0, respectively) and enabled a more detailed identification of discernible misassemblies (10.4 Mb in total). The deep resequencing approach fixed 859 homozygous SNPs (Single Nucleotide Polymorphisms) and 1347 homozygous indels. Moreover, the assembled NGS contigs enabled the closing of 212 gaps with an improvement in the contig L50 of 19.2%. CONCLUSIONS The improved high quality peach genome assembly (Peach v2.0) represents a valuable tool for the analysis of the genetic diversity, domestication, and as a vehicle for genetic improvement of peach and related Prunus species. Moreover, the important phylogenetic position of peach and the absence of recent whole genome duplication (WGD) events make peach a pivotal species for comparative genomics studies aiming at elucidating plant speciation and diversification processes.
Collapse
|
15
|
Genome-Wide Association Mapping of Flowering and Ripening Periods in Apple. FRONTIERS IN PLANT SCIENCE 2017; 8:1923. [PMID: 29176988 PMCID: PMC5686452 DOI: 10.3389/fpls.2017.01923] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2017] [Accepted: 10/24/2017] [Indexed: 05/17/2023]
Abstract
Deciphering the genetic control of flowering and ripening periods in apple is essential for breeding cultivars adapted to their growing environments. We implemented a large Genome-Wide Association Study (GWAS) at the European level using an association panel of 1,168 different apple genotypes distributed over six locations and phenotyped for these phenological traits. The panel was genotyped at a high-density of SNPs using the Axiom®Apple 480 K SNP array. We ran GWAS with a multi-locus mixed model (MLMM), which handles the putatively confounding effect of significant SNPs elsewhere on the genome. Genomic regions were further investigated to reveal candidate genes responsible for the phenotypic variation. At the whole population level, GWAS retained two SNPs as cofactors on chromosome 9 for flowering period, and six for ripening period (four on chromosome 3, one on chromosome 10 and one on chromosome 16) which, together accounted for 8.9 and 17.2% of the phenotypic variance, respectively. For both traits, SNPs in weak linkage disequilibrium were detected nearby, thus suggesting the existence of allelic heterogeneity. The geographic origins and relationships of apple cultivars accounted for large parts of the phenotypic variation. Variation in genotypic frequency of the SNPs associated with the two traits was connected to the geographic origin of the genotypes (grouped as North+East, West and South Europe), and indicated differential selection in different growing environments. Genes encoding transcription factors containing either NAC or MADS domains were identified as major candidates within the small confidence intervals computed for the associated genomic regions. A strong microsynteny between apple and peach was revealed in all the four confidence interval regions. This study shows how association genetics can unravel the genetic control of important horticultural traits in apple, as well as reduce the confidence intervals of the associated regions identified by linkage mapping approaches. Our findings can be used for the improvement of apple through marker-assisted breeding strategies that take advantage of the accumulating additive effects of the identified SNPs.
Collapse
|
16
|
Development and validation of the Axiom(®) Apple480K SNP genotyping array. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2016; 86:62-74. [PMID: 26919684 DOI: 10.1111/tpj.13145] [Citation(s) in RCA: 60] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/09/2015] [Revised: 02/03/2016] [Accepted: 02/05/2016] [Indexed: 05/18/2023]
Abstract
Cultivated apple (Malus × domestica Borkh.) is one of the most important fruit crops in temperate regions, and has great economic and cultural value. The apple genome is highly heterozygous and has undergone a recent duplication which, combined with a rapid linkage disequilibrium decay, makes it difficult to perform genome-wide association (GWA) studies. Single nucleotide polymorphism arrays offer highly multiplexed assays at a relatively low cost per data point and can be a valid tool for the identification of the markers associated with traits of interest. Here, we describe the development and validation of a 487K SNP Affymetrix Axiom(®) genotyping array for apple and discuss its potential applications. The array has been built from the high-depth resequencing of 63 different cultivars covering most of the genetic diversity in cultivated apple. The SNPs were chosen by applying a focal points approach to enrich genic regions, but also to reach a uniform coverage of non-genic regions. A total of 1324 apple accessions, including the 92 progenies of two mapping populations, have been genotyped with the Axiom(®) Apple480K to assess the effectiveness of the array. A large majority of SNPs (359 994 or 74%) fell in the stringent class of poly high resolution polymorphisms. We also devised a filtering procedure to identify a subset of 275K very robust markers that can be safely used for germplasm surveys in apple. The Axiom(®) Apple480K has now been commercially released both for public and proprietary use and will likely be a reference tool for GWA studies in apple.
Collapse
|
17
|
A high-density, multi-parental SNP genetic map on apple validates a new mapping approach for outcrossing species. HORTICULTURE RESEARCH 2016; 3:16057. [PMID: 27917289 PMCID: PMC5120355 DOI: 10.1038/hortres.2016.57] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/07/2016] [Revised: 10/25/2016] [Accepted: 10/25/2016] [Indexed: 05/18/2023]
Abstract
Quantitative trait loci (QTL) mapping approaches rely on the correct ordering of molecular markers along the chromosomes, which can be obtained from genetic linkage maps or a reference genome sequence. For apple (Malus domestica Borkh), the genome sequence v1 and v2 could not meet this need; therefore, a novel approach was devised to develop a dense genetic linkage map, providing the most reliable marker-loci order for the highest possible number of markers. The approach was based on four strategies: (i) the use of multiple full-sib families, (ii) the reduction of missing information through the use of HaploBlocks and alternative calling procedures for single-nucleotide polymorphism (SNP) markers, (iii) the construction of a single backcross-type data set including all families, and (iv) a two-step map generation procedure based on the sequential inclusion of markers. The map comprises 15 417 SNP markers, clustered in 3 K HaploBlock markers spanning 1 267 cM, with an average distance between adjacent markers of 0.37 cM and a maximum distance of 3.29 cM. Moreover, chromosome 5 was oriented according to its homoeologous chromosome 10. This map was useful to improve the apple genome sequence, design the Axiom Apple 480 K SNP array and perform multifamily-based QTL studies. Its collinearity with the genome sequences v1 and v3 are reported. To our knowledge, this is the shortest published SNP map in apple, while including the largest number of markers, families and individuals. This result validates our methodology, proving its value for the construction of integrated linkage maps for any outbreeding species.
Collapse
|
18
|
ASSIsT: an automatic SNP scoring tool for in- and outbreeding species. Bioinformatics 2015; 31:3873-4. [PMID: 26249809 PMCID: PMC4653386 DOI: 10.1093/bioinformatics/btv446] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2015] [Accepted: 07/25/2015] [Indexed: 11/13/2022] Open
Abstract
UNLABELLED ASSIsT (Automatic SNP ScorIng Tool) is a user-friendly customized pipeline for efficient calling and filtering of SNPs from Illumina Infinium arrays, specifically devised for custom genotyping arrays. Illumina has developed an integrated software for SNP data visualization and inspection called GenomeStudio (GS). ASSIsT builds on GS-derived data and identifies those markers that follow a bi-allelic genetic model and show reliable genotype calls. Moreover, ASSIsT re-edits SNP calls with null alleles or additional SNPs in the probe annealing site. ASSIsT can be employed in the analysis of different population types such as full-sib families and mating schemes used in the plant kingdom (backcross, F1, F2), and unrelated individuals. The final result can be directly exported in the format required by the most common software for genetic mapping and marker-trait association analysis. ASSIsT is developed in Python and runs in Windows and Linux. AVAILABILITY AND IMPLEMENTATION The software, example data sets and tutorials are freely available at http://compbiotoolbox.fmach.it/assist/. CONTACT eric.vandeweg@wur.nl.
Collapse
|
19
|
New candidate genes for the fine regulation of the colour of grapes. JOURNAL OF EXPERIMENTAL BOTANY 2015; 66:4427-40. [PMID: 26071528 PMCID: PMC4507754 DOI: 10.1093/jxb/erv159] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]
Abstract
In the last decade, great progress has been made in clarifying the main determinants of anthocyanin accumulation in grape berry skin. However, the molecular details of the fine variation among cultivars, which ultimately contributes to wine typicity, are still not completely understood. To shed light on this issue, the grapes of 170 F1 progeny from the cross 'Syrah'×'Pinot Noir' were characterized at the mature stage for the content of 15 anthocyanins during four growing seasons. This huge data set was used in combination with a dense genetic map to detect genomic regions controlling the anthocyanin pathway both at key enzymatic points and at particular branches. Genes putatively involved in fine tuning the global regulation of anthocyanin biosynthesis were identified by exploring the gene predictions in the QTL (quantitative trait locus) confidence intervals and their expression profile during berry development in offspring with contrasting anthocyanin accumulation. New information on some aspects which had scarcely been investigated so far, such as anthocyanin transport into the vacuole, or completely neglected, such as acylation, is provided. These genes represent a valuable resource in grapevine molecular-based breeding programmes to improve both fruit and wine quality and to tailor wine sensory properties according to consumer demand.
Collapse
|
20
|
A major QTL controlling apple skin russeting maps on the linkage group 12 of 'Renetta Grigia di Torriana'. BMC PLANT BIOLOGY 2015; 15:150. [PMID: 26084469 PMCID: PMC4472412 DOI: 10.1186/s12870-015-0507-4] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/22/2015] [Accepted: 04/27/2015] [Indexed: 05/23/2023]
Abstract
BACKGROUND Russeting is a disorder developed by apple fruits that consists of cuticle cracking followed by the replacement of the epidermis by a corky layer that protects the fruit surface from water loss and pathogens. Although influenced by many environmental conditions and orchard management practices, russeting is under genetic control. The difficulty in classifying offspring and consequent variable segregation ratios have led several authors to conclude that more than one genetic determinant could be involved, although some evidence favours a major gene (Ru). RESULTS In this study we report the mapping of a major genetic russeting determinant on linkage group 12 of apple as inferred from the phenotypic observation in a segregating progeny derived from 'Renetta Grigia di Torriana', the construction of a 20 K Illumina SNP chip based genetic map, and QTL analysis. Recombination analysis in two mapping populations restricted the region of interest to approximately 400 Kb. Of the 58 genes predicted from the Golden Delicious sequence, a putative ABCG family transporter has been identified. Within a small set of russeted cultivars tested with markers of the region, only six showed the same haplotype of 'Renetta Grigia di Torriana'. CONCLUSIONS A major determinant (Ru_RGT) for russeting development putatively involved in cuticle organization is proposed as a candidate for controlling the trait. SNP and SSR markers tightly co-segregating with the Ru_RGT locus may assist the breeder selection. The observed segregations and the analysis of the 'Renetta Grigia di Torriana' haplotypic region in a panel of russeted and non-russeted cultivars may suggest the presence of other determinants for russeting in apple.
Collapse
|
21
|
Accuracy and responses of genomic selection on key traits in apple breeding. HORTICULTURE RESEARCH 2015; 2:15060. [PMID: 26744627 PMCID: PMC4688998 DOI: 10.1038/hortres.2015.60] [Citation(s) in RCA: 75] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/17/2023]
Abstract
The application of genomic selection in fruit tree crops is expected to enhance breeding efficiency by increasing prediction accuracy, increasing selection intensity and decreasing generation interval. The objectives of this study were to assess the accuracy of prediction and selection response in commercial apple breeding programmes for key traits. The training population comprised 977 individuals derived from 20 pedigreed full-sib families. Historic phenotypic data were available on 10 traits related to productivity and fruit external appearance and genotypic data for 7829 SNPs obtained with an Illumina 20K SNP array. From these data, a genome-wide prediction model was built and subsequently used to calculate genomic breeding values of five application full-sib families. The application families had genotypes at 364 SNPs from a dedicated 512 SNP array, and these genotypic data were extended to the high-density level by imputation. These five families were phenotyped for 1 year and their phenotypes were compared to the predicted breeding values. Accuracy of genomic prediction across the 10 traits reached a maximum value of 0.5 and had a median value of 0.19. The accuracies were strongly affected by the phenotypic distribution and heritability of traits. In the largest family, significant selection response was observed for traits with high heritability and symmetric phenotypic distribution. Traits that showed non-significant response often had reduced and skewed phenotypic variation or low heritability. Among the five application families the accuracies were uncorrelated to the degree of relatedness to the training population. The results underline the potential of genomic prediction to accelerate breeding progress in outbred fruit tree crops that still need to overcome long generation intervals and extensive phenotyping costs.
Collapse
|
22
|
Whole-Genome Analysis of Diversity and SNP-Major Gene Association in Peach Germplasm. PLoS One 2015; 10:e0136803. [PMID: 26352671 PMCID: PMC4564248 DOI: 10.1371/journal.pone.0136803] [Citation(s) in RCA: 72] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2015] [Accepted: 08/07/2015] [Indexed: 12/24/2022] Open
Abstract
Peach was domesticated in China more than four millennia ago and from there it spread world-wide. Since the middle of the last century, peach breeding programs have been very dynamic generating hundreds of new commercial varieties, however, in most cases such varieties derive from a limited collection of parental lines (founders). This is one reason for the observed low levels of variability of the commercial gene pool, implying that knowledge of the extent and distribution of genetic variability in peach is critical to allow the choice of adequate parents to confer enhanced productivity, adaptation and quality to improved varieties. With this aim we genotyped 1,580 peach accessions (including a few closely related Prunus species) maintained and phenotyped in five germplasm collections (four European and one Chinese) with the International Peach SNP Consortium 9K SNP peach array. The study of population structure revealed the subdivision of the panel in three main populations, one mainly made up of Occidental varieties from breeding programs (POP1OCB), one of Occidental landraces (POP2OCT) and the third of Oriental accessions (POP3OR). Analysis of linkage disequilibrium (LD) identified differential patterns of genome-wide LD blocks in each of the populations. Phenotypic data for seven monogenic traits were integrated in a genome-wide association study (GWAS). The significantly associated SNPs were always in the regions predicted by linkage analysis, forming haplotypes of markers. These diagnostic haplotypes could be used for marker-assisted selection (MAS) in modern breeding programs.
Collapse
|
23
|
Development and validation of a 20K single nucleotide polymorphism (SNP) whole genome genotyping array for apple (Malus × domestica Borkh). PLoS One 2014; 9:e110377. [PMID: 25303088 PMCID: PMC4193858 DOI: 10.1371/journal.pone.0110377] [Citation(s) in RCA: 101] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2014] [Accepted: 09/12/2014] [Indexed: 01/08/2023] Open
Abstract
High-density SNP arrays for genome-wide assessment of allelic variation have made high resolution genetic characterization of crop germplasm feasible. A medium density array for apple, the IRSC 8K SNP array, has been successfully developed and used for screens of bi-parental populations. However, the number of robust and well-distributed markers contained on this array was not sufficient to perform genome-wide association analyses in wider germplasm sets, or Pedigree-Based Analysis at high precision, because of rapid decay of linkage disequilibrium. We describe the development of an Illumina Infinium array targeting 20K SNPs. The SNPs were predicted from re-sequencing data derived from the genomes of 13 Malus × domestica apple cultivars and one accession belonging to a crab apple species (M. micromalus). A pipeline for SNP selection was devised that avoided the pitfalls associated with the inclusion of paralogous sequence variants, supported the construction of robust multi-allelic SNP haploblocks and selected up to 11 entries within narrow genomic regions of ±5 kb, termed focal points (FPs). Broad genome coverage was attained by placing FPs at 1 cM intervals on a consensus genetic map, complementing them with FPs to enrich the ends of each of the chromosomes, and by bridging physical intervals greater than 400 Kbps. The selection also included ∼3.7K validated SNPs from the IRSC 8K array. The array has already been used in other studies where ∼15.8K SNP markers were mapped with an average of ∼6.8K SNPs per full-sib family. The newly developed array with its high density of polymorphic validated SNPs is expected to be of great utility for Pedigree-Based Analysis and Genomic Selection. It will also be a valuable tool to help dissect the genetic mechanisms controlling important fruit quality traits, and to aid the identification of marker-trait associations suitable for the application of Marker Assisted Selection in apple breeding programs.
Collapse
|
24
|
QTL analysis and candidate gene mapping for the polyphenol content in cider apple. PLoS One 2014; 9:e107103. [PMID: 25271925 PMCID: PMC4182701 DOI: 10.1371/journal.pone.0107103] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2014] [Accepted: 08/12/2014] [Indexed: 11/25/2022] Open
Abstract
Polyphenols have favorable antioxidant potential on human health suggesting that their high content is responsible for the beneficial effects of apple consumption. They control the quality of ciders as they predominantly account for astringency, bitterness, color and aroma. In this study, we identified QTLs controlling phenolic compound concentrations and the average polymerization degree of flavanols in a cider apple progeny. Thirty-two compounds belonging to five groups of phenolic compounds were identified and quantified by reversed phase liquid chromatography on both fruit extract and juice, over three years. The average polymerization degree of flavanols was estimated in fruit by phloroglucinolysis coupled to HPLC. Parental maps were built using SSR and SNP markers and used for the QTL analysis. Sixty-nine and 72 QTLs were detected on 14 and 11 linkage groups of the female and male maps, respectively. A majority of the QTLs identified in this study are specific to this population, while others are consistent with previous studies. This study presents for the first time in apple, QTLs for the mean polymerization degree of procyanidins, for which the mechanisms involved remains unknown to this day. Identification of candidate genes underlying major QTLs was then performed in silico and permitted the identification of 18 enzymes of the polyphenol pathway and six transcription factors involved in the apple anthocyanin regulation. New markers were designed from sequences of the most interesting candidate genes in order to confirm their co-localization with underlying QTLs by genetic mapping. Finally, the potential use of these QTLs in breeding programs is discussed.
Collapse
|
25
|
The draft genome sequence of European pear (Pyrus communis L. 'Bartlett'). PLoS One 2014; 9:e92644. [PMID: 24699266 PMCID: PMC3974708 DOI: 10.1371/journal.pone.0092644] [Citation(s) in RCA: 198] [Impact Index Per Article: 19.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2013] [Accepted: 02/25/2014] [Indexed: 01/03/2023] Open
Abstract
We present a draft assembly of the genome of European pear (Pyrus communis) 'Bartlett'. Our assembly was developed employing second generation sequencing technology (Roche 454), from single-end, 2 kb, and 7 kb insert paired-end reads using Newbler (version 2.7). It contains 142,083 scaffolds greater than 499 bases (maximum scaffold length of 1.2 Mb) and covers a total of 577.3 Mb, representing most of the expected 600 Mb Pyrus genome. A total of 829,823 putative single nucleotide polymorphisms (SNPs) were detected using re-sequencing of 'Louise Bonne de Jersey' and 'Old Home'. A total of 2,279 genetically mapped SNP markers anchor 171 Mb of the assembled genome. Ab initio gene prediction combined with prediction based on homology searching detected 43,419 putative gene models. Of these, 1219 proteins (556 clusters) are unique to European pear compared to 12 other sequenced plant genomes. Analysis of the expansin gene family provided an example of the quality of the gene prediction and an insight into the relationships among one class of cell wall related genes that control fruit softening in both European pear and apple (Malus × domestica). The 'Bartlett' genome assembly v1.0 (http://www.rosaceae.org/species/pyrus/pyrus_communis/genome_v1.0) is an invaluable tool for identifying the genetic control of key horticultural traits in pear and will enable the wide application of marker-assisted and genomic selection that will enhance the speed and efficiency of pear cultivar development.
Collapse
|
26
|
Identification of Pyrus single nucleotide polymorphisms (SNPs) and evaluation for genetic mapping in European pear and interspecific Pyrus hybrids. PLoS One 2013; 8:e77022. [PMID: 24155917 PMCID: PMC3796552 DOI: 10.1371/journal.pone.0077022] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2013] [Accepted: 08/26/2013] [Indexed: 11/18/2022] Open
Abstract
We have used new generation sequencing (NGS) technologies to identify single nucleotide polymorphism (SNP) markers from three European pear (Pyrus communis L.) cultivars and subsequently developed a subset of 1096 pear SNPs into high throughput markers by combining them with the set of 7692 apple SNPs on the IRSC apple Infinium® II 8K array. We then evaluated this apple and pear Infinium® II 9K SNP array for large-scale genotyping in pear across several species, using both pear and apple SNPs. The segregating populations employed for array validation included a segregating population of European pear ('Old Home'×'Louise Bon Jersey') and four interspecific breeding families derived from Asian (P. pyrifolia Nakai and P. bretschneideri Rehd.) and European pear pedigrees. In total, we mapped 857 polymorphic pear markers to construct the first SNP-based genetic maps for pear, comprising 78% of the total pear SNPs included in the array. In addition, 1031 SNP markers derived from apple (13% of the total apple SNPs included in the array) were polymorphic and were mapped in one or more of the pear populations. These results are the first to demonstrate SNP transferability across the genera Malus and Pyrus. Our construction of high density SNP-based and gene-based genetic maps in pear represents an important step towards the identification of chromosomal regions associated with a range of horticultural characters, such as pest and disease resistance, orchard yield and fruit quality.
Collapse
|
27
|
Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin. PLoS One 2013; 8:e67407. [PMID: 23826289 PMCID: PMC3694884 DOI: 10.1371/journal.pone.0067407] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2013] [Accepted: 05/18/2013] [Indexed: 01/21/2023] Open
Abstract
High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the ‘Golden Delicious’ genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.
Collapse
|
28
|
Genetic diversity and population structure assessed by SSR and SNP markers in a large germplasm collection of grape. BMC PLANT BIOLOGY 2013; 13:39. [PMID: 23497049 PMCID: PMC3610244 DOI: 10.1186/1471-2229-13-39] [Citation(s) in RCA: 150] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/26/2012] [Accepted: 02/27/2013] [Indexed: 05/18/2023]
Abstract
BACKGROUND The economic importance of grapevine has driven significant efforts in genomics to accelerate the exploitation of Vitis resources for development of new cultivars. However, although a large number of clonally propagated accessions are maintained in grape germplasm collections worldwide, their use for crop improvement is limited by the scarcity of information on genetic diversity, population structure and proper phenotypic assessment. The identification of representative and manageable subset of accessions would facilitate access to the diversity available in large collections. A genome-wide germplasm characterization using molecular markers can offer reliable tools for adjusting the quality and representativeness of such core samples. RESULTS We investigated patterns of molecular diversity at 22 common microsatellite loci and 384 single nucleotide polymorphisms (SNPs) in 2273 accessions of domesticated grapevine V. vinifera ssp. sativa, its wild relative V. vinifera ssp. sylvestris, interspecific hybrid cultivars and rootstocks. Despite the large number of putative duplicates and extensive clonal relationships among the accessions, we observed high level of genetic variation. In the total germplasm collection the average genetic diversity, as quantified by the expected heterozygosity, was higher for SSR loci (0.81) than for SNPs (0.34). The analysis of the genetic structure in the grape germplasm collection revealed several levels of stratification. The primary division was between accessions of V. vinifera and non-vinifera, followed by the distinction between wild and domesticated grapevine. Intra-specific subgroups were detected within cultivated grapevine representing different eco-geographic groups. The comparison of a phenological core collection and genetic core collections showed that the latter retained more genetic diversity, while maintaining a similar phenotypic variability. CONCLUSIONS The comprehensive molecular characterization of our grape germplasm collection contributes to the knowledge about levels and distribution of genetic diversity in the existing resources of Vitis and provides insights into genetic subdivision within the European germplasm. Genotypic and phenotypic information compared in this study may efficiently guide further exploration of this diversity for facilitating its practical use.
Collapse
|
29
|
A multi-population consensus genetic map reveals inconsistent marker order among maps likely attributed to structural variations in the apple genome. PLoS One 2012; 7:e47864. [PMID: 23144832 PMCID: PMC3489900 DOI: 10.1371/journal.pone.0047864] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2012] [Accepted: 09/19/2012] [Indexed: 12/26/2022] Open
Abstract
Genetic maps serve as frameworks for determining the genetic architecture of quantitative traits, assessing structure of a genome, as well as aid in pursuing association mapping and comparative genetic studies. In this study, a dense genetic map was constructed using a high-throughput 1,536 EST-derived SNP GoldenGate genotyping platform and a global consensus map established by combining the new genetic map with four existing reliable genetic maps of apple. The consensus map identified markers with both major and minor conflicts in positioning across all five maps. These major inconsistencies among marker positions were attributed either to structural variations within the apple genome, or among mapping populations, or genotyping technical errors. These also highlighted problems in assembly and anchorage of the reference draft apple genome sequence in regions with known segmental duplications. Markers common across all five apple genetic maps resulted in successful positioning of 2875 markers, consisting of 2033 SNPs and 843 SSRs as well as other specific markers, on the global consensus map. These markers were distributed across all 17 linkage groups, with an average of 169±33 marker per linkage group and with an average distance of 0.70±0.14 cM between markers. The total length of the consensus map was 1991.38 cM with an average length of 117.14±24.43 cM per linkage group. A total of 569 SNPs were mapped onto the genetic map, consisting of 140 recombinant individuals, from our recently developed apple Oligonucleotide pool assays (OPA). The new functional SNPs, along with the dense consensus genetic map, will be useful for high resolution QTL mapping of important traits in apple and for pursuing comparative genetic studies in Rosaceae.
Collapse
|
30
|
Development of a dense SNP-based linkage map of an apple rootstock progeny using the Malus Infinium whole genome genotyping array. BMC Genomics 2012; 13:203. [PMID: 22631220 PMCID: PMC3410780 DOI: 10.1186/1471-2164-13-203] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2012] [Accepted: 05/25/2012] [Indexed: 11/18/2022] Open
Abstract
Background A whole-genome genotyping array has previously been developed for Malus using SNP data from 28 Malus genotypes. This array offers the prospect of high throughput genotyping and linkage map development for any given Malus progeny. To test the applicability of the array for mapping in diverse Malus genotypes, we applied the array to the construction of a SNP-based linkage map of an apple rootstock progeny. Results Of the 7,867 Malus SNP markers on the array, 1,823 (23.2%) were heterozygous in one of the two parents of the progeny, 1,007 (12.8%) were heterozygous in both parental genotypes, whilst just 2.8% of the 921 Pyrus SNPs were heterozygous. A linkage map spanning 1,282.2 cM was produced comprising 2,272 SNP markers, 306 SSR markers and the S-locus. The length of the M432 linkage map was increased by 52.7 cM with the addition of the SNP markers, whilst marker density increased from 3.8 cM/marker to 0.5 cM/marker. Just three regions in excess of 10 cM remain where no markers were mapped. We compared the positions of the mapped SNP markers on the M432 map with their predicted positions on the ‘Golden Delicious’ genome sequence. A total of 311 markers (13.7% of all mapped markers) mapped to positions that conflicted with their predicted positions on the ‘Golden Delicious’ pseudo-chromosomes, indicating the presence of paralogous genomic regions or mis-assignments of genome sequence contigs during the assembly and anchoring of the genome sequence. Conclusions We incorporated data for the 2,272 SNP markers onto the map of the M432 progeny and have presented the most complete and saturated map of the full 17 linkage groups of M. pumila to date. The data were generated rapidly in a high-throughput semi-automated pipeline, permitting significant savings in time and cost over linkage map construction using microsatellites. The application of the array will permit linkage maps to be developed for QTL analyses in a cost-effective manner, and the identification of SNPs that have been assigned erroneous positions on the ‘Golden Delicious’ reference sequence will assist in the continued improvement of the genome sequence assembly for that variety.
Collapse
|
31
|
Development and evaluation of a 9K SNP array for peach by internationally coordinated SNP detection and validation in breeding germplasm. PLoS One 2012; 7:e35668. [PMID: 22536421 PMCID: PMC3334984 DOI: 10.1371/journal.pone.0035668] [Citation(s) in RCA: 114] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2012] [Accepted: 03/19/2012] [Indexed: 01/14/2023] Open
Abstract
Although a large number of single nucleotide polymorphism (SNP) markers covering the entire genome are needed to enable molecular breeding efforts such as genome wide association studies, fine mapping, genomic selection and marker-assisted selection in peach [Prunus persica (L.) Batsch] and related Prunus species, only a limited number of genetic markers, including simple sequence repeats (SSRs), have been available to date. To address this need, an international consortium (The International Peach SNP Consortium; IPSC) has pursued a coordinated effort to perform genome-scale SNP discovery in peach using next generation sequencing platforms to develop and characterize a high-throughput Illumina Infinium® SNP genotyping array platform. We performed whole genome re-sequencing of 56 peach breeding accessions using the Illumina and Roche/454 sequencing technologies. Polymorphism detection algorithms identified a total of 1,022,354 SNPs. Validation with the Illumina GoldenGate® assay was performed on a subset of the predicted SNPs, verifying ∼75% of genic (exonic and intronic) SNPs, whereas only about a third of intergenic SNPs were verified. Conservative filtering was applied to arrive at a set of 8,144 SNPs that were included on the IPSC peach SNP array v1, distributed over all eight peach chromosomes with an average spacing of 26.7 kb between SNPs. Use of this platform to screen a total of 709 accessions of peach in two separate evaluation panels identified a total of 6,869 (84.3%) polymorphic SNPs. The almost 7,000 SNPs verified as polymorphic through extensive empirical evaluation represent an excellent source of markers for future studies in genetic relatedness, genetic mapping, and dissecting the genetic architecture of complex agricultural traits. The IPSC peach SNP array v1 is commercially available and we expect that it will be used worldwide for genetic studies in peach and related stone fruit and nut species.
Collapse
|
32
|
Whole genome comparisons of Fragaria, Prunus and Malus reveal different modes of evolution between Rosaceous subfamilies. BMC Genomics 2012; 13:129. [PMID: 22475018 PMCID: PMC3368713 DOI: 10.1186/1471-2164-13-129] [Citation(s) in RCA: 65] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2011] [Accepted: 04/04/2012] [Indexed: 02/04/2023] Open
Abstract
BACKGROUND Rosaceae include numerous economically important and morphologically diverse species. Comparative mapping between the member species in Rosaceae have indicated some level of synteny. Recently the whole genome of three crop species, peach, apple and strawberry, which belong to different genera of the Rosaceae family, have been sequenced, allowing in-depth comparison of these genomes. RESULTS Our analysis using the whole genome sequences of peach, apple and strawberry identified 1399 orthologous regions between the three genomes, with a mean length of around 100 kb. Each peach chromosome showed major orthology mostly to one strawberry chromosome, but to more than two apple chromosomes, suggesting that the apple genome went through more chromosomal fissions in addition to the whole genome duplication after the divergence of the three genera. However, the distribution of contiguous ancestral regions, identified using the multiple genome rearrangements and ancestors (MGRA) algorithm, suggested that the Fragaria genome went through a greater number of small scale rearrangements compared to the other genomes since they diverged from a common ancestor. Using the contiguous ancestral regions, we reconstructed a hypothetical ancestral genome for the Rosaceae 7 composed of nine chromosomes and propose the evolutionary steps from the ancestral genome to the extant Fragaria, Prunus and Malus genomes. CONCLUSION Our analysis shows that different modes of evolution may have played major roles in different subfamilies of Rosaceae. The hypothetical ancestral genome of Rosaceae and the evolutionary steps that lead to three different lineages of Rosaceae will facilitate our understanding of plant genome evolution as well as have a practical impact on knowledge transfer among member species of Rosaceae.
Collapse
|
33
|
Genome-wide SNP detection, validation, and development of an 8K SNP array for apple. PLoS One 2012; 7:e31745. [PMID: 22363718 PMCID: PMC3283661 DOI: 10.1371/journal.pone.0031745] [Citation(s) in RCA: 135] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2011] [Accepted: 01/12/2012] [Indexed: 01/07/2023] Open
Abstract
As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide evaluation of allelic variation in apple (Malus×domestica) breeding germplasm. For genome-wide SNP discovery, 27 apple cultivars were chosen to represent worldwide breeding germplasm and re-sequenced at low coverage with the Illumina Genome Analyzer II. Following alignment of these sequences to the whole genome sequence of ‘Golden Delicious’, SNPs were identified using SoapSNP. A total of 2,113,120 SNPs were detected, corresponding to one SNP to every 288 bp of the genome. The Illumina GoldenGate® assay was then used to validate a subset of 144 SNPs with a range of characteristics, using a set of 160 apple accessions. This validation assay enabled fine-tuning of the final subset of SNPs for the Illumina Infinium® II system. The set of stringent filtering criteria developed allowed choice of a set of SNPs that not only exhibited an even distribution across the apple genome and a range of minor allele frequencies to ensure utility across germplasm, but also were located in putative exonic regions to maximize genotyping success rate. A total of 7867 apple SNPs was established for the IRSC apple 8K SNP array v1, of which 5554 were polymorphic after evaluation in segregating families and a germplasm collection. This publicly available genomics resource will provide an unprecedented resolution of SNP haplotypes, which will enable marker-locus-trait association discovery, description of the genetic architecture of quantitative traits, investigation of genetic variation (neutral and functional), and genomic selection in apple.
Collapse
|
34
|
QTL and candidate gene mapping for polyphenolic composition in apple fruit. BMC PLANT BIOLOGY 2012; 12:12. [PMID: 22269060 PMCID: PMC3285079 DOI: 10.1186/1471-2229-12-12] [Citation(s) in RCA: 63] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2011] [Accepted: 01/23/2012] [Indexed: 05/03/2023]
Abstract
BACKGROUND The polyphenolic products of the phenylpropanoid pathway, including proanthocyanidins, anthocyanins and flavonols, possess antioxidant properties that may provide health benefits. To investigate the genetic architecture of control of their biosynthesis in apple fruit, various polyphenolic compounds were quantified in progeny from a 'Royal Gala' × 'Braeburn' apple population segregating for antioxidant content, using ultra high performance liquid chromatography of extracts derived from fruit cortex and skin. RESULTS Construction of genetic maps for 'Royal Gala' and 'Braeburn' enabled detection of 79 quantitative trait loci (QTL) for content of 17 fruit polyphenolic compounds. Seven QTL clusters were stable across two years of harvest and included QTLs for content of flavanols, flavonols, anthocyanins and hydroxycinnamic acids. Alignment of the parental genetic maps with the apple whole genome sequence in silico enabled screening for co-segregation with the QTLs of a range of candidate genes coding for enzymes in the polyphenolic biosynthetic pathway. This co-location was confirmed by genetic mapping of markers derived from the gene sequences. Leucoanthocyanidin reductase (LAR1) co-located with a QTL cluster for the fruit flavanols catechin, epicatechin, procyanidin dimer and five unknown procyanidin oligomers identified near the top of linkage group (LG) 16, while hydroxy cinnamate/quinate transferase (HCT/HQT) co-located with a QTL for chlorogenic acid concentration mapping near the bottom of LG 17. CONCLUSION We conclude that LAR1 and HCT/HQT are likely to influence the concentration of these compounds in apple fruit and provide useful allele-specific markers for marker assisted selection of trees bearing fruit with healthy attributes.
Collapse
|
35
|
Genetic diversity of the genus Malus and implications for linkage mapping with SNPs. TREE GENETICS & GENOMES 2011; 7:857-868. [PMID: 0 DOI: 10.1007/s11295-011-0380-8] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
|
36
|
High temperature reduces apple fruit colour via modulation of the anthocyanin regulatory complex. PLANT, CELL & ENVIRONMENT 2011; 34:1176-90. [PMID: 21410713 DOI: 10.1111/j.1365-3040.2011.02316.x] [Citation(s) in RCA: 179] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]
Abstract
The biosynthesis of anthocyanin in many plants is affected by environmental conditions. In apple (Malus × domestica Borkh.), concentrations of fruit anthocyanins are lower under hot climatic conditions. We examined the anthocyanin accumulation in the peel of maturing 'Mondial Gala' and 'Royal Gala' apples, grown in both temperate and hot climates, and using artificial heating of on-tree fruit. Heat caused a dramatic reduction of both peel anthocyanin concentration and transcripts of the genes of the anthocyanin biosynthetic pathway. Heating fruit rapidly reduced expression of the R2R3 MYB transcription factor (MYB10) responsible for coordinative regulation for red skin colour, as well as expression of other genes in the transcriptional activation complex. A single night of low temperatures is sufficient to elicit a large increase in transcription of MYB10 and consequently the biosynthetic pathway. Candidate genes that can repress anthocyanin biosynthesis did not appear to be responsible for reductions in anthocyanin content. We propose that temperature-induced regulation of anthocyanin biosynthesis is primarily caused by altered transcript levels of the activating anthocyanin regulatory complex.
Collapse
|
37
|
Comparative analysis of rosaceous genomes and the reconstruction of a putative ancestral genome for the family. BMC Evol Biol 2011; 11:9. [PMID: 21226921 PMCID: PMC3033827 DOI: 10.1186/1471-2148-11-9] [Citation(s) in RCA: 85] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2010] [Accepted: 01/12/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Comparative genome mapping studies in Rosaceae have been conducted until now by aligning genetic maps within the same genus, or closely related genera and using a limited number of common markers. The growing body of genomics resources and sequence data for both Prunus and Fragaria permits detailed comparisons between these genera and the recently released Malus × domestica genome sequence. RESULTS We generated a comparative analysis using 806 molecular markers that are anchored genetically to the Prunus and/or Fragaria reference maps, and physically to the Malus genome sequence. Markers in common for Malus and Prunus, and Malus and Fragaria, respectively were 784 and 148. The correspondence between marker positions was high and conserved syntenic blocks were identified among the three genera in the Rosaceae. We reconstructed a proposed ancestral genome for the Rosaceae. CONCLUSIONS A genome containing nine chromosomes is the most likely candidate for the ancestral Rosaceae progenitor. The number of chromosomal translocations observed between the three genera investigated was low. However, the number of inversions identified among Malus and Prunus was much higher than any reported genome comparisons in plants, suggesting that small inversions have played an important role in the evolution of these two genera or of the Rosaceae.
Collapse
|
38
|
The genome of the domesticated apple (Malus × domestica Borkh.). Nat Genet 2010; 42:833-9. [DOI: 10.1038/ng.654] [Citation(s) in RCA: 1538] [Impact Index Per Article: 109.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2009] [Accepted: 08/03/2010] [Indexed: 11/09/2022]
|
39
|
Physical mapping in highly heterozygous genomes: a physical contig map of the Pinot Noir grapevine cultivar. BMC Genomics 2010; 11:204. [PMID: 20346114 PMCID: PMC2865496 DOI: 10.1186/1471-2164-11-204] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2008] [Accepted: 03/26/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Most of the grapevine (Vitis vinifera L.) cultivars grown today are those selected centuries ago, even though grapevine is one of the most important fruit crops in the world. Grapevine has therefore not benefited from the advances in modern plant breeding nor more recently from those in molecular genetics and genomics: genes controlling important agronomic traits are practically unknown. A physical map is essential to positionally clone such genes and instrumental in a genome sequencing project. RESULTS We report on the first whole genome physical map of grapevine built using high information content fingerprinting of 49,104 BAC clones from the cultivar Pinot Noir. Pinot Noir, as most grape varieties, is highly heterozygous at the sequence level. This resulted in the two allelic haplotypes sometimes assembling into separate contigs that had to be accommodated in the map framework or in local expansions of contig maps. We performed computer simulations to assess the effects of increasing levels of sequence heterozygosity on BAC fingerprint assembly and showed that the experimental assembly results are in full agreement with the theoretical expectations, given the heterozygosity levels reported for grape. The map is anchored to a dense linkage map consisting of 994 markers. 436 contigs are anchored to the genetic map, covering 342 of the 475 Mb that make up the grape haploid genome. CONCLUSIONS We have developed a resource that makes it possible to access the grapevine genome, opening the way to a new era both in grape genetics and breeding and in wine making. The effects of heterozygosity on the assembly have been analyzed and characterized by using several complementary approaches which could be easily transferred to the study of other genomes which present the same features.
Collapse
|
40
|
A SNP transferability survey within the genus Vitis. BMC PLANT BIOLOGY 2008; 8:128. [PMID: 19087337 PMCID: PMC2631481 DOI: 10.1186/1471-2229-8-128] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2008] [Accepted: 12/16/2008] [Indexed: 05/18/2023]
Abstract
BACKGROUND Efforts to sequence the genomes of different organisms continue to increase. The DNA sequence is usually decoded for one individual and its application is for the whole species. The recent sequencing of the highly heterozygous Vitis vinifera L. cultivar Pinot Noir (clone ENTAV 115) genome gave rise to several thousand polymorphisms and offers a good model to study the transferability of its degree of polymorphism to other individuals of the same species and within the genus. RESULTS This study was performed by genotyping 137 SNPs through the SNPlex Genotyping System (Applied Biosystems Inc.) and by comparing the SNPlex sequencing results across 35 (of the 137) regions from 69 grape accessions. A heterozygous state transferability of 31.5% across the unrelated cultivars of V. vinifera, of 18.8% across the wild forms of V. vinifera, of 2.3% among non-vinifera Vitis species, and of 0% with Muscadinia rotundifolia was found. In addition, mean allele frequencies were used to evaluate SNP informativeness and develop useful subsets of markers. CONCLUSION Using SNPlex application and corroboration from the sequencing analysis, the informativeness of SNP markers from the heterozygous grape cultivar Pinot Noir was validated in V. vinifera (including cultivars and wild forms), but had a limited application for non-vinifera Vitis species where a resequencing strategy may be preferred, knowing that homology at priming sites is sufficient. This work will allow future applications such as mapping and diversity studies, accession identification and genomic-research assisted breeding within V. vinifera.
Collapse
|
41
|
Sequencing and assembly of highly heterozygous genome of Vitis vinifera L. cv Pinot Noir: Problems and solutions. J Biotechnol 2008; 136:38-43. [DOI: 10.1016/j.jbiotec.2008.04.013] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2007] [Revised: 04/10/2008] [Accepted: 04/18/2008] [Indexed: 10/22/2022]
|
42
|
A reference integrated map for cultivated grapevine (Vitis vinifera L.) from three crosses, based on 283 SSR and 501 SNP-based markers. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2008. [PMID: 18504538 DOI: 10.1007/s00122-008-07943] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
We have developed an integrated map from five elite cultivars of Vitis vinifera L.; Syrah, Pinot Noir, Grenache, Cabernet Sauvignon and Riesling which are parents of three segregating populations. A new source of markers, SNPs, identified in ESTs and unique BAC-end sequences was added to the available IGGP reference set of SSRs. The complete integrated map comprises 1,134 markers (350 AFLP, 332 BESs, 169 ESTs, 283 SSRs) spanning 1,443 cM over 19 linkage groups and shows a mean distance between neighbouring loci of 1.27 cM. Marker order was mainly conserved between the integrated map and the highly dense SyrahxPinot Noir consensus map except for few inversions. Moreover, the marker order has been validated through the assembled genome sequence of Pinot Noir. We have also assessed the transferability of SNP-based markers among five V. vinifera varieties, enabling marker validation across different genotypes. This integrated map can serve as a fundamental tool for molecular breeding in V. vinifera and related species and provide a basis for studies of genome organization and evolution in grapevines.
Collapse
|
43
|
A reference integrated map for cultivated grapevine (Vitis vinifera L.) from three crosses, based on 283 SSR and 501 SNP-based markers. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2008; 117:499-511. [PMID: 18504538 DOI: 10.1007/s00122-008-0794-3] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/29/2008] [Accepted: 05/02/2008] [Indexed: 05/17/2023]
Abstract
We have developed an integrated map from five elite cultivars of Vitis vinifera L.; Syrah, Pinot Noir, Grenache, Cabernet Sauvignon and Riesling which are parents of three segregating populations. A new source of markers, SNPs, identified in ESTs and unique BAC-end sequences was added to the available IGGP reference set of SSRs. The complete integrated map comprises 1,134 markers (350 AFLP, 332 BESs, 169 ESTs, 283 SSRs) spanning 1,443 cM over 19 linkage groups and shows a mean distance between neighbouring loci of 1.27 cM. Marker order was mainly conserved between the integrated map and the highly dense SyrahxPinot Noir consensus map except for few inversions. Moreover, the marker order has been validated through the assembled genome sequence of Pinot Noir. We have also assessed the transferability of SNP-based markers among five V. vinifera varieties, enabling marker validation across different genotypes. This integrated map can serve as a fundamental tool for molecular breeding in V. vinifera and related species and provide a basis for studies of genome organization and evolution in grapevines.
Collapse
|
44
|
A grapevine (Vitis vinifera L.) genetic map integrating the position of 139 expressed genes. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2008; 116:1129-43. [PMID: 18347774 DOI: 10.1007/s00122-008-0741-3] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/21/2007] [Accepted: 03/01/2008] [Indexed: 05/20/2023]
Abstract
Grapevine molecular maps based on microsatellites, AFLP and RAPD markers are now available. SSRs are essential to allow cross-talks between maps, thus upgrading any growing grapevine maps. In this work, single nucleotide polymorphisms (SNPs) were developed from coding sequences and from unique BAC-end sequences, and nested in a SSR framework map of grapevine. Genes participating to flavonoids metabolism and defence, and signal transduction pathways related genes were also considered. Primer pairs for 351 loci were developed from ESTs present on public databases and screened for polymorphism in the "Merzling" (a complex genotype Freiburg 993-60 derived from multiple crosses also involving wild Vitis species) x Vitis vinifera (cv. Teroldego) cross population. In total 138 SNPs, 108 SSR markers and a phenotypic trait (berry colour) were mapped in 19 major linkage groups of the consensus map. In specific cases, ESTs with putatively related functions mapped near QTLs previously identified for resistance and berry ripening. Genes related to anthocyanin metabolism mapped in different linkage groups. A myb gene, which has been correlated with anthocyanin biosynthesis, cosegregated with berry colour on linkage group 2. The possibility of associating candidate genes to known position of QTL is discussed for this plant.
Collapse
|
45
|
SNP high-throughput screening in grapevine using the SNPlex genotyping system. BMC PLANT BIOLOGY 2008; 8:12. [PMID: 18226250 PMCID: PMC2268689 DOI: 10.1186/1471-2229-8-12] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/20/2007] [Accepted: 01/28/2008] [Indexed: 05/24/2023]
Abstract
BACKGROUND Until recently, only a small number of low- and mid-throughput methods have been used for single nucleotide polymorphism (SNP) discovery and genotyping in grapevine (Vitis vinifera L.). However, following completion of the sequence of the highly heterozygous genome of Pinot Noir, it has been possible to identify millions of electronic SNPs (eSNPs) thus providing a valuable source for high-throughput genotyping methods. RESULTS Herein we report the first application of the SNPlexgenotyping system in grapevine aiming at the anchoring of an eukaryotic genome. This approach combines robust SNP detection with automated assay readout and data analysis. 813 candidate eSNPs were developed from non-repetitive contigs of the assembled genome of Pinot Noir and tested in 90 progeny of Syrah x Pinot Noir cross. 563 new SNP-based markers were obtained and mapped. The efficiency rate of 69% was enhanced to 80% when multiple displacement amplification (MDA) methods were used for preparation of genomic DNA for the SNPlex assay. CONCLUSION Unlike other SNP genotyping methods used to investigate thousands of SNPs in a few genotypes, or a few SNPs in around a thousand genotypes, the SNPlex genotyping system represents a good compromise to investigate several hundred SNPs in a hundred or more samples simultaneously. Therefore, the use of the SNPlex assay, coupled with whole genome amplification (WGA), is a good solution for future applications in well-equipped laboratories.
Collapse
|
46
|
A high quality draft consensus sequence of the genome of a heterozygous grapevine variety. PLoS One 2007; 2:e1326. [PMID: 18094749 PMCID: PMC2147077 DOI: 10.1371/journal.pone.0001326] [Citation(s) in RCA: 579] [Impact Index Per Article: 34.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2007] [Accepted: 11/21/2007] [Indexed: 01/11/2023] Open
Abstract
Background Worldwide, grapes and their derived products have a large market. The cultivated grape species Vitis vinifera has potential to become a model for fruit trees genetics. Like many plant species, it is highly heterozygous, which is an additional challenge to modern whole genome shotgun sequencing. In this paper a high quality draft genome sequence of a cultivated clone of V. vinifera Pinot Noir is presented. Principal Findings We estimate the genome size of V. vinifera to be 504.6 Mb. Genomic sequences corresponding to 477.1 Mb were assembled in 2,093 metacontigs and 435.1 Mb were anchored to the 19 linkage groups (LGs). The number of predicted genes is 29,585, of which 96.1% were assigned to LGs. This assembly of the grape genome provides candidate genes implicated in traits relevant to grapevine cultivation, such as those influencing wine quality, via secondary metabolites, and those connected with the extreme susceptibility of grape to pathogens. Single nucleotide polymorphism (SNP) distribution was consistent with a diffuse haplotype structure across the genome. Of around 2,000,000 SNPs, 1,751,176 were mapped to chromosomes and one or more of them were identified in 86.7% of anchored genes. The relative age of grape duplicated genes was estimated and this made possible to reveal a relatively recent Vitis-specific large scale duplication event concerning at least 10 chromosomes (duplication not reported before). Conclusions Sanger shotgun sequencing and highly efficient sequencing by synthesis (SBS), together with dedicated assembly programs, resolved a complex heterozygous genome. A consensus sequence of the genome and a set of mapped marker loci were generated. Homologous chromosomes of Pinot Noir differ by 11.2% of their DNA (hemizygous DNA plus chromosomal gaps). SNP markers are offered as a tool with the potential of introducing a new era in the molecular breeding of grape.
Collapse
|
47
|
A dense single-nucleotide polymorphism-based genetic linkage map of grapevine (Vitis vinifera L.) anchoring Pinot Noir bacterial artificial chromosome contigs. Genetics 2007; 176:2637-50. [PMID: 17603124 PMCID: PMC1950661 DOI: 10.1534/genetics.106.067462] [Citation(s) in RCA: 111] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2006] [Accepted: 06/14/2007] [Indexed: 11/18/2022] Open
Abstract
The construction of a dense genetic map for Vitis vinifera and its anchoring to a BAC-based physical map is described: it includes 994 loci mapped onto 19 linkage groups, corresponding to the basic chromosome number of Vitis. Spanning 1245 cM with an average distance of 1.3 cM between adjacent markers, the map was generated from the segregation of 483 single-nucleotide polymorphism (SNP)-based genetic markers, 132 simple sequence repeats (SSRs), and 379 AFLP markers in a mapping population of 94 F(1) individuals derived from a V. vinifera cross of the cultivars Syrah and Pinot Noir. Of these markers, 623 were anchored to 367 contigs that are included in a physical map produced from the same clone of Pinot Noir and covering 352 Mbp. On the basis of contigs containing two or more genetically mapped markers, region-dependent estimations of physical and recombinational distances are presented. The markers used in this study include 118 SSRs common to an integrated map derived from five segregating populations of V. vinifera. The positions of these SSR markers in the two maps are conserved across all Vitis linkage groups. The addition of SNP-based markers introduces polymorphisms that are easy to database, are useful for evolutionary studies, and significantly increase the density of the map. The map provides the most comprehensive view of the Vitis genome reported to date and will be relevant for future studies on structural and functional genomics and genetic improvement.
Collapse
|
48
|
Abstract
Genetic maps are built using the genotypes of many related individuals. Genotyping errors in these data sets can distort genetic maps, especially by inflating the distances. We have extended the traditional likelihood model used for genetic mapping to include the possibility of genotyping errors. Each individual marker is assigned an error rate, which is inferred from the data, just as the genetic distances are. We have developed a software package, called TMAP, which uses this model to find maximum-likelihood maps for phase-known pedigrees. We have tested our methods using a data set in Vitis and on simulated data and confirmed that our method dramatically reduces the inflationary effect caused by increasing the number of markers and leads to more accurate orders.
Collapse
|
49
|
An improved protocol for the production of AFLPTM markers in complex genomes by means of capillary electrophoresis. J Anim Breed Genet 2005; 122:62-8. [PMID: 16130490 DOI: 10.1111/j.1439-0388.2004.00476.x] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The amplified fragment-length polymorphism (AFLP) technology is a recently introduced method to investigate genomes of different complexity, from microbial to higher organisms. It is applied to purposes as diverse as identification of species, strain and varieties, investigation of genetic diversity within and between populations, simple and complex trait mapping, and construction of linkage and physical maps. This technology has been designed on the use of primers labelled with radioactivity and on AFLP fragment separation on sequencing gel. We show that the original EcoRI/TaqI AFLP protocol does not perform appropriately when transferred to fluorescent labelling and capillary electrophoresis (CE), and propose an improved protocol for the production of high-quality AFLP markers in fish, rodents and artiodactyles by means of the Beckman-Coulter CEQ2000 automatic DNA sequencer. In addition, we describe the procedure routinely used in our laboratory to obtain binary matrices from AFLP profiles with the aid of Genographer free-share software (vers. 1.6.0, J.J. Benham, Montana State University), able to elaborate original fragment data and convert them to standard graphical formats for phylogenetic analyses. Comparison with radioactive AFLPs in goats confirmed the reliability of the protocol developed for CE. In fact, 107 fragments generated by two primer combinations and identified by both techniques were attributed the same scoring. Compared with traditional methods, the use of capillary systems and automated analysis increases data throughput and scoring reliability, decreasing the overall experimental error.
Collapse
|
50
|
Abstract
A comparative genetic map was constructed between two important genera of the family Pinaceae. Ten homologous linkage groups in loblolly pine (Pinus taeda L.) and Douglas fir (Pseudotsuga menziesii [Mirb.] Franco) were identified using orthologous expressed sequence tag polymorphism (ESTP) and restriction fragment length polymorphism (RFLP) markers. The comparative mapping revealed extensive synteny and colinearity between genomes of the Pinaceae, consistent with the hypothesis of conservative chromosomal evolution in this important plant family. This study reports the first comparative map in forest trees at the family taxonomic level and establishes a framework for comparative genomics in Pinaceae.
Collapse
|