1
|
Abbasi Holasou H, Panahi B, Shahi A, Nami Y. Integration of machine learning models with microsatellite markers: New avenue in world grapevine germplasm characterization. Biochem Biophys Rep 2024; 38:101678. [PMID: 38495412 PMCID: PMC10940787 DOI: 10.1016/j.bbrep.2024.101678] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2023] [Revised: 02/09/2024] [Accepted: 02/27/2024] [Indexed: 03/19/2024] Open
Abstract
Development of efficient analytical techniques is required for effective interpretation of biological data to take novel hypotheses and finding the critical predictive patterns. Machine Learning algorithms provide a novel opportunity for development of low-cost and practical solutions in biology. In this study, we proposed a new integrated analytical approach using supervised machine learning algorithms and microsatellites data of worldwide vitis populations. A total of 1378 wild (V. vinifera spp. sylvestris) and cultivated (V. vinifera spp. sativa) accessions of grapevine were investigated using 20 microsatellite markers. Data cleaning, feature selection, and supervised machine learning classification models vis, Naive Bayes, Support Vector Machine (SVM) and Tree Induction methods were implied to find most indicative and diagnostic alleles to represent wild/cultivated and originated geography of each population. Our combined approaches showed microsatellite markers with the highest differentiating capacity and proved efficiency for our pipeline of classification and prediction of vitis accessions. Moreover, our study proposed the best combination of markers for better distinguishing of populations, which can be exploited in future germplasm conservation and breeding programs.
Collapse
Affiliation(s)
- Hossein Abbasi Holasou
- Department of Plant Breeding and Biotechnology, Faculty of Agriculture, University of Tabriz, Tabriz, Iran
| | - Bahman Panahi
- Department of Genomics, Branch for Northwest and West Region, Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research, Education and Extension Organization (AREEO), Tabriz, Iran
| | - Ali Shahi
- Faculty of Agriculture (Meshgin Shahr Campus), Mohaghegh Ardabili University, Ardabil, Iran
| | - Yousef Nami
- Department of Food Biotechnology, Branch for Northwest and West Region, Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research, Education and Extension Organization (AREEO), Tabriz, Iran
| |
Collapse
|
2
|
Tympakianakis S, Trantas E, Avramidou EV, Ververidis F. Vitis vinifera genotyping toolbox to highlight diversity and germplasm identification. FRONTIERS IN PLANT SCIENCE 2023; 14:1139647. [PMID: 37180393 PMCID: PMC10169827 DOI: 10.3389/fpls.2023.1139647] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/07/2023] [Accepted: 03/27/2023] [Indexed: 05/16/2023]
Abstract
The contribution of vine cultivation to human welfare as well as the stimulation of basic social and cultural features of civilization has been great. The wide temporal and regional distribution created a wide array of genetic variants that have been used as propagating material to promote cultivation. Information on the origin and relationships among cultivars is of great interest from a phylogenetics and biotechnology perspective. Fingerprinting and exploration of the complicated genetic background of varieties may contribute to future breeding programs. In this review, we present the most frequently used molecular markers, which have been used on Vitis germplasm. We discuss the scientific progress that led to the new strategies being implemented utilizing state-of-the-art next generation sequencing technologies. Additionally, we attempted to delimit the discussion on the algorithms used in phylogenetic analyses and differentiation of grape varieties. Lastly, the contribution of epigenetics is highlighted to tackle future roadmaps for breeding and exploitation of Vitis germplasm. The latter will remain in the top of the edge for future breeding and cultivation and the molecular tools presented herein, will serve as a reference point in the challenging years to come.
Collapse
Affiliation(s)
- Stylianos Tympakianakis
- Laboratory of Biological and Biotechnological Applications, Department of Agriculture, School of Agricultural Sciences, Hellenic Mediterranean University, Heraklion, Greece
| | - Emmanouil Trantas
- Laboratory of Biological and Biotechnological Applications, Department of Agriculture, School of Agricultural Sciences, Hellenic Mediterranean University, Heraklion, Greece
- Institute of Agri-Food and Life Sciences, Research Center of the Hellenic Mediterranean University, Heraklion, Greece
| | - Evangelia V. Avramidou
- Institute of Mediterranean Forest Ecosystems, Hellenic Agricultural Organisation “DIMITRA“, Athens, Greece
| | - Filippos Ververidis
- Laboratory of Biological and Biotechnological Applications, Department of Agriculture, School of Agricultural Sciences, Hellenic Mediterranean University, Heraklion, Greece
- Institute of Agri-Food and Life Sciences, Research Center of the Hellenic Mediterranean University, Heraklion, Greece
| |
Collapse
|
3
|
T. V. N, S. RP, R. L. R. Population structure and genetic diversity characterization of soybean for seed longevity. PLoS One 2022; 17:e0278631. [PMID: 36472991 PMCID: PMC9725150 DOI: 10.1371/journal.pone.0278631] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2022] [Accepted: 11/19/2022] [Indexed: 12/12/2022] Open
Abstract
Seed longevity is an important trait in the context of germplasm conservation and economics of seed production. The identification of populations with high level of genetic variability for seed longevity and associated traits will become a valuable resource for superior alleles for seed longevity. In this study, Genotyping-by-sequencing (GBS)-single nucleotide polymorphism (SNP) approach, simple sequence repeats (SSR) markers and agro-morphological traits have been explored to investigate the diversity and population structure of assembled 96 genotypes. The GBS technique performed on 96 genotypes of soybean (Glycine max (L.) Merrill) resulted in 37,897 SNPs on sequences aligned to the reference genome sequence. The average genome coverage was 6.81X with a mapping rate of 99.56% covering the entire genome. Totally, 29,955 high quality SNPs were identified after stringent filtering and most of them were detected in non-coding regions. The 96 genotypes were phenotyped for eight quantitative and ten qualitative traits by growing in field by following augmented design. The STRUCTURE (Bayesian-model based algorithm), UPGMA (Un-weighed Pair Group Method with Arithmetic mean) and principal component analysis (PCA) approaches using SSR, SNP as well as quantitative and qualitative traits revealed population structure and diversity in assembled population. The Bayesian-model based STRUCTURE using SNP markers could effectively identify clusters with higher seed longevity associated with seed coat colour and size which were subsequently validated by UPGMA and PCA based on SSR and agro-morphological traits. The results of STRUCTURE, PCA and UPGMA cluster analysis showed high degree of similarity and provided complementary data that helped to identify genotypes with higher longevity. Six black colour genotypes, viz., Local black soybean, Kalitur, ACC Nos. 39, 109, 101 and 37 showed higher seed longevity during accelerated ageing. Higher coefficient of variability observed for plant height, number of pods per plant, seed yield per plant, 100 seed weight and seed longevity confirms the diversity in assembled population and its suitability for quantitative trait loci (QTL) mapping.
Collapse
Affiliation(s)
- Naflath T. V.
- Department of Seed Science and Technology, College of Agriculture, UAS, GKVK, Bangalore, Karnataka, India
| | - Rajendra Prasad S.
- Department of Seed Science and Technology, College of Agriculture, UAS, GKVK, Bangalore, Karnataka, India
| | - Ravikumar R. L.
- Department of Plant Biotechnology, College of Agriculture, UAS, GKVK, Bangalore, Karnataka, India
- * E-mail:
| |
Collapse
|
4
|
Vervalle JA, Costantini L, Lorenzi S, Pindo M, Mora R, Bolognesi G, Marini M, Lashbrooke JG, Tobutt KR, Vivier MA, Roodt-Wilding R, Grando MS, Bellin D. A high-density integrated map for grapevine based on three mapping populations genotyped by the Vitis18K SNP chip. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2022; 135:4371-4390. [PMID: 36271055 PMCID: PMC9734222 DOI: 10.1007/s00122-022-04225-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/12/2021] [Accepted: 09/15/2022] [Indexed: 06/16/2023]
Abstract
We present a high-density integrated map for grapevine, allowing refinement and improved understanding of the grapevine genome, while demonstrating the applicability of the Vitis18K SNP chip for linkage mapping. The improvement of grapevine through biotechnology requires identification of the molecular bases of target traits by studying marker-trait associations. The Vitis18K SNP chip provides a useful genotyping tool for genome-wide marker analysis. Most linkage maps are based on single mapping populations, but an integrated map can increase marker density and show order conservation. Here we present an integrated map based on three mapping populations. The parents consist of the well-known wine cultivars 'Cabernet Sauvignon', 'Corvina' and 'Rhine Riesling', the lesser-known wine variety 'Deckrot', and a table grape selection, G1-7720. Three high-density population maps with an average inter-locus gap ranging from 0.74 to 0.99 cM were developed. These maps show high correlations (0.9965-0.9971) with the reference assembly, containing only 93 markers with large order discrepancies compared to expected physical positions, of which a third is consistent across multiple populations. Moreover, the genetic data aid the further refinement of the grapevine genome assembly, by anchoring 104 yet unanchored scaffolds. From these population maps, an integrated map was constructed which includes 6697 molecular markers and reduces the inter-locus gap distance to 0.60 cM, resulting in the densest integrated map for grapevine thus far. A small number of discrepancies, mainly of short distance, involve 88 markers that remain conflictual across maps. The integrated map shows similar collinearity to the reference assembly (0.9974) as the single maps. This high-density map increases our understanding of the grapevine genome and provides a useful tool for its further characterization and the dissection of complex traits.
Collapse
Affiliation(s)
- Jessica A Vervalle
- Department of Genetics, Stellenbosch University, Stellenbosch, 7600, South Africa
- ARC Infruitec-Nietvoorbij, Stellenbosch, 7599, South Africa
| | - Laura Costantini
- Research and Innovation Centre, Fondazione Edmund Mach, San Michele all'Adige, Italy
| | - Silvia Lorenzi
- Research and Innovation Centre, Fondazione Edmund Mach, San Michele all'Adige, Italy
| | - Massimo Pindo
- Research and Innovation Centre, Fondazione Edmund Mach, San Michele all'Adige, Italy
| | - Riccardo Mora
- Department of Biotechnology, University of Verona, Verona, Italy
| | - Giada Bolognesi
- Department of Biotechnology, University of Verona, Verona, Italy
| | - Martina Marini
- Department of Biotechnology, University of Verona, Verona, Italy
| | - Justin G Lashbrooke
- South African Grape and Wine Research Institute, Stellenbosch University, Stellenbosch, 7600, South Africa
| | - Ken R Tobutt
- ARC Infruitec-Nietvoorbij, Stellenbosch, 7599, South Africa
| | - Melané A Vivier
- South African Grape and Wine Research Institute, Stellenbosch University, Stellenbosch, 7600, South Africa
| | - Rouvay Roodt-Wilding
- Department of Genetics, Stellenbosch University, Stellenbosch, 7600, South Africa
| | - Maria Stella Grando
- Research and Innovation Centre, Fondazione Edmund Mach, San Michele all'Adige, Italy
- Center Agriculture Food and Environment (C3A), University of Trento, San Michele all'Adige, Italy
| | - Diana Bellin
- Department of Biotechnology, University of Verona, Verona, Italy.
| |
Collapse
|
5
|
Abstract
The pip, as the most common grapevine archaeological remain, is extensively used to document past viticulture dynamics. This paper uses state of the art morphological analyses to analyse the largest reference collection of modern pips to date, representative of the present-day diversity of the domesticated grapevine from Western Eurasia. We tested for a costructure between the form of the modern pips and the: destination use (table/wine), geographical origins, and populational labels obtained through two molecular approaches. Significant structuring is demonstrated for each of these cofactors and for the first time it is possible to infer properties of varieties without going through the parallel with modern varieties. These results provide a unique tool that can be applied to archaeological pips in order to reconstruct the spatio-temporal dynamics of grape diversity on a large scale and to better understand viticulture history. The models obtained were then used to infer the affiliations with archaeobotanical remains recovered in Mas de Vignoles XIV (Nîmes, France). The results show a twofold shift between the Late Iron Age and the Middle Ages, from table to wine grape varieties and from eastern to western origins which correlates with previous palaeogenomic results.
Collapse
|
6
|
Recent Large-Scale Genotyping and Phenotyping of Plant Genetic Resources of Vegetatively Propagated Crops. PLANTS 2021; 10:plants10020415. [PMID: 33672381 PMCID: PMC7926561 DOI: 10.3390/plants10020415] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Revised: 02/19/2021] [Accepted: 02/19/2021] [Indexed: 12/12/2022]
Abstract
Several recent national and international projects have focused on large-scale genotyping of plant genetic resources in vegetatively propagated crops like fruit and berries, potatoes and woody ornamentals. The primary goal is usually to identify true-to-type plant material, detect possible synonyms, and investigate genetic diversity and relatedness among accessions. A secondary goal may be to create sustainable databases that can be utilized in research and breeding for several years ahead. Commonly applied DNA markers (like microsatellite DNA and SNPs) and next-generation sequencing each have their pros and cons for these purposes. Methods for large-scale phenotyping have lagged behind, which is unfortunate since many commercially important traits (yield, growth habit, storability, and disease resistance) are difficult to score. Nevertheless, the analysis of gene action and development of robust DNA markers depends on environmentally controlled screening of very large sets of plant material. Although more time-consuming, co-operative projects with broad-scale data collection are likely to produce more reliable results. In this review, we will describe some of the approaches taken in genotyping and/or phenotyping projects concerning a wide variety of vegetatively propagated crops.
Collapse
|
7
|
Bianchi D, Caramanico L, Grossi D, Brancadoro L, Lorenzis GD. How Do Novel M-Rootstock ( Vitis Spp.) Genotypes Cope with Drought? PLANTS 2020; 9:plants9101385. [PMID: 33080884 PMCID: PMC7603061 DOI: 10.3390/plants9101385] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/03/2020] [Revised: 10/09/2020] [Accepted: 10/14/2020] [Indexed: 12/31/2022]
Abstract
Most of the vineyards around the world are in areas characterized by seasonal drought, where water deficits and high temperatures represent severe constraints on the regular grapevine growth cycle. Although grapevines are well adapted to arid and semi-arid environments, water stress can cause physiological changes, from mild to irreversible. Screening of available Vitis spp. genetic diversity for new rootstock breeding programs has been proposed as a way for which new viticulture challenges may be faced. In 2014, novel genotypes (M-rootstocks) were released from the University of Milan. In this work, the behavior of M1, M3 and M4 in response to decreasing water availabilities (80%, 50% and 20% soil water content, SWC) was investigated at the physiological and gene expression levels, evaluating gas exchange, stem water potential and transcript abundances of key genes related to ABA (abscisic acid) biosynthesis (VvZEP, VvNCED1 and VvNCED2) and signaling (VvPP2C4, VvSnRK2.6 and VvABF2), and comparing them to those of cuttings of nine commercial rootstocks widely used in viticulture. M-rootstocks showed a change at physiological levels in severe water-stressed conditions (20% soil water content, SWC), reducing the stomatal conductance and stem water potential, but maintaining high photosynthetic activity. Water use efficiency was high in water-limiting conditions. The transcriptional changes were observed at 50% SWC, with an increment of transcripts of VvNCED1 and VvNCED2 genes. M-rootstocks showed similar behavior to 1103P and 110R rootstocks, two highly tolerant commercial genotypes. These rootstocks adopted a tolerant strategy to face water-stressed conditions.
Collapse
Affiliation(s)
| | | | | | - Lucio Brancadoro
- Correspondence: (L.B.); (G.D.L.); Tel.: +39-02-503-16559 (L.B.); +39-02-503-16565 (G.D.L.)
| | - Gabriella De Lorenzis
- Correspondence: (L.B.); (G.D.L.); Tel.: +39-02-503-16559 (L.B.); +39-02-503-16565 (G.D.L.)
| |
Collapse
|
8
|
Raimondi S, Tumino G, Ruffa P, Boccacci P, Gambino G, Schneider A. DNA-based genealogy reconstruction of Nebbiolo, Barbera and other ancient grapevine cultivars from northwestern Italy. Sci Rep 2020; 10:15782. [PMID: 32978486 PMCID: PMC7519648 DOI: 10.1038/s41598-020-72799-6] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2020] [Accepted: 09/01/2020] [Indexed: 01/08/2023] Open
Abstract
Northwestern Italy is a wine region of the world with the highest of reputations, where top quality wines of remarkable economic value are produced from traditional, long-cultivated varieties. Kinship analyses were performed using 32 microsatellite loci and more than 10 K single-nucleotide polymorphism markers on 227 traditional grapes mostly from Northwestern Italy—including those that have been neglected or are threatened. This was done to better understand the genetic grapevine origins and history of this reputable wine producing area, thus enhancing its cultural value and the marketing appeal of its wines. The work revealed a complex network of genetic relationships among varieties, with little contribution of genotypes from other areas. It revealed the major role played by a few ancient grape varieties as parents of numerous offspring, including some that are endangered today. The ancestry of many cultivars is proposed. Among these are Dolcetto, Barbera and Riesling italico. Through the inference of parent–offspring and sibling relations, marker profiles of ungenotyped putative parents were reconstructed, suggesting kinship relations and a possible parentage for Nebbiolo, one of the most ancient wine grapes worldwide. Historic and geographic implications from the resulting kinships are discussed.
Collapse
Affiliation(s)
- Stefano Raimondi
- National Research Council of Italy-Institute for Sustainable Plant Protection (CNR-IPSP), Strada delle Cacce 73, 10135, Turin, Italy
| | - Giorgio Tumino
- Council for Agricultural Research and Economics-Research Centre for Genomics and Bioinformatics, Via San Protaso 302, 29017, Fiorenzuola d'Arda, PC, Italy
| | - Paola Ruffa
- National Research Council of Italy-Institute for Sustainable Plant Protection (CNR-IPSP), Strada delle Cacce 73, 10135, Turin, Italy.,University of Turin-Department of Agricultural, Forest and Food Sciences (UNITO-DiSAFA), L. Braccini 2, 10095, Grugliasco, Turin, Italy
| | - Paolo Boccacci
- National Research Council of Italy-Institute for Sustainable Plant Protection (CNR-IPSP), Strada delle Cacce 73, 10135, Turin, Italy
| | - Giorgio Gambino
- National Research Council of Italy-Institute for Sustainable Plant Protection (CNR-IPSP), Strada delle Cacce 73, 10135, Turin, Italy
| | - Anna Schneider
- National Research Council of Italy-Institute for Sustainable Plant Protection (CNR-IPSP), Strada delle Cacce 73, 10135, Turin, Italy.
| |
Collapse
|