1
|
Arrones A, Antar O, Pereira-Dias L, Solana A, Ferrante P, Aprea G, Plazas M, Prohens J, Díez MJ, Giuliano G, Gramazio P, Vilanova S. A novel tomato interspecific ( Solanum lycopersicum var. cerasiforme and Solanum pimpinellifolium) MAGIC population facilitates trait association and candidate gene discovery in untapped exotic germplasm. HORTICULTURE RESEARCH 2024; 11:uhae154. [PMID: 39005998 PMCID: PMC11246243 DOI: 10.1093/hr/uhae154] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Accepted: 05/23/2024] [Indexed: 07/16/2024]
Abstract
We developed a novel eight-way tomato multiparental advanced generation intercross (MAGIC) population to improve the accessibility of tomato relatives genetic resources to geneticists and breeders. The interspecific tomato MAGIC population (ToMAGIC) was obtained by intercrossing four accessions each of Solanum lycopersicum var. cerasiforme and Solanum pimpinellifolium, which are the weedy relative and the ancestor of cultivated tomato, respectively. The eight exotic ToMAGIC founders were selected based on a representation of the genetic diversity and geographical distribution of the two taxa. The resulting MAGIC population comprises 354 lines, which were genotyped using a new 12k tomato single primer enrichment technology panel and yielded 6488 high-quality single-nucleotide polymorphism (SNPs). The genotyping data revealed a high degree of homozygosity, an absence of genetic structure, and a balanced representation of the founder genomes. To evaluate the potential of the ToMAGIC population, a proof of concept was conducted by phenotyping it for fruit size, plant pigmentation, leaf morphology, and earliness. Genome-wide association studies identified strong associations for the studied traits, pinpointing both previously identified and novel candidate genes near or within the linkage disequilibrium blocks. Domesticated alleles for fruit size were recessive and were found, at low frequencies, in wild/ancestral populations. Our findings demonstrate that the newly developed ToMAGIC population is a valuable resource for genetic research in tomato, offering significant potential for identifying new genes that govern key traits in tomato. ToMAGIC lines displaying a pyramiding of traits of interest could have direct applicability for integration into breeding pipelines providing untapped variation for tomato breeding.
Collapse
Affiliation(s)
- Andrea Arrones
- Instituto de Conservación y Mejora de la Agrodiversidad Valenciana, Universitat Politècnica de València, Camino de Vera 14, 46022 Valencia, Spain
| | - Oussama Antar
- Instituto de Conservación y Mejora de la Agrodiversidad Valenciana, Universitat Politècnica de València, Camino de Vera 14, 46022 Valencia, Spain
| | - Leandro Pereira-Dias
- Instituto de Conservación y Mejora de la Agrodiversidad Valenciana, Universitat Politècnica de València, Camino de Vera 14, 46022 Valencia, Spain
| | - Andrea Solana
- Instituto de Conservación y Mejora de la Agrodiversidad Valenciana, Universitat Politècnica de València, Camino de Vera 14, 46022 Valencia, Spain
| | - Paola Ferrante
- Agenzia Nazionale Per Le Nuove Tecnologie, L’energia e Lo Sviluppo Economico Sostenibile (ENEA), Casaccia Research Centre, Via Anguillarese 301, 00123 Rome, Italy
| | - Giuseppe Aprea
- Agenzia Nazionale Per Le Nuove Tecnologie, L’energia e Lo Sviluppo Economico Sostenibile (ENEA), Casaccia Research Centre, Via Anguillarese 301, 00123 Rome, Italy
| | - Mariola Plazas
- Instituto de Conservación y Mejora de la Agrodiversidad Valenciana, Universitat Politècnica de València, Camino de Vera 14, 46022 Valencia, Spain
| | - Jaime Prohens
- Instituto de Conservación y Mejora de la Agrodiversidad Valenciana, Universitat Politècnica de València, Camino de Vera 14, 46022 Valencia, Spain
| | - María José Díez
- Instituto de Conservación y Mejora de la Agrodiversidad Valenciana, Universitat Politècnica de València, Camino de Vera 14, 46022 Valencia, Spain
| | - Giovanni Giuliano
- Agenzia Nazionale Per Le Nuove Tecnologie, L’energia e Lo Sviluppo Economico Sostenibile (ENEA), Casaccia Research Centre, Via Anguillarese 301, 00123 Rome, Italy
| | - Pietro Gramazio
- Instituto de Conservación y Mejora de la Agrodiversidad Valenciana, Universitat Politècnica de València, Camino de Vera 14, 46022 Valencia, Spain
| | - Santiago Vilanova
- Instituto de Conservación y Mejora de la Agrodiversidad Valenciana, Universitat Politècnica de València, Camino de Vera 14, 46022 Valencia, Spain
| |
Collapse
|
2
|
Lin YC, Mayer M, Valle Torres D, Pook T, Hölker AC, Presterl T, Ouzunova M, Schön CC. Genomic prediction within and across maize landrace derived populations using haplotypes. FRONTIERS IN PLANT SCIENCE 2024; 15:1351466. [PMID: 38584949 PMCID: PMC10995330 DOI: 10.3389/fpls.2024.1351466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Accepted: 03/05/2024] [Indexed: 04/09/2024]
Abstract
Genomic prediction (GP) using haplotypes is considered advantageous compared to GP solely reliant on single nucleotide polymorphisms (SNPs), owing to haplotypes' enhanced ability to capture ancestral information and their higher linkage disequilibrium with quantitative trait loci (QTL). Many empirical studies supported the advantages of haplotype-based GP over SNP-based approaches. Nevertheless, the performance of haplotype-based GP can vary significantly depending on multiple factors, including the traits being studied, the genetic structure of the population under investigation, and the particular method employed for haplotype construction. In this study, we compared haplotype and SNP based prediction accuracies in four populations derived from European maize landraces. Populations comprised either doubled haploid lines (DH) derived directly from landraces, or gamete capture lines (GC) derived from crosses of the landraces with an inbred line. For two different landraces, both types of populations were generated, genotyped with 600k SNPs and phenotyped as lines per se for five traits. Our study explores three prediction scenarios: (i) within each of the four populations, (ii) across DH and GC populations from the same landrace, and (iii) across landraces using either DH or GC populations. Three haplotype construction methods were evaluated: 1. fixed-window blocks (FixedHB), 2. LD-based blocks (HaploView), and 3. IBD-based blocks (HaploBlocker). In within population predictions, FixedHB and HaploView methods performed as well as or slightly better than SNPs for all traits. HaploBlocker improved accuracy for certain traits but exhibited inferior performance for others. In prediction across populations, the parameter setting from HaploBlocker which controls the construction of shared haplotypes between populations played a crucial role for obtaining optimal results. When predicting across landraces, accuracies were low for both, SNP and haplotype approaches, but for specific traits substantial improvement was observed with HaploBlocker. This study provides recommendations for optimal haplotype construction and identifies relevant parameters for constructing haplotypes in the context of genomic prediction.
Collapse
Affiliation(s)
- Yan-Cheng Lin
- Chair of Plant Breeding, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Manfred Mayer
- Chair of Plant Breeding, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
- Bayer CropScience Deutschland GmbH, Borken, Germany
| | - Daniel Valle Torres
- Chair of Plant Breeding, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
- Sugar Beet Breeding, Strube Research GmbH & Co. KG, Söllingen, Germany
| | - Torsten Pook
- Animal Breeding and Genomics, Wageningen University & Research, Wageningen, Netherlands
| | - Armin C. Hölker
- Product Development Maize and Oil Crops, KWS SAAT SE & Co. KGaA, Einbeck, Germany
| | - Thomas Presterl
- Product Development Maize and Oil Crops, KWS SAAT SE & Co. KGaA, Einbeck, Germany
| | - Milena Ouzunova
- Product Development Maize and Oil Crops, KWS SAAT SE & Co. KGaA, Einbeck, Germany
| | - Chris-Carolin Schön
- Chair of Plant Breeding, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| |
Collapse
|
3
|
Peng MS, Liu YH, Shen QK, Zhang XH, Dong J, Li JX, Zhao H, Zhang H, Zhang X, He Y, Shi H, Cui C, Ouzhuluobu, Wu TY, Liu SM, Gonggalanzi, Baimakangzhuo, Bai C, Duojizhuoma, Liu T, Dai SS, Murphy RW, Qi XB, Dong G, Su B, Zhang YP. Genetic and cultural adaptations underlie the establishment of dairy pastoralism in the Tibetan Plateau. BMC Biol 2023; 21:208. [PMID: 37798721 PMCID: PMC10557253 DOI: 10.1186/s12915-023-01707-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Accepted: 09/20/2023] [Indexed: 10/07/2023] Open
Abstract
BACKGROUND Domestication and introduction of dairy animals facilitated the permanent human occupation of the Tibetan Plateau. Yet the history of dairy pastoralism in the Tibetan Plateau remains poorly understood. Little is known how Tibetans adapted to milk and dairy products. RESULTS We integrated archeological evidence and genetic analysis to show the picture that the dairy ruminants, together with dogs, were introduced from West Eurasia into the Tibetan Plateau since ~ 3600 years ago. The genetic admixture between the exotic and indigenous dogs enriched the candidate lactase persistence (LP) allele 10974A > G of West Eurasian origin in Tibetan dogs. In vitro experiments demonstrate that - 13838G > A functions as a LP allele in Tibetans. Unlike multiple LP alleles presenting selective signatures in West Eurasians and South Asians, the de novo origin of Tibetan-specific LP allele - 13838G > A with low frequency (~ 6-7%) and absence of selection corresponds - 13910C > T in pastoralists across eastern Eurasia steppe. CONCLUSIONS Results depict a novel scenario of genetic and cultural adaptations to diet and expand current understanding of the establishment of dairy pastoralism in the Tibetan Plateau.
Collapse
Affiliation(s)
- Min-Sheng Peng
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China
- Yunnan Key Laboratory of Molecular Biology of Domestic Animals, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China
- KIZ-CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Yan-Hu Liu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China
- Yunnan Key Laboratory of Molecular Biology of Domestic Animals, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China
- KIZ-CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Quan-Kuan Shen
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China
- Yunnan Key Laboratory of Molecular Biology of Domestic Animals, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China
- KIZ-CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Xiao-Hua Zhang
- State Key Laboratory for Conservation and Utilization of Bio-Resources, Yunnan University, Kunming, 650091, China
- Institute of Medical Biology, Chinese Academy of Medical Science, Peking Union Medical College, Kunming, 650118, China
| | - Jiajia Dong
- Key Laboratory of Western China's Environmental Systems (Ministry of Education), College of Earth and Environmental Sciences, Lanzhou University, Lanzhou, 730000, China
| | - Jin-Xiu Li
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China
- Yunnan Key Laboratory of Molecular Biology of Domestic Animals, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China
| | - Hui Zhao
- State Key Laboratory for Conservation and Utilization of Bio-Resources, Yunnan University, Kunming, 650091, China
| | - Hui Zhang
- State Key Laboratory of Primate Biomedical Research (LPBR), School of Primate Translational Medicine, Kunming University of Science and Technology (KUST), Kunming, 650000, China
| | - Xiaoming Zhang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Yaoxi He
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Hong Shi
- State Key Laboratory of Primate Biomedical Research (LPBR), School of Primate Translational Medicine, Kunming University of Science and Technology (KUST), Kunming, 650000, China
| | - Chaoying Cui
- High Altitude Medical Research Center, School of Medicine, Tibetan University, Lhasa, 850000, China
| | - Ouzhuluobu
- High Altitude Medical Research Center, School of Medicine, Tibetan University, Lhasa, 850000, China
| | - Tian-Yi Wu
- National Key Laboratory of High Altitude Medicine, High Altitude Medical Research Institute, Xining, 810000, China
| | - Shi-Ming Liu
- National Key Laboratory of High Altitude Medicine, High Altitude Medical Research Institute, Xining, 810000, China
| | - Gonggalanzi
- High Altitude Medical Research Center, School of Medicine, Tibetan University, Lhasa, 850000, China
| | - Baimakangzhuo
- High Altitude Medical Research Center, School of Medicine, Tibetan University, Lhasa, 850000, China
| | - Caijuan Bai
- The First People's Hospital of Gansu Province, Lanzhou, 730000, China
| | - Duojizhuoma
- High Altitude Medical Research Center, School of Medicine, Tibetan University, Lhasa, 850000, China
| | - Ti Liu
- State Key Laboratory for Conservation and Utilization of Bio-Resources, Yunnan University, Kunming, 650091, China
| | - Shan-Shan Dai
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China
- Yunnan Key Laboratory of Molecular Biology of Domestic Animals, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China
- KIZ-CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Robert W Murphy
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China
- Centre for Biodiversity and Conservation Biology, Royal Ontario Museum, Toronto, ON, M5S 2C6, Canada
| | - Xue-Bin Qi
- State Key Laboratory of Primate Biomedical Research (LPBR), School of Primate Translational Medicine, Kunming University of Science and Technology (KUST), Kunming, 650000, China.
- Tibetan Fukang Hospital, Lhasa, 850000, China.
| | - Guanghui Dong
- Key Laboratory of Western China's Environmental Systems (Ministry of Education), College of Earth and Environmental Sciences, Lanzhou University, Lanzhou, 730000, China.
| | - Bing Su
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China.
- University of Chinese Academy of Sciences, Beijing, 100049, China.
| | - Ya-Ping Zhang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China.
- Yunnan Key Laboratory of Molecular Biology of Domestic Animals, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China.
- KIZ-CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China.
- University of Chinese Academy of Sciences, Beijing, 100049, China.
- State Key Laboratory for Conservation and Utilization of Bio-Resources, Yunnan University, Kunming, 650091, China.
| |
Collapse
|
4
|
Weber SE, Frisch M, Snowdon RJ, Voss-Fels KP. Haplotype blocks for genomic prediction: a comparative evaluation in multiple crop datasets. FRONTIERS IN PLANT SCIENCE 2023; 14:1217589. [PMID: 37731980 PMCID: PMC10507710 DOI: 10.3389/fpls.2023.1217589] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Accepted: 08/21/2023] [Indexed: 09/22/2023]
Abstract
In modern plant breeding, genomic selection is becoming the gold standard for selection of superior genotypes. The basis for genomic prediction models is a set of phenotyped lines along with their genotypic profile. With high marker density and linkage disequilibrium (LD) between markers, genotype data in breeding populations tends to exhibit considerable redundancy. Therefore, interest is growing in the use of haplotype blocks to overcome redundancy by summarizing co-inherited features. Moreover, haplotype blocks can help to capture local epistasis caused by interacting loci. Here, we compared genomic prediction methods that either used single SNPs or haplotype blocks with regards to their prediction accuracy for important traits in crop datasets. We used four published datasets from canola, maize, wheat and soybean. Different approaches to construct haplotype blocks were compared, including blocks based on LD, physical distance, number of adjacent markers and the algorithms implemented in the software "Haploview" and "HaploBlocker". The tested prediction methods included Genomic Best Linear Unbiased Prediction (GBLUP), Extended GBLUP to account for additive by additive epistasis (EGBLUP), Bayesian LASSO and Reproducing Kernel Hilbert Space (RKHS) regression. We found improved prediction accuracy in some traits when using haplotype blocks compared to SNP-based predictions, however the magnitude of improvement was very trait- and model-specific. Especially in settings with low marker density, haplotype blocks can improve genomic prediction accuracy. In most cases, physically large haplotype blocks yielded a strong decrease in prediction accuracy. Especially when prediction accuracy varies greatly across different prediction models, prediction based on haplotype blocks can improve prediction accuracy of underperforming models. However, there is no "best" method to build haplotype blocks, since prediction accuracy varied considerably across methods and traits. Hence, criteria used to define haplotype blocks should not be viewed as fixed biological parameters, but rather as hyperparameters that need to be adjusted for every dataset.
Collapse
Affiliation(s)
- Sven E. Weber
- Department of Plant Breeding, Justus Liebig University, Giessen, Germany
| | - Matthias Frisch
- Department of Biometry and Population Genetics, Justus Liebig University, Giessen, Germany
| | - Rod J. Snowdon
- Department of Plant Breeding, Justus Liebig University, Giessen, Germany
| | - Kai P. Voss-Fels
- Institute for Grapevine Breeding, Hochschule Geisenheim University, Geisenheim, Germany
| |
Collapse
|
5
|
Difabachew YF, Frisch M, Langstroff AL, Stahl A, Wittkop B, Snowdon RJ, Koch M, Kirchhoff M, Cselényi L, Wolf M, Förster J, Weber S, Okoye UJ, Zenke-Philippi C. Genomic prediction with haplotype blocks in wheat. FRONTIERS IN PLANT SCIENCE 2023; 14:1168547. [PMID: 37229104 PMCID: PMC10203549 DOI: 10.3389/fpls.2023.1168547] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Accepted: 04/17/2023] [Indexed: 05/27/2023]
Abstract
Haplotype blocks might carry additional information compared to single SNPs and have therefore been suggested for use as independent variables in genomic prediction. Studies in different species resulted in more accurate predictions than with single SNPs in some traits but not in others. In addition, it remains unclear how the blocks should be built to obtain the greatest prediction accuracies. Our objective was to compare the results of genomic prediction with different types of haplotype blocks to prediction with single SNPs in 11 traits in winter wheat. We built haplotype blocks from marker data from 361 winter wheat lines based on linkage disequilibrium, fixed SNP numbers, fixed lengths in cM and with the R package HaploBlocker. We used these blocks together with data from single-year field trials in a cross-validation study for predictions with RR-BLUP, an alternative method (RMLA) that allows for heterogeneous marker variances, and GBLUP performed with the software GVCHAP. The greatest prediction accuracies for resistance scores for B. graminis, P. triticina, and F. graminearum were obtained with LD-based haplotype blocks while blocks with fixed marker numbers and fixed lengths in cM resulted in the greatest prediction accuracies for plant height. Prediction accuracies of haplotype blocks built with HaploBlocker were greater than those of the other methods for protein concentration and resistances scores for S. tritici, B. graminis, and P. striiformis. We hypothesize that the trait-dependence is caused by properties of the haplotype blocks that have overlapping and contrasting effects on the prediction accuracy. While they might be able to capture local epistatic effects and to detect ancestral relationships better than single SNPs, prediction accuracy might be reduced by unfavorable characteristics of the design matrices in the models that are due to their multi-allelic nature.
Collapse
Affiliation(s)
| | - Matthias Frisch
- Institute of Agronomy and Plant Breeding II, Justus Liebig University, Gießen, Germany
| | - Anna Luise Langstroff
- Institute of Agronomy and Plant Breeding I, Justus Liebig University, Gießen, Germany
| | - Andreas Stahl
- Institute for Resistance Research and Stress Tolerance, Julius Kühn Institute, Quedlinburg, Germany
| | - Benjamin Wittkop
- Institute of Agronomy and Plant Breeding I, Justus Liebig University, Gießen, Germany
| | - Rod J. Snowdon
- Institute of Agronomy and Plant Breeding I, Justus Liebig University, Gießen, Germany
| | | | | | - László Cselényi
- Department of Cereal Breeding, W. von Borries-Eckendorf GmbH & Co. KG, Leopoldshöhe, Germany
| | - Markus Wolf
- German Seed Alliance GmbH, Holtsee, Germany
- Saaten-Union Biotec GmbH, Leopoldshöhe, Germany
| | | | - Sven Weber
- Institute of Agronomy and Plant Breeding I, Justus Liebig University, Gießen, Germany
| | - Uche Joshua Okoye
- Institute of Agronomy and Plant Breeding II, Justus Liebig University, Gießen, Germany
| | - Carola Zenke-Philippi
- Institute of Agronomy and Plant Breeding II, Justus Liebig University, Gießen, Germany
| |
Collapse
|
6
|
Boatwright JL, Sapkota S, Jin H, Schnable JC, Brenton Z, Boyles R, Kresovich S. Sorghum Association Panel whole-genome sequencing establishes cornerstone resource for dissecting genomic diversity. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022; 111:888-904. [PMID: 35653240 PMCID: PMC9544330 DOI: 10.1111/tpj.15853] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Revised: 05/27/2022] [Accepted: 05/28/2022] [Indexed: 05/26/2023]
Abstract
Association mapping panels represent foundational resources for understanding the genetic basis of phenotypic diversity and serve to advance plant breeding by exploring genetic variation across diverse accessions. We report the whole-genome sequencing (WGS) of 400 sorghum (Sorghum bicolor (L.) Moench) accessions from the Sorghum Association Panel (SAP) at an average coverage of 38× (25-72×), enabling the development of a high-density genomic marker set of 43 983 694 variants including single-nucleotide polymorphisms (approximately 38 million), insertions/deletions (indels) (approximately 5 million), and copy number variants (CNVs) (approximately 170 000). We observe slightly more deletions among indels and a much higher prevalence of deletions among CNVs compared to insertions. This new marker set enabled the identification of several novel putative genomic associations for plant height and tannin content, which were not identified when using previous lower-density marker sets. WGS identified and scored variants in 5-kb bins where available genotyping-by-sequencing (GBS) data captured no variants, with half of all bins in the genome falling into this category. The predictive ability of genomic best unbiased linear predictor (GBLUP) models was increased by an average of 30% by using WGS markers rather than GBS markers. We identified 18 selection peaks across subpopulations that formed due to evolutionary divergence during domestication, and we found six Fst peaks resulting from comparisons between converted lines and breeding lines within the SAP that were distinct from the peaks associated with historic selection. This population has served and continues to serve as a significant public resource for sorghum research and demonstrates the value of improving upon existing genomic resources.
Collapse
Affiliation(s)
- J. Lucas Boatwright
- Department of Plant and Environmental SciencesClemson UniversityClemsonSouth Carolina29634USA
- Advanced Plant TechnologyClemson UniversityClemsonSouth Carolina29634USA
| | - Sirjan Sapkota
- Advanced Plant TechnologyClemson UniversityClemsonSouth Carolina29634USA
| | - Hongyu Jin
- Center for Plant Science Innovation and Department of Agronomy and HorticultureUniversity of Nebraska‐LincolnLincolnNebraska68588USA
| | - James C. Schnable
- Center for Plant Science Innovation and Department of Agronomy and HorticultureUniversity of Nebraska‐LincolnLincolnNebraska68588USA
| | | | - Richard Boyles
- Department of Plant and Environmental SciencesClemson UniversityClemsonSouth Carolina29634USA
- Pee Dee Research and Education CenterClemson UniversityFlorenceSouth Carolina29506USA
| | - Stephen Kresovich
- Department of Plant and Environmental SciencesClemson UniversityClemsonSouth Carolina29634USA
- Advanced Plant TechnologyClemson UniversityClemsonSouth Carolina29634USA
- Feed the Future Innovation Lab for Crop ImprovementCornell UniversityIthacaNew York14850USA
| |
Collapse
|
7
|
Mangino G, Arrones A, Plazas M, Pook T, Prohens J, Gramazio P, Vilanova S. Newly Developed MAGIC Population Allows Identification of Strong Associations and Candidate Genes for Anthocyanin Pigmentation in Eggplant. FRONTIERS IN PLANT SCIENCE 2022; 13:847789. [PMID: 35330873 PMCID: PMC8940277 DOI: 10.3389/fpls.2022.847789] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/03/2022] [Accepted: 01/20/2022] [Indexed: 05/17/2023]
Abstract
Multi-parent advanced generation inter-cross (MAGIC) populations facilitate the genetic dissection of complex quantitative traits in plants and are valuable breeding materials. We report the development of the first eggplant MAGIC population (S3 Magic EGGplant InCanum, S3MEGGIC; 8-way), constituted by the 420 S3 individuals developed from the intercrossing of seven cultivated eggplant (Solanum melongena) and one wild relative (S. incanum) parents. The S3MEGGIC recombinant population was genotyped with the eggplant 5k probes SPET platform and phenotyped for anthocyanin presence in vegetative plant tissues (PA) and fruit epidermis (FA), and for the light-insensitive anthocyanic pigmentation under the calyx (PUC). The 7,724 filtered high-confidence single-nucleotide polymorphisms (SNPs) confirmed a low residual heterozygosity (6.87%), a lack of genetic structure in the S3MEGGIC population, and no differentiation among subpopulations carrying a cultivated or wild cytoplasm. Inference of haplotype blocks of the nuclear genome revealed an unbalanced representation of the founder genomes, suggesting a cryptic selection in favour or against specific parental genomes. Genome-wide association study (GWAS) analysis for PA, FA, and PUC detected strong associations with two myeloblastosis (MYB) genes similar to MYB113 involved in the anthocyanin biosynthesis pathway, and with a COP1 gene which encodes for a photo-regulatory protein and may be responsible for the PUC trait. Evidence was found of a duplication of an ancestral MYB113 gene with a translocation from chromosome 10 to chromosome 1 compared with the tomato genome. Parental genotypes for the three genes were in agreement with the identification of the candidate genes performed in the S3MEGGIC population. Our new eggplant MAGIC population is the largest recombinant population in eggplant and is a powerful tool for eggplant genetics and breeding studies.
Collapse
Affiliation(s)
- Giulio Mangino
- Instituto de Conservación y Mejora de la Agrodiversidad Valenciana, Universitat Politècnica de València, Valencia, Spain
| | - Andrea Arrones
- Instituto de Conservación y Mejora de la Agrodiversidad Valenciana, Universitat Politècnica de València, Valencia, Spain
| | - Mariola Plazas
- Instituto de Biología Molecular y Celular de Plantas, Consejo Superior de Investigaciones Científicas-Universitat Politècnica de València, Valencia, Spain
| | - Torsten Pook
- Animal Breeding and Genetics Group, Department of Animal Sciences, Center for Integrated Breeding Research, University of Göttingen, Göttingin, Germany
| | - Jaime Prohens
- Instituto de Conservación y Mejora de la Agrodiversidad Valenciana, Universitat Politècnica de València, Valencia, Spain
| | - Pietro Gramazio
- Instituto de Biología Molecular y Celular de Plantas, Consejo Superior de Investigaciones Científicas-Universitat Politècnica de València, Valencia, Spain
| | - Santiago Vilanova
- Instituto de Conservación y Mejora de la Agrodiversidad Valenciana, Universitat Politècnica de València, Valencia, Spain
| |
Collapse
|
8
|
Pook T, Nemri A, Gonzalez Segovia EG, Valle Torres D, Simianer H, Schoen CC. Increasing calling accuracy, coverage, and read-depth in sequence data by the use of haplotype blocks. PLoS Genet 2021; 17:e1009944. [PMID: 34941872 PMCID: PMC8699914 DOI: 10.1371/journal.pgen.1009944] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Accepted: 11/13/2021] [Indexed: 01/16/2023] Open
Abstract
High-throughput genotyping of large numbers of lines remains a key challenge in plant genetics, requiring geneticists and breeders to find a balance between data quality and the number of genotyped lines under a variety of different existing genotyping technologies when resources are limited. In this work, we are proposing a new imputation pipeline (“HBimpute”) that can be used to generate high-quality genomic data from low read-depth whole-genome-sequence data. The key idea of the pipeline is the use of haplotype blocks from the software HaploBlocker to identify locally similar lines and subsequently use the reads of all locally similar lines in the variant calling for a specific line. The effectiveness of the pipeline is showcased on a dataset of 321 doubled haploid lines of a European maize landrace, which were sequenced at 0.5X read-depth. The overall imputing error rates are cut in half compared to state-of-the-art software like BEAGLE and STITCH, while the average read-depth is increased to 83X, thus enabling the calling of copy number variation. The usefulness of the obtained imputed data panel is further evaluated by comparing the performance of sequence data in common breeding applications to that of genomic data generated with a genotyping array. For both genome-wide association studies and genomic prediction, results are on par or even slightly better than results obtained with high-density array data (600k). In particular for genomic prediction, we observe slightly higher data quality for the sequence data compared to the 600k array in the form of higher prediction accuracies. This occurred specifically when reducing the data panel to the set of overlapping markers between sequence and array, indicating that sequencing data can benefit from the same marker ascertainment as used in the array process to increase the quality and usability of genomic data. High-throughput genotyping of large numbers of lines remains a key challenge in plant genetics and breeding. Cost, precision, and throughput must be balanced to achieve optimal efficiency given available technologies and finite resources. Although genotyping arrays are still considered the gold standard in high-throughput quantitative genetics, recent advances in sequencing provide new opportunities. Both the quality and cost of genomic data generated based on sequencing are highly dependent on the used read-depth. In this work, we propose a new imputation pipeline (“HBimpute”) that uses haplotype blocks to detect individuals of the same genetic origin and subsequently uses all reads of those individuals in the variant calling. Thus, the obtained virtual read-depth is artificially increased, leading to higher calling accuracy, coverage, and the ability to call copy number variation based on low read-depth sequencing data. To conclude, our approach makes sequencing a cost-competitive alternative to genotyping arrays with the added benefit of allowing the calling of structural variation.
Collapse
Affiliation(s)
- Torsten Pook
- Center for Integrated Breeding Research, Animal Breeding and Genetics Group, University of Goettingen, Goettingen, Germany
- * E-mail:
| | | | | | - Daniel Valle Torres
- Plant Breeding, Technical University of Munich, TUM School of Life Sciences Weihenstephan, Freising, Germany
| | - Henner Simianer
- Center for Integrated Breeding Research, Animal Breeding and Genetics Group, University of Goettingen, Goettingen, Germany
| | - Chris-Carolin Schoen
- Plant Breeding, Technical University of Munich, TUM School of Life Sciences Weihenstephan, Freising, Germany
| |
Collapse
|
9
|
Vojgani E, Pook T, Martini JWR, Hölker AC, Mayer M, Schön CC, Simianer H. Accounting for epistasis improves genomic prediction of phenotypes with univariate and bivariate models across environments. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2021; 134:2913-2930. [PMID: 34115154 PMCID: PMC8354961 DOI: 10.1007/s00122-021-03868-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Accepted: 05/24/2021] [Indexed: 06/12/2023]
Abstract
The accuracy of genomic prediction of phenotypes can be increased by including the top-ranked pairwise SNP interactions into the prediction model. We compared the predictive ability of various prediction models for a maize dataset derived from 910 doubled haploid lines from two European landraces (Kemater Landmais Gelb and Petkuser Ferdinand Rot), which were tested at six locations in Germany and Spain. The compared models were Genomic Best Linear Unbiased Prediction (GBLUP) as an additive model, Epistatic Random Regression BLUP (ERRBLUP) accounting for all pairwise SNP interactions, and selective Epistatic Random Regression BLUP (sERRBLUP) accounting for a selected subset of pairwise SNP interactions. These models have been compared in both univariate and bivariate statistical settings for predictions within and across environments. Our results indicate that modeling all pairwise SNP interactions into the univariate/bivariate model (ERRBLUP) is not superior in predictive ability to the respective additive model (GBLUP). However, incorporating only a selected subset of interactions with the highest effect variances in univariate/bivariate sERRBLUP can increase predictive ability significantly compared to the univariate/bivariate GBLUP. Overall, bivariate models consistently outperform univariate models in predictive ability. Across all studied traits, locations and landraces, the increase in prediction accuracy from univariate GBLUP to univariate sERRBLUP ranged from 5.9 to 112.4 percent, with an average increase of 47 percent. For bivariate models, the change ranged from -0.3 to + 27.9 percent comparing the bivariate sERRBLUP to the bivariate GBLUP, with an average increase of 11 percent. This considerable increase in predictive ability achieved by sERRBLUP may be of interest for "sparse testing" approaches in which only a subset of the lines/hybrids of interest is observed at each location.
Collapse
Affiliation(s)
- Elaheh Vojgani
- Center for Integrated Breeding Research, Animal Breeding and Genetics Group, University of Goettingen, Goettingen, Germany.
| | - Torsten Pook
- Center for Integrated Breeding Research, Animal Breeding and Genetics Group, University of Goettingen, Goettingen, Germany
| | - Johannes W R Martini
- International Maize and Wheat Improvement Center (CIMMYT), Texcoco, State of Mexico, Mexico
| | - Armin C Hölker
- Plant Breeding, TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
| | - Manfred Mayer
- Plant Breeding, TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
| | - Chris-Carolin Schön
- Plant Breeding, TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
| | - Henner Simianer
- Center for Integrated Breeding Research, Animal Breeding and Genetics Group, University of Goettingen, Goettingen, Germany
| |
Collapse
|
10
|
Mora-Poblete F, Ballesta P, Lobos GA, Molina-Montenegro M, Gleadow R, Ahmar S, Jiménez-Aspee F. Genome-wide association study of cyanogenic glycosides, proline, sugars, and pigments in Eucalyptus cladocalyx after 18 consecutive dry summers. PHYSIOLOGIA PLANTARUM 2021; 172:1550-1569. [PMID: 33511661 DOI: 10.1111/ppl.13349] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Revised: 01/07/2021] [Accepted: 01/20/2021] [Indexed: 06/12/2023]
Abstract
Natural variation of cyanogenic glycosides, soluble sugars, proline, and nondestructive optical sensing of pigments (chlorophyll, flavonols, and anthocyanins) was examined in ex situ natural populations of Eucalyptus cladocalyx F. Muell. grown under dry environmental conditions in the southern Atacama Desert, Chile. After 18 consecutive dry seasons, considerable plant-to-plant phenotypic variation for all the traits was observed in the field. For example, leaf hydrogen cyanide (HCN) concentrations varied from 0 (two acyanogenic individuals) to 1.54 mg cyanide g-1 DW. Subsequent genome-wide association study revealed associations with several genes with a known function in plants. HCN content was associated robustly with genes encoding Cytochrome P450 proteins, and with genes involved in the detoxification mechanism of HCN in cells (β-cyanoalanine synthase and cyanoalanine nitrilase). Another important finding was that sugars, proline, and pigment content were linked to genes involved in transport, biosynthesis, and/or catabolism. Estimates of genomic heritability (based on haplotypes) ranged between 0.46 and 0.84 (HCN and proline content, respectively). Proline and soluble sugars had the highest predictive ability of genomic prediction models (PA = 0.65 and PA = 0.71, respectively). PA values for HCN content and flavonols were relatively moderate, with estimates ranging from 0.44 to 0.50. These findings provide new understanding on the genetic architecture of cyanogenic capacity, and other key complex traits in cyanogenic E. cladocalyx.
Collapse
Affiliation(s)
| | - Paulina Ballesta
- Institute of Biological Sciences, Universidad de Talca, Talca, Chile
| | - Gustavo A Lobos
- Plant Breeding and Phenomic Center, Faculty of Agricultural Sciences, Universidad de Talca, Talca, Chile
| | - Marco Molina-Montenegro
- Institute of Biological Sciences, Universidad de Talca, Talca, Chile
- Centro de Estudios Avanzados en Zonas Áridas (CEAZA), Facultad de Ciencias del Mar, Universidad Católica del Norte, Coquimbo, Chile
| | - Roslyn Gleadow
- School of Biological Sciences, Monash University, Melbourne, Victoria, Australia
| | - Sunny Ahmar
- Institute of Biological Sciences, Universidad de Talca, Talca, Chile
- College of Plant Sciences and Technology, Huazhong Agricultural University, Wuhan, China
| | - Felipe Jiménez-Aspee
- Department of Food Biofunctionality, Institute of Nutritional Sciences, University of Hohenheim, Stuttgart, Germany
- Departamento de Ciencias Básicas Biomédicas, Facultad de Ciencias de la Salud, Universidad de Talca, Talca, Chile
| |
Collapse
|
11
|
Vojgani E, Pook T, Simianer H. Phenotype Prediction Under Epistasis. Methods Mol Biol 2021; 2212:105-120. [PMID: 33733353 DOI: 10.1007/978-1-0716-0947-7_8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/13/2023]
Abstract
Reliable methods of phenotype prediction from genomic data play an increasingly important role in many areas of plant and animal breeding. Thus, developing methods that enhance prediction accuracy is of major interest. Here, we provide three methods for this purpose: (1) Genomic Best Linear Unbiased Prediction (GBLUP) as a model just accounting for additive SNP effects; (2) Epistatic Random Regression BLUP (ERRBLUP) as a full epistatic model which incorporates all pairwise SNP interactions, and (3) selective Epistatic Random Regression BLUP (sERRBLUP) as an epistatic model which incorporates a subset of pairwise SNP interactions selected based on their absolute effect sizes or the effect variances, which is computed based on solutions from the ERRBLUP model. We compared the predictive ability obtained from GBLUP, ERRBLUP, and sERRBLUP with genotypes from a publicly available wheat dataset and respective simulated phenotypes. Results showed that sERRBLUP provides a substantial increase in prediction accuracy compared to the other methods when the optimal proportion of SNP interactions is kept in the model, especially when an optimal proportion of SNP interactions is selected based on the SNP interaction effect sizes. All methods described here are implemented in the R-package EpiGP, which is able to process large-scale genomic data in a computationally efficient way.
Collapse
Affiliation(s)
- Elaheh Vojgani
- Center for Integrated Breeding Research, Animal Breeding and Genetics Group, Department of Animal Sciences, University of Goettingen, Goettingen, Germany.
| | - Torsten Pook
- Center for Integrated Breeding Research, Animal Breeding and Genetics Group, Department of Animal Sciences, University of Goettingen, Goettingen, Germany
| | - Henner Simianer
- Center for Integrated Breeding Research, Animal Breeding and Genetics Group, Department of Animal Sciences, University of Goettingen, Goettingen, Germany
| |
Collapse
|
12
|
Tibbs Cortes L, Zhang Z, Yu J. Status and prospects of genome-wide association studies in plants. THE PLANT GENOME 2021; 14:e20077. [PMID: 33442955 DOI: 10.1002/tpg2.20077] [Citation(s) in RCA: 127] [Impact Index Per Article: 42.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/05/2020] [Accepted: 11/18/2020] [Indexed: 05/22/2023]
Abstract
Genome-wide association studies (GWAS) have developed into a powerful and ubiquitous tool for the investigation of complex traits. In large part, this was fueled by advances in genomic technology, enabling us to examine genome-wide genetic variants across diverse genetic materials. The development of the mixed model framework for GWAS dramatically reduced the number of false positives compared with naïve methods. Building on this foundation, many methods have since been developed to increase computational speed or improve statistical power in GWAS. These methods have allowed the detection of genomic variants associated with either traditional agronomic phenotypes or biochemical and molecular phenotypes. In turn, these associations enable applications in gene cloning and in accelerated crop breeding through marker assisted selection or genetic engineering. Current topics of investigation include rare-variant analysis, synthetic associations, optimizing the choice of GWAS model, and utilizing GWAS results to advance knowledge of biological processes. Ongoing research in these areas will facilitate further advances in GWAS methods and their applications.
Collapse
Affiliation(s)
| | - Zhiwu Zhang
- Department of Crop and Soil Sciences, Washington State University, Pullman, WA, 99164, USA
| | - Jianming Yu
- Department of Agronomy, Iowa State University, Ames, IA, 50010, USA
| |
Collapse
|
13
|
Nyine M, Adhikari E, Clinesmith M, Aiken R, Betzen B, Wang W, Davidson D, Yu Z, Guo Y, He F, Akhunova A, Jordan KW, Fritz AK, Akhunov E. The Haplotype-Based Analysis of Aegilops tauschii Introgression Into Hard Red Winter Wheat and Its Impact on Productivity Traits. FRONTIERS IN PLANT SCIENCE 2021; 12:716955. [PMID: 34484280 PMCID: PMC8416154 DOI: 10.3389/fpls.2021.716955] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/29/2021] [Accepted: 07/20/2021] [Indexed: 05/13/2023]
Abstract
The introgression from wild relatives have a great potential to broaden the availability of beneficial allelic diversity for crop improvement in breeding programs. Here, we assessed the impact of the introgression from 21 diverse accessions of Aegilops tauschii, the diploid ancestor of the wheat D genome, into 6 hard red winter wheat cultivars on yield and yield component traits. We used 5.2 million imputed D genome SNPs identified by the whole-genome sequencing of parental lines and the sequence-based genotyping of introgression population, including 351 BC1F3:5 lines. Phenotyping data collected from the irrigated and non-irrigated field trials revealed that up to 23% of the introgression lines (ILs) produce more grain than the parents and check cultivars. Based on 16 yield stability statistics, the yield of 12 ILs (3.4%) was stable across treatments, years, and locations; 5 of these lines were also high yielding lines, producing 9.8% more grain than the average yield of check cultivars. The most significant SNP- and haplotype-trait associations were identified on chromosome arms 2DS and 6DL for the spikelet number per spike (SNS), on chromosome arms 2DS, 3DS, 5DS, and 7DS for grain length (GL) and on chromosome arms 1DL, 2DS, 6DL, and 7DS for grain width (GW). The introgression of haplotypes from A. tauschii parents was associated with an increase in SNS, which was positively correlated with a heading date (HD), whereas the haplotypes from hexaploid wheat parents were associated with an increase in GW. We show that the haplotypes on 2DS associated with an increase in the spikelet number and HD are linked with multiple introgressed alleles of Ppd-D1 identified by the whole-genome sequencing of A. tauschii parents. Meanwhile, some introgressed haplotypes exhibited significant pleiotropic effects with the direction of effects on the yield component traits being largely consistent with the previously reported trade-offs, there were haplotype combinations associated with the positive trends in yield. The characterized repertoire of the introgressed haplotypes derived from A. tauschii accessions with the combined positive effects on yield and yield component traits in elite germplasm provides a valuable source of alleles for improving the productivity of winter wheat by optimizing the contribution of component traits to yield.
Collapse
Affiliation(s)
- Moses Nyine
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States
| | - Elina Adhikari
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States
| | - Marshall Clinesmith
- Department of Agronomy, Kansas State University, Manhattan, KS, United States
| | - Robert Aiken
- Department of Agronomy, Kansas State University, Manhattan, KS, United States
| | - Bliss Betzen
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States
| | - Wei Wang
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States
| | - Dwight Davidson
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States
| | - Zitong Yu
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States
| | - Yuanwen Guo
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States
| | - Fei He
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States
| | - Alina Akhunova
- Integrated Genomics Facility, Kansas State University, Manhattan, KS, United States
| | - Katherine W. Jordan
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States
- United States Department of Agriculture, Agricultural Research Service Hard Winter Wheat Genetics Research Unit, Manhattan, KS, United States
| | - Allan K. Fritz
- Department of Agronomy, Kansas State University, Manhattan, KS, United States
| | - Eduard Akhunov
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States
- *Correspondence: Eduard Akhunov
| |
Collapse
|
14
|
Crouse WL, Kelada SNP, Valdar W. Inferring the Allelic Series at QTL in Multiparental Populations. Genetics 2020; 216:957-983. [PMID: 33082282 PMCID: PMC7768242 DOI: 10.1534/genetics.120.303393] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2020] [Accepted: 10/12/2020] [Indexed: 12/25/2022] Open
Abstract
Multiparental populations (MPPs) are experimental populations in which the genome of every individual is a mosaic of known founder haplotypes. These populations are useful for detecting quantitative trait loci (QTL) because tests of association can leverage inferred founder haplotype descent. It is difficult, however, to determine how haplotypes at a locus group into distinct functional alleles, termed the allelic series. The allelic series is important because it provides information about the number of causal variants at a QTL and their combined effects. In this study, we introduce a fully Bayesian model selection framework for inferring the allelic series. This framework accounts for sources of uncertainty found in typical MPPs, including the number and composition of functional alleles. Our prior distribution for the allelic series is based on the Chinese restaurant process, a relative of the Dirichlet process, and we leverage its connection to the coalescent to introduce additional prior information about haplotype relatedness via a phylogenetic tree. We evaluate our approach via simulation and apply it to QTL from two MPPs: the Collaborative Cross (CC) and the Drosophila Synthetic Population Resource (DSPR). We find that, although posterior inference of the exact allelic series is often uncertain, we are able to distinguish biallelic QTL from more complex multiallelic cases. Additionally, our allele-based approach improves haplotype effect estimation when the true number of functional alleles is small. Our method, Tree-Based Inference of Multiallelism via Bayesian Regression (TIMBR), provides new insight into the genetic architecture of QTL in MPPs.
Collapse
Affiliation(s)
- Wesley L Crouse
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina, Chapel Hill, North Carolina 27599
- Department of Genetics, University of North Carolina, Chapel Hill, North Carolina 27599
| | - Samir N P Kelada
- Department of Genetics, University of North Carolina, Chapel Hill, North Carolina 27599
- Marsico Lung Institute, University of North Carolina, Chapel Hill, North Carolina 27599
| | - William Valdar
- Department of Genetics, University of North Carolina, Chapel Hill, North Carolina 27599
- Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, North Carolina 27599
| |
Collapse
|
15
|
A haplotype-led approach to increase the precision of wheat breeding. Commun Biol 2020; 3:712. [PMID: 33239669 PMCID: PMC7689427 DOI: 10.1038/s42003-020-01413-2] [Citation(s) in RCA: 47] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Accepted: 10/15/2020] [Indexed: 12/11/2022] Open
Abstract
Crop productivity must increase at unprecedented rates to meet the needs of the growing worldwide population. Exploiting natural variation for the genetic improvement of crops plays a central role in increasing productivity. Although current genomic technologies can be used for high-throughput identification of genetic variation, methods for efficiently exploiting this genetic potential in a targeted, systematic manner are lacking. Here, we developed a haplotype-based approach to identify genetic diversity for crop improvement using genome assemblies from 15 bread wheat (Triticum aestivum) cultivars. We used stringent criteria to identify identical-by-state haplotypes and distinguish these from near-identical sequences (~99.95% identity). We showed that each cultivar shares ~59 % of its genome with other sequenced cultivars and we detected the presence of extended haplotype blocks containing hundreds to thousands of genes across all wheat chromosomes. We found that genic sequence alone was insufficient to fully differentiate between haplotypes, as were commonly used array-based genotyping chips due to their gene centric design. We successfully used this approach for focused discovery of novel haplotypes from a landrace collection and documented their potential for trait improvement in modern bread wheat. This study provides a framework for defining and exploiting haplotypes to increase the efficiency and precision of wheat breeding towards optimising the agronomic performance of this crucial crop. Brinton, Uauy and colleagues utilize genomic data from the 10+ Wheat Genome Project to develop a useful tool for studying and generating new wheat cultivars. This framework uses advanced exploitation of wheat haplotypes to bring newfound precision and efficiency to wheat breeding.
Collapse
|
16
|
Discovery of beneficial haplotypes for complex traits in maize landraces. Nat Commun 2020; 11:4954. [PMID: 33009396 PMCID: PMC7532167 DOI: 10.1038/s41467-020-18683-3] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2020] [Accepted: 09/06/2020] [Indexed: 12/17/2022] Open
Abstract
Genetic variation is of crucial importance for crop improvement. Landraces are valuable sources of diversity, but for quantitative traits efficient strategies for their targeted utilization are lacking. Here, we map haplotype-trait associations at high resolution in ~1000 doubled-haploid lines derived from three maize landraces to make their native diversity for early development traits accessible for elite germplasm improvement. A comparative genomic analysis of the discovered haplotypes in the landrace-derived lines and a panel of 65 breeding lines, both genotyped with 600k SNPs, points to untapped beneficial variation for target traits in the landraces. The superior phenotypic performance of lines carrying favorable landrace haplotypes as compared to breeding lines with alternative haplotypes confirms these findings. Stability of haplotype effects across populations and environments as well as their limited effects on undesired traits indicate that our strategy has high potential for harnessing beneficial haplotype variation for quantitative traits from genetic resources. Genetic variations present in landraces are critical for crop genetic improvement. Here, the authors map haplotype-trait associations in ~1000 doubled haploid lines derived from three European maize landraces and identify beneficial haplotypes for quantitative traits that are not present in breeding lines.
Collapse
|
17
|
Pook T, Schlather M, Simianer H. MoBPS - Modular Breeding Program Simulator. G3 (BETHESDA, MD.) 2020; 10:1915-1918. [PMID: 32229505 PMCID: PMC7263682 DOI: 10.1534/g3.120.401193] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/28/2020] [Accepted: 03/27/2020] [Indexed: 11/22/2022]
Abstract
The R-package MoBPS provides a computationally efficient and flexible framework to simulate complex breeding programs and compare their economic and genetic impact. Simulations are performed on the base of individuals. MoBPS utilizes a highly efficient implementation with bit-wise data storage and matrix multiplications from the associated R-package miraculix allowing to handle large scale populations. Individual haplotypes are not stored but instead automatically derived based on points of recombination and mutations. The modular structure of MoBPS allows to combine rather coarse simulations, as needed to generate founder populations, with a very detailed modeling of todays' complex breeding programs, making use of all available biotechnologies. MoBPS provides pre-implemented functions for common breeding practices such as optimum genetic contributions and single-step GBLUP but also allows the user to replace certain steps with personalized and/or self-written solutions.
Collapse
Affiliation(s)
- Torsten Pook
- Department of Animal Sciences, Animal Breeding and Genetics Group, University of Goettingen, 37075 Goettingen, Germany
- Center for Integrated Breeding Research, University of Goettingen, 37075 Goettingen, Germany
| | - Martin Schlather
- Center for Integrated Breeding Research, University of Goettingen, 37075 Goettingen, Germany
- Stochastics and Its Applications Group, University of Mannheim, 68131 Mannheim, Germany
| | - Henner Simianer
- Department of Animal Sciences, Animal Breeding and Genetics Group, University of Goettingen, 37075 Goettingen, Germany
- Center for Integrated Breeding Research, University of Goettingen, 37075 Goettingen, Germany
| |
Collapse
|